1
|
Fruchart M, Quindroit P, Jacquemont C, Beuscart JB, Calafiore M, Lamer A. Transforming Primary Care Data Into the Observational Medical Outcomes Partnership Common Data Model: Development and Usability Study. JMIR Med Inform 2024; 12:e49542. [PMID: 39140273 DOI: 10.2196/49542] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/01/2023] [Revised: 04/11/2024] [Accepted: 04/11/2024] [Indexed: 08/15/2024] Open
Abstract
Background Patient-monitoring software generates a large amount of data that can be reused for clinical audits and scientific research. The Observational Health Data Sciences and Informatics (OHDSI) consortium developed the Observational Medical Outcomes Partnership (OMOP) Common Data Model (CDM) to standardize electronic health record data and promote large-scale observational and longitudinal research. Objective This study aimed to transform primary care data into the OMOP CDM format. Methods We extracted primary care data from electronic health records at a multidisciplinary health center in Wattrelos, France. We performed structural mapping between the design of our local primary care database and the OMOP CDM tables and fields. Local French vocabularies concepts were mapped to OHDSI standard vocabularies. To validate the implementation of primary care data into the OMOP CDM format, we applied a set of queries. A practical application was achieved through the development of a dashboard. Results Data from 18,395 patients were implemented into the OMOP CDM, corresponding to 592,226 consultations over a period of 20 years. A total of 18 OMOP CDM tables were implemented. A total of 17 local vocabularies were identified as being related to primary care and corresponded to patient characteristics (sex, location, year of birth, and race), units of measurement, biometric measures, laboratory test results, medical histories, and drug prescriptions. During semantic mapping, 10,221 primary care concepts were mapped to standard OHDSI concepts. Five queries were used to validate the OMOP CDM by comparing the results obtained after the completion of the transformations with the results obtained in the source software. Lastly, a prototype dashboard was developed to visualize the activity of the health center, the laboratory test results, and the drug prescription data. Conclusions Primary care data from a French health care facility have been implemented into the OMOP CDM format. Data concerning demographics, units, measurements, and primary care consultation steps were already available in OHDSI vocabularies. Laboratory test results and drug prescription data were mapped to available vocabularies and structured in the final model. A dashboard application provided health care professionals with feedback on their practice.
Collapse
Affiliation(s)
- Mathilde Fruchart
- Univ Lille, CHU Lille, ULR 2694 - METRICS: Évaluation des Technologies de santé et des, Pratiques médicales, 2 Place de Verdun, Lille, F-59000, France
| | - Paul Quindroit
- Univ Lille, CHU Lille, ULR 2694 - METRICS: Évaluation des Technologies de santé et des, Pratiques médicales, 2 Place de Verdun, Lille, F-59000, France
| | - Chloé Jacquemont
- Département de Médecine Générale, University of Lille, Lille, France
| | - Jean-Baptiste Beuscart
- Univ Lille, CHU Lille, ULR 2694 - METRICS: Évaluation des Technologies de santé et des, Pratiques médicales, 2 Place de Verdun, Lille, F-59000, France
| | - Matthieu Calafiore
- Univ Lille, CHU Lille, ULR 2694 - METRICS: Évaluation des Technologies de santé et des, Pratiques médicales, 2 Place de Verdun, Lille, F-59000, France
- Département de Médecine Générale, University of Lille, Lille, France
| | - Antoine Lamer
- Univ Lille, CHU Lille, ULR 2694 - METRICS: Évaluation des Technologies de santé et des, Pratiques médicales, 2 Place de Verdun, Lille, F-59000, France
- F2RSM Psy - Fédération régionale de recherche en psychiatrie et santé mentale Hauts-de-France, Saint-André-Lez-Lille, France
| |
Collapse
|
2
|
Kookal KK, Walji MF, Brandon R, Kivanc F, Mertz E, Kottek A, Mullins J, Liang S, Jenson LE, White JM. Systematically assessing the quality of dental electronic health record data for an investigation into oral health care disparities. J Public Health Dent 2024. [PMID: 38659337 DOI: 10.1111/jphd.12618] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/09/2023] [Revised: 03/27/2024] [Accepted: 04/11/2024] [Indexed: 04/26/2024]
Abstract
OBJECTIVES This work describes the process by which the quality of electronic health care data for a public health study was determined. The objectives were to adapt, develop, and implement data quality assessments (DQAs) based on the National Institutes of Health Pragmatic Trials Collaboratory (NIHPTC) data quality framework within the three domains of completeness, accuracy, and consistency, for an investigation into oral health care disparities of a preventive care program. METHODS Electronic health record data for eligible children in a dental accountable care organization of 30 offices, in Oregon, were extracted iteratively from January 1, 2014, through March 31, 2022. Baseline eligibility criteria included: children ages 0-18 with a baseline examination, Oregon home address, and either Medicaid or commercial dental benefits at least once between 2014 and 2108. Using the NIHPTC framework as a guide, DQAs were conducted throughout data element identification, extraction, staging, profiling, review, and documentation. RESULTS The data set included 91,487 subjects, 11 data tables comprising 75 data variables (columns), with a total of 6,861,525 data elements. Data completeness was 97.2%, the accuracy of EHR data elements in extracts was 100%, and consistency between offices was strong; 29 of 30 offices within 2 standard deviations of the mean (s = 94%). CONCLUSIONS The NIHPTC framework proved to be a useful approach, to identify, document, and characterize the dataset. The concepts of completeness, accuracy, and consistency were adapted by the multidisciplinary research team and the overall quality of the data are demonstrated to be of high quality.
Collapse
Affiliation(s)
- Krishna Kumar Kookal
- Technology Services and Informatics, School of Dentistry, University of Texas Health Science Center at Houston, Houston, Texas, USA
| | - Muhammad F Walji
- Department of Clinical and Health Informatics, D. Bradley McWIlliams School of Biomedical Informatics, University of Texas Health Science Center at Houston, Houston, Texas, USA
| | - Ryan Brandon
- Willamette Dental Group and Skourtes Institute, Hillsboro, Oregon, USA
| | - Ferit Kivanc
- Willamette Dental Group and Skourtes Institute, Hillsboro, Oregon, USA
| | - Elizabeth Mertz
- Department of Preventive and Restorative Dental Sciences, University of California, San Francisco, California, USA
| | - Aubri Kottek
- Department of Preventive and Restorative Dental Sciences, University of California, San Francisco, California, USA
| | - Joanna Mullins
- Willamette Dental Group and Skourtes Institute, Hillsboro, Oregon, USA
| | - Shuang Liang
- Department of Preventive and Restorative Dental Sciences, University of California, San Francisco, California, USA
| | - Larry E Jenson
- Department of Preventive and Restorative Dental Sciences, University of California, San Francisco, California, USA
| | - Joel M White
- Department of Preventive and Restorative Dental Sciences, University of California, San Francisco, California, USA
| |
Collapse
|
3
|
Thompson YT, Li Y, Silovsky J. From Scientific Research to Practical Implementations: Applications to Improve Data Quality in Child Welfare. J Behav Health Serv Res 2024; 51:289-301. [PMID: 38153681 DOI: 10.1007/s11414-023-09875-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 12/14/2023] [Indexed: 12/29/2023]
Abstract
Child welfare decisions have life-impacting consequences which, often times, are underpinned by limited or inadequate data and poor quality. Though research of data quality has gained popularity and made advancements in various practical areas, it has not made significant inroads for child welfare fields or data systems. Poor data quality can hinder service decision-making, impacting child behavioral health and well-being as well as increasing unnecessary expenditure of time and resources. Poor data quality can also undermine the validity of research and slow policymaking processes. The purpose of this commentary is to summarize the data quality research base in other fields, describe obstacles and uniqueness to improve data quality in child welfare, and propose necessary steps to scientific research and practical implementation that enables researchers and practitioners to improve the quality of child welfare services based on the enhanced quality of data.
Collapse
Affiliation(s)
- Yutian T Thompson
- University of Oklahoma Health Sciences Center, Oklahoma City, OK, USA.
| | - Yaqi Li
- University of Oklahoma Health Sciences Center, Oklahoma City, OK, USA.
| | - Jane Silovsky
- University of Oklahoma Health Sciences Center, Oklahoma City, OK, USA
| |
Collapse
|
4
|
Secor AM, Célestin K, Jasmin M, Honoré JG, Wagner AD, Beima-Sofie K, Pintye J, Puttkammer N. Electronic Medical Record Data Missingness and Interruption in Antiretroviral Therapy Among Adults and Children Living With HIV in Haiti: Retrospective Longitudinal Study. JMIR Pediatr Parent 2024; 7:e51574. [PMID: 38488632 PMCID: PMC10986334 DOI: 10.2196/51574] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 08/24/2023] [Revised: 01/18/2024] [Accepted: 01/19/2024] [Indexed: 04/04/2024] Open
Abstract
Background Children (aged 0-14 years) living with HIV often experience lower rates of HIV diagnosis, treatment, and viral load suppression. In Haiti, only 63% of children living with HIV know their HIV status (compared to 85% overall), 63% are on treatment (compared to 85% overall), and 48% are virally suppressed (compared to 73% overall). Electronic medical records (EMRs) can improve HIV care and patient outcomes, but these benefits are largely dependent on providers having access to quality and nonmissing data. Objective We sought to understand the associations between EMR data missingness and interruption in antiretroviral therapy treatment by age group (pediatric vs adult). Methods We assessed associations between patient intake record data missingness and interruption in treatment (IIT) status at 6 and 12 months post antiretroviral therapy initiation using patient-level data drawn from iSanté, the most widely used EMR in Haiti. Missingness was assessed for tuberculosis diagnosis, World Health Organization HIV stage, and weight using a composite score indicator (ie, the number of indicators of interest missing). Risk ratios were estimated using marginal parameters from multilevel modified Poisson models with robust error variances and random intercepts for the facility to account for clustering. Results Data were drawn from 50 facilities and comprised 31,457 patient records from people living with HIV, of which 1306 (4.2%) were pediatric cases. Pediatric patients were more likely than adult patients to experience IIT (n=431, 33% vs n=7477, 23.4% at 6 months; P<.001). Additionally, pediatric patient records had higher data missingness, with 581 (44.5%) pediatric records missing at least 1 indicator of interest, compared to 7812 (25.9%) adult records (P<.001). Among pediatric patients, each additional indicator missing was associated with a 1.34 times greater likelihood of experiencing IIT at 6 months (95% CI 1.08-1.66; P=.008) and 1.24 times greater likelihood of experiencing IIT at 12 months (95% CI 1.05-1.46; P=.01). These relationships were not statistically significant for adult patients. Compared to pediatric patients with 0 missing indicators, pediatric patients with 1, 2, or 3 missing indicators were 1.59 (95% CI 1.26-2.01; P<.001), 1.74 (95% CI 1.02-2.97; P=.04), and 2.25 (95% CI 1.43-3.56; P=.001) times more likely to experience IIT at 6 months, respectively. Among adult patients, compared to patients with 0 indicators missing, having all 3 indicators missing was associated with being 1.32 times more likely to experience IIT at 6 months (95% CI 1.03-1.70; P=.03), while there was no association with IIT status for other levels of missingness. Conclusions These findings suggest that both EMR data quality and quality of care are lower for children living with HIV in Haiti. This underscores the need for further research into the mechanisms by which EMR data quality impacts the quality of care and patient outcomes among this population. Efforts to improve both EMR data quality and quality of care should consider prioritizing pediatric patients.
Collapse
Affiliation(s)
- Andrew M Secor
- Department of Global Health, University of Washington, Seattle, WA, United States
| | - Kemar Célestin
- Centre Haïtien pour le Renforcement du Système de Santé, Port-au-Prince, Haiti
| | - Margareth Jasmin
- Centre Haïtien pour le Renforcement du Système de Santé, Port-au-Prince, Haiti
| | - Jean Guy Honoré
- Centre Haïtien pour le Renforcement du Système de Santé, Port-au-Prince, Haiti
| | - Anjuli D Wagner
- Department of Global Health, University of Washington, Seattle, WA, United States
| | - Kristin Beima-Sofie
- Department of Global Health, University of Washington, Seattle, WA, United States
| | - Jillian Pintye
- Department of Global Health, University of Washington, Seattle, WA, United States
| | - Nancy Puttkammer
- International Training and Education Center for Health, Seattle, WA, United States
| |
Collapse
|
5
|
Black JE, Terry AL, Cejic S, Freeman T, Lizotte D, McKay S, Speechley M, Ryan B. Understanding data provenance when using electronic medical records for research: Lessons learned from the Deliver Primary Healthcare Information (DELPHI) database. Int J Popul Data Sci 2023; 8:2177. [PMID: 38425492 PMCID: PMC10900298 DOI: 10.23889/ijpds.v8i5.2177] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 03/02/2024] Open
Abstract
Introduction We set out to assess the impact of Choosing Wisely Canada recommendations (2014) on reducing unnecessary health investigations and interventions in primary care across Southwestern Ontario. Methods We used the Deliver Primary Healthcare Information (DELPHI) database, which stores deidentified electronic medical records (EMR) of nearly 65,000 primary care patients across Southwestern Ontario. When conducting research using EMR data, data provenance (i.e., how the data came to be) should first be established. We first considered DELPHI data provenance in relation to longitudinal analyses, flagging a change in EMR software that occurred during 2012 and 2013. We attempted to link records between EMR databases produced by different software using probabilistic linkage and inspected 10 years of data in the DELPHI database (2009 to 2019) for data quality issues, including comparability over time. Results We encountered several issues resulting from this change in EMR software. These included limited linkage of records between software without a common identifier; data migration issues that distorted procedure dates; and unusual changes in laboratory test and medication prescription volumes. Conclusion This study reinforces the necessity of assessing data provenance and quality for new research projects. By understanding data provenance, we can anticipate related data quality issues such as changes in EMR data over time-which represent a growing concern as longitudinal data analyses increase in feasibility and popularity.
Collapse
Affiliation(s)
- Jason Edward Black
- Department of Family Medicine, Schulich School of Medicine and Dentistry, Western University, London, ON, Canada
| | - Amanda L. Terry
- Department of Family Medicine, Schulich School of Medicine and Dentistry, Western University, London, ON, Canada
- Department of Epidemiology and Biostatistics, Schulich School of Medicine and Dentistry, Western University, London, ON, Canada
- Schulich Interfaculty Program in Public Health, Schulich School of Medicine and Dentistry, Western University, London, ON, Canada
| | - Sonny Cejic
- Department of Family Medicine, Schulich School of Medicine and Dentistry, Western University, London, ON, Canada
| | - Tom Freeman
- Department of Family Medicine, Schulich School of Medicine and Dentistry, Western University, London, ON, Canada
| | - Dan Lizotte
- Department of Epidemiology and Biostatistics, Schulich School of Medicine and Dentistry, Western University, London, ON, Canada
- Schulich Interfaculty Program in Public Health, Schulich School of Medicine and Dentistry, Western University, London, ON, Canada
- Department of Computer Science, Faculty of Science, Western University, London, ON, Canada
| | - Scott McKay
- Department of Family Medicine, Schulich School of Medicine and Dentistry, Western University, London, ON, Canada
| | - Mark Speechley
- Department of Epidemiology and Biostatistics, Schulich School of Medicine and Dentistry, Western University, London, ON, Canada
- Schulich Interfaculty Program in Public Health, Schulich School of Medicine and Dentistry, Western University, London, ON, Canada
| | - Bridget Ryan
- Department of Family Medicine, Schulich School of Medicine and Dentistry, Western University, London, ON, Canada
- Department of Epidemiology and Biostatistics, Schulich School of Medicine and Dentistry, Western University, London, ON, Canada
- Schulich Interfaculty Program in Public Health, Schulich School of Medicine and Dentistry, Western University, London, ON, Canada
| |
Collapse
|
6
|
Lewis AE, Weiskopf N, Abrams ZB, Foraker R, Lai AM, Payne PRO, Gupta A. Electronic health record data quality assessment and tools: a systematic review. J Am Med Inform Assoc 2023; 30:1730-1740. [PMID: 37390812 PMCID: PMC10531113 DOI: 10.1093/jamia/ocad120] [Citation(s) in RCA: 10] [Impact Index Per Article: 10.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/07/2023] [Revised: 05/16/2023] [Accepted: 06/23/2023] [Indexed: 07/02/2023] Open
Abstract
OBJECTIVE We extended a 2013 literature review on electronic health record (EHR) data quality assessment approaches and tools to determine recent improvements or changes in EHR data quality assessment methodologies. MATERIALS AND METHODS We completed a systematic review of PubMed articles from 2013 to April 2023 that discussed the quality assessment of EHR data. We screened and reviewed papers for the dimensions and methods defined in the original 2013 manuscript. We categorized papers as data quality outcomes of interest, tools, or opinion pieces. We abstracted and defined additional themes and methods though an iterative review process. RESULTS We included 103 papers in the review, of which 73 were data quality outcomes of interest papers, 22 were tools, and 8 were opinion pieces. The most common dimension of data quality assessed was completeness, followed by correctness, concordance, plausibility, and currency. We abstracted conformance and bias as 2 additional dimensions of data quality and structural agreement as an additional methodology. DISCUSSION There has been an increase in EHR data quality assessment publications since the original 2013 review. Consistent dimensions of EHR data quality continue to be assessed across applications. Despite consistent patterns of assessment, there still does not exist a standard approach for assessing EHR data quality. CONCLUSION Guidelines are needed for EHR data quality assessment to improve the efficiency, transparency, comparability, and interoperability of data quality assessment. These guidelines must be both scalable and flexible. Automation could be helpful in generalizing this process.
Collapse
Affiliation(s)
- Abigail E Lewis
- Division of Computational and Data Sciences, Washington University in St. Louis, St. Louis, Missouri, USA
- Institute for Informatics, Data Science and Biostatistics, Washington University in St. Louis, St. Louis, Missouri, USA
| | - Nicole Weiskopf
- Department of Medical Informatics and Clinical Epidemiology, Oregon Health & Science University, Portland, Oregon, USA
| | - Zachary B Abrams
- Institute for Informatics, Data Science and Biostatistics, Washington University in St. Louis, St. Louis, Missouri, USA
| | - Randi Foraker
- Institute for Informatics, Data Science and Biostatistics, Washington University in St. Louis, St. Louis, Missouri, USA
| | - Albert M Lai
- Institute for Informatics, Data Science and Biostatistics, Washington University in St. Louis, St. Louis, Missouri, USA
| | - Philip R O Payne
- Institute for Informatics, Data Science and Biostatistics, Washington University in St. Louis, St. Louis, Missouri, USA
| | - Aditi Gupta
- Institute for Informatics, Data Science and Biostatistics, Washington University in St. Louis, St. Louis, Missouri, USA
| |
Collapse
|
7
|
Ozonze O, Scott PJ, Hopgood AA. Automating Electronic Health Record Data Quality Assessment. J Med Syst 2023; 47:23. [PMID: 36781551 PMCID: PMC9925537 DOI: 10.1007/s10916-022-01892-2] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/10/2021] [Accepted: 11/15/2022] [Indexed: 02/15/2023]
Abstract
Information systems such as Electronic Health Record (EHR) systems are susceptible to data quality (DQ) issues. Given the growing importance of EHR data, there is an increasing demand for strategies and tools to help ensure that available data are fit for use. However, developing reliable data quality assessment (DQA) tools necessary for guiding and evaluating improvement efforts has remained a fundamental challenge. This review examines the state of research on operationalising EHR DQA, mainly automated tooling, and highlights necessary considerations for future implementations. We reviewed 1841 articles from PubMed, Web of Science, and Scopus published between 2011 and 2021. 23 DQA programs deployed in real-world settings to assess EHR data quality (n = 14), and a few experimental prototypes (n = 9), were identified. Many of these programs investigate completeness (n = 15) and value conformance (n = 12) quality dimensions and are backed by knowledge items gathered from domain experts (n = 9), literature reviews and existing DQ measurements (n = 3). A few DQA programs also explore the feasibility of using data-driven techniques to assess EHR data quality automatically. Overall, the automation of EHR DQA is gaining traction, but current efforts are fragmented and not backed by relevant theory. Existing programs also vary in scope, type of data supported, and how measurements are sourced. There is a need to standardise programs for assessing EHR data quality, as current evidence suggests their quality may be unknown.
Collapse
Affiliation(s)
- Obinwa Ozonze
- School of Computing, University of Portsmouth, Buckingham Building, Lion Terrace, Portsmouth, PO1 3HE, UK
| | - Philip J Scott
- Institute of Management and Health, University of Wales Trinity Saint David, Lampeter, SA48 7ED, UK
| | - Adrian A Hopgood
- School of Computing, University of Portsmouth, Buckingham Building, Lion Terrace, Portsmouth, PO1 3HE, UK.
| |
Collapse
|
8
|
Soos B, Garies S, Cornect-Benoit A, Montgomery L, Sharpe H, Rittenbach K, Manca D, Duerksen K, Forst B, Drummond N. Documenting cannabis use in primary care: a descriptive cross-sectional study using electronic medical record data in Alberta, Canada. BMC Res Notes 2023; 16:9. [PMID: 36726135 PMCID: PMC9890680 DOI: 10.1186/s13104-023-06274-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/15/2022] [Accepted: 01/16/2023] [Indexed: 02/03/2023] Open
Abstract
OBJECTIVE Documenting cannabis use is important for patient care, but no formal requirements for consistent reporting exist in primary care. The objective of this study was to understand how cannabis use is documented in primary care electronic medical record (EMR) data. RESULTS This was a cross-sectional study using de-identified EMR data from over 398,000 patients and 333 primary care providers in Alberta, Canada. An automated pattern-matching algorithm was developed to identify text and ICD-9 diagnostic codes indicating cannabis use in the EMR. There was a total of 11,724 records indicating cannabis use from 4652 patients, representing approximately 1.2% of the patient sample. Commonly used terms and ICD-9 codes included cannabis, marijuana/marihuana, THC, 304.3 and 305.2. Nabilone was the most frequently prescribed cannabinoid medication. Slightly more males and those with a chronic condition had cannabis use recorded more often. Overall, very few patients have cannabis use recorded in primary care EMR data and this is not captured in a systematic way. We propose several strategies to improve the documentation of cannabis use to facilitate more effective clinical care, research, and surveillance.
Collapse
Affiliation(s)
- Boglarka Soos
- grid.22072.350000 0004 1936 7697Department of Family Medicine, G012 Health Sciences Centre, University of Calgary, 3330 Hospital Drive NW, Calgary, AB T2N 4N1 Canada ,grid.22072.350000 0004 1936 7697Department of Community Health Sciences, University of Calgary, Calgary, AB Canada
| | - Stephanie Garies
- grid.22072.350000 0004 1936 7697Department of Family Medicine, G012 Health Sciences Centre, University of Calgary, 3330 Hospital Drive NW, Calgary, AB T2N 4N1 Canada
| | - Ashley Cornect-Benoit
- grid.22072.350000 0004 1936 7697Department of Community Health Sciences, University of Calgary, Calgary, AB Canada
| | - Lori Montgomery
- grid.22072.350000 0004 1936 7697Department of Family Medicine, G012 Health Sciences Centre, University of Calgary, 3330 Hospital Drive NW, Calgary, AB T2N 4N1 Canada ,grid.22072.350000 0004 1936 7697Department of Anesthesiology, Perioperative and Pain Medicine, University of Calgary, Calgary, AB Canada
| | - Heather Sharpe
- grid.17089.370000 0001 2190 316XDepartment of Medicine, University of Alberta, Edmonton, AB Canada ,grid.22072.350000 0004 1936 7697Department of Medicine, University of Calgary, Calgary, AB Canada
| | - Katherine Rittenbach
- grid.22072.350000 0004 1936 7697Department of Psychiatry, University of Calgary, Calgary, AB Canada ,grid.17089.370000 0001 2190 316XDepartment of Psychiatry, University of Alberta, Edmonton, AB Canada
| | - Donna Manca
- grid.17089.370000 0001 2190 316XDepartment of Family Medicine, University of Alberta, Edmonton, AB Canada
| | - Kimberley Duerksen
- grid.17089.370000 0001 2190 316XDepartment of Family Medicine, University of Alberta, Edmonton, AB Canada
| | - Brian Forst
- grid.17089.370000 0001 2190 316XDepartment of Family Medicine, University of Alberta, Edmonton, AB Canada
| | - Neil Drummond
- grid.22072.350000 0004 1936 7697Department of Family Medicine, G012 Health Sciences Centre, University of Calgary, 3330 Hospital Drive NW, Calgary, AB T2N 4N1 Canada ,grid.17089.370000 0001 2190 316XDepartment of Family Medicine, University of Alberta, Edmonton, AB Canada
| |
Collapse
|
9
|
Mashoufi M, Ayatollahi H, Khorasani-Zavareh D, Talebi Azad Boni T. Data Quality in Health Care: Main Concepts and Assessment Methodologies. Methods Inf Med 2023; 62:5-18. [PMID: 36716776 DOI: 10.1055/s-0043-1761500] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/01/2023]
Abstract
INTRODUCTION In the health care environment, a huge volume of data is produced on a daily basis. However, the processes of collecting, storing, sharing, analyzing, and reporting health data usually face with numerous challenges that lead to producing incomplete, inaccurate, and untimely data. As a result, data quality issues have received more attention than before. OBJECTIVE The purpose of this article is to provide an insight into the data quality definitions, dimensions, and assessment methodologies. METHODS In this article, a scoping literature review approach was used to describe and summarize the main concepts related to data quality and data quality assessment methodologies. Search terms were selected to find the relevant articles published between January 1, 2012 and September 31, 2022. The retrieved articles were then reviewed and the results were reported narratively. RESULTS In total, 23 papers were included in the study. According to the results, data quality dimensions were various and different methodologies were used to assess them. Most studies used quantitative methods to measure data quality dimensions either in paper-based or computer-based medical records. Only two studies investigated respondents' opinions about data quality. CONCLUSION In health care, high-quality data not only are important for patient care, but also are vital for improving quality of health care services and better decision making. Therefore, using technical and nontechnical solutions as well as constant assessment and supervision is suggested to improve data quality.
Collapse
Affiliation(s)
- Mehrnaz Mashoufi
- Department of Health Information Management, School of Medicine, Ardabil University of Medical Sciences, Ardabil, Iran
| | - Haleh Ayatollahi
- Health Management and Economics Research Center, Health Management Research Institute, Iran University of Medical Sciences, Tehran, Iran.,Department of Health Information Management, School of Health Management and Information Sciences, Iran University of Medical Sciences, Tehran, Iran
| | - Davoud Khorasani-Zavareh
- Department of Health in Emergencies and Disasters, Safety Promotion and Injury Prevention Research Center, School of Public Health and Safety, Shahid Beheshti University of Medical Sciences, Tehran, Iran
| | - Tahere Talebi Azad Boni
- Department of Health Information Management, School of Health Management and Information Sciences, Iran University of Medical Sciences, Tehran, Iran.,Social Determinants of Health Research Center, Saveh University of Medical Sciences, Saveh, Iran
| |
Collapse
|
10
|
Kueper JK, Rayner J, Zwarenstein M, Lizotte DJ. Describing a complex primary health care population to support future decision support initiatives. Int J Popul Data Sci 2022; 7:1756. [PMID: 37670733 PMCID: PMC10476014 DOI: 10.23889/ijpds.v7i1.1756] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/05/2022] Open
Abstract
Introduction Developing decision support tools using data from a health care organization, to support care within that organization, is a promising paradigm to improve care delivery and population health. Descriptive epidemiology may be a valuable supplement to stakeholder input towards selection of potential initiatives and to inform methodological decisions throughout tool development. We additionally propose that to properly characterize complex populations in large-scale descriptive studies, both simple statistical and machine learning techniques can be useful. Objective To describe sociodemographic, clinical, and health care use characteristics of primary care clients served by the Alliance for Healthier Communities, which provides team-based primary health care through Community Health Centres (CHCs) across Ontario, Canada. Methods We used electronic health record data from adult ongoing primary care clients served by CHCs in 2009-2019. We performed traditional table-based summaries for each characteristic; and applied three unsupervised learning techniques to explore patterns of common condition co-occurrence, care provider teams, and care frequency. Results There were 221,047 eligible clients. Sociodemographics: We described 13 characteristics, stratified by CHC type and client multimorbidity status. Clinical characteristics: Eleven-year prevalence of 24 investigated conditions ranged from 1% (Hepatitis C) to 63% (chronic musculoskeletal problem) with non-uniform risk across the care history; multimorbidity was common (81%) with variable co-occurrence patterns. Health care use characteristics: Most care was provided by physician and nursing providers, with heterogeneous combinations of other provider types. A subset of clients had many issues addressed within single-visits and there was within- and between-client variability in care frequency. In addition to substantive findings, we discuss methodological considerations for future decision support initiatives. Conclusions We demonstrated the use of methods from statistics and machine learning, applied with an epidemiological lens, to provide an overview of a complex primary care population and lay a foundation for stakeholder engagement and decision support tool development.
Collapse
Affiliation(s)
- Jacqueline K. Kueper
- Department of Epidemiology and Biostatistics, Schulich School of Medicine & Dentistry, Western University, London, Ontario, Canada
- Department of Computer Science, Faculty of Science, Western University, London, Ontario, Canada
| | - Jennifer Rayner
- Department of Family Medicine, Schulich School of Medicine & Dentistry, Western University, London, Ontario, Canada
- Alliance for Healthier Communities, Toronto, Ontario, Canada
| | - Merrick Zwarenstein
- Department of Epidemiology and Biostatistics, Schulich School of Medicine & Dentistry, Western University, London, Ontario, Canada
- Department of Family Medicine, Schulich School of Medicine & Dentistry, Western University, London, Ontario, Canada
| | - Daniel J. Lizotte
- Department of Epidemiology and Biostatistics, Schulich School of Medicine & Dentistry, Western University, London, Ontario, Canada
- Department of Computer Science, Faculty of Science, Western University, London, Ontario, Canada
| |
Collapse
|
11
|
Maletzky A, Böck C, Tschoellitsch T, Roland T, Ludwig H, Thumfart S, Giretzlehner M, Hochreiter S, Meier J. Lifting Hospital Electronic Health Record Data Treasures: Challenges and Opportunities. JMIR Med Inform 2022; 10:e38557. [PMID: 36269654 PMCID: PMC9636533 DOI: 10.2196/38557] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/07/2022] [Revised: 08/02/2022] [Accepted: 09/07/2022] [Indexed: 12/04/2022] Open
Abstract
Electronic health records (EHRs) have been successfully used in data science and machine learning projects. However, most of these data are collected for clinical use rather than for retrospective analysis. This means that researchers typically face many different issues when attempting to access and prepare the data for secondary use. We aimed to investigate how raw EHRs can be accessed and prepared in retrospective data science projects in a disciplined, effective, and efficient way. We report our experience and findings from a large-scale data science project analyzing routinely acquired retrospective data from the Kepler University Hospital in Linz, Austria. The project involved data collection from more than 150,000 patients over a period of 10 years. It included diverse data modalities, such as static demographic data, irregularly acquired laboratory test results, regularly sampled vital signs, and high-frequency physiological waveform signals. Raw medical data can be corrupted in many unexpected ways that demand thorough manual inspection and highly individualized data cleaning solutions. We present a general data preparation workflow, which was shaped in the course of our project and consists of the following 7 steps: obtain a rough overview of the available EHR data, define clinically meaningful labels for supervised learning, extract relevant data from the hospital’s data warehouses, match data extracted from different sources, deidentify them, detect errors and inconsistencies therein through a careful exploratory analysis, and implement a suitable data processing pipeline in actual code. Only few of the data preparation issues encountered in our project were addressed by generic medical data preprocessing tools that have been proposed recently. Instead, highly individualized solutions for the specific data used in one’s own research seem inevitable. We believe that the proposed workflow can serve as a guidance for practitioners, helping them to identify and address potential problems early and avoid some common pitfalls.
Collapse
Affiliation(s)
- Alexander Maletzky
- Research Department Medical Informatics, RISC Software GmbH, Hagenberg, Austria
| | - Carl Böck
- JKU LIT SAL eSPML Lab, Institute of Signal Processing, Johannes Kepler University, Linz, Austria
| | - Thomas Tschoellitsch
- Department of Anesthesiology and Critical Care Medicine, Kepler University Hospital GmbH, Johannes Kepler University, Linz, Austria
| | - Theresa Roland
- ELLIS Unit Linz, LIT AI Lab, Institute for Machine Learning, Johannes Kepler University, Linz, Austria
| | - Helga Ludwig
- ELLIS Unit Linz, LIT AI Lab, Institute for Machine Learning, Johannes Kepler University, Linz, Austria
| | - Stefan Thumfart
- Research Department Medical Informatics, RISC Software GmbH, Hagenberg, Austria
| | | | - Sepp Hochreiter
- ELLIS Unit Linz, LIT AI Lab, Institute for Machine Learning, Johannes Kepler University, Linz, Austria
| | - Jens Meier
- Department of Anesthesiology and Critical Care Medicine, Kepler University Hospital GmbH, Johannes Kepler University, Linz, Austria
| |
Collapse
|
12
|
McDonald N, Kriellaars D, Doupe M, Giesbrecht G, Pryce RT. Database quality assessment in research in paramedicine: a scoping review protocol. BMJ Open 2022; 12:e063372. [PMID: 35835522 PMCID: PMC9289022 DOI: 10.1136/bmjopen-2022-063372] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 11/25/2022] Open
Abstract
INTRODUCTION The paramedic practice environment presents unique challenges to data documentation and access, as well as linkage to other parts of the healthcare system. Variable or unknown data quality can influence the validity of research in paramedicine. A number of database quality assessment (DQA) frameworks have been developed and used to evaluate data quality in other areas of healthcare. The extent these or other DQA practices have been applied to paramedic research is not known. Accordingly, this scoping review aims to describe the range, extent and nature of DQA practices within research in paramedicine. METHODS AND ANALYSIS This scoping review will follow established methods for the conduct (Johanna Briggs Institute; Arksey and O'Malley) and reporting (Preferred Reporting Items in Systematic Reviews and Meta-Analyses extension for scoping reviews) of scoping reviews. In consultation with a professional librarian, a search strategy was developed representing the applicable population, concept and context. This strategy will be applied to MEDLINE (National Library of Medicine), Embase (Elsevier), Scopus (Elsevier) and CINAHL (EBSCO) to identify studies published from 2011 through 2021 that assess paramedic data quality as a stated goal. Studies will be included if they report quantitative results of DQA using data that relate primarily to the paramedic practice environment. Protocols, commentaries, case studies, interviews, simulations and experimental data-processing techniques will be excluded. No restrictions will be placed on language. Study selection will be performed by two reviewers, with a third available to resolve conflicts. Data will be extracted from included studies using a data-charting form piloted and iteratively revised based on studies known to be relevant. Results will be summarised in a chart of study characteristics, DQA-specific outcomes and key findings. ETHICS AND DISSEMINATION Ethical approval is not required. Results will be submitted to relevant conferences and peer-reviewed journals. TRIAL REGISTRATION 10.17605/OSF.IO/Z287T.
Collapse
Affiliation(s)
- Neil McDonald
- Applied Health Sciences, University of Manitoba, Winnipeg, Manitoba, Canada
- Winnipeg Fire Paramedic Service, Winnipeg, Manitoba, Canada
| | - Dean Kriellaars
- College of Rehabilitation Sciences, University of Manitoba, Winnipeg, Manitoba, Canada
| | - Malcolm Doupe
- Department of Community Health Sciences, University of Manitoba, Winnipeg, Manitoba, Canada
| | - Gordon Giesbrecht
- Faculty of Kinesiology and Recreation Management, University of Manitoba, Winnipeg, Manitoba, Canada
| | - Rob T Pryce
- Kinesiology and Applied Health, The University of Winnipeg, Winnipeg, Manitoba, Canada
| |
Collapse
|
13
|
Rahmani H, Maleki R, Ghanbari MK, Behzadifar M. Quality Assessment of Services in Primary Healthcare in Iran: A Systematic Review and Meta-analysis. Ethiop J Health Sci 2022; 32:453-462. [PMID: 35693585 PMCID: PMC9175234 DOI: 10.4314/ejhs.v32i2.26] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/14/2021] [Accepted: 11/30/2021] [Indexed: 11/08/2022] Open
Abstract
Background Primary healthcare (PHC) plays an important role in achieving universal health coverage (UHC). The SERVQUAL instrument is the tool for evaluating the quality of services in the health sector. The main purpose of this study is to evaluate the quality of services provided in PHC in Iran using the SERVQUAL instrument. Materials and Method We searched eight databases from January 2000 to September 2021. We analyzed the mean of various SERVQUAL instrument items using the DerSimonian-Laird approach via a random model with 95% confidence interval. Also, we used I2 to evaluate the heterogeneity of the studies. Results Finally, 17 studies were chosen for analysis in the present study. There were 8,767 study participants, out of which 8,237 were female and 530 were male. The mean dimensions of perception were as follows: Tangibles = 3.71, reliability = 4, responsiveness = 3.79, assurance = 3.83, and empathy = 3.86. For the expectation, the mean dimension were: Tangibles = 4.46, reliability = 4.46, responsiveness = 4.36, assurance = 4.36, and empathy = 4.36 respectively. The total gap quality between perception and expectation was -0.53. Conclusion All dimensions of quality based on SERVQUAL were negative, and the quality of service in PHC is not satisfactory. Therefore, policymakers must adopt serious and effective programs to improve services in this area. We also recommend that quality management of services in PHC in Iran should move toward comprehensive optimization in all areas, and quality in this area should be a priority.
Collapse
Affiliation(s)
- Hojjat Rahmani
- Department of Health Management and Economics, School of Public Health, Tehran University of Medical Sciences, Tehran, Iran
| | - Raheleh Maleki
- Department of Health Management and Economics, School of Public Health, Tehran University of Medical Sciences, Tehran, Iran
| | - Mahboubeh Khaton Ghanbari
- PHD candidate of health management, Department of Health Management and Economics, School of Public Health, Tehran University of Medical Sciences, Tehran, Iran
| | - Masoud Behzadifar
- Social Determinants of Health Research Center, Lorestan University of Medical Sciences, Khorramabad, Iran
| |
Collapse
|
14
|
Razzaghi H, Greenberg J, Bailey LC. Developing a systematic approach to assessing data quality in secondary use of clinical data based on intended use. Learn Health Syst 2022; 6:e10264. [PMID: 35036548 PMCID: PMC8753309 DOI: 10.1002/lrh2.10264] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/07/2020] [Revised: 02/24/2021] [Accepted: 03/01/2021] [Indexed: 11/10/2022] Open
Abstract
INTRODUCTION Secondary use of electronic health record (EHR) data for research requires that the data are fit for use. Data quality (DQ) frameworks have traditionally focused on structural conformance and completeness of clinical data extracted from source systems. In this paper, we propose a framework for evaluating semantic DQ that will allow researchers to evaluate fitness for use prior to analyses. METHODS We reviewed current DQ literature, as well as experience from recent multisite network studies, and identified gaps in the literature and current practice. Derived principles were used to construct the conceptual framework with attention to both analytic fitness and informatics practice. RESULTS We developed a systematic framework that guides researchers in assessing whether a data source is fit for use for their intended study or project. It combines tools for evaluating clinical context with DQ principles, as well as factoring in the characteristics of the data source, in order to develop semantic DQ checks. CONCLUSIONS Our framework provides a systematic process for DQ development. Further work is needed to codify practices and metadata around both structural and semantic data quality.
Collapse
Affiliation(s)
- Hanieh Razzaghi
- Department of Pediatrics and Biomedical and Health InformaticsChildren's Hospital of PhiladelphiaPhiladelphiaPennsylvaniaUSA
- Metadata Research CenterCollege of Computing and Informatics, Drexel UniversityPhiladelphiaPennsylvaniaUSA
| | - Jane Greenberg
- Metadata Research CenterCollege of Computing and Informatics, Drexel UniversityPhiladelphiaPennsylvaniaUSA
| | - L. Charles Bailey
- Department of Pediatrics and Biomedical and Health InformaticsChildren's Hospital of PhiladelphiaPhiladelphiaPennsylvaniaUSA
- Department of PediatricsPerelman School of Medicine, University of PennsylvaniaPhiladelphiaPennsylvaniaUSA
| |
Collapse
|
15
|
Laing S, Mercer J. Improved preventive care clinical decision-making efficiency: leveraging a point-of-care clinical decision support system. BMC Med Inform Decis Mak 2021; 21:315. [PMID: 34763691 PMCID: PMC8588582 DOI: 10.1186/s12911-021-01675-8] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/15/2021] [Accepted: 10/25/2021] [Indexed: 01/01/2023] Open
Abstract
Background Electronic medical records are widely used in family practices across Canada and can improve health outcomes. However, recent reports indicate that physicians using electronic medical records work longer and have less direct patient contact which may contribute to burnout. Therefore, new and innovative digital tools are essential to reduce physician workloads and improve patient-physician interaction to address physician burnout. The objective of this study was to assess the efficiency and accuracy of clinical decision-making when using a new preventive care point-of-care clinical decision support system (CDSS). An estimate of the potential annual time savings was also determined. This study also assessed physician reported perceived usefulness and ease of use of the CDSS. Methods Quantitative and qualitative data were collected during this study. Each participant evaluated two simulated patient charts and identified which preventive care metrics were due. The participants recorded their decisions and the time required to assess each chart. Participants then completed a Technology Acceptance Model survey regarding the perceived usefulness and ease of use of the CDSS, which included qualitative feedback. The amount of time saved was determined and participants’ clinical decision-making accuracy was scored against current Canadian preventive care guidelines. The number of preventive care specific visits completed per year was determined using clinic billing data. Results The preventive care CDSS saved an average of 195.7 s of chart review time (249.5 s vs 445.2 s; P < 0.001). A total of 1520 preventive visits were performed at Primrose and Bruyère Family Medicine Centres. Extrapolated across the organization, implementation of the new tool could save 82.6 h per year. Decision-making accuracy was not affected by the new tool (78.4% vs 80.9%, P > 0.05). Participants rated the perceived ease of use and usefulness to be very high. Conclusions New digital tools may reduce providers’ workload without impacting clinical decision-making accuracy. Participants indicated that the preventive care CDSS was useful and easy to use. Further software development and clinical studies are required to further improve and characterize the effect this new CDSS has when implemented in clinical practice. Supplementary Information The online version contains supplementary material available at 10.1186/s12911-021-01675-8.
Collapse
Affiliation(s)
- Scott Laing
- Department of Family Medicine, Primrose Family Medicine Centre, University of Ottawa, Ottawa, Canada.
| | - Jay Mercer
- Department of Family Medicine, Bruyère Family Medicine Centre, University of Ottawa, Ottawa, Canada
| |
Collapse
|
16
|
Shi X, Prins C, Van Pottelbergh G, Mamouris P, Vaes B, De Moor B. An automated data cleaning method for Electronic Health Records by incorporating clinical knowledge. BMC Med Inform Decis Mak 2021; 21:267. [PMID: 34535146 PMCID: PMC8449435 DOI: 10.1186/s12911-021-01630-7] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/16/2021] [Accepted: 09/09/2021] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND The use of Electronic Health Records (EHR) data in clinical research is incredibly increasing, but the abundancy of data resources raises the challenge of data cleaning. It can save time if the data cleaning can be done automatically. In addition, the automated data cleaning tools for data in other domains often process all variables uniformly, meaning that they cannot serve well for clinical data, as there is variable-specific information that needs to be considered. This paper proposes an automated data cleaning method for EHR data with clinical knowledge taken into consideration. METHODS We used EHR data collected from primary care in Flanders, Belgium during 1994-2015. We constructed a Clinical Knowledge Database to store all the variable-specific information that is necessary for data cleaning. We applied Fuzzy search to automatically detect and replace the wrongly spelled units, and performed the unit conversion following the variable-specific conversion formula. Then the numeric values were corrected and outliers were detected considering the clinical knowledge. In total, 52 clinical variables were cleaned, and the percentage of missing values (completeness) and percentage of values within the normal range (correctness) before and after the cleaning process were compared. RESULTS All variables were 100% complete before data cleaning. 42 variables had a drop of less than 1% in the percentage of missing values and 9 variables declined by 1-10%. Only 1 variable experienced large decline in completeness (13.36%). All variables had more than 50% values within the normal range after cleaning, of which 43 variables had a percentage higher than 70%. CONCLUSIONS We propose a general method for clinical variables, which achieves high automation and is capable to deal with large-scale data. This method largely improved the efficiency to clean the data and removed the technical barriers for non-technical people.
Collapse
Affiliation(s)
- Xi Shi
- Department of Electrical Engineering (ESAT), Stadius Centre for Dynamical Systems, Signal Processing and Data Analytics, KU Leuven, Kasteelpark Arenberg 10 - Box 2446, 3001, Leuven, Belgium.
| | - Charlotte Prins
- Leuven Statistics Research Center, KU Leuven, 3000, Leuven, Belgium
| | | | - Pavlos Mamouris
- Academic Center for General Practice, KU Leuven, 3000, Leuven, Belgium
| | - Bert Vaes
- Academic Center for General Practice, KU Leuven, 3000, Leuven, Belgium
| | - Bart De Moor
- Department of Electrical Engineering (ESAT), Stadius Centre for Dynamical Systems, Signal Processing and Data Analytics, KU Leuven, Kasteelpark Arenberg 10 - Box 2446, 3001, Leuven, Belgium
| |
Collapse
|
17
|
Scherer J, Nolden M, Kleesiek J, Metzger J, Kades K, Schneider V, Bach M, Sedlaczek O, Bucher AM, Vogl TJ, Grünwald F, Kühn JP, Hoffmann RT, Kotzerke J, Bethge O, Schimmöller L, Antoch G, Müller HW, Daul A, Nikolaou K, la Fougère C, Kunz WG, Ingrisch M, Schachtner B, Ricke J, Bartenstein P, Nensa F, Radbruch A, Umutlu L, Forsting M, Seifert R, Herrmann K, Mayer P, Kauczor HU, Penzkofer T, Hamm B, Brenner W, Kloeckner R, Düber C, Schreckenberger M, Braren R, Kaissis G, Makowski M, Eiber M, Gafita A, Trager R, Weber WA, Neubauer J, Reisert M, Bock M, Bamberg F, Hennig J, Meyer PT, Ruf J, Haberkorn U, Schoenberg SO, Kuder T, Neher P, Floca R, Schlemmer HP, Maier-Hein K. Joint Imaging Platform for Federated Clinical Data Analytics. JCO Clin Cancer Inform 2021; 4:1027-1038. [PMID: 33166197 PMCID: PMC7713526 DOI: 10.1200/cci.20.00045] [Citation(s) in RCA: 22] [Impact Index Per Article: 7.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/12/2022] Open
Abstract
PURPOSE Image analysis is one of the most promising applications of artificial intelligence (AI) in health care, potentially improving prediction, diagnosis, and treatment of diseases. Although scientific advances in this area critically depend on the accessibility of large-volume and high-quality data, sharing data between institutions faces various ethical and legal constraints as well as organizational and technical obstacles. METHODS The Joint Imaging Platform (JIP) of the German Cancer Consortium (DKTK) addresses these issues by providing federated data analysis technology in a secure and compliant way. Using the JIP, medical image data remain in the originator institutions, but analysis and AI algorithms are shared and jointly used. Common standards and interfaces to local systems ensure permanent data sovereignty of participating institutions. RESULTS The JIP is established in the radiology and nuclear medicine departments of 10 university hospitals in Germany (DKTK partner sites). In multiple complementary use cases, we show that the platform fulfills all relevant requirements to serve as a foundation for multicenter medical imaging trials and research on large cohorts, including the harmonization and integration of data, interactive analysis, automatic analysis, federated machine learning, and extensibility and maintenance processes, which are elementary for the sustainability of such a platform. CONCLUSION The results demonstrate the feasibility of using the JIP as a federated data analytics platform in heterogeneous clinical information technology and software landscapes, solving an important bottleneck for the application of AI to large-scale clinical imaging data.
Collapse
Affiliation(s)
- Jonas Scherer
- Division of Medical Image Computing, German Cancer Research Center, Heidelberg, Germany.,German Cancer Consortium, Heidelberg, Germany
| | - Marco Nolden
- Division of Medical Image Computing, German Cancer Research Center, Heidelberg, Germany.,German Cancer Consortium, Heidelberg, Germany.,Pattern Analysis and Learning Group, Radio-oncology and Clinical Radiotherapy, Heidelberg University Hospital, Heidelberg, Germany
| | - Jens Kleesiek
- German Cancer Consortium, Heidelberg, Germany.,Division of Radiology, German Cancer Research Center, Heidelberg, Germany
| | - Jasmin Metzger
- Division of Medical Image Computing, German Cancer Research Center, Heidelberg, Germany.,German Cancer Consortium, Heidelberg, Germany
| | - Klaus Kades
- Division of Medical Image Computing, German Cancer Research Center, Heidelberg, Germany.,German Cancer Consortium, Heidelberg, Germany
| | - Verena Schneider
- German Cancer Consortium, Heidelberg, Germany.,Division of Radiology, German Cancer Research Center, Heidelberg, Germany
| | - Michael Bach
- German Cancer Consortium, Heidelberg, Germany.,Division of Radiology, German Cancer Research Center, Heidelberg, Germany
| | - Oliver Sedlaczek
- German Cancer Consortium, Heidelberg, Germany.,Division of Radiology, German Cancer Research Center, Heidelberg, Germany.,Klinik Diagnostische und Interventionelle Radiologie der Universität Heidelberg, Heidelberg, Germany
| | - Andreas M Bucher
- German Cancer Consortium, Heidelberg, Germany.,Institut für Diagnostische und Interventionelle Radiologie, Universitätsklinikum Frankfurt, Frankfurt, Germany
| | - Thomas J Vogl
- German Cancer Consortium, Heidelberg, Germany.,Institut für Diagnostische und Interventionelle Radiologie, Universitätsklinikum Frankfurt, Frankfurt, Germany
| | - Frank Grünwald
- German Cancer Consortium, Heidelberg, Germany.,Klinik für Nuklearmedizin, Universitätsklinikum Frankfurt, Frankfurt, Germany
| | - Jens-Peter Kühn
- German Cancer Consortium, Heidelberg, Germany.,Institut und Poliklinik für Diagnostische und Interventionelle Radiologie, Universitätsklinikum Carl Gustav Carus Dresden, Dresden, Germany
| | - Ralf-Thorsten Hoffmann
- German Cancer Consortium, Heidelberg, Germany.,Institut und Poliklinik für Diagnostische und Interventionelle Radiologie, Universitätsklinikum Carl Gustav Carus Dresden, Dresden, Germany
| | - Jörg Kotzerke
- German Cancer Consortium, Heidelberg, Germany.,Klinik und Poliklinik für Nuklearmedizin, Universitätsklinikum Carl Gustav Carus Dresden, Dresden, Germany
| | - Oliver Bethge
- German Cancer Consortium, Heidelberg, Germany.,Medical Faculty, Department of Diagnostic and Interventional Radiology, University Düsseldorf, Düsseldorf, Germany
| | - Lars Schimmöller
- German Cancer Consortium, Heidelberg, Germany.,Medical Faculty, Department of Diagnostic and Interventional Radiology, University Düsseldorf, Düsseldorf, Germany
| | - Gerald Antoch
- German Cancer Consortium, Heidelberg, Germany.,Medical Faculty, Department of Diagnostic and Interventional Radiology, University Düsseldorf, Düsseldorf, Germany
| | - Hans-Wilhelm Müller
- German Cancer Consortium, Heidelberg, Germany.,Klinik für Nuklearmedizin, Universitätsklinikum Düsseldorf, Düsseldorf, Germany
| | - Andreas Daul
- German Cancer Consortium, Heidelberg, Germany.,Klinik für Diagnostische und Interventionelle Radiologie, Universitätsklinikum Tübingen, Tübingen, Germany
| | - Konstantin Nikolaou
- German Cancer Consortium, Heidelberg, Germany.,Klinik für Diagnostische und Interventionelle Radiologie, Universitätsklinikum Tübingen, Tübingen, Germany
| | - Christian la Fougère
- German Cancer Consortium, Heidelberg, Germany.,Klinik für Nuklearmedizin und Klinische Molekulare Bildgebung, Universitätsklinikum Tübingen, Tübingen, Germany
| | - Wolfgang G Kunz
- German Cancer Consortium, Heidelberg, Germany.,Department of Radiology, University Hospital, Ludwig Maximilian University Munich, Munich, Germany
| | - Michael Ingrisch
- German Cancer Consortium, Heidelberg, Germany.,Department of Radiology, University Hospital, Ludwig Maximilian University Munich, Munich, Germany
| | - Balthasar Schachtner
- German Cancer Consortium, Heidelberg, Germany.,Department of Radiology, University Hospital, Ludwig Maximilian University Munich, Munich, Germany.,German Center of Lung Research, Giessen, Germany
| | - Jens Ricke
- German Cancer Consortium, Heidelberg, Germany.,Department of Radiology, University Hospital, Ludwig Maximilian University Munich, Munich, Germany
| | - Peter Bartenstein
- German Cancer Consortium, Heidelberg, Germany.,Klinik und Poliklinik für Nuklearmedizin, Klinikum der Universität München, München, Germany
| | - Felix Nensa
- German Cancer Consortium, Heidelberg, Germany.,Institut für Diagnostische und Interventionelle Radiologie und Neuroradiologie, Universitätsklinikum Essen AöR, Essen, Germany
| | - Alexander Radbruch
- German Cancer Consortium, Heidelberg, Germany.,Institut für Diagnostische und Interventionelle Radiologie und Neuroradiologie, Universitätsklinikum Essen AöR, Essen, Germany
| | - Lale Umutlu
- German Cancer Consortium, Heidelberg, Germany.,Institut für Diagnostische und Interventionelle Radiologie und Neuroradiologie, Universitätsklinikum Essen AöR, Essen, Germany
| | - Michael Forsting
- German Cancer Consortium, Heidelberg, Germany.,Institut für Diagnostische und Interventionelle Radiologie und Neuroradiologie, Universitätsklinikum Essen AöR, Essen, Germany
| | - Robert Seifert
- German Cancer Consortium, Heidelberg, Germany.,Klinik für Nuklearmedizin, Universitätsklinikum Essen AöR, Essen, Germany
| | - Ken Herrmann
- German Cancer Consortium, Heidelberg, Germany.,Klinik für Nuklearmedizin, Universitätsklinikum Essen AöR, Essen, Germany
| | - Philipp Mayer
- German Cancer Consortium, Heidelberg, Germany.,Klinik Diagnostische und Interventionelle Radiologie der Universität Heidelberg, Heidelberg, Germany
| | - Hans-Ulrich Kauczor
- German Cancer Consortium, Heidelberg, Germany.,Klinik Diagnostische und Interventionelle Radiologie der Universität Heidelberg, Heidelberg, Germany.,German Center of Lung Research, Giessen, Germany
| | - Tobias Penzkofer
- German Cancer Consortium, Heidelberg, Germany.,Klinik für Radiologie (mit dem Bereich Kinderradiologie), Charité Universitätsmedizin Berlin, Berlin, Germany
| | - Bernd Hamm
- German Cancer Consortium, Heidelberg, Germany.,Klinik für Radiologie (mit dem Bereich Kinderradiologie), Charité Universitätsmedizin Berlin, Berlin, Germany
| | - Winfried Brenner
- German Cancer Consortium, Heidelberg, Germany.,Klinik für Nuklearmedizin, Charité-Universitätsmedizin Berlin, Berlin, Germany
| | - Roman Kloeckner
- German Cancer Consortium, Heidelberg, Germany.,Klinik und Poliklinik für Diagnostische und Interventionelle Radiologie, Universitätsmedizin Mainz, Mainz, Germany
| | - Christoph Düber
- German Cancer Consortium, Heidelberg, Germany.,Klinik und Poliklinik für Diagnostische und Interventionelle Radiologie, Universitätsmedizin Mainz, Mainz, Germany
| | - Mathias Schreckenberger
- German Cancer Consortium, Heidelberg, Germany.,Klinik und Poliklinik für Nuklearmedizin, Universitätsmedizin Mainz, Mainz, Germany
| | - Rickmer Braren
- German Cancer Consortium, Heidelberg, Germany.,Institut für Diagnostische und Interventionelle Radiologie, Klinikum Rechts der Isar, Technical University of Munich, Munich, Germany
| | - Georgios Kaissis
- German Cancer Consortium, Heidelberg, Germany.,Pattern Analysis and Learning Group, Radio-oncology and Clinical Radiotherapy, Heidelberg University Hospital, Heidelberg, Germany.,Institut für Diagnostische und Interventionelle Radiologie, Klinikum Rechts der Isar, Technical University of Munich, Munich, Germany.,Department of Computing, Imperial College London, London, United Kingdom
| | - Marcus Makowski
- German Cancer Consortium, Heidelberg, Germany.,Institut für Diagnostische und Interventionelle Radiologie, Klinikum Rechts der Isar, Technical University of Munich, Munich, Germany
| | - Matthias Eiber
- German Cancer Consortium, Heidelberg, Germany.,Klinik und Poliklinik für Nuklearmedizin, Klinikum Rechts der Isar, Technical University of Munich, Munich, Germany
| | - Andrei Gafita
- German Cancer Consortium, Heidelberg, Germany.,Klinik und Poliklinik für Nuklearmedizin, Klinikum Rechts der Isar, Technical University of Munich, Munich, Germany
| | - Rupert Trager
- German Cancer Consortium, Heidelberg, Germany.,Klinik und Poliklinik für Nuklearmedizin, Klinikum Rechts der Isar, Technical University of Munich, Munich, Germany
| | - Wolfgang A Weber
- German Cancer Consortium, Heidelberg, Germany.,Klinik und Poliklinik für Nuklearmedizin, Klinikum Rechts der Isar, Technical University of Munich, Munich, Germany
| | - Jakob Neubauer
- German Cancer Consortium, Heidelberg, Germany.,Klinik für Diagnostische und Interventionelle Radiologie, Universitätsklinikum Freiburg, Freiburg, Germany
| | - Marco Reisert
- German Cancer Consortium, Heidelberg, Germany.,Klinik für Diagnostische und Interventionelle Radiologie, Universitätsklinikum Freiburg, Freiburg, Germany
| | - Michael Bock
- German Cancer Consortium, Heidelberg, Germany.,Klinik für Diagnostische und Interventionelle Radiologie, Universitätsklinikum Freiburg, Freiburg, Germany
| | - Fabian Bamberg
- German Cancer Consortium, Heidelberg, Germany.,Klinik für Diagnostische und Interventionelle Radiologie, Universitätsklinikum Freiburg, Freiburg, Germany
| | - Jürgen Hennig
- German Cancer Consortium, Heidelberg, Germany.,Klinik für Diagnostische und Interventionelle Radiologie, Universitätsklinikum Freiburg, Freiburg, Germany
| | - Philipp Tobias Meyer
- German Cancer Consortium, Heidelberg, Germany.,Klinik für Nuklearmedizin, Universitätsklinikum Freiburg, Freiburg, Germany
| | - Juri Ruf
- German Cancer Consortium, Heidelberg, Germany.,Klinik für Nuklearmedizin, Universitätsklinikum Freiburg, Freiburg, Germany
| | - Uwe Haberkorn
- German Cancer Consortium, Heidelberg, Germany.,Klinische Kooperationseinheit Nuklearmedizin, Deutsches Krebsforschungszentrum Heidelberg, Heidelberg, Germany
| | - Stefan O Schoenberg
- German Cancer Consortium, Heidelberg, Germany.,Universitätsmedizin Mannheim, Medizinische Fakultät Mannheim der Universität Heidelberg, Heidelberg, Germany
| | - Tristan Kuder
- German Cancer Consortium, Heidelberg, Germany.,Medizinische Physik in der Radiologie, Deutsches Krebsforschungszentrum Heidelberg, Heidelberg, Germany
| | - Peter Neher
- Division of Medical Image Computing, German Cancer Research Center, Heidelberg, Germany.,German Cancer Consortium, Heidelberg, Germany
| | - Ralf Floca
- Division of Medical Image Computing, German Cancer Research Center, Heidelberg, Germany.,German Cancer Consortium, Heidelberg, Germany.,Pattern Analysis and Learning Group, Radio-oncology and Clinical Radiotherapy, Heidelberg University Hospital, Heidelberg, Germany
| | - Heinz-Peter Schlemmer
- Medical Faculty Heidelberg, University of Heidelberg, Heidelberg, Germany.,German Cancer Consortium, Heidelberg, Germany.,Division of Radiology, German Cancer Research Center, Heidelberg, Germany
| | - Klaus Maier-Hein
- Division of Medical Image Computing, German Cancer Research Center, Heidelberg, Germany.,Pattern Analysis and Learning Group, Radio-oncology and Clinical Radiotherapy, Heidelberg University Hospital, Heidelberg, Germany
| |
Collapse
|
18
|
Verma AA, Pasricha SV, Jung HY, Kushnir V, Mak DYF, Koppula R, Guo Y, Kwan JL, Lapointe-Shaw L, Rawal S, Tang T, Weinerman A, Razak F. Assessing the quality of clinical and administrative data extracted from hospitals: the General Medicine Inpatient Initiative (GEMINI) experience. J Am Med Inform Assoc 2021; 28:578-587. [PMID: 33164061 DOI: 10.1093/jamia/ocaa225] [Citation(s) in RCA: 45] [Impact Index Per Article: 15.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/14/2020] [Revised: 08/27/2020] [Accepted: 09/14/2020] [Indexed: 12/20/2022] Open
Abstract
OBJECTIVE Large clinical databases are increasingly used for research and quality improvement. We describe an approach to data quality assessment from the General Medicine Inpatient Initiative (GEMINI), which collects and standardizes administrative and clinical data from hospitals. METHODS The GEMINI database contained 245 559 patient admissions at 7 hospitals in Ontario, Canada from 2010 to 2017. We performed 7 computational data quality checks and iteratively re-extracted data from hospitals to correct problems. Thereafter, GEMINI data were compared to data that were manually abstracted from the hospital's electronic medical record for 23 419 selected data points on a sample of 7488 patients. RESULTS Computational checks flagged 103 potential data quality issues, which were either corrected or documented to inform future analysis. For example, we identified the inclusion of canceled radiology tests, a time shift of transfusion data, and mistakenly processing the chemical symbol for sodium ("Na") as a missing value. Manual validation identified 1 important data quality issue that was not detected by computational checks: transfusion dates and times at 1 site were unreliable. Apart from that single issue, across all data tables, GEMINI data had high overall accuracy (ranging from 98%-100%), sensitivity (95%-100%), specificity (99%-100%), positive predictive value (93%-100%), and negative predictive value (99%-100%) compared to the gold standard. DISCUSSION AND CONCLUSION Computational data quality checks with iterative re-extraction facilitated reliable data collection from hospitals but missed 1 critical quality issue. Combining computational and manual approaches may be optimal for assessing the quality of large multisite clinical databases.
Collapse
Affiliation(s)
- Amol A Verma
- Li Ka Shing Knowledge Institute, St. Michael's Hospital, Toronto, Ontario, Canada.,Department of Medicine, University of Toronto, Toronto, Ontario, Canada.,Institute for Health Policy, Management and Evaluation, University of Toronto, Toronto, Ontario, Canada
| | - Sachin V Pasricha
- Li Ka Shing Knowledge Institute, St. Michael's Hospital, Toronto, Ontario, Canada.,School of Medicine, Faculty of Health Sciences, Queen's University, Kingston, Ontario, Canada
| | - Hae Young Jung
- Li Ka Shing Knowledge Institute, St. Michael's Hospital, Toronto, Ontario, Canada
| | - Vladyslav Kushnir
- Li Ka Shing Knowledge Institute, St. Michael's Hospital, Toronto, Ontario, Canada
| | - Denise Y F Mak
- Li Ka Shing Knowledge Institute, St. Michael's Hospital, Toronto, Ontario, Canada
| | - Radha Koppula
- Li Ka Shing Knowledge Institute, St. Michael's Hospital, Toronto, Ontario, Canada
| | - Yishan Guo
- Li Ka Shing Knowledge Institute, St. Michael's Hospital, Toronto, Ontario, Canada
| | - Janice L Kwan
- Department of Medicine, University of Toronto, Toronto, Ontario, Canada.,Department of Medicine, Mount Sinai Hospital, Toronto, Ontario, Canada
| | - Lauren Lapointe-Shaw
- Department of Medicine, University of Toronto, Toronto, Ontario, Canada.,Division of General Internal Medicine, University Health Network, Toronto, Ontario, Canada.,Institute for Clinical and Evaluative Sciences, Toronto, Ontario, Canada
| | - Shail Rawal
- Department of Medicine, University of Toronto, Toronto, Ontario, Canada.,Division of General Internal Medicine, University Health Network, Toronto, Ontario, Canada
| | - Terence Tang
- Department of Medicine, University of Toronto, Toronto, Ontario, Canada.,Institute for Better Health, Trillium Health Partners, Toronto, Ontario, Canada
| | - Adina Weinerman
- Department of Medicine, University of Toronto, Toronto, Ontario, Canada.,Sunnybrook Health Sciences Centre, Toronto, Ontario, Canada
| | - Fahad Razak
- Li Ka Shing Knowledge Institute, St. Michael's Hospital, Toronto, Ontario, Canada.,Department of Medicine, University of Toronto, Toronto, Ontario, Canada.,Institute for Health Policy, Management and Evaluation, University of Toronto, Toronto, Ontario, Canada
| |
Collapse
|
19
|
Tubaishat A. The Adoption of Electronic Health Records in Primary Healthcare Settings. Comput Inform Nurs 2021; 39:883-889. [PMID: 34101659 DOI: 10.1097/cin.0000000000000755] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/25/2022]
Abstract
Little is known about the adoption rate of electronic health records in primary healthcare settings. This study aimed to estimate this adoption rate in Jordan, using a national survey with a descriptive cross-sectional design. The first step was to review the annual report of the Jordanian Ministry of Health as a basis for identifying the sample, which are primary healthcare settings. Then, Electronic Health Solutions, the company that vends electronic health record systems in Jordan, was used to determine which primary healthcare settings were using these electronic record systems. The final task was to determine which functionalities of the system were being used in these settings, and for this, a telephone survey of key personnel was conducted. It was found that 21.6% of the primary healthcare settings were using electronic health records, while the other 78.4% were still relying on paper records. The results also showed that the adoption rate of electronic health records was significantly associated with the type of the setting (P < .001), its size (P < .001), location (P < .001), and region (P = .04). As a conclusion, the adoption rate in Jordan is still in its infancy compared with those in developed countries. Policy and decision makers should therefore be focused on minimizing any challenges or obstacles to such adoption.
Collapse
Affiliation(s)
- Ahmad Tubaishat
- Author Affiliation: Department of Adult Health Nursing, School of Nursing, Al Al-Bayt University, Mafraq, Jordan
| |
Collapse
|
20
|
Bian J, Lyu T, Loiacono A, Viramontes TM, Lipori G, Guo Y, Wu Y, Prosperi M, George TJ, Harle CA, Shenkman EA, Hogan W. Assessing the practice of data quality evaluation in a national clinical data research network through a systematic scoping review in the era of real-world data. J Am Med Inform Assoc 2021; 27:1999-2010. [PMID: 33166397 PMCID: PMC7727392 DOI: 10.1093/jamia/ocaa245] [Citation(s) in RCA: 18] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/28/2020] [Revised: 09/13/2020] [Accepted: 09/18/2020] [Indexed: 11/13/2022] Open
Abstract
Objective To synthesize data quality (DQ) dimensions and assessment methods of real-world data, especially electronic health records, through a systematic scoping review and to assess the practice of DQ assessment in the national Patient-centered Clinical Research Network (PCORnet). Materials and Methods We started with 3 widely cited DQ literature—2 reviews from Chan et al (2010) and Weiskopf et al (2013a) and 1 DQ framework from Kahn et al (2016)—and expanded our review systematically to cover relevant articles published up to February 2020. We extracted DQ dimensions and assessment methods from these studies, mapped their relationships, and organized a synthesized summarization of existing DQ dimensions and assessment methods. We reviewed the data checks employed by the PCORnet and mapped them to the synthesized DQ dimensions and methods. Results We analyzed a total of 3 reviews, 20 DQ frameworks, and 226 DQ studies and extracted 14 DQ dimensions and 10 assessment methods. We found that completeness, concordance, and correctness/accuracy were commonly assessed. Element presence, validity check, and conformance were commonly used DQ assessment methods and were the main focuses of the PCORnet data checks. Discussion Definitions of DQ dimensions and methods were not consistent in the literature, and the DQ assessment practice was not evenly distributed (eg, usability and ease-of-use were rarely discussed). Challenges in DQ assessments, given the complex and heterogeneous nature of real-world data, exist. Conclusion The practice of DQ assessment is still limited in scope. Future work is warranted to generate understandable, executable, and reusable DQ measures.
Collapse
Affiliation(s)
- Jiang Bian
- Department of Health Outcomes and Biomedical Informatics, College of Medicine, University of Florida, Gainesville, Florida, USA.,Cancer Informatics Shared Resource, University of Florida Health Cancer Center, Gainesville, Florida, USA
| | - Tianchen Lyu
- Department of Health Outcomes and Biomedical Informatics, College of Medicine, University of Florida, Gainesville, Florida, USA
| | - Alexander Loiacono
- Department of Health Outcomes and Biomedical Informatics, College of Medicine, University of Florida, Gainesville, Florida, USA
| | - Tonatiuh Mendoza Viramontes
- Department of Health Outcomes and Biomedical Informatics, College of Medicine, University of Florida, Gainesville, Florida, USA
| | - Gloria Lipori
- Clinical and Translational Institute, University of Florida, Gainesville, Florida, USA
| | - Yi Guo
- Department of Health Outcomes and Biomedical Informatics, College of Medicine, University of Florida, Gainesville, Florida, USA
| | - Yonghui Wu
- Department of Health Outcomes and Biomedical Informatics, College of Medicine, University of Florida, Gainesville, Florida, USA
| | - Mattia Prosperi
- Department of Epidemiology, College of Public Health and Health Professions & College of Medicine, University of Florida, Gainesville, Florida, USA
| | - Thomas J George
- Hematology & Oncology, Department of Medicine, College of Medicine, University of Florida, Gainesville, Florida, USA
| | - Christopher A Harle
- Department of Health Outcomes and Biomedical Informatics, College of Medicine, University of Florida, Gainesville, Florida, USA
| | - Elizabeth A Shenkman
- Department of Health Outcomes and Biomedical Informatics, College of Medicine, University of Florida, Gainesville, Florida, USA
| | - William Hogan
- Department of Health Outcomes and Biomedical Informatics, College of Medicine, University of Florida, Gainesville, Florida, USA
| |
Collapse
|
21
|
Garies S, McBrien K, Quan H, Manca D, Drummond N, Williamson T. A data quality assessment to inform hypertension surveillance using primary care electronic medical record data from Alberta, Canada. BMC Public Health 2021; 21:264. [PMID: 33530975 PMCID: PMC7852125 DOI: 10.1186/s12889-021-10295-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/12/2019] [Accepted: 01/20/2021] [Indexed: 11/16/2022] Open
Abstract
BACKGROUND Hypertension is a common chronic condition affecting nearly a quarter of Canadians. Hypertension surveillance in Canada typically relies on administrative data and/or national surveys. Routinely-captured data from primary care electronic medical records (EMRs) are a complementary source for chronic disease surveillance, with longitudinal patient-level details such as sociodemographics, blood pressure, weight, prescribed medications, and behavioural risk factors. As EMR data are generated from patient care and administrative tasks, assessing data quality is essential before using for secondary purposes. This study evaluated the quality of primary care EMR data from one province in Canada within the context of hypertension surveillance. METHODS We conducted a cross-sectional, descriptive study using primary care EMR data collected by two practice-based research networks in Alberta, Canada. There were 48,377 adults identified with hypertension from 53 clinics as of June 2018. Summary statistics were used to examine the quality of data elements considered relevant for hypertension surveillance. RESULTS Patient year of birth and sex were complete, but other sociodemographic information (ethnicity, occupation, education) was largely incomplete and highly variable. Height, weight, body mass index and blood pressure were complete for most patients (over 90%), but a small proportion of outlying values indicate data inaccuracies were present. Most patients had a relevant laboratory test present (e.g. blood glucose/glycated hemoglobin, lipid profile), though a very small proportion of values were outside a biologically plausible range. Details of prescribed antihypertensive medication, such as start date, strength, dose, frequency, were mostly complete. Nearly 80% of patients had a smoking status recorded, though only 66% had useful information (i.e. categorized as current, past, or never), and less than half had their alcohol use described; information related to amount, frequency or duration was not available. CONCLUSIONS Blood pressure and prescribed medications in primary care EMR data demonstrated good completeness and plausibility, and contribute valuable information for hypertension epidemiology and surveillance. The use of other clinical, laboratory, and sociodemographic variables should be used carefully due to variable completeness and suspected data errors. Additional strategies to improve these data at the point of entry and after data extraction (e.g. statistical methods) are required.
Collapse
Affiliation(s)
- Stephanie Garies
- Department of Family Medicine, University of Calgary, G012 Health Sciences Centre, 3330 Hospital Drive NW, Calgary, Alberta, T2N 4N1, Canada.
| | - Kerry McBrien
- Department of Family Medicine, University of Calgary, G012 Health Sciences Centre, 3330 Hospital Drive NW, Calgary, Alberta, T2N 4N1, Canada
- Department of Community Health Sciences, University of Calgary, 3280 Hospital Drive NW, Calgary, Alberta, T2N 4Z6, Canada
| | - Hude Quan
- Department of Community Health Sciences, University of Calgary, 3280 Hospital Drive NW, Calgary, Alberta, T2N 4Z6, Canada
| | - Donna Manca
- Department of Family Medicine, University of Alberta, 6-10 University Terrace, Edmonton, Alberta, T6G 2T4, Canada
| | - Neil Drummond
- Department of Family Medicine, University of Calgary, G012 Health Sciences Centre, 3330 Hospital Drive NW, Calgary, Alberta, T2N 4N1, Canada
- Department of Community Health Sciences, University of Calgary, 3280 Hospital Drive NW, Calgary, Alberta, T2N 4Z6, Canada
- Department of Family Medicine, University of Alberta, 6-10 University Terrace, Edmonton, Alberta, T6G 2T4, Canada
- School of Public Health, University of Alberta, 3-300 Edmonton Clinic Health Academy, 11405-87 Ave, Edmonton, Alberta, T6G 1C9, Canada
| | - Tyler Williamson
- Department of Community Health Sciences, University of Calgary, 3280 Hospital Drive NW, Calgary, Alberta, T2N 4Z6, Canada
| |
Collapse
|
22
|
Mohmad N, Hassan NH, Samy GN, Abd. Aziz NS, Maarop N, Abu Bakar NA. A Review of Factors Influencing Patient Readmission Based on Data Quality Dimension Model. 2020 8TH INTERNATIONAL CONFERENCE ON INFORMATION TECHNOLOGY AND MULTIMEDIA (ICIMU) 2020. [DOI: 10.1109/icimu49871.2020.9243500] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 09/02/2023]
|
23
|
Alsabbagh MW, Kueper JK, Wong ST, Burge F, Johnston S, Peterson S, Lawson B, Chung H, Bennett M, Blackman S, McGrail K, Campbell J, Hogg W, Glazier R. Development of comparable algorithms to measure primary care indicators using administrative health data across three Canadian provinces. Int J Popul Data Sci 2020; 5:1340. [PMID: 33644408 PMCID: PMC7893851 DOI: 10.23889/ijpds.v5i1.1340] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/02/2022] Open
Abstract
INTRODUCTION Performance measurement has been recognized as key to transforming primary care (PC). Yet, performance reporting in PC lags behind even though high-performing PC is foundational to an effective and efficient health care system. OBJECTIVES We used administrative data from three Canadian provinces, British Columbia, Ontario and Nova Scotia, to: 1) identify and develop a core set of PC performance indicators using administrative data and 2) examine their ability to capture PC performance. METHODS Administrative data used included Physician Billings, Discharge Abstract Database, the National Ambulatory Care and Reporting System database, Census and Vital Statistics. Indicators were compiled based on a literature review of PC indicators previously developed with administrative data available in Canada (n=158). We engaged in iterative discussions to assess data conformity, completeness, and plausibility of results in all jurisdictions. Challenges to creating comparable algorithms were examined through content analysis and research team discussions, which included clinicians, analysts, and health services researchers familiar with PC. RESULTS Our final list included 21 PC performance indicators pertaining to 1) technical care (n=4), 2) continuity of care (n=6), and 3) health services utilization (n=11). Establishing comparable algorithms across provinces was possible though time intensive. A major challenge was inconsistent data elements. Ease of data access, and a deep understanding of the data and practice context, was essential for selecting the most appropriate data elements. CONCLUSIONS This project is unique in creating algorithms to measure PC performance across provinces. It was essential to balance internal validity of the indicators within a province and external validity across provinces. The intuitive desire of having the exact same coding across provinces was infeasible due to lack of standardized PC data. Rather, a context-tailored definition was developed for each jurisdiction. This work serves as an example for developing comparable PC performance indicators across different provincial/territorial jurisdictions.
Collapse
Affiliation(s)
| | | | - ST Wong
- University of British Columbia
| | | | - S Johnston
- Bruyère Research Institute, University of Ottawa
| | | | | | | | | | | | | | | | - W Hogg
- University of Ottawa, Montfort Hospital Research Institute
| | | |
Collapse
|
24
|
Black JE, Terry AL, Lizotte DJ. Development and evaluation of an osteoarthritis risk model for integration into primary care health information technology. Int J Med Inform 2020; 141:104160. [PMID: 32593009 DOI: 10.1016/j.ijmedinf.2020.104160] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/30/2019] [Revised: 02/28/2020] [Accepted: 04/24/2020] [Indexed: 11/15/2022]
Abstract
BACKGROUND We developed and evaluated a prognostic prediction model that estimates osteoarthritis risk for use by patients and practitioners that is designed to be appropriate for integration into primary care health information technology systems. Osteoarthritis, a joint disorder characterized by pain and stiffness, causes significant morbidity among older Canadians. Because our prognostic prediction model for osteoarthritis risk uses data that are readily available in primary care settings, it supports targeting of interventions delivered as part of clinical practice that are aimed at risk reduction. METHODS We used the CPCSSN (Canadian Primary Sentinel Surveillance Network) database, which contains aggregated electronic health information from a cohort of primary care practices, to develop and evaluate a prognostic prediction model to estimate 5-year osteoarthritis risk, addressing contextual challenges of data availability and missingness. We constructed a retrospective cohort of 383,117 eligible primary care patients who were included in the cohort if they had an encounter with their primary care practitioner between 1 January 2009 and 31 December 2010. Patients were excluded if they had a diagnosis of osteoarthritis prior to their first visit in this time period. Incident cases of osteoarthritis were observed. The model was constructed to predict incident osteoarthritis based on age, sex, BMI, previous leg injury, and osteoporosis. Evaluation of the model used internal 10-fold cross-validation; we argue that internal validation is particularly appropriate for a model that is to be integrated into the same context from which the data were derived. RESULTS The resulting prediction model for 5-year risk of osteoarthritis diagnosis demonstrated state-of-the-art discrimination (estimated AUROC 0.84) and good calibration (assessed visually.) The model relies only on information that is readily available in Canadian primary care settings, and hence is appropriate for integration into Canadian primary care health information technology. CONCLUSIONS If the contextual challenges arising when using primary care electronic medical record data are appropriately addressed, highly discriminative models for osteoarthritis risk may be constructed using only data commonly available in primary care. Because the models are constructed from data in the same setting where the model is to be applied, internal validation provides strong evidence that the resulting model will perform well in its intended application.
Collapse
Affiliation(s)
- Jason E Black
- Graduate Program in Epidemiology & Biostatistics, Western University, 1151 Richmond Street, London, Ontario, N6A 5C1, Canada.
| | - Amanda L Terry
- Department of Family Medicine, Department of Epidemiology & Biostatistics, Schulich Interfaculty Program in Public Health, Western University, 1151 Richmond Street, London, Ontario, N6A 3K7, Canada.
| | - Daniel J Lizotte
- Department of Computer Science, Department of Epidemiology & Biostatistics, Schulich Interfaculty Program in Public Health, Department of Statistical and Actuarial Sciences, 1151 Richmond Street, Western University, London, Ontario, N6A 3K7, Canada.
| |
Collapse
|