Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For: Afshar AS, Li Y, Chen Z, Chen Y, Lee JH, Irani D, Crank A, Singh D, Kanter M, Faraday N, Kharrazi H. An exploratory data quality analysis of time series physiologic signals using a large-scale intensive care unit database. JAMIA Open 2021;4:ooab057. [PMID: 34350392 PMCID: PMC8327372 DOI: 10.1093/jamiaopen/ooab057] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [What about the content of this article? (0)] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/27/2021] [Revised: 06/04/2021] [Accepted: 07/02/2021] [Indexed: 11/14/2022] Open

For:	Afshar AS, Li Y, Chen Z, Chen Y, Lee JH, Irani D, Crank A, Singh D, Kanter M, Faraday N, Kharrazi H. An exploratory data quality analysis of time series physiologic signals using a large-scale intensive care unit database. JAMIA Open 2021;4:ooab057. [PMID: 34350392 PMCID: PMC8327372 DOI: 10.1093/jamiaopen/ooab057] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [What about the content of this article? (0)] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/27/2021] [Revised: 06/04/2021] [Accepted: 07/02/2021] [Indexed: 11/14/2022] Open

Number

Cited by Other Article(s)

Deschepper M, Colpaert K. Creating awareness of the heterogeneity of the intensive care unit population and its impact on generalizability of results and transportability of models. Intensive Crit Care Nurs 2024;80:103565. [PMID: 37875048 DOI: 10.1016/j.iccn.2023.103565] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/26/2023]

Searle T, Ibrahim Z, Teo J, Dobson RJB. Discharge summary hospital course summarisation of in patient Electronic Health Record text with clinical concept guided deep pre-trained Transformer models. J Biomed Inform 2023;141:104358. [PMID: 37023846 DOI: 10.1016/j.jbi.2023.104358] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/21/2022] [Revised: 03/29/2023] [Accepted: 04/02/2023] [Indexed: 04/08/2023]

Syed R, Eden R, Makasi T, Chukwudi I, Mamudu A, Kamalpour M, Kapugama Geeganage D, Sadeghianasl S, Leemans SJJ, Goel K, Andrews R, Wynn MT, Ter Hofstede A, Myers T. Digital Health Data Quality Issues: Systematic Review. J Med Internet Res 2023;25:e42615. [PMID: 37000497 PMCID: PMC10131725 DOI: 10.2196/42615] [Citation(s) in RCA: 7] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/12/2022] [Revised: 12/07/2022] [Accepted: 12/31/2022] [Indexed: 04/01/2023] Open

Abstract

BACKGROUND

The promise of digital health is principally dependent on the ability to electronically capture data that can be analyzed to improve decision-making. However, the ability to effectively harness data has proven elusive, largely because of the quality of the data captured. Despite the importance of data quality (DQ), an agreed-upon DQ taxonomy evades literature. When consolidated frameworks are developed, the dimensions are often fragmented, without consideration of the interrelationships among the dimensions or their resultant impact.

OBJECTIVE

The aim of this study was to develop a consolidated digital health DQ dimension and outcome (DQ-DO) framework to provide insights into 3 research questions: What are the dimensions of digital health DQ? How are the dimensions of digital health DQ related? and What are the impacts of digital health DQ?

METHODS

Following the PRISMA (Preferred Reporting Items for Systematic Reviews and Meta-Analyses) guidelines, a developmental systematic literature review was conducted of peer-reviewed literature focusing on digital health DQ in predominately hospital settings. A total of 227 relevant articles were retrieved and inductively analyzed to identify digital health DQ dimensions and outcomes. The inductive analysis was performed through open coding, constant comparison, and card sorting with subject matter experts to identify digital health DQ dimensions and digital health DQ outcomes. Subsequently, a computer-assisted analysis was performed and verified by DQ experts to identify the interrelationships among the DQ dimensions and relationships between DQ dimensions and outcomes. The analysis resulted in the development of the DQ-DO framework.

RESULTS

The digital health DQ-DO framework consists of 6 dimensions of DQ, namely accessibility, accuracy, completeness, consistency, contextual validity, and currency; interrelationships among the dimensions of digital health DQ, with consistency being the most influential dimension impacting all other digital health DQ dimensions; 5 digital health DQ outcomes, namely clinical, clinician, research-related, business process, and organizational outcomes; and relationships between the digital health DQ dimensions and DQ outcomes, with the consistency and accessibility dimensions impacting all DQ outcomes.

CONCLUSIONS

The DQ-DO framework developed in this study demonstrates the complexity of digital health DQ and the necessity for reducing digital health DQ issues. The framework further provides health care executives with holistic insights into DQ issues and resultant outcomes, which can help them prioritize which DQ-related problems to tackle first.

Collapse

Chen JS, Lin WC, Yang S, Chiang MF, Hribar MR. Development of an Open-Source Annotated Glaucoma Medication Dataset From Clinical Notes in the Electronic Health Record. Transl Vis Sci Technol 2022;11:20. [PMID: 36441131 PMCID: PMC9710490 DOI: 10.1167/tvst.11.11.20] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/05/2022] [Accepted: 10/21/2022] [Indexed: 11/30/2022] Open

Abstract

Purpose

To describe the methods involved in processing and characteristics of an open dataset of annotated clinical notes from the electronic health record (EHR) annotated for glaucoma medications.

Methods

In this study, 480 clinical notes from office visits, medical record numbers (MRNs), visit identification numbers, provider names, and billing codes were extracted for 480 patients seen for glaucoma by a comprehensive or glaucoma ophthalmologist from January 1, 2019, to August 31, 2020. MRNs and all visit data were de-identified using a hash function with salt from the deidentifyr package. All progress notes were annotated for glaucoma medication name, route, frequency, dosage, and drug use using an open-source annotation tool, Doccano. Annotations were saved separately. All protected health information (PHI) in progress notes and annotated files were de-identified using the published de-identifying algorithm Philter. All progress notes and annotations were manually validated by two ophthalmologists to ensure complete de-identification.

Results

The final dataset contained 5520 annotated sentences, including those with and without medications, for 480 clinical notes. Manual validation revealed 10 instances of remaining PHI which were manually corrected.

Conclusions

Annotated free-text clinical notes can be de-identified for upload as an open dataset. As data availability increases with the adoption of EHRs, free-text open datasets will become increasingly valuable for "big data" research and artificial intelligence development. This dataset is published online and publicly available at https://github.com/jche253/Glaucoma_Med_Dataset.

Translational Relevance

This open access medication dataset may be a source of raw data for future research involving big data and artificial intelligence research using free-text.

Collapse

Kharrazi H, Chang HY, Weiner JP, Gudzune KA. Assessing the Added Value of Blood Pressure Information Derived from Electronic Health Records in Predicting Health Care Cost and Utilization. Popul Health Manag 2021;25:323-334. [PMID: 34847729 DOI: 10.1089/pop.2021.0250] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/08/2023] Open

Abstract

Health care providers are increasingly using clinical measures derived from electronic health records (EHRs) for risk stratification and predictive modeling. EHR-specific data elements such as prescriptions, laboratory results, and vital signs have been shown to improve risk prediction models. In this study, the value of EHR-based blood pressure (BP) values was assessed in predicting health care costs (ie, total, medical, and pharmacy) and key utilization end points (ie, hospitalization, emergency department use, and being among the highest utilizers). The study population included 37,451 patients of a large integrated delivery system in the mid-western United States with complete EHR data files, who were 18-64 years old, had continuous insurance at an affiliated health plan, and had eligible BP records. Both EHRs and insurance claims of the study population were used to extract the predictors (ie, demographics, diagnosis, and BP values) and outcomes (ie, costs and utilizations). Predictors were extracted from 2012 data, whereas concurrent and prospective outcomes were extracted from 2012 to 2013 data. Three base models (BMs) were constructed to predict each of the outcomes. The first BM no. 1 used demographics. The second BM no. 2 added the Charlson comorbidity index to BM no. 1, whereas the third BM no. 3 added the Adjusted Clinical Group Dx-PM case-mix score to BM no. 1. BP was specified as means, ranges, and classes. Adding BP ranges to BM no. 1 and BM no. 2 showed the greatest improvements when predicting costs and utilization. More specifically, adjusted R² and area under the curve of BM no. 2 improved by 32.9% and 14.1% when BP ranges were added to predict concurrent total cost and hospitalization, respectively. The effect of BP measures on improving the risk stratification models was diminished when predicting prospective outcomes after adding the measures to BM no. 3 (ie, the more comprehensive diagnostic model), specifically when represented as BP means. Given the increasing availability of BP information, this research suggests that these data should be integrated into provider-based population health analytic activities. Future research should focus on subpopulations that benefit the most from incorporating vital signs such as BP measures in risk stratification models.

Collapse