Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

Download

Total Articles

1473
(from Reference Citation Analysis)

Article PDFs (481)

Cited by > 0 (1039)

Searched Name

Electronic health records

Ranked By

Results Analysis

Year Published Analysis
Article Type Analysis
Publication Title Analysis
Category Analysis

Results Analysis

Indexed Articles

Year Published

Show more Refine

Article Type

Show more Refine

Article Statistics

Refine

MESH Headings

Show more Refine

First Author

Show more Refine

First Author Affiliations

Show more Refine

Authors

Show more Refine

Publication Titles

Show more Refine

Grant Agencies

Show more Refine

Countries/Regions

Show more Refine

Affiliations

Show more Refine

Corresponding Author Affiliations

Show more Refine

Category

Show more Refine

Number

Citation Analysis

Göğebakan K, Ulu R, Abiyev R, Şah M. A drug prescription recommendation system based on novel DIAKID ontology and extensive semantic rules. Health Inf Sci Syst 2024;12:27. [PMID: 38524804 PMCID: PMC10960787 DOI: 10.1007/s13755-024-00286-7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/10/2023] [Accepted: 02/28/2024] [Indexed: 03/26/2024] Open

Chen JS, Copado IA, Vallejos C, Kalaw FGP, Soe P, Cai CX, Toy BC, Borkar D, Sun CQ, Shantha JG, Baxter SL. Variations in Electronic Health Record-Based Definitions of Diabetic Retinopathy Cohorts: A Literature Review and Quantitative Analysis. Ophthalmology Science 2024;4:100468. [PMID: 38560278 PMCID: PMC10973665 DOI: 10.1016/j.xops.2024.100468] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 07/11/2023] [Revised: 01/04/2024] [Accepted: 01/11/2024] [Indexed: 04/04/2024]

Abstract

Purpose

Use of the electronic health record (EHR) has motivated the need for data standardization. A gap in knowledge exists regarding variations in existing terminologies for defining diabetic retinopathy (DR) cohorts. This study aimed to review the literature and analyze variations regarding codified definitions of DR.

Design

Literature review and quantitative analysis.

Subjects

Published manuscripts.

Methods

Four graders reviewed PubMed and Google Scholar for peer-reviewed studies. Studies were included if they used codified definitions of DR (e.g., billing codes). Data elements such as author names, publication year, purpose, data set type, and DR definitions were manually extracted. Each study was reviewed by ≥ 2 authors to validate inclusion eligibility. Quantitative analyses of the codified definitions were then performed to characterize the variation between DR cohort definitions.

Main Outcome Measures

Number of studies included and numeric counts of billing codes used to define codified cohorts.

Results

In total, 43 studies met the inclusion criteria. Half of the included studies used datasets based on structured EHR data (i.e., data registries, institutional EHR review), and half used claims data. All but 1 of the studies used billing codes such as the International Classification of Diseases 9th or 10th edition (ICD-9 or ICD-10), either alone or in addition to another terminology for defining disease. Of the 27 included studies that used ICD-9 and the 20 studies that used ICD-10 codes, the most common codes used pertained to the full spectrum of DR severity. Diabetic retinopathy complications (e.g., vitreous hemorrhage) were also used to define some DR cohorts.

Conclusions

Substantial variations exist among codified definitions for DR cohorts within retrospective studies. Variable definitions may limit generalizability and reproducibility of retrospective studies. More work is needed to standardize disease cohorts.

Financial Disclosures

Proprietary or commercial disclosure may be found in the Footnotes and Disclosures at the end of this article.

Collapse

Affiliation(s)

Jimmy S Chen Division of Ophthalmology Informatics and Data Science, Viterbi Family Department of Ophthalmology and Shiley Eye Institute, University of California San Diego, La Jolla, California UCSD Health Department of Biomedical Informatics, University of California San Diego, La Jolla, California
Ivan A Copado Division of Ophthalmology Informatics and Data Science, Viterbi Family Department of Ophthalmology and Shiley Eye Institute, University of California San Diego, La Jolla, California UCSD Health Department of Biomedical Informatics, University of California San Diego, La Jolla, California
Cecilia Vallejos Division of Ophthalmology Informatics and Data Science, Viterbi Family Department of Ophthalmology and Shiley Eye Institute, University of California San Diego, La Jolla, California UCSD Health Department of Biomedical Informatics, University of California San Diego, La Jolla, California
Fritz Gerald P Kalaw Division of Ophthalmology Informatics and Data Science, Viterbi Family Department of Ophthalmology and Shiley Eye Institute, University of California San Diego, La Jolla, California UCSD Health Department of Biomedical Informatics, University of California San Diego, La Jolla, California
Priyanka Soe Division of Ophthalmology Informatics and Data Science, Viterbi Family Department of Ophthalmology and Shiley Eye Institute, University of California San Diego, La Jolla, California UCSD Health Department of Biomedical Informatics, University of California San Diego, La Jolla, California
Cindy X Cai Wilmer Eye Institute, Johns Hopkins School of Medicine, Baltimore, Maryland
Brian C Toy Department of Ophthalmology, Roski Eye Institute, Keck School of Medicine, University of Southern California, Los Angeles, California
Durga Borkar Department of Ophthalmology, Duke Eye Center, Duke University, Durham, North Carolina
Catherine Q Sun F.I. Proctor Foundation, University of California San Francisco, San Francisco, California Department of Ophthalmology, University of California San Francisco, San Francisco, California
Jessica G Shantha F.I. Proctor Foundation, University of California San Francisco, San Francisco, California Department of Ophthalmology, University of California San Francisco, San Francisco, California
Sally L Baxter Division of Ophthalmology Informatics and Data Science, Viterbi Family Department of Ophthalmology and Shiley Eye Institute, University of California San Diego, La Jolla, California UCSD Health Department of Biomedical Informatics, University of California San Diego, La Jolla, California

Collapse

Röchner P, Rothlauf F. Using machine learning to link electronic health records in cancer registries: On the tradeoff between linkage quality and manual effort. Int J Med Inform 2024;185:105387. [PMID: 38428200 DOI: 10.1016/j.ijmedinf.2024.105387] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/24/2023] [Revised: 10/05/2023] [Accepted: 02/20/2024] [Indexed: 03/03/2024]

Abstract

BACKGROUND

Cancer registries link a large number of electronic health records reported by medical institutions to already registered records of the matching individual and tumor. Records are automatically linked using deterministic and probabilistic approaches; machine learning is rarely used. Records that cannot be matched automatically with sufficient accuracy are typically processed manually. For application, it is important to know how well record linkage approaches match real-world records and how much manual effort is required to achieve the desired linkage quality. We study the task of linking reported records to the matching registered tumor in cancer registries.

METHODS

We compare the tradeoff between linkage quality and manual effort of five machine learning methods (logistic regression, random forest, gradient boosting, neural network, and a stacked method) to a deterministic baseline. The record linkage methods are compared in a two-class setting (no-match/ match) and a three-class setting (no-match/ undecided/ match). A cancer registry collected and linked the dataset consisting of categorical variables matching 145,755 reported records with 33,289 registered tumors.

RESULTS

In the two-class setting, the gradient boosting, neural network, and stacked models have higher accuracy and F1 score (accuracy: 0.968-0.978, F1 score: 0.983-0.988) than the deterministic baseline (accuracy: 0.964, F1 score: 0.980) when the same records are manually processed (0.89% of all records). In the three-class setting, these three machine learning methods can automatically process all reported records and still have higher accuracy and F1 score than the deterministic baseline. The linkage quality of the machine learning methods studied, except for the neural network, increase as the number of manually processed records increases.

CONCLUSION

Machine learning methods can significantly improve linkage quality and reduce the manual effort required by medical coders to match tumor records in cancer registries compared to a deterministic baseline. Our results help cancer registries estimate how linkage quality increases as more records are manually processed.

Collapse

Vessels T, Strayer N, Lee H, Choi KW, Zhang S, Han L, Morley TJ, Smoller JW, Xu Y, Ruderfer DM. Integrating Electronic Health Records and Polygenic Risk to Identify Genetically Unrelated Comorbidities of Schizophrenia That May Be Modifiable. Biol Psychiatry Glob Open Sci 2024;4:100297. [PMID: 38645405 PMCID: PMC11033077 DOI: 10.1016/j.bpsgos.2024.100297] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/22/2023] [Revised: 02/07/2024] [Accepted: 02/11/2024] [Indexed: 04/23/2024] Open

Abstract

Background

Patients with schizophrenia have substantial comorbidity that contributes to reduced life expectancy of 10 to 20 years. Identifying modifiable comorbidities could improve rates of premature mortality. Conditions that frequently co-occur but lack shared genetic risk with schizophrenia are more likely to be products of treatment, behavior, or environmental factors and therefore are enriched for potentially modifiable associations.

Methods

Phenome-wide comorbidity was calculated from electronic health records of 250,000 patients across 2 independent health care institutions (Vanderbilt University Medical Center and Mass General Brigham); associations with schizophrenia polygenic risk scores were calculated across the same phenotypes in linked biobanks.

Results

Schizophrenia comorbidity was significantly correlated across institutions (r = 0.85), and the 77 identified comorbidities were consistent with prior literature. Overall, comorbidity and polygenic risk score associations were significantly correlated (r = 0.55, p = 1.29 × 10-118). However, directly testing for the absence of genetic effects identified 36 comorbidities that had significantly equivalent schizophrenia polygenic risk score distributions between cases and controls. This set included phenotypes known to be consequences of antipsychotic medications (e.g., movement disorders) or of the disease such as reduced hygiene (e.g., diseases of the nail), thereby validating the approach. It also highlighted phenotypes with less clear causal relationships and minimal genetic effects such as tobacco use disorder and diabetes.

Conclusions

This work demonstrates the consistency and robustness of electronic health record-based schizophrenia comorbidities across independent institutions and with the existing literature. It identifies known and novel comorbidities with an absence of shared genetic risk, indicating other causes that may be modifiable and where further study of causal pathways could improve outcomes for patients.

Collapse

Affiliation(s)

Tess Vessels Division of Genetic Medicine, Department of Medicine, Vanderbilt University Medical Center, Nashville, Tennessee Center for Digital Genomic Medicine, Department of Medicine, Vanderbilt University Medical Center, Nashville, Tennessee Vanderbilt Genetics Institute, Vanderbilt University Medical Center, Nashville, Tennessee
Nicholas Strayer Department of Biostatistics, Vanderbilt University Medical Center, Nashville, Tennessee
Hyunjoon Lee Psychiatric & Neurodevelopmental Genetics Unit, Center for Genomic Medicine, Massachusetts General Hospital, Boston, Massachusetts Center for Precision Psychiatry, Department of Psychiatry, Massachusetts General Hospital, Boston, Massachusetts
Karmel W. Choi Psychiatric & Neurodevelopmental Genetics Unit, Center for Genomic Medicine, Massachusetts General Hospital, Boston, Massachusetts Center for Precision Psychiatry, Department of Psychiatry, Massachusetts General Hospital, Boston, Massachusetts
Siwei Zhang Department of Biostatistics, Vanderbilt University Medical Center, Nashville, Tennessee
Lide Han Division of Genetic Medicine, Department of Medicine, Vanderbilt University Medical Center, Nashville, Tennessee Center for Digital Genomic Medicine, Department of Medicine, Vanderbilt University Medical Center, Nashville, Tennessee Vanderbilt Genetics Institute, Vanderbilt University Medical Center, Nashville, Tennessee
Theodore J. Morley Division of Genetic Medicine, Department of Medicine, Vanderbilt University Medical Center, Nashville, Tennessee Center for Digital Genomic Medicine, Department of Medicine, Vanderbilt University Medical Center, Nashville, Tennessee Vanderbilt Genetics Institute, Vanderbilt University Medical Center, Nashville, Tennessee
Jordan W. Smoller Psychiatric & Neurodevelopmental Genetics Unit, Center for Genomic Medicine, Massachusetts General Hospital, Boston, Massachusetts Center for Precision Psychiatry, Department of Psychiatry, Massachusetts General Hospital, Boston, Massachusetts
Yaomin Xu Department of Biostatistics, Vanderbilt University Medical Center, Nashville, Tennessee Department of Biomedical Informatics, Vanderbilt University Medical Center, Nashville, Tennessee
Douglas M. Ruderfer Division of Genetic Medicine, Department of Medicine, Vanderbilt University Medical Center, Nashville, Tennessee Center for Digital Genomic Medicine, Department of Medicine, Vanderbilt University Medical Center, Nashville, Tennessee Vanderbilt Genetics Institute, Vanderbilt University Medical Center, Nashville, Tennessee Department of Biomedical Informatics, Vanderbilt University Medical Center, Nashville, Tennessee Department of Psychiatry and Behavioral Sciences, Vanderbilt University Medical Center, Nashville, Tennessee

Collapse

Lee CT, Zhang K, Li W, Tang K, Ling Y, Walji MF, Jiang X. Identifying predictors of the tooth loss phenotype in a large periodontitis patient cohort using a machine learning approach. J Dent 2024;144:104921. [PMID: 38437976 DOI: 10.1016/j.jdent.2024.104921] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/14/2023] [Revised: 02/17/2024] [Accepted: 03/01/2024] [Indexed: 03/06/2024] Open

Tavabi N, Pruneski J, Golchin S, Singh M, Sanborn R, Heyworth B, Landschaft A, Kimia A, Kiapour A. Building large-scale registries from unstructured clinical notes using a low-resource natural language processing pipeline. Artif Intell Med 2024;151:102847. [PMID: 38658131 DOI: 10.1016/j.artmed.2024.102847] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/13/2023] [Revised: 02/06/2024] [Accepted: 03/19/2024] [Indexed: 04/26/2024]

Abstract

Building clinical registries is an important step in clinical research and improvement of patient care quality. Natural Language Processing (NLP) methods have shown promising results in extracting valuable information from unstructured clinical notes. However, the structure and nature of clinical notes are very different from regular text that state-of-the-art NLP models are trained and tested on, and they have their own set of challenges. In this study, we propose Sentence Extractor with Keywords (SE-K), an efficient and interpretable classification approach for extracting information from clinical notes and show that it outperforms more computationally expensive methods in text classification. Following the Institutional Review Board (IRB) approval, we used SE-K and two embedding based NLP approaches (Sentence Extractor with Embeddings (SE-E) and Bidirectional Encoder Representations from Transformers (BERT)) to develop comprehensive registry of anterior cruciate ligament surgeries from 20 years of unstructured clinical data at a multi-site tertiary-care regional children's hospital. The low-resource approach (SE-K) had better performance (average AUROC of 0.94 ± 0.04) than the embedding-based approaches (SE-E: 0.93 ± 0.04 and BERT: 0.87 ± 0.09) for out of sample validation, in addition to minimum performance drop between test and out-of-sample validation. Moreover, the SE-K approach was at least six times faster (on CPU) than SE-E (on CPU) and BERT (on GPU) and provides interpretability. Our proposed approach, SE-K, can be effectively used to extract relevant variables from clinic notes to build large-scale registries, with consistently better performance compared to the more resource-intensive approaches (e.g., BERT). Such approaches can facilitate information extraction from unstructured notes for registry building, quality improvement and adverse event monitoring.

Collapse

Al Teneiji AS, Abu Salim TY, Riaz Z. Factors impacting the adoption of big data in healthcare: A systematic literature review. Int J Med Inform 2024;187:105460. [PMID: 38653062 DOI: 10.1016/j.ijmedinf.2024.105460] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/23/2023] [Revised: 03/21/2024] [Accepted: 04/15/2024] [Indexed: 04/25/2024]

Abstract

BACKGROUND

The term "big data" refers to the vast volume, variety, and velocity of data generated from various sources-e.g., sensors, social media, and online platforms. Big data adoption within healthcare poses an intriguing possibility for improving patients' health, increasing operational efficiency, and enabling data-driven decision-making. Despite considerable interest in the adoption of big data in healthcare, empirical research assessing the factors impacting the adoption process is lacking. Therefore, this review aimed to investigate the literature using a systematic approach to explore the factors that affect big data adoption in healthcare.

METHODS

A systematic literature review was conducted. The methodical and thorough process of discovering, assessing, and synthesizing relevant studies provided a full review of the available data. Several databases were used for the information search. Most of the articles retrieved from the search came from popular medical research databases, such as Scopus, Taylor & Francis, ScienceDirect, Emerald Insights, PubMed, Springer, IEEE, MDPI, Google Scholar, ProQuest Central, ProQuest Public Health Database, and MEDLINE.

RESULTS AND CONCLUSION

The results of the systematic literature review indicated that several theoretical frameworks (including the technology acceptance model; the technology, organization, and environment framework; the interactive communication technology adoption model; diffusion of innovation theory; dynamic capabilities theory; and the absorptive capability framework) can be used to analyze and understand technology acceptance in healthcare. It is vital to consider the safety of electronic health records during the use of big data. Furthermore, several elements were found to determine technological acceptance, including environmental, technological, organizational, political, and regulatory factors.

Collapse

Sánchez-Valle J, Correia RB, Camacho-Artacho M, Lepore R, Mattos MM, Rocha LM, Valencia A. Prevalence and differences in the co-administration of drugs known to interact: an analysis of three distinct and large populations. BMC Med 2024;22:166. [PMID: 38637816 PMCID: PMC11027217 DOI: 10.1186/s12916-024-03384-1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 09/01/2023] [Accepted: 04/08/2024] [Indexed: 04/20/2024] Open

Abstract

BACKGROUND

The co-administration of drugs known to interact greatly impacts morbidity, mortality, and health economics. This study aims to examine the drug-drug interaction (DDI) phenomenon with a large-scale longitudinal analysis of age and gender differences found in drug administration data from three distinct healthcare systems.

METHODS

This study analyzes drug administrations from population-wide electronic health records in Blumenau (Brazil; 133 K individuals), Catalonia (Spain; 5.5 M individuals), and Indianapolis (USA; 264 K individuals). The stratified prevalences of DDI for multiple severity levels per patient gender and age at the time of administration are computed, and null models are used to estimate the expected impact of polypharmacy on DDI prevalence. Finally, to study actionable strategies to reduce DDI prevalence, alternative polypharmacy regimens using drugs with fewer known interactions are simulated.

RESULTS

A large prevalence of co-administration of drugs known to interact is found in all populations, affecting 12.51%, 12.12%, and 10.06% of individuals in Blumenau, Indianapolis, and Catalonia, respectively. Despite very different healthcare systems and drug availability, the increasing prevalence of DDI as patients age is very similar across all three populations and is not explained solely by higher co-administration rates in the elderly. In general, the prevalence of DDI is significantly higher in women - with the exception of men over 50 years old in Indianapolis. Finally, we show that using proton pump inhibitor alternatives to omeprazole (the drug involved in more co-administrations in Catalonia and Blumenau), the proportion of patients that are administered known DDI can be reduced by up to 21% in both Blumenau and Catalonia and 2% in Indianapolis.

CONCLUSIONS

DDI administration has a high incidence in society, regardless of geographic, population, and healthcare management differences. Although DDI prevalence increases with age, our analysis points to a complex phenomenon that is much more prevalent than expected, suggesting comorbidities as key drivers of the increase. Furthermore, the gender differences observed in most age groups across populations are concerning in regard to gender equity in healthcare. Finally, our study exemplifies how electronic health records' analysis can lead to actionable interventions that significantly reduce the administration of known DDI and its associated human and economic costs.

Collapse

Daley MF, Reifler LM, Shoup JA, Glanz JM, Lewin BJ, Klein NP, Kharbanda EO, McLean HQ, Hambidge SJ, Nelson JC, Naleway AL, Weintraub ES, McNeil MM, Razzaghi H, Singleton JA. Influenza vaccination accuracy among adults: Self-report compared with electronic health record data. Vaccine 2024;42:2740-2746. [PMID: 38531726 DOI: 10.1016/j.vaccine.2024.03.052] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/22/2023] [Revised: 03/09/2024] [Accepted: 03/19/2024] [Indexed: 03/28/2024]

Baldridge D, Kaster L, Sancimino C, Srivastava S, Molholm S, Gupta A, Oh I, Lanzotti V, Grewal D, Riggs ER, Savatt JM, Hauck R, Sveden A, Constantino JN, Piven J, Gurnett CA, Chopra M, Hazlett H, Payne PRO. The Brain Gene Registry: a data snapshot. J Neurodev Disord 2024;16:17. [PMID: 38632549 PMCID: PMC11022437 DOI: 10.1186/s11689-024-09530-3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 10/17/2023] [Accepted: 03/27/2024] [Indexed: 04/19/2024] Open

Abstract

Monogenic disorders account for a large proportion of population-attributable risk for neurodevelopmental disabilities. However, the data necessary to infer a causal relationship between a given genetic variant and a particular neurodevelopmental disorder is often lacking. Recognizing this scientific roadblock, 13 Intellectual and Developmental Disabilities Research Centers (IDDRCs) formed a consortium to create the Brain Gene Registry (BGR), a repository pairing clinical genetic data with phenotypic data from participants with variants in putative brain genes. Phenotypic profiles are assembled from the electronic health record (EHR) and a battery of remotely administered standardized assessments collectively referred to as the Rapid Neurobehavioral Assessment Protocol (RNAP), which include cognitive, neurologic, and neuropsychiatric assessments, as well as assessments for attention deficit hyperactivity disorder (ADHD) and autism spectrum disorder (ASD). Co-enrollment of BGR participants in the Clinical Genome Resource's (ClinGen's) GenomeConnect enables display of variant information in ClinVar. The BGR currently contains data on 479 participants who are 55% male, 6% Asian, 6% Black or African American, 76% white, and 12% Hispanic/Latine. Over 200 genes are represented in the BGR, with 12 or more participants harboring variants in each of these genes: CACNA1A, DNMT3A, SLC6A1, SETD5, and MYT1L. More than 30% of variants are de novo and 43% are classified as variants of uncertain significance (VUSs). Mean standard scores on cognitive or developmental screens are below average for the BGR cohort. EHR data reveal developmental delay as the earliest and most common diagnosis in this sample, followed by speech and language disorders, ASD, and ADHD. BGR data has already been used to accelerate gene-disease validity curation of 36 genes evaluated by ClinGen's BGR Intellectual Disability (ID)-Autism (ASD) Gene Curation Expert Panel. In summary, the BGR is a resource for use by stakeholders interested in advancing translational research for brain genes and continues to recruit participants with clinically reported variants to establish a rich and well-characterized national resource to promote research on neurodevelopmental disorders.

Collapse

Affiliation(s)

Dustin Baldridge Department of Pediatrics, Washington University School of Medicine in St. Louis, St. Louis, MO, USA.
Levi Kaster Institute for Informatics, Data Science and Biostatistics, Washington University School of Medicine in St. Louis, St. Louis, MO, USA
Catherine Sancimino Department of Pediatrics, Albert Einstein College of Medicine, Bronx, NY, USA
Siddharth Srivastava Department of Neurology, Boston Children's Hospital, Harvard Medical School, Boston, MA, USA Rosamund Stone Zander Translational Neuroscience Center, Boston Children's Hospital, Boston, MA, USA
Sophie Molholm Departments of Pediatrics and Neuroscience, Albert Einstein College of Medicine, Bronx, NY, USA
Aditi Gupta Institute for Informatics, Data Science and Biostatistics, Washington University School of Medicine in St. Louis, St. Louis, MO, USA
Inez Oh Institute for Informatics, Data Science and Biostatistics, Washington University School of Medicine in St. Louis, St. Louis, MO, USA
Virginia Lanzotti Department of Psychiatry, Washington University School of Medicine in St. Louis, St. Louis, MO, USA
Daleep Grewal Institute for Informatics, Data Science and Biostatistics, Washington University School of Medicine in St. Louis, St. Louis, MO, USA
Erin Rooney Riggs Autism and Developmental Medicine Institute, Geisinger, Danville, PA, USA
Juliann M Savatt Department of Genomic Health, Geisinger, Danville, PA, USA
Rachel Hauck Institute for Informatics, Data Science and Biostatistics, Washington University School of Medicine in St. Louis, St. Louis, MO, USA
Abigail Sveden Rosamund Stone Zander Translational Neuroscience Center, Boston Children's Hospital, Boston, MA, USA
John N Constantino Division of Behavioral and Mental Health, Departments of Psychiatry and Pediatrics, Children's Healthcare of Atlanta, Emory University, Atlanta, GA, USA
Joseph Piven The Carolina Institute for Developmental Disabilities, University of North Carolina, Chapel Hill, NC, USA
Christina A Gurnett Department of Neurology, Washington University School of Medicine in St. Louis, St. Louis, MO, USA
Maya Chopra Department of Neurology, Boston Children's Hospital, Harvard Medical School, Boston, MA, USA Rosamund Stone Zander Translational Neuroscience Center, Boston Children's Hospital, Boston, MA, USA
Heather Hazlett The Carolina Institute for Developmental Disabilities, University of North Carolina, Chapel Hill, NC, USA
Philip R O Payne Institute for Informatics, Data Science and Biostatistics, Washington University School of Medicine in St. Louis, St. Louis, MO, USA

Collapse

Li Y, Yang AY, Marelli A, Li Y. MixEHR-SurG: A joint proportional hazard and guided topic model for inferring mortality-associated topics from electronic health records. J Biomed Inform 2024;153:104638. [PMID: 38631461 DOI: 10.1016/j.jbi.2024.104638] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/20/2023] [Revised: 03/07/2024] [Accepted: 04/03/2024] [Indexed: 04/19/2024]

Abstract

Survival models can help medical practitioners to evaluate the prognostic importance of clinical variables to patient outcomes such as mortality or hospital readmission and subsequently design personalized treatment regimes. Electronic Health Records (EHRs) hold the promise for large-scale survival analysis based on systematically recorded clinical features for each patient. However, existing survival models either do not scale to high dimensional and multi-modal EHR data or are difficult to interpret. In this study, we present a supervised topic model called MixEHR-SurG to simultaneously integrate heterogeneous EHR data and model survival hazard. Our contributions are three-folds: (1) integrating EHR topic inference with Cox proportional hazards likelihood; (2) integrating patient-specific topic hyperparameters using the PheCode concepts such that each topic can be identified with exactly one PheCode-associated phenotype; (3) multi-modal survival topic inference. This leads to a highly interpretable survival topic model that can infer PheCode-specific phenotype topics associated with patient mortality. We evaluated MixEHR-SurG using a simulated dataset and two real-world EHR datasets: the Quebec Congenital Heart Disease (CHD) data consisting of 8211 subjects with 75,187 outpatient claim records of 1767 unique ICD codes; the MIMIC-III consisting of 1458 subjects with multi-modal EHR records. Compared to the baselines, MixEHR-SurG achieved a superior dynamic AUROC for mortality prediction, with a mean AUROC score of 0.89 in the simulation dataset and a mean AUROC of 0.645 on the CHD dataset. Qualitatively, MixEHR-SurG associates severe cardiac conditions with high mortality risk among the CHD patients after the first heart failure hospitalization and critical brain injuries with increased mortality among the MIMIC-III patients after their ICU discharge. Together, the integration of the Cox proportional hazards model and EHR topic inference in MixEHR-SurG not only leads to competitive mortality prediction but also meaningful phenotype topics for in-depth survival analysis. The software is available at GitHub: https://github.com/li-lab-mcgill/MixEHR-SurG.

Collapse

Alageel NA, Hughes CM, Alwhaibi M, Alkeridy W, Barry HE. Potentially inappropriate prescribing for people with dementia in ambulatory care: a cross-sectional observational study. BMC Geriatr 2024;24:328. [PMID: 38600444 PMCID: PMC11008018 DOI: 10.1186/s12877-024-04949-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/25/2023] [Accepted: 04/04/2024] [Indexed: 04/12/2024] Open

Abstract

BACKGROUND

Studies have shown that potentially inappropriate prescribing (PIP) is highly prevalent among people with dementia (PwD) and linked to negative outcomes, such as hospitalisation and mortality. However, there are limited data on prescribing appropriateness for PwD in Saudi Arabia. Therefore, we aimed to estimate the prevalence of PIP and investigate associations between PIP and other patient characteristics among PwD in an ambulatory care setting.

METHODS

A cross-sectional, retrospective analysis was conducted at a tertiary hospital in Saudi Arabia. Patients who were ≥ 65 years old, had dementia, and visited ambulatory care clinics between 01/01/2019 and 31/12/2021 were included. Prescribing appropriateness was evaluated by applying the Screening Tool of Older Persons Potentially Inappropriate Prescriptions (STOPP) criteria. Descriptive analyses were used to describe the study population. Prevalence of PIP and the prevalence per each STOPP criterion were calculated as a percentage of all eligible patients. Logistic regression analysis was used to investigate associations between PIP, polypharmacy, age and sex; odds ratios (ORs) and 95% confidence intervals (CIs) were calculated. Analyses were conducted using SPSS v27.

RESULTS

A total of 287 PwD were identified; 56.0% (n = 161) were female. The mean number of medications prescribed was 9.0 [standard deviation (SD) ± 4.2]. The prevalence of PIP was 61.0% (n = 175). Common instances of PIP were drugs prescribed beyond the recommended duration (n = 90, 31.4%), drugs prescribed without an evidence-based clinical indication (n = 78, 27.2%), proton pump inhibitors (PPIs) for > 8 weeks (n = 75, 26.0%), and acetylcholinesterase inhibitors with concurrent drugs that reduce heart rate (n = 60, 21.0%). Polypharmacy was observed in 82.6% (n = 237) of patients and was strongly associated with PIP (adjusted OR 24.1, 95% CI 9.0-64.5).

CONCLUSIONS

Findings have revealed a high prevalence of PIP among PwD in Saudi Arabia that is strongly associated with polypharmacy. Future research should aim to explore key stakeholders' experiences and perspectives of medicines management to optimise medication use for this vulnerable patient population.

Collapse

Raycheva R, Kostadinov K, Mitova E, Iskrov G, Stefanov G, Vakevainen M, Elomaa K, Man YS, Gross E, Zschüntzsch J, Röttger R, Stefanov R. Landscape analysis of available European data sources amenable for machine learning and recommendations on usability for rare diseases screening. Orphanet J Rare Dis 2024;19:147. [PMID: 38582900 PMCID: PMC10998425 DOI: 10.1186/s13023-024-03162-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/17/2023] [Accepted: 03/30/2024] [Indexed: 04/08/2024] Open

Abstract

BACKGROUND

Patient registries and databases are essential tools for advancing clinical research in the area of rare diseases, as well as for enhancing patient care and healthcare planning. The primary aim of this study is a landscape analysis of available European data sources amenable to machine learning (ML) and their usability for Rare Diseases screening, in terms of findable, accessible, interoperable, reusable(FAIR), legal, and business considerations. Second, recommendations will be proposed to provide a better understanding of the health data ecosystem.

METHODS

In the period of March 2022 to December 2022, a cross-sectional study using a semi-structured questionnaire was conducted among potential respondents, identified as main contact person of a health-related databases. The design of the self-completed questionnaire survey instrument was based on information drawn from relevant scientific publications, quantitative and qualitative research, and scoping review on challenges in mapping European rare disease (RD) databases. To determine database characteristics associated with the adherence to the FAIR principles, legal and business aspects of database management Bayesian models were fitted.

RESULTS

In total, 330 unique replies were processed and analyzed, reflecting the same number of distinct databases (no duplicates included). In terms of geographical scope, we observed 24.2% (n = 80) national, 10.0% (n = 33) regional, 8.8% (n = 29) European, and 5.5% (n = 18) international registries coordinated in Europe. Over 80.0% (n = 269) of the databases were still active, with approximately 60.0% (n = 191) established after the year 2000 and 71.0% last collected new data in 2022. Regarding their geographical scope, European registries were associated with the highest overall FAIR adherence, while registries with regional and "other" geographical scope were ranked at the bottom of the list with the lowest proportion. Responders' willingness to share data as a contribution to the goals of the Screen4Care project was evaluated at the end of the survey. This question was completed by 108 respondents; however, only 18 of them (16.7%) expressed a direct willingness to contribute to the project by sharing their databases. Among them, an equal split between pro-bono and paid services was observed.

CONCLUSIONS

The most important results of our study demonstrate not enough sufficient FAIR principles adherence and low willingness of the EU health databases to share patient information, combined with some legislation incapacities, resulting in barriers to the secondary use of data.

Collapse

Rodríguez-Ramallo H, Báez-Gutiérrez N, Jaramillo-Ruiz D, Sanfélix-Gimeno G, Villegas-Portero R, Jiménez-Murillo JL, Hernández-Quiles C, Santos-Ramos B. Therapeutic management, adherence, and clinical outcomes of heart failure in Andalucía. ANDALIC Protocol. Farm Hosp 2024:S1130-6343(24)00037-0. [PMID: 38582665 DOI: 10.1016/j.farma.2024.03.002] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/04/2023] [Revised: 02/29/2024] [Accepted: 03/01/2024] [Indexed: 04/08/2024] Open

Correcher-Martínez E, López-Lacort M, Muñoz-Quiles C, Díez-Domingo J, Orrico-Sánchez A. Risk of herpes zoster in adults with SARS-CoV-2 infection in Spain: A population-based, retrospective cohort study. Int J Infect Dis 2024;143:107037. [PMID: 38575055 DOI: 10.1016/j.ijid.2024.107037] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/19/2024] [Revised: 03/14/2024] [Accepted: 03/31/2024] [Indexed: 04/06/2024] Open

Fu S, Jia H, Vassilaki M, Keloth VK, Dang Y, Zhou Y, Garg M, Petersen RC, St Sauver J, Moon S, Wang L, Wen A, Li F, Xu H, Tao C, Fan J, Liu H, Sohn S. FedFSA: Hybrid and federated framework for functional status ascertainment across institutions. J Biomed Inform 2024;152:104623. [PMID: 38458578 PMCID: PMC11005095 DOI: 10.1016/j.jbi.2024.104623] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/12/2023] [Revised: 01/12/2024] [Accepted: 03/04/2024] [Indexed: 03/10/2024]

Abstract

INTRODUCTION

Patients' functional status assesses their independence in performing activities of daily living, including basic ADLs (bADL), and more complex instrumental activities (iADL). Existing studies have discovered that patients' functional status is a strong predictor of health outcomes, particularly in older adults. Depite their usefulness, much of the functional status information is stored in electronic health records (EHRs) in either semi-structured or free text formats. This indicates the pressing need to leverage computational approaches such as natural language processing (NLP) to accelerate the curation of functional status information. In this study, we introduced FedFSA, a hybrid and federated NLP framework designed to extract functional status information from EHRs across multiple healthcare institutions.

METHODS

FedFSA consists of four major components: 1) individual sites (clients) with their private local data, 2) a rule-based information extraction (IE) framework for ADL extraction, 3) a BERT model for functional status impairment classification, and 4) a concept normalizer. The framework was implemented using the OHNLP Backbone for rule-based IE and open-source Flower and PyTorch library for federated BERT components. For gold standard data generation, we carried out corpus annotation to identify functional status-related expressions based on ICF definitions. Four healthcare institutions were included in the study. To assess FedFSA, we evaluated the performance of category- and institution-specific ADL extraction across different experimental designs.

RESULTS

ADL extraction performance ranges from an F1-score of 0.907 to 0.986 for bADL and 0.825 to 0.951 for iADL across the four healthcare sites. The performance for ADL extraction with impairment ranges from an F1-score of 0.722 to 0.954 for bADL and 0.674 to 0.813 for iADL across four healthcare sites. For category-specific ADL extraction, laundry and transferring yielded relatively high performance, while dressing, medication, bathing, and continence achieved moderate-high performance. Conversely, food preparation and toileting showed low performance.

CONCLUSION

NLP performance varied across ADL categories and healthcare sites. Federated learning using a FedFSA framework performed higher than non-federated learning for impaired ADL extraction at all healthcare sites. Our study demonstrated the potential of the federated learning framework in functional status extraction and impairment classification in EHRs, exemplifying the importance of a large-scale, multi-institutional collaborative development effort.

Collapse

Carson NJ, Yang X, Mullin B, Stettenbauer E, Waddington M, Zhang A, Williams P, Rios Perez GE, Cook BL. Predicting adolescent suicidal behavior following inpatient discharge using structured and unstructured data. J Affect Disord 2024;350:382-387. [PMID: 38158050 PMCID: PMC10923087 DOI: 10.1016/j.jad.2023.12.059] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 08/31/2023] [Revised: 11/30/2023] [Accepted: 12/24/2023] [Indexed: 01/03/2024]

Sim JA, Huang X, Horan MR, Baker JN, Huang IC. Using natural language processing to analyze unstructured patient-reported outcomes data derived from electronic health records for cancer populations: a systematic review. Expert Rev Pharmacoecon Outcomes Res 2024;24:467-475. [PMID: 38383308 PMCID: PMC11001514 DOI: 10.1080/14737167.2024.2322664] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/02/2023] [Accepted: 02/20/2024] [Indexed: 02/23/2024]

Loftus J, Levy HP, Stevenson JM. Documentation of results and medication prescribing after combinatorial psychiatric pharmacogenetic testing: A case for discrete results. Genet Med 2024;26:101056. [PMID: 38153010 DOI: 10.1016/j.gim.2023.101056] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/28/2023] [Revised: 12/18/2023] [Accepted: 12/20/2023] [Indexed: 12/29/2023] Open

Soe NN, Latt PM, Yu Z, Lee D, Kim CM, Tran D, Ong JJ, Ge Z, Fairley CK, Zhang L. Clinical features-based machine learning models to separate sexually transmitted infections from other skin diagnoses. J Infect 2024;88:106128. [PMID: 38452934 DOI: 10.1016/j.jinf.2024.106128] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/27/2023] [Revised: 01/22/2024] [Accepted: 02/13/2024] [Indexed: 03/09/2024]

Abstract

INTRODUCTION

Many sexual health services are overwhelmed and cannot cater for all the individuals who present with sexually transmitted infections (STIs). Digital health software that separates STIs from non-STIs could improve the efficiency of clinical services. We developed and evaluated a machine learning model that predicts whether patients have an STI based on their clinical features.

METHODS

We manually extracted 25 demographic features and clinical features from 1315 clinical records in the electronic health record system at Melbourne Sexual Health Center. We examined 16 machine learning models to predict a binary outcome of an STI or a non-STI diagnosis. We evaluated the models' performance with the area under the ROC curve (AUC), accuracy and F1-scores.

RESULTS

Our study included 1315 consultations, of which 36.8% (484/1315) were diagnosed with STIs and 63.2% (831/1315) had non-STI conditions. The study population predominantly consisted of heterosexual men (49.5%, 651/1315), followed by gay, bisexual and other men who have sex with men (GBMSM) (25.7%), women (21.6%) and unknown gender (3.2%). The median age was 31 years (intra-quartile range (IQR) 26-39). The top 5 performing models were CatBoost (AUC 0.912), Random Forest (AUC 0.917), LightGBM (AUC 0.907), Gradient Boosting (AUC 0.905) and XGBoost (AUC 0.900). The best model, CatBoost, achieved an accuracy of 0.837, sensitivity of 0.776, specificity of 0.831, precision of 0.782 and F1-score of 0.778. The key important features were lesion duration, type of skin lesions, age, gender, history of skin disorders, number of lesions, dysuria duration, anorectal pain and itchiness.

CONCLUSIONS

Our best model demonstrates a reasonable performance in distinguishing STIs from non-STIs. However, to be clinically useful, more detailed information such as clinical images, may be required to reach sufficient accuracy.

Collapse

Affiliation(s)

Nyi Nyi Soe Melbourne Sexual Health Centre, Alfred Health, Melbourne, Australia; Central Clinical School, Faculty of Medicine, Nursing and Health Sciences, Monash University, Melbourne, Australia
Phyu Mon Latt Melbourne Sexual Health Centre, Alfred Health, Melbourne, Australia; Central Clinical School, Faculty of Medicine, Nursing and Health Sciences, Monash University, Melbourne, Australia
Zhen Yu Central Clinical School, Faculty of Medicine, Nursing and Health Sciences, Monash University, Melbourne, Australia; Monash e-Research Centre, Faculty of Engineering, Airdoc Research, Nvidia AI Technology Research Centre, Monash University, Melbourne, Australia
David Lee Melbourne Sexual Health Centre, Alfred Health, Melbourne, Australia
Cham-Mill Kim Melbourne Medical School, Faculty of Medicine, Dentistry and Health Sciences, The University of Melbourne, Melbourne, Australia
Daniel Tran Melbourne Medical School, Faculty of Medicine, Dentistry and Health Sciences, The University of Melbourne, Melbourne, Australia
Jason J Ong Melbourne Sexual Health Centre, Alfred Health, Melbourne, Australia; Central Clinical School, Faculty of Medicine, Nursing and Health Sciences, Monash University, Melbourne, Australia
Zongyuan Ge Monash e-Research Centre, Faculty of Engineering, Airdoc Research, Nvidia AI Technology Research Centre, Monash University, Melbourne, Australia
Christopher K Fairley Melbourne Sexual Health Centre, Alfred Health, Melbourne, Australia; Central Clinical School, Faculty of Medicine, Nursing and Health Sciences, Monash University, Melbourne, Australia
Lei Zhang Clinical Medical Research Center, Children's Hospital of Nanjing Medical University, Nanjing, China; Melbourne Sexual Health Centre, Alfred Health, Melbourne, Australia; Central Clinical School, Faculty of Medicine, Nursing and Health Sciences, Monash University, Melbourne, Australia.

Collapse

Laukvik LB, Lyngstad M, Rotegård AK, Fossum M. Utilizing nursing standards in electronic health records: A descriptive qualitative study. Int J Med Inform 2024;184:105350. [PMID: 38306850 DOI: 10.1016/j.ijmedinf.2024.105350] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/21/2023] [Revised: 01/15/2024] [Accepted: 01/24/2024] [Indexed: 02/04/2024]

Abstract

BACKGROUND

The electronic health record (EHR), including standardized structures and languages, represents an important data source for nurses, to continually update their individual and shared perceptual understanding of clinical situations. Registered nurses' utilization of nursing standards, such as standardized nursing care plans and language in EHRs, has received little attention in the literature. Further research is needed to understand nurses' care planning and documentation practice.

AIMS

This study aimed to describe the experiences and perceptions of nurses' EHR documentation practices utilizing standardized nursing care plans including standardized nursing language, in the daily documentation of nursing care for patients living in special dementia-care units in nursing homes in Norway.

METHODS

A descriptive qualitative study was conducted between April and November 2021 among registered nurses working in special dementia care units in Norwegian nursing homes. In-depth interviews were conducted, and data was analyzed utilizing reflexive thematic analysis with a deductive orientation. Findings Four themes were generated from the analysis. First, the knowledge, skills, and attitude of system users were perceived to influence daily documentation practice. Second, management and organization of documentation work, internally and externally, influenced motivation and engagement in daily documentation processes. Third, usability issues of the EHR were perceived to limit the daily workflow and the nurses' information-needs. Last, nursing standards in the EHR were perceived to contribute to the development of documentation practices, supporting and stimulating ethical awareness, cognitive processes, and knowledge development.

CONCLUSION

Nurses and nursing leaders need to be continuously involved and engaged in EHR documentation to safeguard development and implementation of relevant nursing standards.

Collapse

Li Z, Lan L, Zhou Y, Li R, Chavin KD, Xu H, Li L, Shih DJH, Jim Zheng W. Developing deep learning-based strategies to predict the risk of hepatocellular carcinoma among patients with nonalcoholic fatty liver disease from electronic health records. J Biomed Inform 2024;152:104626. [PMID: 38521180 DOI: 10.1016/j.jbi.2024.104626] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/20/2023] [Revised: 02/23/2024] [Accepted: 03/20/2024] [Indexed: 03/25/2024]

Abstract

OBJECTIVE

The accuracy of deep learning models for many disease prediction problems is affected by time-varying covariates, rare incidence, covariate imbalance and delayed diagnosis when using structured electronic health records data. The situation is further exasperated when predicting the risk of one disease on condition of another disease, such as the hepatocellular carcinoma risk among patients with nonalcoholic fatty liver disease due to slow, chronic progression, the scarce of data with both disease conditions and the sex bias of the diseases. The goal of this study is to investigate the extent to which the aforementioned issues influence deep learning performance, and then devised strategies to tackle these challenges. These strategies were applied to improve hepatocellular carcinoma risk prediction among patients with nonalcoholic fatty liver disease.

METHODS

We evaluated two representative deep learning models in the task of predicting the occurrence of hepatocellular carcinoma in a cohort of patients with nonalcoholic fatty liver disease (n = 220,838) from a national EHR database. The disease prediction task was carefully formulated as a classification problem while taking censorship and the length of follow-up into consideration.

RESULTS

We developed a novel backward masking scheme to deal with the issue of delayed diagnosis which is very common in EHR data analysis and evaluate how the length of longitudinal information after the index date affects disease prediction. We observed that modeling time-varying covariates improved the performance of the algorithms and transfer learning mitigated reduced performance caused by the lack of data. In addition, covariate imbalance, such as sex bias in data impaired performance. Deep learning models trained on one sex and evaluated in the other sex showed reduced performance, indicating the importance of assessing covariate imbalance while preparing data for model training.

CONCLUSIONS

The strategies developed in this work can significantly improve the performance of hepatocellular carcinoma risk prediction among patients with nonalcoholic fatty liver disease. Furthermore, our novel strategies can be generalized to apply to other disease risk predictions using structured electronic health records, especially for disease risks on condition of another disease.

Collapse

Essay P, Rajasekharan A. Robust diagnosis recommendation system for Primary Care Telemedicine using long short-term memory multi-class sequence classification. Heliyon 2024;10:e26770. [PMID: 38510056 PMCID: PMC10950495 DOI: 10.1016/j.heliyon.2024.e26770] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/11/2024] [Revised: 02/12/2024] [Accepted: 02/20/2024] [Indexed: 03/22/2024] Open

Abstract

Background

Telemedicine offers opportunity for robust diagnoses recommendations to support healthcare providers intra-consultation in a way that does not limit providers ability to explore diagnostic codes and make the most appropriate selection for each consultation.

Objective

The objective of this work was to develop a recommendation system for ICD-10 coding using multiclass sequence classification and deep learning. The recommendations are intended to support telemedicine clinicians in making timely and appropriate diagnosis selections. The recommendations allow clinicians to find and select the best diagnosis code much quicker and without leaving the telemedicine platform to search codes and code descriptions.

Methods

We developed an LSTM model for multi-class text sequence classification to make diagnosis recommendations. The LSTM recommender used text-based symptoms, complaints, and consultation request reasons as model inputs. Data were extracted from a live telemedicine platform which spans general medicine, dermatology, and mental health clinical specialties. A popularity-based model was used for baseline comparison.

Results

Using over 2.8 MM telemedicine consultations during 2021 and 2022, our LSTM recommender average accuracy was 31.7%. LSTM recommender average coverage in the top 20 recommended diagnoses was 85.8% with an average personalization score of 0.87.

Conclusions

LSTM multi-class sequence classification recommends diagnoses specific to individual consultations, is retrainable on regular intervals, and could improve diagnoses recommendations such that providers require less time and resources searching for diagnosis codes. In addition, the LSTM recommender is robust enough to make recommendations across clinical specialties such as general medicine, dermatology, and mental health.

Collapse

Grigoroglou C, Walshe K, Kontopantelis E, Ferguson J, Stringer G, Ashcroft DM, Allen T. Comparing the clinical practice and prescribing safety of locum and permanent doctors: observational study of primary care consultations in England. BMC Med 2024;22:126. [PMID: 38532468 DOI: 10.1186/s12916-024-03332-z] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Grants] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 11/22/2023] [Accepted: 02/29/2024] [Indexed: 03/28/2024] Open

Abstract

BACKGROUND

Temporary doctors, known as locums, are a key component of the medical workforce in the NHS but evidence on differences in quality and safety between locum and permanent doctors is limited. We aimed to examine differences in the clinical practice, and prescribing safety for locum and permanent doctors working in primary care in England.

METHODS

We accessed electronic health care records (EHRs) for 3.5 million patients from the CPRD GOLD database with linkage to Hospital Episode Statistics from 1st April 2010 to 31st March 2022. We used multi-level mixed effects logistic regression to compare consultations with locum and permanent GPs for several patient outcomes including general practice revisits; prescribing of antibiotics; strong opioids; hypnotics; A&E visits; emergency hospital admissions; admissions for ambulatory care sensitive conditions; test ordering; referrals; and prescribing safety indicators while controlling for patient and practice characteristics.

RESULTS

Consultations with locum GPs were 22% more likely to involve a prescription for an antibiotic (OR = 1.22 (1.21 to 1.22)), 8% more likely to involve a prescription for a strong opioid (OR = 1.08 (1.06 to 1.09)), 4% more likely to be followed by an A&E visit on the same day (OR = 1.04 (1.01 to 1.08)) and 5% more likely to be followed by an A&E visit within 1 to 7 days (OR = 1.05 (1.02 to 1.08)). Consultations with a locum were 12% less likely to lead to a practice revisit within 7 days (OR = 0.88 (0.87 to 0.88)), 4% less likely to involve a prescription for a hypnotic (OR = 0.96 (0.94 to 0.98)), 15% less likely to involve a referral (OR = 0.85 (0.84 to 0.86)) and 19% less likely to involve a test (OR = 0.81 (0.80 to 0.82)). We found no evidence that emergency admissions, ACSC admissions and eight out of the eleven prescribing safety indicators were different if patients were seen by a locum or a permanent GP.

CONCLUSIONS

Despite existing concerns, the clinical practice and performance of locum GPs did not appear to be systematically different from that of permanent GPs. The practice and performance of both locum and permanent GPs is likely shaped by the organisational setting and systems within which they work.

Collapse

Vezyridis P. 'Kindling the fire' of NHS patient data exploitations: The care.data controversy in news media discourses. Soc Sci Med 2024;348:116824. [PMID: 38598987 DOI: 10.1016/j.socscimed.2024.116824] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/10/2023] [Revised: 03/14/2024] [Accepted: 03/21/2024] [Indexed: 04/12/2024]

De Lillo A, Pathak GA, Low A, De Angelis F, Abou Alaiwi S, Miller EJ, Fuciarelli M, Polimanti R. Clinical spectrum of Transthyretin amyloidogenic mutations among diverse population origins. Hum Genomics 2024;18:31. [PMID: 38523305 PMCID: PMC10962184 DOI: 10.1186/s40246-024-00596-7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/06/2023] [Accepted: 03/08/2024] [Indexed: 03/26/2024] Open

Abstract

PURPOSE

Coding mutations in the Transthyretin (TTR) gene cause a hereditary form of amyloidosis characterized by a complex genotype-phenotype correlation with limited information regarding differences among worldwide populations.

METHODS

We compared 676 diverse individuals carrying TTR amyloidogenic mutations (rs138065384, Phe44Leu; rs730881165, Ala81Thr; rs121918074, His90Asn; rs76992529, Val122Ile) to 12,430 non-carriers matched by age, sex, and genetically-inferred ancestry to assess their clinical presentations across 1,693 outcomes derived from electronic health records in UK biobank.

RESULTS

In individuals of African descent (AFR), Val122Ile mutation was linked to multiple outcomes related to the circulatory system (fold-enrichment = 2.96, p = 0.002) with the strongest associations being cardiac congenital anomalies (phecode 747.1, p = 0.003), endocarditis (phecode 420.3, p = 0.006), and cardiomyopathy (phecode 425, p = 0.007). In individuals of Central-South Asian descent (CSA), His90Asn mutation was associated with dermatologic outcomes (fold-enrichment = 28, p = 0.001). The same TTR mutation was linked to neoplasms in European-descent individuals (EUR, fold-enrichment = 3.09, p = 0.003). In EUR, Ala81Thr showed multiple associations with respiratory outcomes related (fold-enrichment = 3.61, p = 0.002), but the strongest association was with atrioventricular block (phecode 426.2, p = 2.81 × 10- 4). Additionally, the same mutation in East Asians (EAS) showed associations with endocrine-metabolic traits (fold-enrichment = 4.47, p = 0.003). In the cross-ancestry meta-analysis, Val122Ile mutation was associated with peripheral nerve disorders (phecode 351, p = 0.004) in addition to cardiac congenital anomalies (fold-enrichment = 6.94, p = 0.003).

CONCLUSIONS

Overall, these findings highlight that TTR amyloidogenic mutations present ancestry-specific and ancestry-convergent associations related to a range of health domains. This supports the need to increase awareness regarding the range of outcomes associated with TTR mutations across worldwide populations to reduce misdiagnosis and delayed diagnosis of TTR-related amyloidosis.

Collapse

Vivekrabinson K, Ragavan K, Jothi Thilaga P, Bharath Singh J. Secure Cloud-Based Electronic Health Records: Cross-Patient Block-Level Deduplication with Blockchain Auditing. J Med Syst 2024;48:33. [PMID: 38526807 DOI: 10.1007/s10916-024-02053-3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/07/2023] [Accepted: 03/12/2024] [Indexed: 03/27/2024]

Jahandideh S, Hutchinson AF, Bucknall TK, Considine J, Driscoll A, Manias E, Phillips NM, Rasmussen B, Vos N, Hutchinson AM. Using machine learning models to predict falls in hospitalised adults. Int J Med Inform 2024;187:105436. [PMID: 38583216 DOI: 10.1016/j.ijmedinf.2024.105436] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/29/2023] [Revised: 02/09/2024] [Accepted: 03/22/2024] [Indexed: 04/09/2024]

Abstract

BACKGROUND

Identifying patients at high risk of falling is crucial in implementing effective fall prevention programs. While the integration of information systems is becoming more widespread in the healthcare industry, it poses a significant challenge in analysing vast amounts of data to identify factors that could enhance patient safety.

OBJECTIVE

To determine fall-associated factors and develop high-performance prediction tools for at-risk patients in acute and sub-acute care services in Australia.

METHODS

A retrospective study of 672,400 patients admitted to acute and sub-acute care services within a large metropolitan tertiary health service in Victoria, Australia, between January 1, 2019, and December 31, 2021. Data were obtained from four sources: the Department of Health Victorian Admitted Episodes Dataset, RiskManTM, electronic health records, and the health workforce dataset. Machine learning techniques, including Random Forest and Deep Neural Network models, were used to analyse the data, predict patient falls, and identify the most important risk factors for falls in this population. Model performance was evaluated using accuracy, F1-score, precision, recall, specificity, Matthew's correlation coefficient, and the area under the receiver operating characteristic curve (AUC).

RESULTS

The deep neural network and random forest models were highly accurate in predicting hospital patient falls. The deep neural network model achieved an accuracy of 0.988 and a specificity of 0.999, while the RF achieved an accuracy of 0.989 and a specificity of 1.000. The top 20 variables impacting falls were compared across both models, and 12 common factors were identified. These factors can be broadly classified into three categories: patient-related factors, staffing-related factors, and admission-related factors. Although not all factors are modifiable, they must be considered when planning fall prevention interventions.

CONCLUSION

The study demonstrated machine learning's potential to predict falls and identify key risk factors. Further validation across diverse populations and settings is essential for broader applicability.

Collapse

Affiliation(s)

S Jahandideh School of Nursing and Midwifery, Centre for Quality and Patient Safety Research in the Institute for Health Transformation, Deakin University, Geelong, Victoria, Australia
A F Hutchinson School of Nursing and Midwifery, Centre for Quality and Patient Safety Research in the Institute for Health Transformation, Deakin University, Geelong, Victoria, Australia; Epworth HealthCare, Richmond, Victoria, Australia
T K Bucknall School of Nursing and Midwifery, Centre for Quality and Patient Safety Research in the Institute for Health Transformation, Deakin University, Geelong, Victoria, Australia; Alfred Health, Prahran, Victoria, Australia
J Considine School of Nursing and Midwifery, Centre for Quality and Patient Safety Research in the Institute for Health Transformation, Deakin University, Geelong, Victoria, Australia; Eastern Health, Box Hill, Victoria, Australia
A Driscoll School of Nursing and Midwifery, Centre for Quality and Patient Safety Research in the Institute for Health Transformation, Deakin University, Geelong, Victoria, Australia
E Manias School of Nursing and Midwifery, Centre for Quality and Patient Safety Research in the Institute for Health Transformation, Deakin University, Geelong, Victoria, Australia
N M Phillips School of Nursing and Midwifery, Centre for Quality and Patient Safety Research in the Institute for Health Transformation, Deakin University, Geelong, Victoria, Australia
B Rasmussen School of Nursing and Midwifery, Centre for Quality and Patient Safety Research in the Institute for Health Transformation, Deakin University, Geelong, Victoria, Australia; Western Health, Sunshine, Victoria, Australia
N Vos Monash Health, Clayton, Victoria, Australia
A M Hutchinson School of Nursing and Midwifery, Centre for Quality and Patient Safety Research in the Institute for Health Transformation, Deakin University, Geelong, Victoria, Australia; Barwon Health, Geelong, Victoria, Australia.

Collapse

Harrison H, Ip S, Renzi C, Li Y, Barclay M, Usher-Smith J, Lyratzopoulos G, Wood A, Antoniou AC. Implementation and external validation of the Cambridge Multimorbidity Score in the UK Biobank cohort. BMC Med Res Methodol 2024;24:71. [PMID: 38509467 PMCID: PMC10953059 DOI: 10.1186/s12874-024-02175-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/18/2023] [Accepted: 02/06/2024] [Indexed: 03/22/2024] Open

Abstract

BACKGROUND

Patients with multiple conditions present a growing challenge for healthcare provision. Measures of multimorbidity may support clinical management, healthcare resource allocation and accounting for the health of participants in purpose-designed cohorts. The recently developed Cambridge Multimorbidity scores (CMS) have the potential to achieve these aims using primary care records, however, they have not yet been validated outside of their development cohort.

METHODS

The CMS, developed in the Clinical Research Practice Dataset (CPRD), were validated in UK Biobank participants whose data is not available in CPRD (the cohort used for CMS development) with available primary care records (n = 111,898). This required mapping of the 37 pre-existing conditions used in the CMS to the coding frameworks used by UK Biobank data providers. We used calibration plots and measures of discrimination to validate the CMS for two of the three outcomes used in the development study (death and primary care consultation rate) and explored variation by age and sex. We also examined the predictive ability of the CMS for the outcome of cancer diagnosis. The results were compared to an unweighted count score of the 37 pre-existing conditions.

RESULTS

For all three outcomes considered, the CMS were poorly calibrated in UK Biobank. We observed a similar discriminative ability for the outcome of primary care consultation rate to that reported in the development study (C-index: 0.67 (95%CI:0.66-0.68) for both, 5-year follow-up); however, we report lower discrimination for the outcome of death than the development study (0.69 (0.68-0.70) and 0.89 (0.88-0.90) respectively). Discrimination for cancer diagnosis was adequate (0.64 (0.63-0.65)). The CMS performs favourably to the unweighted count score for death, but not for the outcomes of primary care consultation rate or cancer diagnosis.

CONCLUSIONS

In the UK Biobank, CMS discriminates reasonably for the outcomes of death, primary care consultation rate and cancer diagnosis and may be a valuable resource for clinicians, public health professionals and data scientists. However, recalibration will be required to make accurate predictions when cohort composition and risk levels differ substantially from the development cohort. The generated resources (including codelists for the conditions and code for CMS implementation in UK Biobank) are available online.

Collapse

Kim MK, Rouphael C, McMichael J, Welch N, Dasarathy S. Challenges in and Opportunities for Electronic Health Record-Based Data Analysis and Interpretation. Gut Liver 2024;18:201-208. [PMID: 37905424 PMCID: PMC10938158 DOI: 10.5009/gnl230272] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 07/14/2023] [Accepted: 08/15/2023] [Indexed: 11/02/2023] Open

Jeffery AD, Fabbri D, Reeves RM, Matheny ME. Use of noisy labels as weak learners to identify incompletely ascertainable outcomes: A Feasibility study with opioid-induced respiratory depression. Heliyon 2024;10:e26434. [PMID: 38444495 PMCID: PMC10912240 DOI: 10.1016/j.heliyon.2024.e26434] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/09/2023] [Revised: 02/09/2024] [Accepted: 02/13/2024] [Indexed: 03/07/2024] Open

Abstract

Objective

Assigning outcome labels to large observational data sets in a timely and accurate manner, particularly when outcomes are rare or not directly ascertainable, remains a significant challenge within biomedical informatics. We examined whether noisy labels generated from subject matter experts' heuristics using heterogenous data types within a data programming paradigm could provide outcomes labels to a large, observational data set. We chose the clinical condition of opioid-induced respiratory depression for our use case because it is rare, has no administrative codes to easily identify the condition, and typically requires at least some unstructured text to ascertain its presence.

Materials and methods

Using de-identified electronic health records of 52,861 post-operative encounters, we applied a data programming paradigm (implemented in the Snorkel software) for the development of a machine learning classifier for opioid-induced respiratory depression. Our approach included subject matter experts creating 14 labeling functions that served as noisy labels for developing a probabilistic Generative model. We used probabilistic labels from the Generative model as outcome labels for training a Discriminative model on the source data. We evaluated performance of the Discriminative model with a hold-out test set of 599 independently-reviewed patient records.

Results

The final Discriminative classification model achieved an accuracy of 0.977, an F1 score of 0.417, a sensitivity of 1.0, and an AUC of 0.988 in the hold-out test set with a prevalence of 0.83% (5/599).

Discussion

All of the confirmed Cases were identified by the classifier. For rare outcomes, this finding is encouraging because it reduces the number of manual reviews needed by excluding visits/patients with low probabilities.

Conclusion

Application of a data programming paradigm with expert-informed labeling functions might have utility for phenotyping clinical phenomena that are not easily ascertainable from highly-structured data.

Collapse

Wang Y, Yin C, Zhang P. Multimodal risk prediction with physiological signals, medical images and clinical notes. Heliyon 2024;10:e26772. [PMID: 38455585 PMCID: PMC10918115 DOI: 10.1016/j.heliyon.2024.e26772] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/06/2023] [Revised: 02/17/2024] [Accepted: 02/20/2024] [Indexed: 03/09/2024] Open

Ponjoan A, Blanch J, Fages-Masmiquel E, Martí-Lluch R, Alves-Cabratosa L, Garcia-Gil MDM, Domínguez-Armengol G, Ribas-Aulinas F, Zacarías-Pons L, Ramos R. Sex matters in the association between cardiovascular health and incident dementia: evidence from real world data. Alzheimers Res Ther 2024;16:58. [PMID: 38481343 PMCID: PMC10938682 DOI: 10.1186/s13195-024-01406-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/10/2023] [Accepted: 01/31/2024] [Indexed: 03/17/2024]

Abstract

BACKGROUND

Cardiovascular health has been associated with dementia onset, but little is known about the variation of such association by sex and age considering dementia subtypes. We assessed the role of sex and age in the association between cardiovascular risk and the onset of all-cause dementia, Alzheimer's disease, and vascular dementia in people aged 50-74 years.

METHODS

This is a retrospective cohort study covering 922.973 Catalans who attended the primary care services of the Catalan Health Institute (Spain). Data were obtained from the System for the Development of Research in Primary Care (SIDIAP database). Exposure was the cardiovascular risk (CVR) at baseline categorized into four levels of Framingham-REGICOR score (FRS): low (FRS < 5%), low-intermediate (5% ≤ FRS < 7.5%), high-intermediate (7.5% ≤ FRS < 10%), high (FRS ≥ 10%), and one group with previous vascular disease. Cases of all-cause dementia and Alzheimer's disease were identified using validated algorithms, and cases of vascular dementia were identified by diagnostic codes. We fitted stratified Cox models using age parametrized as b-Spline.

RESULTS

A total of 51,454 incident cases of all-cause dementia were recorded over a mean follow-up of 12.7 years. The hazard ratios in the low-intermediate and high FRS groups were 1.12 (95% confidence interval: 1.08-1.15) and 1.55 (1.50-1.60) for all-cause dementia; 1.07 (1.03-1.11) and 1.17 (1.11-1.24) for Alzheimer's disease; and 1.34 (1.21-1.50) and 1.90 (1.67-2.16) for vascular dementia. These associations were stronger in women and in midlife compared to later life in all dementia types. Women with a high Framingham-REGICOR score presented a similar risk of developing dementia - of any type - to women who had previous vascular disease, and at age 50-55, they showed three times higher risk of developing dementia risk compared to the lowest Framingham-REGICOR group.

CONCLUSIONS

We found a dose‒response association between the Framingham-REGICOR score and the onset of all dementia types. Poor cardiovascular health in midlife increased the onset of all dementia types later in life, especially in women.

Collapse

Affiliation(s)

Anna Ponjoan Vascular Health Research Group (ISV-Girona), Fundació Institut Universitari per a la Recerca a l'Atenció Primària de Salut Jordi Gol i Gurina (IDIAPJGol), C/Maluquer Salvador nº11, Girona, Catalonia, 17002, Spain. Girona Biomedical Research Institute (IDIBGI), Dr. Trueta University Hospital. Parc Hospitalari Martí I Julià, (Ed. M2), C/Dr. Castany S/N, Salt (Girona), Catalonia, 17190, Spain. Network for Research On Chronicity, Primary Care, and Health Promotion (RICAPPS), C/ Maluquer Salvador nº11, Girona, Catalonia, 17002, Spain.
Jordi Blanch Vascular Health Research Group (ISV-Girona), Fundació Institut Universitari per a la Recerca a l'Atenció Primària de Salut Jordi Gol i Gurina (IDIAPJGol), C/Maluquer Salvador nº11, Girona, Catalonia, 17002, Spain
Ester Fages-Masmiquel Atenció Primària, Gerència Territorial de Girona, Institut Català de la Salut. C/Mossèn Joan Pons S/N, Girona, 17001, Spain
Ruth Martí-Lluch Vascular Health Research Group (ISV-Girona), Fundació Institut Universitari per a la Recerca a l'Atenció Primària de Salut Jordi Gol i Gurina (IDIAPJGol), C/Maluquer Salvador nº11, Girona, Catalonia, 17002, Spain Girona Biomedical Research Institute (IDIBGI), Dr. Trueta University Hospital. Parc Hospitalari Martí I Julià, (Ed. M2), C/Dr. Castany S/N, Salt (Girona), Catalonia, 17190, Spain Network for Research On Chronicity, Primary Care, and Health Promotion (RICAPPS), C/ Maluquer Salvador nº11, Girona, Catalonia, 17002, Spain
Lia Alves-Cabratosa Vascular Health Research Group (ISV-Girona), Fundació Institut Universitari per a la Recerca a l'Atenció Primària de Salut Jordi Gol i Gurina (IDIAPJGol), C/Maluquer Salvador nº11, Girona, Catalonia, 17002, Spain
María Del Mar Garcia-Gil Vascular Health Research Group (ISV-Girona), Fundació Institut Universitari per a la Recerca a l'Atenció Primària de Salut Jordi Gol i Gurina (IDIAPJGol), C/Maluquer Salvador nº11, Girona, Catalonia, 17002, Spain
Gina Domínguez-Armengol Vascular Health Research Group (ISV-Girona), Fundació Institut Universitari per a la Recerca a l'Atenció Primària de Salut Jordi Gol i Gurina (IDIAPJGol), C/Maluquer Salvador nº11, Girona, Catalonia, 17002, Spain Network for Research On Chronicity, Primary Care, and Health Promotion (RICAPPS), C/ Maluquer Salvador nº11, Girona, Catalonia, 17002, Spain
Francesc Ribas-Aulinas Vascular Health Research Group (ISV-Girona), Fundació Institut Universitari per a la Recerca a l'Atenció Primària de Salut Jordi Gol i Gurina (IDIAPJGol), C/Maluquer Salvador nº11, Girona, Catalonia, 17002, Spain Network for Research On Chronicity, Primary Care, and Health Promotion (RICAPPS), C/ Maluquer Salvador nº11, Girona, Catalonia, 17002, Spain
Lluís Zacarías-Pons Vascular Health Research Group (ISV-Girona), Fundació Institut Universitari per a la Recerca a l'Atenció Primària de Salut Jordi Gol i Gurina (IDIAPJGol), C/Maluquer Salvador nº11, Girona, Catalonia, 17002, Spain Network for Research On Chronicity, Primary Care, and Health Promotion (RICAPPS), C/ Maluquer Salvador nº11, Girona, Catalonia, 17002, Spain
Rafel Ramos Vascular Health Research Group (ISV-Girona), Fundació Institut Universitari per a la Recerca a l'Atenció Primària de Salut Jordi Gol i Gurina (IDIAPJGol), C/Maluquer Salvador nº11, Girona, Catalonia, 17002, Spain. Girona Biomedical Research Institute (IDIBGI), Dr. Trueta University Hospital. Parc Hospitalari Martí I Julià, (Ed. M2), C/Dr. Castany S/N, Salt (Girona), Catalonia, 17190, Spain. Network for Research On Chronicity, Primary Care, and Health Promotion (RICAPPS), C/ Maluquer Salvador nº11, Girona, Catalonia, 17002, Spain. Atenció Primària, Gerència Territorial de Girona, Institut Català de la Salut. C/Mossèn Joan Pons S/N, Girona, 17001, Spain. Translab Research Group, Department of Medical Sciences, University of Girona, C/Emili Grahit, 77, Girona, Catalonia, 17071, Spain.

Collapse

Deng Y, Pacheco JA, Ghosh A, Chung A, Mao C, Smith JC, Zhao J, Wei WQ, Barnado A, Dorn C, Weng C, Liu C, Cordon A, Yu J, Tedla Y, Kho A, Ramsey-Goldman R, Walunas T, Luo Y. Natural language processing to identify lupus nephritis phenotype in electronic health records. BMC Med Inform Decis Mak 2024;22:348. [PMID: 38433189 PMCID: PMC10910523 DOI: 10.1186/s12911-024-02420-7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/09/2021] [Accepted: 01/09/2024] [Indexed: 03/05/2024] Open

Abstract

BACKGROUND

Systemic lupus erythematosus (SLE) is a rare autoimmune disorder characterized by an unpredictable course of flares and remission with diverse manifestations. Lupus nephritis, one of the major disease manifestations of SLE for organ damage and mortality, is a key component of lupus classification criteria. Accurately identifying lupus nephritis in electronic health records (EHRs) would therefore benefit large cohort observational studies and clinical trials where characterization of the patient population is critical for recruitment, study design, and analysis. Lupus nephritis can be recognized through procedure codes and structured data, such as laboratory tests. However, other critical information documenting lupus nephritis, such as histologic reports from kidney biopsies and prior medical history narratives, require sophisticated text processing to mine information from pathology reports and clinical notes. In this study, we developed algorithms to identify lupus nephritis with and without natural language processing (NLP) using EHR data from the Northwestern Medicine Enterprise Data Warehouse (NMEDW).

METHODS

We developed five algorithms: a rule-based algorithm using only structured data (baseline algorithm) and four algorithms using different NLP models. The first NLP model applied simple regular expression for keywords search combined with structured data. The other three NLP models were based on regularized logistic regression and used different sets of features including positive mention of concept unique identifiers (CUIs), number of appearances of CUIs, and a mixture of three components (i.e. a curated list of CUIs, regular expression concepts, structured data) respectively. The baseline algorithm and the best performing NLP algorithm were externally validated on a dataset from Vanderbilt University Medical Center (VUMC).

RESULTS

Our best performing NLP model incorporated features from both structured data, regular expression concepts, and mapped concept unique identifiers (CUIs) and showed improved F measure in both the NMEDW (0.41 vs 0.79) and VUMC (0.52 vs 0.93) datasets compared to the baseline lupus nephritis algorithm.

CONCLUSION

Our NLP MetaMap mixed model improved the F-measure greatly compared to the structured data only algorithm in both internal and external validation datasets. The NLP algorithms can serve as powerful tools to accurately identify lupus nephritis phenotype in EHR for clinical research and better targeted therapies.

Collapse

Affiliation(s)

Yu Deng Center for Health Information Partnerships, Feinberg School of Medicine, Northwestern University, Chicago, USA
Jennifer A Pacheco Center for Genetic Medicine, Feinberg School of Medicine, Northwestern University, Chicago, USA
Anika Ghosh Center for Health Information Partnerships, Feinberg School of Medicine, Northwestern University, Chicago, USA
Anh Chung Center for Health Information Partnerships, Feinberg School of Medicine, Northwestern University, Chicago, USA Department of Medicine/Rheumatology, Feinberg School of Medicine, Northwestern University, Chicago, USA
Chengsheng Mao Center for Health Information Partnerships, Feinberg School of Medicine, Northwestern University, Chicago, USA
Joshua C Smith Department of Biomedical Informatics, Vanderbilt University Medical Center, Nashville, USA
Juan Zhao Department of Biomedical Informatics, Vanderbilt University Medical Center, Nashville, USA
Wei-Qi Wei Department of Biomedical Informatics, Vanderbilt University Medical Center, Nashville, USA
April Barnado Department of Medicine, Vanderbilt University Medical Center, Nashville, USA
Chad Dorn Department of Biomedical Informatics, Vanderbilt University Medical Center, Nashville, USA
Chunhua Weng Department of Biomedical Informatics, Columbia University, New York City, USA
Cong Liu Department of Biomedical Informatics, Columbia University, New York City, USA
Adam Cordon Center for Genetic Medicine, Feinberg School of Medicine, Northwestern University, Chicago, USA
Jingzhi Yu Center for Health Information Partnerships, Feinberg School of Medicine, Northwestern University, Chicago, USA
Yacob Tedla Center for Health Information Partnerships, Feinberg School of Medicine, Northwestern University, Chicago, USA
Abel Kho Center for Health Information Partnerships, Feinberg School of Medicine, Northwestern University, Chicago, USA
Rosalind Ramsey-Goldman Department of Medicine/Rheumatology, Feinberg School of Medicine, Northwestern University, Chicago, USA
Theresa Walunas Center for Health Information Partnerships, Feinberg School of Medicine, Northwestern University, Chicago, USA.
Yuan Luo Center for Health Information Partnerships, Feinberg School of Medicine, Northwestern University, Chicago, USA.

Collapse

McCaffery K, Carey KA, Campbell V, Gifford S, Smith K, Edelson D, Churpek MM, Mayampurath A. Predicting transfers to intensive care in children using CEWT and other early warning systems. Resusc Plus 2024;17:100540. [PMID: 38260119 PMCID: PMC10801303 DOI: 10.1016/j.resplu.2023.100540] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/21/2023] [Revised: 11/15/2023] [Accepted: 12/13/2023] [Indexed: 01/24/2024] Open

Bernstein IA, Koornwinder A, Hwang HH, Wang SY. Automated Recognition of Visual Acuity Measurements in Ophthalmology Clinical Notes Using Deep Learning. Ophthalmol Sci 2024;4:100371. [PMID: 37868799 PMCID: PMC10587603 DOI: 10.1016/j.xops.2023.100371] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 04/24/2023] [Revised: 06/20/2023] [Accepted: 07/13/2023] [Indexed: 10/24/2023]

Abstract

Purpose

Visual acuity (VA) is a critical component of the eye examination but is often only documented in electronic health records (EHRs) as unstructured free-text notes, making it challenging to use in research. This study aimed to improve on existing rule-based algorithms by developing and evaluating deep learning models to perform named entity recognition of different types of VA measurements and their lateralities from free-text ophthalmology notes: VA for each of the right and left eyes, with and without glasses correction, and with and without pinhole.

Design

Cross-sectional study.

Subjects

A total of 319 756 clinical notes with documented VA measurements from approximately 90 000 patients were included.

Methods

The notes were split into train, validation, and test sets. Bidirectional Encoder Representations from Transformers (BERT) models were fine-tuned to identify VA measurements from the progress notes and included BERT models pretrained on biomedical literature (BioBERT), critical care EHR notes (ClinicalBERT), both (BlueBERT), and a lighter version of BERT with 40% fewer parameters (DistilBERT). A baseline rule-based algorithm was created to recognize the same VA entities to compare against BERT models.

Main Outcome Measures

Model performance was evaluated on a held-out test set using microaveraged precision, recall, and F1 score for all entities.

Results

On the human-annotated subset, BlueBERT achieved the best microaveraged F1 score (F1 = 0.92), followed by ClinicalBERT (F1 = 0.91), DistilBERT (F1 = 0.90), BioBERT (F1 = 0.84), and the baseline model (F1 = 0.83). Common errors included labeling VA in sections outside of the examination portion of the note, difficulties labeling current VA alongside a series of past VAs, and missing nonnumeric VAs.

Conclusions

This study demonstrates that deep learning models are capable of identifying VA measurements from free-text ophthalmology notes with high precision and recall, achieving significant performance improvements over a rule-based algorithm. The ability to recognize VA from free-text notes would enable a more detailed characterization of ophthalmology patient cohorts and enhance the development of models to predict ophthalmology outcomes.

Financial Disclosures

Proprietary or commercial disclosure may be found in the Footnotes and Disclosures at the end of this article.

Collapse

Oss Boll H, Amirahmadi A, Ghazani MM, Morais WOD, Freitas EPD, Soliman A, Etminani F, Byttner S, Recamonde-Mendoza M. Graph neural networks for clinical risk prediction based on electronic health records: A survey. J Biomed Inform 2024;151:104616. [PMID: 38423267 DOI: 10.1016/j.jbi.2024.104616] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/22/2023] [Revised: 02/21/2024] [Accepted: 02/23/2024] [Indexed: 03/02/2024]

Barcelona V, Scharp D, Moen H, Davoudi A, Idnay BR, Cato K, Topaz M. Using Natural Language Processing to Identify Stigmatizing Language in Labor and Birth Clinical Notes. Matern Child Health J 2024;28:578-586. [PMID: 38147277 DOI: 10.1007/s10995-023-03857-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 12/10/2023] [Indexed: 12/27/2023]

Abstract

INTRODUCTION

Stigma and bias related to race and other minoritized statuses may underlie disparities in pregnancy and birth outcomes. One emerging method to identify bias is the study of stigmatizing language in the electronic health record. The objective of our study was to develop automated natural language processing (NLP) methods to identify two types of stigmatizing language: marginalizing language and its complement, power/privilege language, accurately and automatically in labor and birth notes.

METHODS

We analyzed notes for all birthing people > 20 weeks' gestation admitted for labor and birth at two hospitals during 2017. We then employed text preprocessing techniques, specifically using TF-IDF values as inputs, and tested machine learning classification algorithms to identify stigmatizing and power/privilege language in clinical notes. The algorithms assessed included Decision Trees, Random Forest, and Support Vector Machines. Additionally, we applied a feature importance evaluation method (InfoGain) to discern words that are highly correlated with these language categories.

RESULTS

For marginalizing language, Decision Trees yielded the best classification with an F-score of 0.73. For power/privilege language, Support Vector Machines performed optimally, achieving an F-score of 0.91. These results demonstrate the effectiveness of the selected machine learning methods in classifying language categories in clinical notes.

CONCLUSION

We identified well-performing machine learning methods to automatically detect stigmatizing language in clinical notes. To our knowledge, this is the first study to use NLP performance metrics to evaluate the performance of machine learning methods in discerning stigmatizing language. Future studies should delve deeper into refining and evaluating NLP methods, incorporating the latest algorithms rooted in deep learning.

Collapse

Sushil M, Butte AJ, Schuit E, van Smeden M, Leeuwenberg AM. Cross-institution natural language processing for reliable clinical association studies: a methodological exploration. J Clin Epidemiol 2024;167:111258. [PMID: 38219811 DOI: 10.1016/j.jclinepi.2024.111258] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/20/2023] [Revised: 12/21/2023] [Accepted: 01/08/2024] [Indexed: 01/16/2024]

Abstract

OBJECTIVES

Natural language processing (NLP) of clinical notes in electronic medical records is increasingly used to extract otherwise sparsely available patient characteristics, to assess their association with relevant health outcomes. Manual data curation is resource intensive and NLP methods make these studies more feasible. However, the methodology of using NLP methods reliably in clinical research is understudied. The objective of this study is to investigate how NLP models could be used to extract study variables (specifically exposures) to reliably conduct exposure-outcome association studies.

STUDY DESIGN AND SETTING

In a convenience sample of patients admitted to the intensive care unit of a US academic health system, multiple association studies are conducted, comparing the association estimates based on NLP-extracted vs. manually extracted exposure variables. The association studies varied in NLP model architecture (Bidirectional Encoder Decoder from Transformers, Long Short-Term Memory), training paradigm (training a new model, fine-tuning an existing external model), extracted exposures (employment status, living status, and substance use), health outcomes (having a do-not-resuscitate/intubate code, length of stay, and in-hospital mortality), missing data handling (multiple imputation vs. complete case analysis), and the application of measurement error correction (via regression calibration).

RESULTS

The study was conducted on 1,174 participants (median [interquartile range] age, 61 [50, 73] years; 60.6% male). Additionally, up to 500 discharge reports of participants from the same health system and 2,528 reports of participants from an external health system were used to train the NLP models. Substantial differences were found between the associations based on NLP-extracted and manually extracted exposures under all settings. The error in association was only weakly correlated with the overall F1 score of the NLP models.

CONCLUSION

Associations estimated using NLP-extracted exposures should be interpreted with caution. Further research is needed to set conditions for reliable use of NLP in medical association studies.

Collapse

Vithanage D, Yu P, Wang L, Deng C. Contextual Word Embedding for Biomedical Knowledge Extraction: a Rapid Review and Case Study. J Healthc Inform Res 2024;8:158-179. [PMID: 38273979 PMCID: PMC10805696 DOI: 10.1007/s41666-023-00157-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/30/2022] [Revised: 11/27/2023] [Accepted: 12/09/2023] [Indexed: 01/27/2024]

Afraz A, Montazeri M, Shahrbabaki ME, Ahmadian L, Jahani Y. The viewpoints of parents of children with mental disorders regarding the confidentiality and security of their children's information in the Iranian national electronic health record system. Int J Med Inform 2024;183:105334. [PMID: 38218129 DOI: 10.1016/j.ijmedinf.2023.105334] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/30/2023] [Revised: 12/18/2023] [Accepted: 12/28/2023] [Indexed: 01/15/2024]

Abstract

INTRODUCTION

Electronic health records help collect and communicate patient information among healthcare providers. The confidentiality of information, especially for patients with mental disorders, is paramount due to its profound impacts on individuals' lives' social and personal aspects. This study aimed to investigate the viewpoints and concerns of parents of children with mental disorders regarding the confidentiality and security of their children's information in the Iranian National Electronic Health Record System (IEHRS).

METHODS

This is a survey study on parents or guardians of children with mental disorders who visited Kerman's specialised child psychiatry treatment centres. The data collection tool was a researcher-made questionnaire with 28 questions organised in seven sections, including demographic information of parents, children's medical history, Internet use, knowledge about IEHRS, the necessity of data collection, IEHRS security concerns, and privacy concerns. The data were analysed in SPSS 24 software using descriptive statistics and logistic and ordinal regressions to assess the relationship between parents' demographic characteristics and their viewpoints regarding information security and confidentiality concerns.

RESULTS

The results showed that more than 85 % of the parents believed that the security of their children's information in IEHRS was moderate to high. More than two-thirds (71 %) of the parents also believed that IEHRS should tighten its privacy policies. Most participants (87 %) were concerned about their children's information security in IEHRS. In this study, the parents' concerns about the privacy and security of information in IEHRS were not significantly associated with their age, gender, or knowledge about IEHRS.

CONCLUSIONS

Most parents of children with mental disorders were concerned about the security and confidentiality of their children's information in IEHRS. Thus, health policymakers should maintain a high level of security and establish appropriate privacy and confidentiality rules in IEHRS. In addition, they should be transparent about the system's security mechanisms and confidentiality regulations to win public trust.

Collapse

Tripathi S, Fritz BA, Abdelhack M, Avidan MS, Chen Y, King CR. Multi-view representation learning for tabular data integration using inter-feature relationships. J Biomed Inform 2024;151:104602. [PMID: 38346530 DOI: 10.1016/j.jbi.2024.104602] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/24/2023] [Revised: 01/31/2024] [Accepted: 02/01/2024] [Indexed: 02/16/2024]

Abstract

OBJECTIVE

An applied problem facing all areas of data science is harmonizing data sources. Joining data from multiple origins with unmapped and only partially overlapping features is a prerequisite to developing and testing robust, generalizable algorithms, especially in healthcare. This integrating is usually resolved using meta-data such as feature names, which may be unavailable or ambiguous. Our goal is to design methods that create a mapping between structured tabular datasets derived from electronic health records independent of meta-data.

METHODS

We evaluate methods in the challenging case of numeric features without reliable and distinctive univariate summaries, such as nearly Gaussian and binary features. We assume that a small set of features are a priori mapped between two datasets, which share unknown identical features and possibly many unrelated features. Inter-feature relationships are the main source of identification which we expect. We compare the performance of contrastive learning methods for feature representations, novel partial auto-encoders, mutual-information graph optimizers, and simple statistical baselines on simulated data, public datasets, the MIMIC-III medical-record changeover, and perioperative records from before and after a medical-record system change. Performance was evaluated using both mapping of identical features and reconstruction accuracy of examples in the format of the other dataset.

RESULTS

Contrastive learning-based methods overall performed the best, often substantially beating the literature baseline in matching and reconstruction, especially in the more challenging real data experiments. Partial auto-encoder methods showed on-par matching with contrastive methods in all synthetic and some real datasets, along with good reconstruction. However, the statistical method we created performed reasonably well in many cases, with much less dependence on hyperparameter tuning. When validating feature match output in the EHR dataset we found that some mistakes were actually a surrogate or related feature as reviewed by two subject matter experts.

CONCLUSION

In simulation studies and real-world examples, we find that inter-feature relationships are effective at identifying matching or closely related features across tabular datasets when meta-data is not available. Decoder architectures are also reasonably effective at imputing features without an exact match.

Collapse

Starolis MW, Zaydman MA, Liesman RM. Working with the Electronic Health Record and Laboratory Information System to Maximize Ordering and Reporting of Molecular Microbiology Results. Clin Lab Med 2024;44:95-107. [PMID: 38280801 DOI: 10.1016/j.cll.2023.10.009] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/29/2024]

Dong Z, Leveille S, Lewis D, Walker J. People with diabetes who read their clinicians' visit notes: Behaviors and attitudes. Chronic Illn 2024;20:173-183. [PMID: 37151042 DOI: 10.1177/17423953231171890] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 05/09/2023]

Pozzar RA, Tulsky JA, Berry DL, Batista J, Yackel HD, Phan H, Wright AA. Developing a Collaborative Agenda-Setting Intervention (CASI) to promote patient-centered communication in ovarian cancer care: A design thinking approach. Patient Educ Couns 2024;120:108099. [PMID: 38086227 DOI: 10.1016/j.pec.2023.108099] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/20/2023] [Revised: 12/01/2023] [Accepted: 12/05/2023] [Indexed: 01/29/2024]

Calleja-Panero JL, Esteban Mur R, Jarque I, Romero-Gómez M, Group SR, García Labrador L, González Calvo J. Chronic liver disease-associated severe thrombocytopenia in Spain: Results from a retrospective study using machine learning and natural language processing. Gastroenterol Hepatol 2024;47:236-245. [PMID: 37236305 DOI: 10.1016/j.gastrohep.2023.05.010] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/09/2023] [Revised: 05/02/2023] [Accepted: 05/19/2023] [Indexed: 05/28/2023]

Abstract

BACKGROUND

Patients with chronic liver disease (CLD) often develop thrombocytopenia (TCP) as a complication. Severe TCP (platelet count<50×109/L) can increase morbidity and complicate CLD management, increasing bleeding risk during invasive procedures.

OBJECTIVES

To describe the real-world scenario of CLD-associated severe TCP patients' clinical characteristics. To evaluate the association between invasive procedures, prophylactic treatments, and bleeding events in this group of patients. To describe their need of medical resource use in Spain.

METHODS

This is a retrospective, multicenter study including patients who had confirmed diagnosis of CLD and severe TCP in four hospitals within the Spanish National Healthcare Network from January 2014 to December 2018. We analyzed the free-text information from Electronic Health Records (EHRs) of patients using Natural Language Processing (NLP), machine learning techniques, and SNOMED-CT terminology. Demographics, comorbidities, analytical parameters and characteristics of CLD were extracted at baseline and need for invasive procedures, prophylactic treatments, bleeding events and medical resources used in the follow up period. Frequency tables were generated for categorical variables, whereas continuous variables were described in summary tables as mean (SD) and median (Q1-Q3).

RESULTS

Out of 1,765,675 patients, 1787 had CLD and severe TCP; 65.2% were male with a mean age of 54.7 years old. Cirrhosis was detected in 46% (n=820) of patients and 9.1% (n=163) had hepatocellular carcinoma. Invasive procedures were needed in 85.6% of patients during the follow up period. Patients undergoing procedures compared to those patients without invasive procedures presented higher rates of bleeding events (33% vs 8%, p<0.0001) and higher number of bleedings. While prophylactic platelet transfusions were given to 25.6% of patients undergoing procedures, TPO receptor agonist use was only detected in 3.1% of them. Most patients (60.9%) required at least one hospital admission during the follow up and 14.4% of admissions were due to bleeding events with a hospital length of stay of 6 (3, 9) days.

CONCLUSIONS

NLP and machine learning are useful tools to describe real-world data in patients with CLD and severe TCP in Spain. Bleeding events are frequent in those patients who need invasive procedures, even receiving platelet transfusions as a prophylactic treatment, increasing the further use of medical resources. Because that, new prophylactic treatments that are not yet generalized, are needed.

Collapse

AlSaad R, Malluhi Q, Abd-Alrazaq A, Boughorbel S. Temporal self-attention for risk prediction from electronic health records using non-stationary kernel approximation. Artif Intell Med 2024;149:102802. [PMID: 38462292 DOI: 10.1016/j.artmed.2024.102802] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/03/2022] [Revised: 09/27/2023] [Accepted: 02/03/2024] [Indexed: 03/12/2024]

Abstract

Effective modeling of patient representation from electronic health records (EHRs) is increasingly becoming a vital research topic. Yet, modeling the non-stationarity in EHR data has received less attention. Most existing studies follow a strong assumption of stationarity in patient representation from EHRs. However, in practice, a patient's visits are irregularly spaced over a relatively long period of time, and disease progression patterns exhibit non-stationarity. Furthermore, the time gaps between patient visits often encapsulate significant domain knowledge, potentially revealing undiscovered patterns that characterize specific medical conditions. To address these challenges, we introduce a new method which combines the self-attention mechanism with non-stationary kernel approximation to capture both contextual information and temporal relationships between patient visits in EHRs. To assess the effectiveness of our proposed approach, we use two real-world EHR datasets, comprising a total of 76,925 patients, for the task of predicting the next diagnosis code for a patient, given their EHR history. The first dataset is a general EHR cohort and consists of 11,451 patients with a total of 3,485 unique diagnosis codes. The second dataset is a disease-specific cohort that includes 65,474 pregnant patients and encompasses a total of 9,782 unique diagnosis codes. Our experimental evaluation involved nine prediction models, categorized into three distinct groups. Group 1 comprises the baselines: original self-attention with positional encoding model, RETAIN model, and LSTM model. Group 2 includes models employing self-attention with stationary kernel approximations, specifically incorporating three variations of Bochner's feature maps. Lastly, Group 3 consists of models utilizing self-attention with non-stationary kernel approximations, including quadratic, cubic, and bi-quadratic polynomials. The experimental results demonstrate that non-stationary kernels significantly outperformed baseline methods for NDCG@10 and Hit@10 metrics in both datasets. The performance boost was more substantial in dataset 1 for the NDCG@10 metric. On the other hand, stationary Kernels showed significant but smaller gains over baselines and were nearly as effective as Non-stationary Kernels for Hit@10 in dataset 2. These findings robustly validate the efficacy of employing non-stationary kernels for temporal modeling of EHR data, and emphasize the importance of modeling non-stationary temporal information in healthcare prediction tasks.

Collapse

Ayden MA, Yuksel ME, Yuksel Erdem SE. A two-stream deep model for automated ICD-9 code prediction in an intensive care unit. Heliyon 2024;10:e25960. [PMID: 38375292 PMCID: PMC10875443 DOI: 10.1016/j.heliyon.2024.e25960] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/09/2024] [Revised: 01/30/2024] [Accepted: 02/05/2024] [Indexed: 02/21/2024] Open

Bernburg M, Tell A, Groneberg DA, Mache S. Digital stressors and resources perceived by emergency physicians and associations to their digital stress perception, mental health, job satisfaction and work engagement. BMC Emerg Med 2024;24:31. [PMID: 38413900 PMCID: PMC10900642 DOI: 10.1186/s12873-024-00950-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/01/2023] [Accepted: 02/08/2024] [Indexed: 02/29/2024] Open

Abstract

BACKGROUND

Digital technologies are increasingly being integrated into healthcare settings, including emergency departments, with the potential to improve efficiency and patient care. Although digitalisation promises many benefits, the use of digital technologies can also introduce new stressors and challenges among medical staff, which may result in the development of various negative work and health outcomes. Therefore, this study aims to identify existing digital stressors and resources among emergency physicians, examine associations with various work- and health-related parameters, and finally identify the potential need for preventive measures.

METHODS

In this quantitative cross-sectional study, an online questionnaire was used to examine the relationship between digital stressors (technostress creators), digital resources (technostress inhibitors), technostress perception as well as mental health, job satisfaction and work engagement among 204 physicians working in German emergency medicine departments. Data collection lasted from December 2022 to April 2023. Validated scales were used for the questionnaire (e.g. "Technostress"-scale and the Copenhagen Psychosocial Questionnaire (COPSOQ). Descriptive and multiple regression analyses were run to test explorative assumptions.

RESULTS

The study found medium levels of technostress perception among the participating emergency physicians as well as low levels of persisting technostress inhibitors. The queried physicians on average reported medium levels of exhaustion symptoms, high levels of work engagement and job satisfaction. Significant associations between digital stressors and work- as well as health-related outcomes were analyzed.

CONCLUSION

This study provides a preliminary assessment of the persistence of digital stressors, digital resources and technostress levels, and their potential impact on relevant health and work-related outcomes, among physicians working in German emergency departments. Understanding and mitigating these stressors is essential to promote the well-being of physicians and ensure optimal patient care. As digitisation processes will continue to increase, the need for preventive support measures in dealing with technology stressors is obvious and should be expanded accordingly in the clinics. By integrating such support into everyday hospital life, medical staff in emergency departments can better focus on patient care and mitigate potential stress factors associated with digital technologies.

Collapse

Yan C, Ong HH, Grabowska ME, Krantz MS, Su WC, Dickson AL, Peterson JF, Feng Q, Roden DM, Stein CM, Kerchberger VE, Malin BA, Wei WQ. Large Language Models Facilitate the Generation of Electronic Health Record Phenotyping Algorithms. medRxiv 2024:2023.12.19.23300230. [PMID: 38196578 PMCID: PMC10775330 DOI: 10.1101/2023.12.19.23300230] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/11/2024]

Abstract

Objectives

Phenotyping is a core task in observational health research utilizing electronic health records (EHRs). Developing an accurate algorithm demands substantial input from domain experts, involving extensive literature review and evidence synthesis. This burdensome process limits scalability and delays knowledge discovery. We investigate the potential for leveraging large language models (LLMs) to enhance the efficiency of EHR phenotyping by generating high-quality algorithm drafts.

Materials and Methods

We prompted four LLMs-GPT-4 and GPT-3.5 of ChatGPT, Claude 2, and Bard-in October 2023, asking them to generate executable phenotyping algorithms in the form of SQL queries adhering to a common data model (CDM) for three phenotypes (i.e., type 2 diabetes mellitus, dementia, and hypothyroidism). Three phenotyping experts evaluated the returned algorithms across several critical metrics. We further implemented the top-rated algorithms and compared them against clinician-validated phenotyping algorithms from the Electronic Medical Records and Genomics (eMERGE) network.

Results

GPT-4 and GPT-3.5 exhibited significantly higher overall expert evaluation scores in instruction following, algorithmic logic, and SQL executability, when compared to Claude 2 and Bard. Although GPT-4 and GPT-3.5 effectively identified relevant clinical concepts, they exhibited immature capability in organizing phenotyping criteria with the proper logic, leading to phenotyping algorithms that were either excessively restrictive (with low recall) or overly broad (with low positive predictive values).

Conclusion

GPT versions 3.5 and 4 are capable of drafting phenotyping algorithms by identifying relevant clinical criteria aligned with a CDM. However, expertise in informatics and clinical experience is still required to assess and further refine generated algorithms.

Collapse