1
|
Thayer DS, Mumtaz S, Elmessary MA, Scanlon I, Zinnurov A, Coldea AI, Scanlon J, Chapman M, Curcin V, John A, DelPozo-Banos M, Davies H, Karwath A, Gkoutos GV, Fitzpatrick NK, Quint JK, Varma S, Milner C, Oliveira C, Parkinson H, Denaxas S, Hemingway H, Jefferson E. Creating a next-generation phenotype library: the health data research UK Phenotype Library. JAMIA Open 2024; 7:ooae049. [PMID: 38895652 PMCID: PMC11182945 DOI: 10.1093/jamiaopen/ooae049] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/19/2023] [Revised: 02/12/2024] [Accepted: 05/20/2024] [Indexed: 06/21/2024] Open
Abstract
Objective To enable reproducible research at scale by creating a platform that enables health data users to find, access, curate, and re-use electronic health record phenotyping algorithms. Materials and Methods We undertook a structured approach to identifying requirements for a phenotype algorithm platform by engaging with key stakeholders. User experience analysis was used to inform the design, which we implemented as a web application featuring a novel metadata standard for defining phenotyping algorithms, access via Application Programming Interface (API), support for computable data flows, and version control. The application has creation and editing functionality, enabling researchers to submit phenotypes directly. Results We created and launched the Phenotype Library in October 2021. The platform currently hosts 1049 phenotype definitions defined against 40 health data sources and >200K terms across 16 medical ontologies. We present several case studies demonstrating its utility for supporting and enabling research: the library hosts curated phenotype collections for the BREATHE respiratory health research hub and the Adolescent Mental Health Data Platform, and it is supporting the development of an informatics tool to generate clinical evidence for clinical guideline development groups. Discussion This platform makes an impact by being open to all health data users and accepting all appropriate content, as well as implementing key features that have not been widely available, including managing structured metadata, access via an API, and support for computable phenotypes. Conclusions We have created the first openly available, programmatically accessible resource enabling the global health research community to store and manage phenotyping algorithms. Removing barriers to describing, sharing, and computing phenotypes will help unleash the potential benefit of health data for patients and the public.
Collapse
Affiliation(s)
- Daniel S Thayer
- SAIL Databank, Medical School, Swansea University, Swansea, SA2 8PP, United Kingdom
| | - Shahzad Mumtaz
- Health Informatics Centre, School of Medicine, University of Dundee, Dundee, DD1 9SY, United Kingdom
- School of Natural and Computing Sciences, University of Aberdeen, Aberdeen, AB24 3UE, United Kingdom
| | - Muhammad A Elmessary
- SAIL Databank, Medical School, Swansea University, Swansea, SA2 8PP, United Kingdom
| | - Ieuan Scanlon
- SAIL Databank, Medical School, Swansea University, Swansea, SA2 8PP, United Kingdom
| | - Artur Zinnurov
- SAIL Databank, Medical School, Swansea University, Swansea, SA2 8PP, United Kingdom
| | - Alex-Ioan Coldea
- SAIL Databank, Medical School, Swansea University, Swansea, SA2 8PP, United Kingdom
| | - Jack Scanlon
- SAIL Databank, Medical School, Swansea University, Swansea, SA2 8PP, United Kingdom
| | - Martin Chapman
- Department of Population Health Sciences, King’s College London, London, SE1 1UL, United Kingdom
| | - Vasa Curcin
- Department of Population Health Sciences, King’s College London, London, SE1 1UL, United Kingdom
| | - Ann John
- Adolescent Mental Health Data Platform and DATAMIND, Swansea University, Swansea, SA2 8PP, United Kingdom
| | - Marcos DelPozo-Banos
- Adolescent Mental Health Data Platform and DATAMIND, Swansea University, Swansea, SA2 8PP, United Kingdom
| | - Hannah Davies
- SAIL Databank, Medical School, Swansea University, Swansea, SA2 8PP, United Kingdom
| | - Andreas Karwath
- Institute of Cancer and Genomic Sciences, University of Birmingham, Birmingham, B15 2TT, United Kingdom
| | - Georgios V Gkoutos
- Institute of Cancer and Genomic Sciences, University of Birmingham, Birmingham, B15 2TT, United Kingdom
| | - Natalie K Fitzpatrick
- Institute of Health Informatics, University College London, London, NW1 2DA, United Kingdom
| | - Jennifer K Quint
- School of Public Health and National Heart and Lung Institute, Imperial College London, London, W12 0BZ, United Kingdom
| | - Susheel Varma
- Health Data Research United Kingdom, London, NW1 2BE, United Kingdom
| | - Chris Milner
- Health Data Research United Kingdom, London, NW1 2BE, United Kingdom
| | - Carla Oliveira
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Welcome Trust Genome Campus, Hinxton, Cambridge, CB10 1SD, United Kingdom
| | - Helen Parkinson
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Welcome Trust Genome Campus, Hinxton, Cambridge, CB10 1SD, United Kingdom
| | - Spiros Denaxas
- Institute of Health Informatics, University College London, London, NW1 2DA, United Kingdom
- University College London Hospitals National Institute of Health Research Biomedical Research Centre, London, NW1 2BU, United Kingdom
- British Heart Foundation Data Science Center, Health Data Research United Kingdom, London, NW1 2BE, United Kingdom
| | - Harry Hemingway
- Institute of Health Informatics, University College London, London, NW1 2DA, United Kingdom
- University College London Hospitals National Institute of Health Research Biomedical Research Centre, London, NW1 2BU, United Kingdom
| | - Emily Jefferson
- Health Informatics Centre, School of Medicine, University of Dundee, Dundee, DD1 9SY, United Kingdom
- Health Data Research United Kingdom, London, NW1 2BE, United Kingdom
| |
Collapse
|
2
|
Mobley AR, Subramanian A, Champsi A, Wang X, Myles P, McGreavy P, Bunting KV, Shukla D, Nirantharakumar K, Kotecha D. Thromboembolic events and vascular dementia in patients with atrial fibrillation and low apparent stroke risk. Nat Med 2024:10.1038/s41591-024-03049-9. [PMID: 38839900 DOI: 10.1038/s41591-024-03049-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/26/2023] [Accepted: 05/07/2024] [Indexed: 06/07/2024]
Abstract
The prevention of thromboembolism in atrial fibrillation (AF) is typically restricted to patients with specific risk factors and ignores outcomes such as vascular dementia. This population-based cohort study used electronic healthcare records from 5,199,994 primary care patients (UK; 2005-2020). A total of 290,525 (5.6%) had a diagnosis of AF and were aged 40-75 years, of which 36,340 had no history of stroke, a low perceived risk of stroke based on clinical risk factors and no oral anticoagulant prescription. Matching was performed for age, sex and region to 117,298 controls without AF. During 5 years median follow-up (831,005 person-years), incident stroke occurred in 3.8% with AF versus 1.5% control (adjusted hazard ratio (HR) 2.06, 95% confidence interval (CI) 1.91-2.21; P < 0.001), arterial thromboembolism 0.3% versus 0.1% (HR 2.39, 95% CI 1.83-3.11; P < 0.001), and all-cause mortality 8.9% versus 5.0% (HR 1.44, 95% CI 1.38-1.50; P < 0.001). AF was associated with all-cause dementia (HR 1.17, 95% CI 1.04-1.32; P = 0.010), driven by vascular dementia (HR 1.68, 95% CI 1.33-2.12; P < 0.001) rather than Alzheimer's disease (HR 0.85, 95% CI 0.70-1.03; P = 0.09). Death and thromboembolic outcomes, including vascular dementia, are substantially increased in patients with AF despite a lack of conventional stroke risk factors.
Collapse
Affiliation(s)
- Alastair R Mobley
- Institute of Cardiovascular Sciences, University of Birmingham, Birmingham, UK
- NIHR Birmingham Biomedical Research Centre, University Hospitals Birmingham NHS Foundation Trust, Birmingham, UK
- West Midlands NHS Secure Data Environment, University Hospitals Birmingham NHS Foundation Trust, Birmingham, UK
| | | | - Asgher Champsi
- Institute of Cardiovascular Sciences, University of Birmingham, Birmingham, UK
- NIHR Birmingham Biomedical Research Centre, University Hospitals Birmingham NHS Foundation Trust, Birmingham, UK
| | - Xiaoxia Wang
- Institute of Cardiovascular Sciences, University of Birmingham, Birmingham, UK
- NIHR Birmingham Biomedical Research Centre, University Hospitals Birmingham NHS Foundation Trust, Birmingham, UK
- West Midlands NHS Secure Data Environment, University Hospitals Birmingham NHS Foundation Trust, Birmingham, UK
| | - Puja Myles
- Clinical Practice Research Datalink, Medicines and Healthcare products Regulatory Agency, London, UK
| | | | - Karina V Bunting
- Institute of Cardiovascular Sciences, University of Birmingham, Birmingham, UK
- NIHR Birmingham Biomedical Research Centre, University Hospitals Birmingham NHS Foundation Trust, Birmingham, UK
| | - David Shukla
- West Midlands NHS Secure Data Environment, University Hospitals Birmingham NHS Foundation Trust, Birmingham, UK
- Institute of Applied Health Research, University of Birmingham, Birmingham, UK
- Primary Care Clinical Research, NIHR Clinical Research Network West Midlands, Birmingham, UK
| | - Krishnarajah Nirantharakumar
- NIHR Birmingham Biomedical Research Centre, University Hospitals Birmingham NHS Foundation Trust, Birmingham, UK
- West Midlands NHS Secure Data Environment, University Hospitals Birmingham NHS Foundation Trust, Birmingham, UK
- Institute of Applied Health Research, University of Birmingham, Birmingham, UK
| | - Dipak Kotecha
- Institute of Cardiovascular Sciences, University of Birmingham, Birmingham, UK.
- NIHR Birmingham Biomedical Research Centre, University Hospitals Birmingham NHS Foundation Trust, Birmingham, UK.
- West Midlands NHS Secure Data Environment, University Hospitals Birmingham NHS Foundation Trust, Birmingham, UK.
| |
Collapse
|
3
|
Henderson AD, Butler-Cole BFC, Tazare J, Tomlinson LA, Marks M, Jit M, Briggs A, Lin LY, Carlile O, Bates C, Parry J, Bacon SCJ, Dillingham I, Dennison WA, Costello RE, Wei Y, Walker AJ, Hulme W, Goldacre B, Mehrkar A, MacKenna B, Herrett E, Eggo RM. Clinical coding of long COVID in primary care 2020-2023 in a cohort of 19 million adults: an OpenSAFELY analysis. EClinicalMedicine 2024; 72:102638. [PMID: 38800803 PMCID: PMC11127160 DOI: 10.1016/j.eclinm.2024.102638] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 11/20/2023] [Revised: 04/10/2024] [Accepted: 04/29/2024] [Indexed: 05/29/2024] Open
Abstract
Background Long COVID is the patient-coined term for the persistent symptoms of COVID-19 illness for weeks, months or years following the acute infection. There is a large burden of long COVID globally from self-reported data, but the epidemiology, causes and treatments remain poorly understood. Primary care is used to help identify and treat patients with long COVID and therefore Electronic Health Records (EHRs) of past COVID-19 patients could be used to help fill these knowledge gaps. We aimed to describe the incidence and differences in demographic and clinical characteristics in recorded long COVID in primary care records in England. Methods With the approval of NHS England we used routine clinical data from over 19 million adults in England linked to SARS-COV-2 test result, hospitalisation and vaccination data to describe trends in the recording of 16 clinical codes related to long COVID between November 2020 and January 2023. Using OpenSAFELY, we calculated rates per 100,000 person-years and plotted how these changed over time. We compared crude and adjusted (for age, sex, 9 NHS regions of England, and the dominant variant circulating) rates of recorded long COVID in patient records between different key demographic and vaccination characteristics using negative binomial models. Findings We identified a total of 55,465 people recorded to have long COVID over the study period, which included 20,025 diagnoses codes and 35,440 codes for further assessment. The incidence of new long COVID records increased steadily over 2021, and declined over 2022. The overall rate per 100,000 person-years was 177.5 cases in women (95% CI: 175.5-179) and 100.5 in men (99.5-102). The majority of those with a long COVID record did not have a recorded positive SARS-COV-2 test 12 or more weeks before the long COVID record. Interpretation In this descriptive study, EHR recorded long COVID was very low between 2020 and 2023, and incident records of long COVID declined over 2022. Using EHR diagnostic or referral codes unfortunately has major limitations in identifying and ascertaining true cases and timing of long COVID. Funding This research was supported by the National Institute for Health and Care Research (NIHR) (OpenPROMPT: COV-LT2-0073).
Collapse
Affiliation(s)
| | - Ben FC. Butler-Cole
- Bennett Institute for Applied Data Science, Nuffield Department of Primary Care Health Sciences, University of Oxford, OX2 6GG, UK
| | - John Tazare
- London School of Hygiene and Tropical Medicine, Keppel Street, London WC1E 7HT, UK
| | - Laurie A. Tomlinson
- London School of Hygiene and Tropical Medicine, Keppel Street, London WC1E 7HT, UK
| | - Michael Marks
- London School of Hygiene and Tropical Medicine, Keppel Street, London WC1E 7HT, UK
| | - Mark Jit
- London School of Hygiene and Tropical Medicine, Keppel Street, London WC1E 7HT, UK
| | - Andrew Briggs
- London School of Hygiene and Tropical Medicine, Keppel Street, London WC1E 7HT, UK
| | - Liang-Yu Lin
- London School of Hygiene and Tropical Medicine, Keppel Street, London WC1E 7HT, UK
| | - Oliver Carlile
- London School of Hygiene and Tropical Medicine, Keppel Street, London WC1E 7HT, UK
| | - Chris Bates
- TPP, TPP House, 129 Low Lane, Horsforth, Leeds LS18 5PX, UK
| | - John Parry
- TPP, TPP House, 129 Low Lane, Horsforth, Leeds LS18 5PX, UK
| | - Sebastian CJ. Bacon
- Bennett Institute for Applied Data Science, Nuffield Department of Primary Care Health Sciences, University of Oxford, OX2 6GG, UK
| | - Iain Dillingham
- Bennett Institute for Applied Data Science, Nuffield Department of Primary Care Health Sciences, University of Oxford, OX2 6GG, UK
| | | | - Ruth E. Costello
- London School of Hygiene and Tropical Medicine, Keppel Street, London WC1E 7HT, UK
| | - Yinghui Wei
- Centre for Mathematical Sciences, School of Engineering, Computing and Mathematics, University of Plymouth, Plymouth, UK
| | - Alex J. Walker
- Bennett Institute for Applied Data Science, Nuffield Department of Primary Care Health Sciences, University of Oxford, OX2 6GG, UK
| | - William Hulme
- Bennett Institute for Applied Data Science, Nuffield Department of Primary Care Health Sciences, University of Oxford, OX2 6GG, UK
| | - Ben Goldacre
- Bennett Institute for Applied Data Science, Nuffield Department of Primary Care Health Sciences, University of Oxford, OX2 6GG, UK
| | - Amir Mehrkar
- Bennett Institute for Applied Data Science, Nuffield Department of Primary Care Health Sciences, University of Oxford, OX2 6GG, UK
| | - Brian MacKenna
- Bennett Institute for Applied Data Science, Nuffield Department of Primary Care Health Sciences, University of Oxford, OX2 6GG, UK
| | - Emily Herrett
- London School of Hygiene and Tropical Medicine, Keppel Street, London WC1E 7HT, UK
| | - Rosalind M. Eggo
- London School of Hygiene and Tropical Medicine, Keppel Street, London WC1E 7HT, UK
| |
Collapse
|
4
|
Oikonomou EK, Aminorroaya A, Dhingra LS, Partridge C, Velazquez EJ, Desai NR, Krumholz HM, Miller EJ, Khera R. Real-world evaluation of an algorithmic machine-learning-guided testing approach in stable chest pain: a multinational, multicohort study. EUROPEAN HEART JOURNAL. DIGITAL HEALTH 2024; 5:303-313. [PMID: 38774380 PMCID: PMC11104476 DOI: 10.1093/ehjdh/ztae023] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 01/03/2024] [Revised: 02/13/2024] [Accepted: 03/20/2024] [Indexed: 05/24/2024]
Abstract
Aims An algorithmic strategy for anatomical vs. functional testing in suspected coronary artery disease (CAD) (Anatomical vs. Stress teSting decIsion Support Tool; ASSIST) is associated with better outcomes than random selection. However, in the real world, this decision is rarely random. We explored the agreement between a provider-driven vs. simulated algorithmic approach to cardiac testing and its association with outcomes across multinational cohorts. Methods and results In two cohorts of functional vs. anatomical testing in a US hospital health system [Yale; 2013-2023; n = 130 196 (97.0%) vs. n = 4020 (3.0%), respectively], and the UK Biobank [n = 3320 (85.1%) vs. n = 581 (14.9%), respectively], we examined outcomes stratified by agreement between the real-world and ASSIST-recommended strategies. Younger age, female sex, Black race, and diabetes history were independently associated with lower odds of ASSIST-aligned testing. Over a median of 4.9 (interquartile range [IQR]: 2.4-7.1) and 5.4 (IQR: 2.6-8.8) years, referral to the ASSIST-recommended strategy was associated with a lower risk of acute myocardial infarction or death (hazard ratioadjusted: 0.81, 95% confidence interval [CI] 0.77-0.85, P < 0.001 and 0.74 [95% CI 0.60-0.90], P = 0.003, respectively), an effect that remained significant across years, test types, and risk profiles. In post hoc analyses of anatomical-first testing in the Prospective Multicentre Imaging Study for Evaluation of Chest Pain (PROMISE) trial, alignment with ASSIST was independently associated with a 17% and 30% higher risk of detecting CAD in any vessel or the left main artery/proximal left anterior descending coronary artery, respectively. Conclusion In cohorts where historical practices largely favour functional testing, alignment with an algorithmic approach to cardiac testing defined by ASSIST was associated with a lower risk of adverse outcomes. This highlights the potential utility of a data-driven approach in the diagnostic management of CAD.
Collapse
Affiliation(s)
- Evangelos K Oikonomou
- Section of Cardiovascular Medicine, Department of Internal Medicine, Yale School of Medicine, 333 Cedar Street, PO Box 208017, New Haven, 06520-8017 CT, USA
| | - Arya Aminorroaya
- Section of Cardiovascular Medicine, Department of Internal Medicine, Yale School of Medicine, 333 Cedar Street, PO Box 208017, New Haven, 06520-8017 CT, USA
| | - Lovedeep S Dhingra
- Section of Cardiovascular Medicine, Department of Internal Medicine, Yale School of Medicine, 333 Cedar Street, PO Box 208017, New Haven, 06520-8017 CT, USA
| | - Caitlin Partridge
- Yale Center for Clinical Investigation, 2 Church Street South, New Haven, 06519 CT, USA
| | - Eric J Velazquez
- Section of Cardiovascular Medicine, Department of Internal Medicine, Yale School of Medicine, 333 Cedar Street, PO Box 208017, New Haven, 06520-8017 CT, USA
| | - Nihar R Desai
- Section of Cardiovascular Medicine, Department of Internal Medicine, Yale School of Medicine, 333 Cedar Street, PO Box 208017, New Haven, 06520-8017 CT, USA
| | - Harlan M Krumholz
- Section of Cardiovascular Medicine, Department of Internal Medicine, Yale School of Medicine, 333 Cedar Street, PO Box 208017, New Haven, 06520-8017 CT, USA
- Center for Outcomes Research and Evaluation, Yale-New Haven Hospital, 195 Church Street 5th Floor, New Haven, 06510 CT, USA
| | - Edward J Miller
- Section of Cardiovascular Medicine, Department of Internal Medicine, Yale School of Medicine, 333 Cedar Street, PO Box 208017, New Haven, 06520-8017 CT, USA
| | - Rohan Khera
- Section of Cardiovascular Medicine, Department of Internal Medicine, Yale School of Medicine, 333 Cedar Street, PO Box 208017, New Haven, 06520-8017 CT, USA
- Center for Outcomes Research and Evaluation, Yale-New Haven Hospital, 195 Church Street 5th Floor, New Haven, 06510 CT, USA
- Section of Biomedical Informatics and Data Science, Yale School of Medicine, 100 College Street, New Haven, 06511 CT, USA
- Section of Health Informatics, Department of Biostatistics, Yale School of Public Health, 60 College Street, New Haven, 06510 CT, USA
| |
Collapse
|
5
|
Kervezee L, Dashti HS, Pilz LK, Skarke C, Ruben MD. Using routinely collected clinical data for circadian medicine: A review of opportunities and challenges. PLOS DIGITAL HEALTH 2024; 3:e0000511. [PMID: 38781189 PMCID: PMC11115276 DOI: 10.1371/journal.pdig.0000511] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/25/2024]
Abstract
A wealth of data is available from electronic health records (EHR) that are collected as part of routine clinical care in hospitals worldwide. These rich, longitudinal data offer an attractive object of study for the field of circadian medicine, which aims to translate knowledge of circadian rhythms to improve patient health. This narrative review aims to discuss opportunities for EHR in studies of circadian medicine, highlight the methodological challenges, and provide recommendations for using these data to advance the field. In the existing literature, we find that data collected in real-world clinical settings have the potential to shed light on key questions in circadian medicine, including how 24-hour rhythms in clinical features are associated with-or even predictive of-health outcomes, whether the effect of medication or other clinical activities depend on time of day, and how circadian rhythms in physiology may influence clinical reference ranges or sampling protocols. However, optimal use of EHR to advance circadian medicine requires careful consideration of the limitations and sources of bias that are inherent to these data sources. In particular, time of day influences almost every interaction between a patient and the healthcare system, creating operational 24-hour patterns in the data that have little or nothing to do with biology. Addressing these challenges could help to expand the evidence base for the use of EHR in the field of circadian medicine.
Collapse
Affiliation(s)
- Laura Kervezee
- Group of Circadian Medicine, Department of Cell and Chemical Biology, Leiden University Medical Center, Leiden, the Netherlands
| | - Hassan S. Dashti
- Department of Anesthesia, Critical Care and Pain Medicine, Massachusetts General Hospital and Harvard Medical School, Boston, Massachusetts, United States of America
| | - Luísa K. Pilz
- Department of Anesthesiology and Intensive Care Medicine CCM / CVK, Charité–Universitätsmedizin Berlin, corporate member of Freie Universität Berlin and Humboldt Universität zu Berlin, Berlin, Germany
- ECRC Experimental and Clinical Research Center, Charité–Universitätsmedizin Berlin, corporate member of Freie Universität Berlin and Humboldt Universität zu Berlin, Berlin, Germany
| | - Carsten Skarke
- Institute for Translational Medicine and Therapeutics (ITMAT), University of Pennsylvania Perelman School of Medicine, Philadelphia, Pennsylvania, United States of America
- Chronobiology and Sleep Institute (CSI), University of Pennsylvania Perelman School of Medicine, Philadelphia, Pennsylvania, United States of America
- Department of Medicine, University of Pennsylvania Perelman School of Medicine, Philadelphia, Pennsylvania, United States of America
| | - Marc D. Ruben
- Divisions of Pulmonary and Sleep Medicine and Biomedical Informatics, Cincinnati Children’s Hospital Medical Center, Cincinnati, Ohio, United States of America
| |
Collapse
|
6
|
Massen GM, Stone PW, Kwok HHY, Jenkins G, Allen RJ, Wain LV, Stewart I, Quint JK. Review of codelists used to define hypertension in electronic health records and development of a codelist for research. Open Heart 2024; 11:e002640. [PMID: 38626934 PMCID: PMC11029375 DOI: 10.1136/openhrt-2024-002640] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 02/15/2024] [Accepted: 04/02/2024] [Indexed: 04/19/2024] Open
Abstract
BACKGROUND AND AIMS Hypertension is a leading risk factor for cardiovascular disease. Electronic health records (EHRs) are routinely collected throughout a person's care, recording all aspects of health status, including current and past conditions, prescriptions and test results. EHRs can be used for epidemiological research. However, there are nuances in the way conditions are recorded using clinical coding; it is important to understand the methods which have been applied to define exposures, covariates and outcomes to enable interpretation of study findings. This study aimed to identify codelists used to define hypertension in studies that use EHRs and generate recommended codelists to support reproducibility and consistency. ELIGIBILITY CRITERIA Studies included populations with hypertension defined within an EHR between January 2010 and August 2023 and were systematically identified using MEDLINE and Embase. A summary of the most frequently used sources and codes is described. Due to an absence of Systematized Nomenclature of Medicine Clinical Terms (SNOMED CT) codelists in the literature, a recommended SNOMED CT codelist was developed to aid consistency and standardisation of hypertension research using EHRs. FINDINGS 375 manuscripts met the study criteria and were eligible for inclusion, and 112 (29.9%) reported codelists. The International Classification of Diseases (ICD) was the most frequently used clinical terminology, 59 manuscripts provided ICD 9 codelists (53%) and 58 included ICD 10 codelists (52%). Informed by commonly used ICD and Read codes, usage recommendations were made. We derived SNOMED CT codelists informed by National Institute for Health and Care Excellence guidelines for hypertension management. It is recommended that these codelists be used to identify hypertension in EHRs using SNOMED CT codes. CONCLUSIONS Less than one-third of hypertension studies using EHRs included their codelists. Transparent methodology for codelist creation is essential for replication and will aid interpretation of study findings. We created SNOMED CT codelists to support and standardise hypertension definitions in EHR studies.
Collapse
Affiliation(s)
| | - Philip W Stone
- School of Public Health, Imperial College London, London, UK
| | - Harley H Y Kwok
- School of Public Health, Imperial College London, London, UK
| | - Gisli Jenkins
- National Heart and Lung Institute, Imperial College London, London, UK
| | - Richard J Allen
- Department of Population Health Sciences, University of Leicester, Leicester, UK
- NIHR Biomedical Research Centre, University of Leicester, Leicester, UK
| | - Louise V Wain
- Department of Population Health Sciences, University of Leicester, Leicester, UK
- NIHR Biomedical Research Centre, University of Leicester, Leicester, UK
| | - Iain Stewart
- National Heart and Lung Institute, Imperial College London, London, UK
| | | |
Collapse
|
7
|
Higgins BE, Leonard-Hawkhead B, Azuara-Blanco A. Quality of Reporting Electronic Health Record Data in Glaucoma: A Systematic Literature Review. Ophthalmol Glaucoma 2024:S2589-4196(24)00064-4. [PMID: 38599318 DOI: 10.1016/j.ogla.2024.04.002] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/13/2023] [Revised: 03/05/2024] [Accepted: 04/02/2024] [Indexed: 04/12/2024]
Abstract
TOPIC Assessing reporting standards in glaucoma studies utilizing electronic health records (EHR). CLINICAL RELEVANCE Glaucoma's significance, underscored by its status as a leading cause of irreversible blindness worldwide, necessitates reliable research findings. This study evaluates adherence to the CODE-EHR best-practice framework in glaucoma studies using EHR, aiming to improve clinical care and patient outcomes. METHODS A systematic review, following Preferred Reporting Items for Systematic Reviews and Meta-Analyses (PRISMA) guidelines (PROSPERO CRD42023430025), identified relevant studies (January 2022-May 2023) in MEDLINE, EMBASE, CINAHL, and Web of Science. Eligible studies, using EHR data from clinical institutions for glaucoma research, were assessed for study design, participant characteristics, EHR data, and sources. Quality appraisal used the CODE-EHR best-practice framework, focusing on data construction, linkage, fitness for purpose, disease and outcome definitions, analysis, and ethics and governance. RESULTS Of 31 identified studies, predominant EHR sources were hospitals and clinical warehouses. Commonly reported elements included age, gender, glaucoma diagnosis, and intraocular pressure. Only 16% fully adhered to CODE-EHR best-practice framework's minimum standards, with none meeting preferred standards. While statistical analysis and ethical considerations were relatively well-addressed, areas such as EHR data management and study design showed room for improvement. Patient and public involvement, and acknowledgment of data linkage processes, data security, and storage reporting were often missed. CONCLUSION Adherence to CODE-EHR best-practice framework's standards in EHR-based studies of glaucoma can be improved upon. Standardized reporting of EHR data are essential to ensure the reliability of research, facilitating its translation into clinical practice and improving healthcare decision-making for better patient outcomes. FINANCIAL DISCLOSURE(S) Proprietary or commercial disclosure may be found in the Footnotes and Disclosures at the end of this article.
Collapse
Affiliation(s)
- Bethany E Higgins
- Centre for Public Health, Institute of Clinical Science Block A, Royal Victoria Hospital, Belfast, United Kingdom; Optometry and Visual Sciences, School of Health & Psychological Sciences, City, University of London, London, United Kingdom.
| | - Benedict Leonard-Hawkhead
- Centre for Public Health, Institute of Clinical Science Block A, Royal Victoria Hospital, Belfast, United Kingdom.
| | - Augusto Azuara-Blanco
- Centre for Public Health, Institute of Clinical Science Block A, Royal Victoria Hospital, Belfast, United Kingdom.
| |
Collapse
|
8
|
Armoundas AA, Narayan SM, Arnett DK, Spector-Bagdady K, Bennett DA, Celi LA, Friedman PA, Gollob MH, Hall JL, Kwitek AE, Lett E, Menon BK, Sheehan KA, Al-Zaiti SS. Use of Artificial Intelligence in Improving Outcomes in Heart Disease: A Scientific Statement From the American Heart Association. Circulation 2024; 149:e1028-e1050. [PMID: 38415358 PMCID: PMC11042786 DOI: 10.1161/cir.0000000000001201] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 02/29/2024]
Abstract
A major focus of academia, industry, and global governmental agencies is to develop and apply artificial intelligence and other advanced analytical tools to transform health care delivery. The American Heart Association supports the creation of tools and services that would further the science and practice of precision medicine by enabling more precise approaches to cardiovascular and stroke research, prevention, and care of individuals and populations. Nevertheless, several challenges exist, and few artificial intelligence tools have been shown to improve cardiovascular and stroke care sufficiently to be widely adopted. This scientific statement outlines the current state of the art on the use of artificial intelligence algorithms and data science in the diagnosis, classification, and treatment of cardiovascular disease. It also sets out to advance this mission, focusing on how digital tools and, in particular, artificial intelligence may provide clinical and mechanistic insights, address bias in clinical studies, and facilitate education and implementation science to improve cardiovascular and stroke outcomes. Last, a key objective of this scientific statement is to further the field by identifying best practices, gaps, and challenges for interested stakeholders.
Collapse
|
9
|
Boßelmann CM, Ivaniuk A, St John M, Taylor SC, Krishnaswamy G, Milinovich A, Leu C, Gupta A, Pestana-Knight EM, Najm I, Lal D. Healthcare utilization and clinical characteristics of genetic epilepsy in electronic health records. Brain Commun 2024; 6:fcae090. [PMID: 38524155 PMCID: PMC10959483 DOI: 10.1093/braincomms/fcae090] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/29/2023] [Revised: 02/05/2024] [Accepted: 03/12/2024] [Indexed: 03/26/2024] Open
Abstract
Understanding the clinical characteristics and medical treatment of individuals affected by genetic epilepsies is instrumental in guiding selection for genetic testing, defining the phenotype range of these rare disorders, optimizing patient care pathways and pinpointing unaddressed medical need by quantifying healthcare resource utilization. To date, a matched longitudinal cohort study encompassing the entire spectrum of clinical characteristics and medical treatment from childhood through adolescence has not been performed. We identified individuals with genetic and non-genetic epilepsies and onset at ages 0-5 years by linkage across the Cleveland Clinic Health System. We used natural language processing to extract medical terms and procedures from longitudinal electronic health records and tested for cross-sectional and temporal associations with genetic epilepsy. We implemented a two-stage design: in the discovery cohort, individuals were stratified as being 'likely genetic' or 'non-genetic' by a natural language processing algorithm, and controls did not receive genetic testing. The validation cohort consisted of cases with genetic epilepsy confirmed by manual chart review and an independent set of controls who received negative genetic testing. The discovery and validation cohorts consisted of 503 and 344 individuals with genetic epilepsy and matched controls, respectively. The median age at the first encounter was 0.1 years and 7.9 years at the last encounter, and the mean duration of follow-up was 8.2 years. We extracted 188,295 Unified Medical Language System annotations for statistical analysis across 9659 encounters. Individuals with genetic epilepsy received an earlier epilepsy diagnosis and had more frequent and complex encounters with the healthcare system. Notably, the highest enrichment of encounters compared with the non-genetic groups was found during the transition from paediatric to adult care. Our computational approach could validate established comorbidities of genetic epilepsies, such as behavioural abnormality and intellectual disability. We also revealed novel associations for genitourinary abnormalities (odds ratio 1.91, 95% confidence interval: 1.66-2.20, P = 6.16 × 10-19) linked to a spectrum of underrecognized epilepsy-associated genetic disorders. This case-control study leveraged real-world data to identify novel features associated with the likelihood of a genetic aetiology and quantified the healthcare utilization of genetic epilepsies compared with matched controls. Our results strongly recommend early genetic testing to stratify individuals into specialized care paths, thus improving the clinical management of people with genetic epilepsies.
Collapse
Affiliation(s)
- Christian M Boßelmann
- Genomic Medicine Institute, Lerner Research Institute, Cleveland Clinic, Cleveland, OH 44195, USA
- Epilepsy Center, Neurological Institute, Cleveland Clinic, Cleveland, OH 44195, USA
| | - Alina Ivaniuk
- Genomic Medicine Institute, Lerner Research Institute, Cleveland Clinic, Cleveland, OH 44195, USA
- Epilepsy Center, Neurological Institute, Cleveland Clinic, Cleveland, OH 44195, USA
| | - Mark St John
- Genomic Medicine Institute, Lerner Research Institute, Cleveland Clinic, Cleveland, OH 44195, USA
- Epilepsy Center, Neurological Institute, Cleveland Clinic, Cleveland, OH 44195, USA
| | - Sara C Taylor
- Neurological Institute, Cleveland Clinic, Cleveland, OH 44195, USA
| | | | - Alex Milinovich
- Department of Quantitative Health Sciences, Lerner Research Institute, Cleveland Clinic, Cleveland, OH 44195, USA
| | - Costin Leu
- Genomic Medicine Institute, Lerner Research Institute, Cleveland Clinic, Cleveland, OH 44195, USA
- Department of Clinical and Experimental Epilepsy, Institute of Neurology, University College London, London, WC1N 3BG, UK
- Department of Neurology, The University of Texas Health Science Center at Houston, Houston, TX 77030, USA
- Center for Neurogenetics, The University of Texas Health Science Center at Houston, Houston, TX 77030, USA
| | - Ajay Gupta
- Epilepsy Center, Neurological Institute, Cleveland Clinic, Cleveland, OH 44195, USA
| | | | - Imad Najm
- Epilepsy Center, Neurological Institute, Cleveland Clinic, Cleveland, OH 44195, USA
| | - Dennis Lal
- Genomic Medicine Institute, Lerner Research Institute, Cleveland Clinic, Cleveland, OH 44195, USA
- Epilepsy Center, Neurological Institute, Cleveland Clinic, Cleveland, OH 44195, USA
- Department of Neurology, The University of Texas Health Science Center at Houston, Houston, TX 77030, USA
- Center for Neurogenetics, The University of Texas Health Science Center at Houston, Houston, TX 77030, USA
- Stanley Center for Psychiatric Research, Broad Institute of Harvard and M.I.T., Cambridge, MA 02142, USA
- Cologne Center for Genomics (CCG), University of Cologne, 50931 Cologne, Germany
| |
Collapse
|
10
|
Sulonen S, Leinonen S, Lehtonen E, Hujanen P, Vaajanen A, Syvänen U, Hemelings R, Stalmans I, Tuulonen A, Uusitalo-Jarvinen H. A prototype protocol for evaluating the real-world data set using a structured electronic health record in glaucoma. Acta Ophthalmol 2024; 102:216-227. [PMID: 37753831 DOI: 10.1111/aos.15763] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/03/2023] [Revised: 08/09/2023] [Accepted: 08/29/2023] [Indexed: 09/28/2023]
Abstract
PURPOSE As the first step in monitoring and evaluating day-to-day glaucoma care, this study reports all real-world data recorded during the first full year after the implementation of a prototype for glaucoma-specific structured electronic healthcare record (EHR). METHODS In 2019, 4618 patients visited Tays Medical Glaucoma Clinic at Tays Eye Centre, Tampere University Hospital, Finland, that serves a population of 0.53 M. Patient data were entered into a glaucoma-specific EHR by trained nurses to be checked by glaucoma specialists. Tays Eye Centre follows the Finnish Current Care Guideline for glaucoma in which glaucoma is defined using a '2 out of 3' rule, that is, ≥2 findings evaluated as glaucomatous in optic nerve head (ONH), retinal nerve fibre layer (RNFL) and visual field (VF). RESULTS The clinical evaluations of ONH, RNFL and VF were recorded in 95%-100% of all eyes. ONH was evaluated as glaucomatous more often (44%) than RNFL (33%) and VF tests (30%). Progressive changes in any of the three tests were recorded in 35% of the '≥2/3 glaucoma group' compared to 2%-9% in the other groups. The mean IOP at visit was 15 mmHg. The mean target IOP was 17 mmHg, and it was recorded in 94% of eyes. CONCLUSION The developed structured data presentation enables comparisons between different population-based real-world glaucoma data sets and glaucoma clinics. Compared to a data set from the UK, the proportion of glaucoma suspicion-related visits was smaller in Tays Eye Centre and test intervals were longer.
Collapse
Affiliation(s)
- Sakari Sulonen
- Faculty of Medicine and Health Technology, Tampere University, Tampere, Finland
| | - Sanna Leinonen
- Faculty of Medicine and Health Technology, Tampere University, Tampere, Finland
- Tays Eye Centre, Tampere University Hospital, Tampere, Finland
| | - Eemil Lehtonen
- Faculty of Medicine and Health Technology, Tampere University, Tampere, Finland
| | - Pekko Hujanen
- Faculty of Medicine and Health Technology, Tampere University, Tampere, Finland
- Tays Eye Centre, Tampere University Hospital, Tampere, Finland
| | - Anu Vaajanen
- Eye and Vision Research SILK, Faculty of Medicine and Health Technology, Tampere University, Tampere, Finland
| | - Ulla Syvänen
- Tays Eye Centre, Tampere University Hospital, Tampere, Finland
| | - Ruben Hemelings
- Department of Neurosciences, Research Group Ophthalmology, KU Leuven, Leuven, Belgium
- Singapore Eye Research Institute, Singapore National Eye Centre, Singapore, Singapore
- SERI-NTU Advanced Ocular Engineering (STANCE) programme, Singapore, Singapore
| | - Ingeborg Stalmans
- Department of Neurosciences, Research Group Ophthalmology, KU Leuven, Leuven, Belgium
- Ophthalmology Department, UZ Leuven, Leuven, Belgium
| | - Anja Tuulonen
- Tays Eye Centre, Tampere University Hospital, Tampere, Finland
| | - Hannele Uusitalo-Jarvinen
- Faculty of Medicine and Health Technology, Tampere University, Tampere, Finland
- Tays Eye Centre, Tampere University Hospital, Tampere, Finland
| |
Collapse
|
11
|
Nakao YM, Nadarajah R, Shuweihdi F, Nakao K, Fuat A, Moore J, Bates C, Wu J, Gale C. Predicting incident heart failure from population-based nationwide electronic health records: protocol for a model development and validation study. BMJ Open 2024; 14:e073455. [PMID: 38253453 PMCID: PMC10806764 DOI: 10.1136/bmjopen-2023-073455] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 03/06/2023] [Accepted: 06/29/2023] [Indexed: 01/24/2024] Open
Abstract
INTRODUCTION Heart failure (HF) is increasingly common and associated with excess morbidity, mortality, and healthcare costs. Treatment of HF can alter the disease trajectory and reduce clinical events in HF. However, many cases of HF remain undetected until presentation with more advanced symptoms, often requiring hospitalisation. Predicting incident HF is challenging and statistical models are limited by performance and scalability in routine clinical practice. An HF prediction model implementable in nationwide electronic health records (EHRs) could enable targeted diagnostics to enable earlier identification of HF. METHODS AND ANALYSIS We will investigate a range of development techniques (including logistic regression and supervised machine learning methods) on routinely collected primary care EHRs to predict risk of new-onset HF over 1, 5 and 10 years prediction horizons. The Clinical Practice Research Datalink (CPRD)-GOLD dataset will be used for derivation (training and testing) and the CPRD-AURUM dataset for external validation. Both comprise large cohorts of patients, representative of the population of England in terms of age, sex and ethnicity. Primary care records are linked at patient level to secondary care and mortality data. The performance of the prediction model will be assessed by discrimination, calibration and clinical utility. We will only use variables routinely accessible in primary care. ETHICS AND DISSEMINATION Permissions for CPRD-GOLD and CPRD-AURUM datasets were obtained from CPRD (ref no: 21_000324). The CPRD ethical approval committee approved the study. The results will be submitted as a research paper for publication to a peer-reviewed journal and presented at peer-reviewed conferences. TRIAL REGISTRATION DETAILS The study was registered on Clinical Trials.gov (NCT05756127). A systematic review for the project was registered on PROSPERO (registration number: CRD42022380892).
Collapse
Affiliation(s)
- Yoko M Nakao
- Leeds Institute of Cardiovascular and Metabolic Medicine, University of Leeds, Leeds, UK
- Leeds Institute for Data Analytics, University of Leeds, Leeds, UK
- Department of Pharmacoepidemiology, Graduate School of Medicine and Public Health, Kyoto University, Kyoto, Japan
| | - Ramesh Nadarajah
- Leeds Institute of Cardiovascular and Metabolic Medicine, University of Leeds, Leeds, UK
- Leeds Institute for Data Analytics, University of Leeds, Leeds, UK
- Department of Cardiology, Leeds Teaching Hospital NHS Trust, Leeds, UK
| | - Farag Shuweihdi
- Leeds Institute of Cardiovascular and Metabolic Medicine, University of Leeds, Leeds, UK
| | - Kazuhiro Nakao
- Leeds Institute of Cardiovascular and Metabolic Medicine, University of Leeds, Leeds, UK
- Leeds Institute for Data Analytics, University of Leeds, Leeds, UK
- Department of Cardiovascular Medicine, National Cerebral and Cardiovascular Center, Suita, Japan
| | - Ahmet Fuat
- Carmel Medical Practice, Darlington & School of Medicine, Pharmacy and Health, Durham University, Darham, UK
| | - Jim Moore
- Stroke Road Surgery, Bishop's Cleeve, Cheltenham, UK
| | | | - Jianhua Wu
- Department of Biostatistics and Health Data Science, Queen Mary University of London, London, UK
| | - Chris Gale
- Leeds Institute of Cardiovascular and Metabolic Medicine, University of Leeds, Leeds, UK
- Leeds Institute for Data Analytics, University of Leeds, Leeds, UK
- Department of Cardiology, Leeds Teaching Hospital NHS Trust, Leeds, UK
| |
Collapse
|
12
|
Hwang TS, Thomas M, Hribar M, Chen A, White E. The Impact of Documentation Workflow on the Accuracy of the Coded Diagnoses in the Electronic Health Record. OPHTHALMOLOGY SCIENCE 2024; 4:100409. [PMID: 38054107 PMCID: PMC10694743 DOI: 10.1016/j.xops.2023.100409] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 08/01/2023] [Revised: 09/15/2023] [Accepted: 09/29/2023] [Indexed: 12/07/2023]
Abstract
Objective To determine the impact of documentation workflow on the accuracy of coded diagnoses in electronic health records (EHRs). Design Cross-sectional study. Participants All patients who completed visits at the Casey Eye Institute Retina Division faculty clinic between April 7, 2022 and April 13, 2022. Main Outcome Measures Agreement between coded diagnoses and clinical notes. Methods We assessed the rate of agreement between the diagnoses in the clinical notes and the coded diagnosis in the EHR using manual review and examined the impact of the documentation workflow on the rate of agreement in an academic retina practice. Results In 202 visits by 8 physicians, 78% (range, 22%-100%) had an agreement between the coded diagnoses and the clinical notes. When physicians integrated the diagnosis code entry and note composition, the rate of agreement was 87.9% (range, 62%-100%). For those who entered the diagnosis codes separately from writing notes, the agreement was 44.4% (22%-50%, P < 0.0001). Conclusion The visit-specific agreement between the coded diagnosis and the progress note can vary widely by workflow. The workflow and EHR design may be an important part of understanding and improving the quality of EHR data. Financial Disclosures Proprietary or commercial disclosure may be found in the Footnotes and Disclosures at the end of this article.
Collapse
Affiliation(s)
- Thomas S. Hwang
- Casey Eye Institute, Oregon Health and Science University, Portland, OR
| | - Merina Thomas
- Casey Eye Institute, Oregon Health and Science University, Portland, OR
| | - Michelle Hribar
- Casey Eye Institute, Oregon Health and Science University, Portland, OR
- Department of Medical Informatics and Clinical Epidemiology, Oregon Health and Science University, Portland, OR
| | - Aiyin Chen
- Casey Eye Institute, Oregon Health and Science University, Portland, OR
| | - Elizabeth White
- Casey Eye Institute, Oregon Health and Science University, Portland, OR
| |
Collapse
|
13
|
Jiang Y, Wang C, Zhou S. Artificial intelligence-based risk stratification, accurate diagnosis and treatment prediction in gynecologic oncology. Semin Cancer Biol 2023; 96:82-99. [PMID: 37783319 DOI: 10.1016/j.semcancer.2023.09.005] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/17/2022] [Revised: 08/27/2023] [Accepted: 09/25/2023] [Indexed: 10/04/2023]
Abstract
As data-driven science, artificial intelligence (AI) has paved a promising path toward an evolving health system teeming with thrilling opportunities for precision oncology. Notwithstanding the tremendous success of oncological AI in such fields as lung carcinoma, breast tumor and brain malignancy, less attention has been devoted to investigating the influence of AI on gynecologic oncology. Hereby, this review sheds light on the ever-increasing contribution of state-of-the-art AI techniques to the refined risk stratification and whole-course management of patients with gynecologic tumors, in particular, cervical, ovarian and endometrial cancer, centering on information and features extracted from clinical data (electronic health records), cancer imaging including radiological imaging, colposcopic images, cytological and histopathological digital images, and molecular profiling (genomics, transcriptomics, metabolomics and so forth). However, there are still noteworthy challenges beyond performance validation. Thus, this work further describes the limitations and challenges faced in the real-word implementation of AI models, as well as potential solutions to address these issues.
Collapse
Affiliation(s)
- Yuting Jiang
- Department of Obstetrics and Gynecology, Key Laboratory of Birth Defects and Related Diseases of Women and Children of MOE and State Key Laboratory of Biotherapy, West China Second Hospital, Sichuan University and Collaborative Innovation Center, Chengdu, Sichuan 610041, China; Department of Pulmonary and Critical Care Medicine, State Key Laboratory of Respiratory Health and Multimorbidity, Frontiers Science Center for Disease-related Molecular Network, West China Hospital, Sichuan University, Chengdu, Sichuan 610041, China
| | - Chengdi Wang
- Department of Obstetrics and Gynecology, Key Laboratory of Birth Defects and Related Diseases of Women and Children of MOE and State Key Laboratory of Biotherapy, West China Second Hospital, Sichuan University and Collaborative Innovation Center, Chengdu, Sichuan 610041, China; Department of Pulmonary and Critical Care Medicine, State Key Laboratory of Respiratory Health and Multimorbidity, Frontiers Science Center for Disease-related Molecular Network, West China Hospital, Sichuan University, Chengdu, Sichuan 610041, China
| | - Shengtao Zhou
- Department of Obstetrics and Gynecology, Key Laboratory of Birth Defects and Related Diseases of Women and Children of MOE and State Key Laboratory of Biotherapy, West China Second Hospital, Sichuan University and Collaborative Innovation Center, Chengdu, Sichuan 610041, China; Department of Pulmonary and Critical Care Medicine, State Key Laboratory of Respiratory Health and Multimorbidity, Frontiers Science Center for Disease-related Molecular Network, West China Hospital, Sichuan University, Chengdu, Sichuan 610041, China.
| |
Collapse
|
14
|
Asselbergs FW, Kotecha D. The CODE-EHR global framework: lifting the veil on health record data. Eur Heart J 2023; 44:3398-3400. [PMID: 37408471 DOI: 10.1093/eurheartj/ehad424] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 07/07/2023] Open
Affiliation(s)
- Folkert W Asselbergs
- Amsterdam University Medical Center, Department of Cardiology, University of Amsterdam, 9 Meibergdreef, Amsterdam, 1105 AZ, The Netherlands
- Health Data Research UK and Institute of Health Informatics, University College London, 222 Euston Rd., London NW1 2DA, UK
| | - Dipak Kotecha
- Institute of Cardiovascular Sciences, University of Birmingham, Birmingham, UK
- Health Data Research UK Midlands, University Hospitals Birmingham NHS Foundation Trust, Mindelsohn Way, Edgbaston, Birmingham B15 2TH, UK
| |
Collapse
|
15
|
Dhingra LS, Shen M, Mangla A, Khera R. Cardiovascular Care Innovation through Data-Driven Discoveries in the Electronic Health Record. Am J Cardiol 2023; 203:136-148. [PMID: 37499593 PMCID: PMC10865722 DOI: 10.1016/j.amjcard.2023.06.104] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 02/28/2023] [Revised: 05/24/2023] [Accepted: 06/29/2023] [Indexed: 07/29/2023]
Abstract
The electronic health record (EHR) represents a rich source of patient information, increasingly being leveraged for cardiovascular research. Although its primary use remains the seamless delivery of health care, the various longitudinally aggregated structured and unstructured data elements for each patient within the EHR can define the computational phenotypes of disease and care signatures and their association with outcomes. Although structured data elements, such as demographic characteristics, laboratory measurements, problem lists, and medications, are easily extracted, unstructured data are underused. The latter include free text in clinical narratives, documentation of procedures, and reports of imaging and pathology. Rapid scaling up of data storage and rapid innovation in natural language processing and computer vision can power insights from unstructured data streams. However, despite an array of opportunities for research using the EHR, specific expertise is necessary to adequately address confidentiality, accuracy, completeness, and heterogeneity challenges in EHR-based research. These often require methodological innovation and best practices to design and conduct successful research studies. Our review discusses these challenges and their proposed solutions. In addition, we highlight the ongoing innovations in federated learning in the EHR through a greater focus on common data models and discuss ongoing work that defines such an approach to large-scale, multicenter, federated studies. Such parallel improvements in technology and research methods enable innovative care and optimization of patient outcomes.
Collapse
Affiliation(s)
| | - Miles Shen
- Section of Cardiovascular Medicine, Department of Internal Medicine; Department of Internal Medicine
| | - Anjali Mangla
- Section of Cardiovascular Medicine, Department of Internal Medicine; Department of Neuroscience, Yale School of Medicine, New Haven, Connecticut
| | - Rohan Khera
- Section of Cardiovascular Medicine, Department of Internal Medicine; Center for Outcomes Research and Evaluation (CORE), Yale New Haven Hospital, New Haven, Connecticut; Section of Health Informatics, Department of Biostatistics, Yale School of Public Health, New Haven, Connecticut.; Section of Biomedical Informatics and Data Science, Yale School of Medicine, New Haven, Connecticut.
| |
Collapse
|
16
|
Nakao YM, Nakao K, Nadarajah R, Banerjee A, Fonarow GC, Petrie MC, Rahimi K, Wu J, Gale CP. Prognosis, characteristics, and provision of care for patients with the unspecified heart failure electronic health record phenotype: a population-based linked cohort study of 95262 individuals. EClinicalMedicine 2023; 63:102164. [PMID: 37662516 PMCID: PMC10474358 DOI: 10.1016/j.eclinm.2023.102164] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 03/16/2023] [Revised: 07/26/2023] [Accepted: 07/31/2023] [Indexed: 09/05/2023] Open
Abstract
Background Whether the accuracy of the phenotype ascribed to patients in electronic health records (EHRs) is associated with variation in prognosis and care provision is unknown. We investigated this for heart failure (HF, characterised as HF with preserved ejection fraction [HFpEF], HF with reduced ejection fraction [HFrEF] and unspecified HF). Methods We included individuals aged 16 years and older with a new diagnosis of HF between January 2, 1998 and February 28, 2022 from linked primary and secondary care records in the Clinical Practice Research Datalink in England. We investigated the provision of guideline-recommended diagnostic investigations and pharmacological treatments. The primary outcome was a composite of HF hospitalisation or all-cause death, and secondary outcomes were time to HF hospitalisation, all-cause death and death from cardiovascular causes. We used Kaplan-Meier curves and log rank tests to compare survival across HF phenotypes and adjusted for potential confounders in Cox proportional hazards regression analyses. Findings Of a cohort of 95,262 individuals, 1271 (1.3%) were recorded as having HFpEF, 10,793 (11.3%) as HFrEF and 83,198 (87.3%) as unspecified HF. Individuals recorded as unspecified HF were older with a higher prevalence of dementia. Unspecified HF, compared to patients with a recorded HF phenotype, were less likely to receive specialist assessment, echocardiography or natriuretic peptide testing in the peri-diagnostic period, or receive angiotensin-converting enzyme inhibitors, beta blockers or mineralocorticoid receptor antagonists up to 12 months after diagnosis (risk ratios compared to HFrEF, 0.64, 95% CI 0.63-0.64; 0.59, 0.58-0.60; 0.57, 0.55-0.59; respectively) and had significantly worse outcomes (adjusted hazard ratios compared to HFrEF, HF hospitalisation and death 1.66, 95% CI 1.59-1.74; all-cause mortality 2.00, 1.90-2.10; cardiovascular death 1.77, 1.65-1.90). Interpretation Our findings suggested that absence of specification of HF phenotype in routine EHRs is inversely associated with clinical investigations, treatments and survival, representing an actionable target to mitigate prognostic and health resource burden. Funding Japan Research Foundation for Healthy Aging and British Heart Foundation.
Collapse
Affiliation(s)
- Yoko M. Nakao
- Leeds Institute of Cardiovascular and Metabolic Medicine, University of Leeds, Leeds, UK
- Leeds Institute for Data Analytics, University of Leeds, Leeds, UK
- Department of Pharmacoepidemiology, Graduate School of Medicine and Public Health, Kyoto University, Kyoto, Japan
| | - Kazuhiro Nakao
- Leeds Institute of Cardiovascular and Metabolic Medicine, University of Leeds, Leeds, UK
- Leeds Institute for Data Analytics, University of Leeds, Leeds, UK
- Department of Cardiovascular Medicine, National Cerebral and Cardiovascular Center, Suita, Japan
| | - Ramesh Nadarajah
- Leeds Institute of Cardiovascular and Metabolic Medicine, University of Leeds, Leeds, UK
- Leeds Institute for Data Analytics, University of Leeds, Leeds, UK
- Department of Cardiology, Leeds Teaching Hospitals NHS Trust, Leeds, UK
| | - Amitava Banerjee
- Institute of Health Informatics, University College London, London, UK
- Department of Cardiology, Barts Health NHS Trust, London, UK
| | - Gregg C. Fonarow
- Division of Cardiology, Department of Medicine, University of California at Los Angeles, Los Angeles, USA
| | - Mark C. Petrie
- Institute of Cardiovascular and Medical Sciences, University of Glasgow, Glasgow, UK
| | - Kazem Rahimi
- Nuffield Department of Women's and Reproductive Health, University of Oxford, Oxford, UK
- National Institute of Health Research Oxford Biomedical Research Centre, Oxford University Hospitals NHS Foundation Trust, Oxford, UK
- Deep Medicine, Oxford Martin School, University of Oxford, Oxford, UK
- Oxford University Hospitals NHS Foundation Trust, Oxford, UK
| | - Jianhua Wu
- Wolfson Institute of Population Health, Queen Mary University of London, London, UK
| | - Chris P. Gale
- Leeds Institute of Cardiovascular and Metabolic Medicine, University of Leeds, Leeds, UK
- Leeds Institute for Data Analytics, University of Leeds, Leeds, UK
- Department of Cardiology, Leeds Teaching Hospitals NHS Trust, Leeds, UK
| |
Collapse
|
17
|
Klement W, El Emam K. Consolidated Reporting Guidelines for Prognostic and Diagnostic Machine Learning Modeling Studies: Development and Validation. J Med Internet Res 2023; 25:e48763. [PMID: 37651179 PMCID: PMC10502599 DOI: 10.2196/48763] [Citation(s) in RCA: 7] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/05/2023] [Revised: 07/11/2023] [Accepted: 07/31/2023] [Indexed: 09/01/2023] Open
Abstract
BACKGROUND The reporting of machine learning (ML) prognostic and diagnostic modeling studies is often inadequate, making it difficult to understand and replicate such studies. To address this issue, multiple consensus and expert reporting guidelines for ML studies have been published. However, these guidelines cover different parts of the analytics lifecycle, and individually, none of them provide a complete set of reporting requirements. OBJECTIVE We aimed to consolidate the ML reporting guidelines and checklists in the literature to provide reporting items for prognostic and diagnostic ML in in-silico and shadow mode studies. METHODS We conducted a literature search that identified 192 unique peer-reviewed English articles that provide guidance and checklists for reporting ML studies. The articles were screened by their title and abstract against a set of 9 inclusion and exclusion criteria. Articles that were filtered through had their quality evaluated by 2 raters using a 9-point checklist constructed from guideline development good practices. The average κ was 0.71 across all quality criteria. The resulting 17 high-quality source papers were defined as having a quality score equal to or higher than the median. The reporting items in these 17 articles were consolidated and screened against a set of 6 inclusion and exclusion criteria. The resulting reporting items were sent to an external group of 11 ML experts for review and updated accordingly. The updated checklist was used to assess the reporting in 6 recent modeling papers in JMIR AI. Feedback from the external review and initial validation efforts was used to improve the reporting items. RESULTS In total, 37 reporting items were identified and grouped into 5 categories based on the stage of the ML project: defining the study details, defining and collecting the data, modeling methodology, model evaluation, and explainability. None of the 17 source articles covered all the reporting items. The study details and data description reporting items were the most common in the source literature, with explainability and methodology guidance (ie, data preparation and model training) having the least coverage. For instance, a median of 75% of the data description reporting items appeared in each of the 17 high-quality source guidelines, but only a median of 33% of the data explainability reporting items appeared. The highest-quality source articles tended to have more items on reporting study details. Other categories of reporting items were not related to the source article quality. We converted the reporting items into a checklist to support more complete reporting. CONCLUSIONS Our findings supported the need for a set of consolidated reporting items, given that existing high-quality guidelines and checklists do not individually provide complete coverage. The consolidated set of reporting items is expected to improve the quality and reproducibility of ML modeling studies.
Collapse
Affiliation(s)
- William Klement
- University of Ottawa, Ottawa, ON, Canada
- CHEO Research Institute, Ottawa, ON, Canada
| | - Khaled El Emam
- University of Ottawa, Ottawa, ON, Canada
- CHEO Research Institute, Ottawa, ON, Canada
| |
Collapse
|
18
|
Hujanen P, Ruha H, Lehtonen E, Pirinen I, Huhtala H, Vaajanen A, Syvänen U, Tuulonen A, Uusitalo-Järvinen H. Ten-year real-world outcomes of antivascular endothelial growth factor therapy in neovascular age-related macular degeneration using pro re nata regimen. BMJ Open Ophthalmol 2023; 8:e001328. [PMID: 37586826 PMCID: PMC10432647 DOI: 10.1136/bmjophth-2023-001328] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/01/2023] [Accepted: 08/01/2023] [Indexed: 08/18/2023] Open
Abstract
BACKGROUND/AIMS To analyse long-term outcomes of antivascular endothelial growth factor (anti-VEGF) therapy for the treatment of neovascular age-related macular degeneration (nAMD) using pro re nata (PRN) regimen in a single-centre clinical practice. METHODS All patients receiving intravitreal injection (IVI) for nAMD between 1 January 2008 and 31 December 2020 were searched from electronic medical records. All 3844 treatment-naïve eyes of 3008 patients were included with a total of 50 146 IVIs (87% bevacizumab) administered. Main outcome measures were mean change in visual acuity (VA) from baseline, proportion of eyes within 15 letters of baseline, proportion of eyes with VA ≥20/40 Snellen and ≤20/200 Snellen, number of annual visits and number of annual IVIs. RESULTS The mean baseline VA was 55 Early Treatment Diabetic Retinopathy Study (ETDRS) letters and the mean change in VA from baseline was +2, +2, ±0, -2, -2 and -4 ETDRS letters at year 1, 2, 3, 5, 7 and 10, respectively. Proportions of eyes within 15 letters of baseline were 88%, 87%, 82%, 80%, 76% and 72% at the end of years 1, 2, 3, 5, 7 and 10, respectively. The median number of annual IVI was 6 at years 1-7 and 5 at year 10. The median number of annual total visits was 10 at year 1, 9 at years 2-7 and 8 at year 10, respectively. CONCLUSIONS VA was maintained short-term and long-term with anti-VEGF therapy using PRN treatment regimen.
Collapse
Affiliation(s)
- Pekko Hujanen
- Tays Eye Centre, Tampere University Hospital, Tampere, Finland
- Faculty of Medicine and Health Technology, Tampere University, Tampere, Finland
| | - Heikki Ruha
- Tays Eye Centre, Tampere University Hospital, Tampere, Finland
- Faculty of Medicine and Health Technology, Tampere University, Tampere, Finland
| | - Eemil Lehtonen
- Tays Eye Centre, Tampere University Hospital, Tampere, Finland
- Faculty of Medicine and Health Technology, Tampere University, Tampere, Finland
| | - Inka Pirinen
- Tays Eye Centre, Tampere University Hospital, Tampere, Finland
- Faculty of Medicine and Health Technology, Tampere University, Tampere, Finland
| | - Heini Huhtala
- Faculty of Social Sciences, Tampere University, Tampere, Finland
| | - Anu Vaajanen
- Faculty of Medicine and Health Technology, Tampere University, Tampere, Finland
| | - Ulla Syvänen
- Tays Eye Centre, Tampere University Hospital, Tampere, Finland
| | - Anja Tuulonen
- Tays Eye Centre, Tampere University Hospital, Tampere, Finland
| | - Hannele Uusitalo-Järvinen
- Tays Eye Centre, Tampere University Hospital, Tampere, Finland
- Faculty of Medicine and Health Technology, Tampere University, Tampere, Finland
| |
Collapse
|
19
|
Breger A, Selby I, Roberts M, Babar J, Gkrania-Klotsas E, Preller J, Escudero Sánchez L, Rudd JHF, Aston JAD, Weir-McCall JR, Sala E, Schönlieb CB. A pipeline to further enhance quality, integrity and reusability of the NCCID clinical data. Sci Data 2023; 10:493. [PMID: 37500661 PMCID: PMC10374610 DOI: 10.1038/s41597-023-02340-7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/14/2023] [Accepted: 06/27/2023] [Indexed: 07/29/2023] Open
Abstract
The National COVID-19 Chest Imaging Database (NCCID) is a centralized UK database of thoracic imaging and corresponding clinical data. It is made available by the National Health Service Artificial Intelligence (NHS AI) Lab to support the development of machine learning tools focused on Coronavirus Disease 2019 (COVID-19). A bespoke cleaning pipeline for NCCID, developed by the NHSx, was introduced in 2021. We present an extension to the original cleaning pipeline for the clinical data of the database. It has been adjusted to correct additional systematic inconsistencies in the raw data such as patient sex, oxygen levels and date values. The most important changes will be discussed in this paper, whilst the code and further explanations are made publicly available on GitLab. The suggested cleaning will allow global users to work with more consistent data for the development of machine learning tools without being an expert. In addition, it highlights some of the challenges when working with clinical multi-center data and includes recommendations for similar future initiatives.
Collapse
Affiliation(s)
- Anna Breger
- Department of Applied Mathematics and Theoretical Physics, University of Cambridge, Cambridge, UK.
- Center of Medical Physics and Biomedical Engineering, Medical University of Vienna, Vienna, Austria.
| | - Ian Selby
- Department of Radiology, University of Cambridge, Cambridge, UK.
- Cambridge University Hospitals NHS Trust, Cambridge, UK.
| | - Michael Roberts
- Department of Applied Mathematics and Theoretical Physics, University of Cambridge, Cambridge, UK
- Department of Medicine, University of Cambridge, Cambridge, UK
| | - Judith Babar
- Department of Radiology, University of Cambridge, Cambridge, UK
- Cambridge University Hospitals NHS Trust, Cambridge, UK
| | - Effrossyni Gkrania-Klotsas
- Cambridge University Hospitals NHS Trust, Cambridge, UK
- School of Clinical Medicine, University of Cambridge, Cambridge, UK
| | - Jacobus Preller
- Cambridge University Hospitals NHS Trust, Cambridge, UK
- School of Clinical Medicine, University of Cambridge, Cambridge, UK
| | - Lorena Escudero Sánchez
- Department of Radiology, University of Cambridge, Cambridge, UK
- Cancer Research UK (CRUK) Cambridge Centre, Cambridge, UK
| | - James H F Rudd
- Department of Medicine, University of Cambridge, Cambridge, UK
| | - John A D Aston
- Department of Pure Mathematics and Mathematical Statistics, University of Cambridge, Cambridge, UK
| | - Jonathan R Weir-McCall
- Department of Radiology, University of Cambridge, Cambridge, UK
- Cambridge University Hospitals NHS Trust, Cambridge, UK
- Department of Radiology, Royal Papworth Hospital, Cambridge, UK
| | - Evis Sala
- Advanced Radiodiagnostics Centre, Fondazione Policlinico Universitario Agostino Gemelli, Rome, Italy
- Università Cattolica del Sacro Cuore, Rome, Italy
| | - Carola-Bibiane Schönlieb
- Department of Applied Mathematics and Theoretical Physics, University of Cambridge, Cambridge, UK
| |
Collapse
|