1
|
Martinson AK, Chin AT, Butte MJ, Rider NL. Artificial Intelligence and Machine Learning for Inborn Errors of Immunity: Current State and Future Promise. THE JOURNAL OF ALLERGY AND CLINICAL IMMUNOLOGY. IN PRACTICE 2024:S2213-2198(24)00828-6. [PMID: 39127104 DOI: 10.1016/j.jaip.2024.08.012] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/02/2024] [Revised: 07/10/2024] [Accepted: 08/01/2024] [Indexed: 08/12/2024]
Abstract
Artificial intelligence (AI) and machine learning (ML) research within medicine has exponentially increased over the last decade, with studies showcasing the potential of AI/ML algorithms to improve clinical practice and outcomes. Ongoing research and efforts to develop AI-based models have expanded to aid in the identification of inborn errors of immunity (IEI). The use of larger electronic health record data sets, coupled with advances in phenotyping precision and enhancements in ML techniques, has the potential to significantly improve the early recognition of IEI, thereby increasing access to equitable care. In this review, we provide a comprehensive examination of AI/ML for IEI, covering the spectrum from data preprocessing for AI/ML analysis to current applications within immunology, and address the challenges associated with implementing clinical decision support systems to refine the diagnosis and management of IEI.
Collapse
Affiliation(s)
| | - Aaron T Chin
- Department of Pediatrics, Division of Immunology, Allergy and Rheumatology, University of California, Los Angeles, Los Angeles, Calif
| | - Manish J Butte
- Department of Pediatrics, Division of Immunology, Allergy and Rheumatology, University of California, Los Angeles, Los Angeles, Calif
| | - Nicholas L Rider
- Department of Health Systems & Implementation Science, Virginia Tech Carilion School of Medicine, Roanoke, Va; Department of Medicine, Division of Allergy-Immunology, Carilion Clinic, Roanoke, Va.
| |
Collapse
|
2
|
Rider NL, Truxton A, Ohrt T, Margolin-Katz I, Horan M, Shin H, Davila R, Tenembaum V, Quinn J, Modell V, Modell F, Orange JS, Branner A, Senerchia C. Validating inborn error of immunity prevalence and risk with nationally representative electronic health record data. J Allergy Clin Immunol 2024; 153:1704-1710. [PMID: 38278184 DOI: 10.1016/j.jaci.2024.01.011] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/17/2023] [Revised: 01/17/2024] [Accepted: 01/18/2024] [Indexed: 01/28/2024]
Abstract
BACKGROUND The 10 Warning Signs of Primary Immunodeficiency were created 30 years ago to advance recognition of inborn errors of immunity (IEI). However, no population-level assessment of their utility applied to electronic health record (EHR) data has been conducted. OBJECTIVE We sought to quantify the value of having ≥2 warning signs (WS) toward diagnosing IEI using a highly representative real-world US cohort. A secondary goal was estimating the US prevalence of IEI. METHODS In this cohort study, we accessed normalized and de-identified EHR data on 152 million US patients. An IEI cohort (n = 41,080), in which patients were defined by having at least 1 verifiable IEI diagnosis placed ≥2 times in their record, was compared with a matched set of controls (n = 250,262). WS were encoded along with relevant diagnoses, relative weights were calculated, and the proportion of IEI cases versus controls with ≥2 WS was compared. RESULTS The proportion of IEI cases with ≥2 WS significantly differed from controls (0.33 vs 0.031; P < .0005, χ2 test). We also estimated a US IEI prevalence of 6 per 10,000 individuals (41,080/73,165,655; 0.056%). WS 9 (≥2 deep-seated infections), 7 (fungal infections), 5 (failure to thrive) and 4 (≥2 pneumonias in 1 year) were the most heavily weighted among the IEI cohort. CONCLUSIONS This nationally representative US-based cohort study demonstrates that presence of WS and associated clinical diagnoses can facilitate identification of patients with IEI from EHR data. In addition, we estimate that 6 in 10,000, or approximately 150,000 to 200,000 individuals are affected by IEI across the United States.
Collapse
Affiliation(s)
- Nicholas L Rider
- Department of Health Systems and Implementation Science, Virginia Tech Carilion School of Medicine, Roanoke, Va.
| | - Ahuva Truxton
- Optum Clinical Trial Solutions, Optum Life Sciences, Eden Prairie, Minn
| | - Tracy Ohrt
- Optum Clinical Trial Solutions, Optum Life Sciences, Eden Prairie, Minn
| | | | - Mary Horan
- Optum Clinical Trial Solutions, Optum Life Sciences, Eden Prairie, Minn
| | - Harold Shin
- Division of Clinical Informatics, Liberty University College of Osteopathic Medicine, Lynchburg, Va
| | | | | | | | | | | | - Jordan S Orange
- Department of Pediatrics, Columbia University Vagelos College of Physicians and Surgeons, New York, NY
| | - Almut Branner
- Optum Clinical Trial Solutions, Optum Life Sciences, Eden Prairie, Minn
| | - Cynthia Senerchia
- Optum Clinical Trial Solutions, Optum Life Sciences, Eden Prairie, Minn
| |
Collapse
|
3
|
Faviez C, Chen X, Garcelon N, Zaidan M, Billot K, Petzold F, Faour H, Douillet M, Rozet JM, Cormier-Daire V, Attié-Bitach T, Lyonnet S, Saunier S, Burgun A. Objectivizing issues in the diagnosis of complex rare diseases: lessons learned from testing existing diagnosis support systems on ciliopathies. BMC Med Inform Decis Mak 2024; 24:134. [PMID: 38789985 PMCID: PMC11127295 DOI: 10.1186/s12911-024-02538-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/30/2024] [Accepted: 05/17/2024] [Indexed: 05/26/2024] Open
Abstract
BACKGROUND There are approximately 8,000 different rare diseases that affect roughly 400 million people worldwide. Many of them suffer from delayed diagnosis. Ciliopathies are rare monogenic disorders characterized by a significant phenotypic and genetic heterogeneity that raises an important challenge for clinical diagnosis. Diagnosis support systems (DSS) applied to electronic health record (EHR) data may help identify undiagnosed patients, which is of paramount importance to improve patients' care. Our objective was to evaluate three online-accessible rare disease DSSs using phenotypes derived from EHRs for the diagnosis of ciliopathies. METHODS Two datasets of ciliopathy cases, either proven or suspected, and two datasets of controls were used to evaluate the DSSs. Patient phenotypes were automatically extracted from their EHRs and converted to Human Phenotype Ontology terms. We tested the ability of the DSSs to diagnose cases in contrast to controls based on Orphanet ontology. RESULTS A total of 79 cases and 38 controls were selected. Performances of the DSSs on ciliopathy real world data (best DSS with area under the ROC curve = 0.72) were not as good as published performances on the test set used in the DSS development phase. None of these systems obtained results which could be described as "expert-level". Patients with multisystemic symptoms were generally easier to diagnose than patients with isolated symptoms. Diseases easily confused with ciliopathy generally affected multiple organs and had overlapping phenotypes. Four challenges need to be considered to improve the performances: to make the DSSs interoperable with EHR systems, to validate the performances in real-life settings, to deal with data quality, and to leverage methods and resources for rare and complex diseases. CONCLUSION Our study provides insights into the complexities of diagnosing highly heterogenous rare diseases and offers lessons derived from evaluation existing DSSs in real-world settings. These insights are not only beneficial for ciliopathy diagnosis but also hold relevance for the enhancement of DSS for various complex rare disorders, by guiding the development of more clinically relevant rare disease DSSs, that could support early diagnosis and finally make more patients eligible for treatment.
Collapse
Affiliation(s)
- Carole Faviez
- Centre de Recherche des Cordeliers, Sorbonne Université, INSERM, Université Paris Cité, Paris, F-75006, France.
- HeKA, Inria Paris, Paris, F-75012, France.
- Universite Paris Cite, Paris, France.
| | - Xiaoyi Chen
- Centre de Recherche des Cordeliers, Sorbonne Université, INSERM, Université Paris Cité, Paris, F-75006, France
- HeKA, Inria Paris, Paris, F-75012, France
- Data Science Platform, Université Paris Cité, Imagine Institute, INSERM UMR 1163, Paris, F-75015, France
| | - Nicolas Garcelon
- Centre de Recherche des Cordeliers, Sorbonne Université, INSERM, Université Paris Cité, Paris, F-75006, France
- HeKA, Inria Paris, Paris, F-75012, France
- Data Science Platform, Université Paris Cité, Imagine Institute, INSERM UMR 1163, Paris, F-75015, France
| | - Mohamad Zaidan
- Service de Néphrologie, Dialyse et Transplantation, Hôpital Universitaire Bicêtre, Assistance Publique-Hôpitaux de Paris (AP-HP), Kremlin Bicêtre, F-94270, France
| | - Katy Billot
- Laboratory of Renal Hereditary Diseases, Imagine Institute, INSERM UMR 1163, Université Paris Cité, Paris, F-75015, France
| | - Friederike Petzold
- Laboratory of Renal Hereditary Diseases, Imagine Institute, INSERM UMR 1163, Université Paris Cité, Paris, F-75015, France
- Division of Nephrology, University of Leipzig Medical Center, Leipzig, Germany
| | - Hassan Faour
- Data Science Platform, Université Paris Cité, Imagine Institute, INSERM UMR 1163, Paris, F-75015, France
| | - Maxime Douillet
- Data Science Platform, Université Paris Cité, Imagine Institute, INSERM UMR 1163, Paris, F-75015, France
| | - Jean-Michel Rozet
- Laboratory of Genetics in Ophthalmology, Imagine Institute, INSERM UMR 1163, Université Paris Cité, Paris, F-75015, France
| | - Valérie Cormier-Daire
- Reference Centre for Constitutional Bone Diseases, laboratory of Osteochondrodysplasia, Imagine Institute, INSERM UMR 1163, Université Paris Cité, Paris, F-75015, France
- Service de médecine génomique des maladies rares, Hôpital Necker-Enfants Malades, AP-HP, Paris, F-75015, France
| | - Tania Attié-Bitach
- Service d'Histologie-Embryologie-Cytogénétique, Hôpital Necker-Enfants Malades, AP-HP, Paris, F-75015, France
| | - Stanislas Lyonnet
- Service de médecine génomique des maladies rares, Hôpital Necker-Enfants Malades, AP-HP, Paris, F-75015, France
- Laboratory of Embryology and Genetics of Congenital Malformations, INSERM UMR 1163, Imagine Institute, Paris Cité, Paris, F-75015, France
| | - Sophie Saunier
- Laboratory of Renal Hereditary Diseases, Imagine Institute, INSERM UMR 1163, Université Paris Cité, Paris, F-75015, France
| | - Anita Burgun
- Centre de Recherche des Cordeliers, Sorbonne Université, INSERM, Université Paris Cité, Paris, F-75006, France
- HeKA, Inria Paris, Paris, F-75012, France
- Department of Medical Informatics, Hôpital Necker-Enfants Malades, AP-HP, Paris, F-75015, France
| |
Collapse
|
4
|
Roberts K, Chin AT, Loewy K, Pompeii L, Shin H, Rider NL. Natural language processing of clinical notes enables early inborn error of immunity risk ascertainment. THE JOURNAL OF ALLERGY AND CLINICAL IMMUNOLOGY. GLOBAL 2024; 3:100224. [PMID: 38439946 PMCID: PMC10910118 DOI: 10.1016/j.jacig.2024.100224] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 11/06/2023] [Revised: 12/24/2023] [Accepted: 01/21/2024] [Indexed: 03/06/2024]
Abstract
Background There are now approximately 450 discrete inborn errors of immunity (IEI) described; however, diagnostic rates remain suboptimal. Use of structured health record data has proven useful for patient detection but may be augmented by natural language processing (NLP). Here we present a machine learning model that can distinguish patients from controls significantly in advance of ultimate diagnosis date. Objective We sought to create an NLP machine learning algorithm that could identify IEI patients early during the disease course and shorten the diagnostic odyssey. Methods Our approach involved extracting a large corpus of IEI patient clinical-note text from a major referral center's electronic health record (EHR) system and a matched control corpus for comparison. We built text classifiers with simple machine learning methods and trained them on progressively longer time epochs before date of diagnosis. Results The top performing NLP algorithm effectively distinguished cases from controls robustly 36 months before ultimate clinical diagnosis (area under precision recall curve > 0.95). Corpus analysis demonstrated that statistically enriched, IEI-relevant terms were evident 24+ months before diagnosis, validating that clinical notes can provide a signal for early prediction of IEI. Conclusion Mining EHR notes with NLP holds promise for improving early IEI patient detection.
Collapse
Affiliation(s)
- Kirk Roberts
- McWilliams School of Biomedical Informatics, The University of Texas Health Science Center at Houston, Houston, Tex
| | - Aaron T. Chin
- Division of Immunology, Allergy, and Rheumatology, University of California, Los Angeles, Calif
| | | | - Lisa Pompeii
- Department of Patient Services, Cincinnati Children’s Hospital Medical Center, Cincinnati, Ohio
| | - Harold Shin
- College of Osteopathic Medicine, Liberty University, Lynchburg, Va
| | - Nicholas L. Rider
- Division of Health System & Implementation Science, Virginia Tech Carilion School of Medicine, Roanoke, Va
- Section of Allergy and Immunology, Carilion Clinic, Roanoke, Va
| |
Collapse
|
5
|
Rivière JG, Soler Palacín P, Butte MJ. Proceedings from the inaugural Artificial Intelligence in Primary Immune Deficiencies (AIPID) conference. J Allergy Clin Immunol 2024; 153:637-642. [PMID: 38224784 PMCID: PMC11402388 DOI: 10.1016/j.jaci.2024.01.002] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/11/2023] [Revised: 01/09/2024] [Accepted: 01/11/2024] [Indexed: 01/17/2024]
Abstract
Here, we summarize the proceedings of the inaugural Artificial Intelligence in Primary Immune Deficiencies conference, during which experts and advocates gathered to advance research into the applications of artificial intelligence (AI), machine learning, and other computational tools in the diagnosis and management of inborn errors of immunity (IEIs). The conference focused on the key themes of expediting IEI diagnoses, challenges in data collection, roles of natural language processing and large language models in interpreting electronic health records, and ethical considerations in implementation. Innovative AI-based tools trained on electronic health records and claims databases have discovered new patterns of warning signs for IEIs, facilitating faster diagnoses and enhancing patient outcomes. Challenges in training AIs persist on account of data limitations, especially in cases of rare diseases, overlapping phenotypes, and biases inherent in current data sets. Furthermore, experts highlighted the significance of ethical considerations, data protection, and the necessity for open science principles. The conference delved into regulatory frameworks, equity in access, and the imperative for collaborative efforts to overcome these obstacles and harness the transformative potential of AI. Concerted efforts to successfully integrate AI into daily clinical immunology practice are still needed.
Collapse
Affiliation(s)
- Jacques G Rivière
- Infection and Immunity in Pediatric Patients Research Group, Vall d'Hebron Barcelona Hospital Campus, Barcelona, Spain; Pediatric Infectious Diseases and Immunodeficiencies Unit, Hospital Infantil i de la Dona, Vall d'Hebron Barcelona Hospital Campus, Barcelona, Spain; Jeffrey Modell Diagnostic and Research Center for Primary Immunodeficiencies, Barcelona, Spain; Universitat Autònoma de Barcelona, Barcelona, Spain
| | - Pere Soler Palacín
- Infection and Immunity in Pediatric Patients Research Group, Vall d'Hebron Barcelona Hospital Campus, Barcelona, Spain; Pediatric Infectious Diseases and Immunodeficiencies Unit, Hospital Infantil i de la Dona, Vall d'Hebron Barcelona Hospital Campus, Barcelona, Spain; Jeffrey Modell Diagnostic and Research Center for Primary Immunodeficiencies, Barcelona, Spain; Universitat Autònoma de Barcelona, Barcelona, Spain
| | - Manish J Butte
- Division of Immunology, Allergy, and Rheumatology, Department of Pediatrics, University of California Los Angeles, Los Angeles, Calif; Department of Microbiology, Immunology, and Molecular Genetics, University of California Los Angeles, Los Angeles, Calif; Department of Human Genetics, University of California Los Angeles, Los Angeles, Calif.
| |
Collapse
|
6
|
Faviez C, Vincent M, Garcelon N, Boyer O, Knebelmann B, Heidet L, Saunier S, Chen X, Burgun A. Performance and clinical utility of a new supervised machine-learning pipeline in detecting rare ciliopathy patients based on deep phenotyping from electronic health records and semantic similarity. Orphanet J Rare Dis 2024; 19:55. [PMID: 38336713 PMCID: PMC10858490 DOI: 10.1186/s13023-024-03063-7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/29/2023] [Accepted: 02/03/2024] [Indexed: 02/12/2024] Open
Abstract
BACKGROUND Rare diseases affect approximately 400 million people worldwide. Many of them suffer from delayed diagnosis. Among them, NPHP1-related renal ciliopathies need to be diagnosed as early as possible as potential treatments have been recently investigated with promising results. Our objective was to develop a supervised machine learning pipeline for the detection of NPHP1 ciliopathy patients from a large number of nephrology patients using electronic health records (EHRs). METHODS AND RESULTS We designed a pipeline combining a phenotyping module re-using unstructured EHR data, a semantic similarity module to address the phenotype dependence, a feature selection step to deal with high dimensionality, an undersampling step to address the class imbalance, and a classification step with multiple train-test split for the small number of rare cases. The pipeline was applied to thirty NPHP1 patients and 7231 controls and achieved good performances (sensitivity 86% with specificity 90%). A qualitative review of the EHRs of 40 misclassified controls showed that 25% had phenotypes belonging to the ciliopathy spectrum, which demonstrates the ability of our system to detect patients with similar conditions. CONCLUSIONS Our pipeline reached very encouraging performance scores for pre-diagnosing ciliopathy patients. The identified patients could then undergo genetic testing. The same data-driven approach can be adapted to other rare diseases facing underdiagnosis challenges.
Collapse
Affiliation(s)
- Carole Faviez
- Centre de Recherche des Cordeliers, Université Paris Cité, Sorbonne Université, INSERM UMR 1138, 75006, Paris, France.
- Inria, 75012, Paris, France.
| | - Marc Vincent
- Université Paris Cité, Imagine Institute, Data Science Platform, INSERM UMR 1163, 75015, Paris, France
| | - Nicolas Garcelon
- Centre de Recherche des Cordeliers, Université Paris Cité, Sorbonne Université, INSERM UMR 1138, 75006, Paris, France
- Inria, 75012, Paris, France
- Université Paris Cité, Imagine Institute, Data Science Platform, INSERM UMR 1163, 75015, Paris, France
| | - Olivia Boyer
- Department of Pediatric Nephrology, APHP-Centre, Reference Center for Inherited Renal Diseases (MARHEA), Imagine Institute, Hôpital Necker-Enfants Malades, Université Paris Cité, 75015, Paris, France
- Laboratory of Renal Hereditary Diseases, INSERM UMR 1163, Imagine Institute, Université Paris Cité, 75015, Paris, France
| | - Bertrand Knebelmann
- Nephrology and Transplantation Department, MARHEA, Hôpital Necker-Enfants Malades, AP-HP, Université Paris Cité, 75015, Paris, France
| | - Laurence Heidet
- Department of Pediatric Nephrology, APHP-Centre, Reference Center for Inherited Renal Diseases (MARHEA), Imagine Institute, Hôpital Necker-Enfants Malades, Université Paris Cité, 75015, Paris, France
| | - Sophie Saunier
- Laboratory of Renal Hereditary Diseases, INSERM UMR 1163, Imagine Institute, Université Paris Cité, 75015, Paris, France
| | - Xiaoyi Chen
- Centre de Recherche des Cordeliers, Université Paris Cité, Sorbonne Université, INSERM UMR 1138, 75006, Paris, France
- Inria, 75012, Paris, France
- Université Paris Cité, Imagine Institute, Data Science Platform, INSERM UMR 1163, 75015, Paris, France
| | - Anita Burgun
- Centre de Recherche des Cordeliers, Université Paris Cité, Sorbonne Université, INSERM UMR 1138, 75006, Paris, France
- Inria, 75012, Paris, France
- Département d'informatique Médicale, Hôpital Necker-Enfants Malades, AP-HP, 75015, Paris, France
| |
Collapse
|
7
|
Papanastasiou G, Yang G, Fotiadis DI, Dikaios N, Wang C, Huda A, Sobolevsky L, Raasch J, Perez E, Sidhu G, Palumbo D. Large-scale deep learning analysis to identify adult patients at risk for combined and common variable immunodeficiencies. COMMUNICATIONS MEDICINE 2023; 3:189. [PMID: 38123736 PMCID: PMC10733406 DOI: 10.1038/s43856-023-00412-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/17/2023] [Accepted: 11/21/2023] [Indexed: 12/23/2023] Open
Abstract
BACKGROUND Primary immunodeficiency (PI) is a group of heterogeneous disorders resulting from immune system defects. Over 70% of PI is undiagnosed, leading to increased mortality, co-morbidity and healthcare costs. Among PI disorders, combined immunodeficiencies (CID) are characterized by complex immune defects. Common variable immunodeficiency (CVID) is among the most common types of PI. In light of available treatments, it is critical to identify adult patients at risk for CID and CVID, before the development of serious morbidity and mortality. METHODS We developed a deep learning-based method (named "TabMLPNet") to analyze clinical history from nationally representative medical claims from electronic health records (Optum® data, covering all US), evaluated in the setting of identifying CID/CVID in adults. Further, we revealed the most important CID/CVID-associated antecedent phenotype combinations. Four large cohorts were generated: a total of 47,660 PI cases and (1:1 matched) controls. RESULTS The sensitivity/specificity of TabMLPNet modeling ranges from 0.82-0.88/0.82-0.85 across cohorts. Distinctive combinations of antecedent phenotypes associated with CID/CVID are identified, consisting of respiratory infections/conditions, genetic anomalies, cardiac defects, autoimmune diseases, blood disorders and malignancies, which can possibly be useful to systematize the identification of CID and CVID. CONCLUSIONS We demonstrated an accurate method in terms of CID and CVID detection evaluated on large-scale medical claims data. Our predictive scheme can potentially lead to the development of new clinical insights and expanded guidelines for identification of adult patients at risk for CID and CVID as well as be used to improve patient outcomes on population level.
Collapse
Affiliation(s)
| | - Guang Yang
- National Heart and Lung Institute, Imperial College London, London, UK
- Cardiovascular Research Centre, Royal Brompton Hospital, London, UK
- School of Biomedical Engineering & Imaging Sciences, King's College London, London, UK
| | - Dimitris I Fotiadis
- Department of Biomedical Research, Institute of Molecular Biology and Biotechnology, FORTH, Ioannina, Greece
- Unit of Medical Technology and Intelligent Information Systems, University of Ioannina, Ioannina, Greece
| | | | - Chengjia Wang
- School of Mathematical and Computer Sciences, Heriot Watt, Edinburgh, UK
- Edinburgh Centre for Robotics, Edinburgh, UK
| | | | | | | | - Elena Perez
- Allergy Associates of the Palm Beaches, North Palm Beach, FL, USA
| | | | | |
Collapse
|
8
|
Messelink MA, Welsing PMJ, Devercelli G, Marsden JWN, Leavis HL. Clinical Validation of a Primary Antibody Deficiency Screening Algorithm for Primary Care. J Clin Immunol 2023; 43:2022-2032. [PMID: 37715890 PMCID: PMC10660978 DOI: 10.1007/s10875-023-01575-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/07/2023] [Accepted: 08/27/2023] [Indexed: 09/18/2023]
Abstract
PURPOSE The diagnostic delay of primary antibody deficiencies (PADs) is associated with increased morbidity, mortality, and healthcare costs. Therefore, a screening algorithm was previously developed for the early detection of patients at risk of PAD in primary care. We aimed to clinically validate and optimize the PAD screening algorithm by applying it to a primary care database in the Netherlands. METHODS The algorithm was applied to a data set of 61,172 electronic health records (EHRs). Four hundred high-scoring EHRs were screened for exclusion criteria, and remaining patients were invited for serum immunoglobulin analysis and referred if clinically necessary. RESULTS Of the 104 patients eligible for inclusion, 16 were referred by their general practitioner for suspected PAD, of whom 10 had a PAD diagnosis. In patients selected by the screening algorithm and included for laboratory analysis, prevalence of PAD was ~ 1:10 versus 1:1700-1:25,000 in the general population. To optimize efficiency of the screening process, we refitted the algorithm with the subset of high-risk patients, which improved the area under the curve-receiver operating characteristics curve value to 0.80 (95% confidence interval 0.63-0.97). We propose a two-step screening process, first applying the original algorithm to distinguish high-risk from low-risk patients, then applying the optimized algorithm to select high-risk patients for serum immunoglobulin analysis. CONCLUSION Using the screening algorithm, we were able to identify 10 new PAD patients from a primary care population, thus reducing diagnostic delay. Future studies should address further validation in other populations and full cost-effectiveness analyses. REGISTRATION Clinicaltrials.gov record number NCT05310604, first submitted 25 March 2022.
Collapse
Affiliation(s)
- Marianne A Messelink
- Department of Rheumatology & Clinical Immunology, University Medical Center Utrecht, Utrecht, Netherlands.
| | - Paco M J Welsing
- Department of Rheumatology & Clinical Immunology, University Medical Center Utrecht, Utrecht, Netherlands
| | | | - Jan Willem N Marsden
- Department of Rheumatology & Clinical Immunology, University Medical Center Utrecht, Utrecht, Netherlands
| | - Helen L Leavis
- Department of Rheumatology & Clinical Immunology, University Medical Center Utrecht, Utrecht, Netherlands
| |
Collapse
|
9
|
Messelink MA, Berbers RM, van Montfrans JM, Ellerbroek PM, Gladiator A, Welsing PMJ, Leavis H. Development of a primary care screening algorithm for the early detection of patients at risk of primary antibody deficiency. ALLERGY, ASTHMA, AND CLINICAL IMMUNOLOGY : OFFICIAL JOURNAL OF THE CANADIAN SOCIETY OF ALLERGY AND CLINICAL IMMUNOLOGY 2023; 19:44. [PMID: 37245042 DOI: 10.1186/s13223-023-00790-7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/01/2022] [Accepted: 04/09/2023] [Indexed: 05/29/2023]
Abstract
BACKGROUND Primary antibody deficiencies (PAD) are characterized by a heterogeneous clinical presentation and low prevalence, contributing to a median diagnostic delay of 3-10 years. This increases the risk of morbidity and mortality from undiagnosed PAD, which may be prevented with adequate therapy. To reduce the diagnostic delay of PAD, we developed a screening algorithm using primary care electronic health record (EHR) data to identify patients at risk of PAD. This screening algorithm can be used as an aid to notify general practitioners when further laboratory evaluation of immunoglobulins should be considered, thereby facilitating a timely diagnosis of PAD. METHODS Candidate components for the algorithm were based on a broad range of presenting signs and symptoms of PAD that are available in primary care EHRs. The decision on inclusion and weight of the components in the algorithm was based on the prevalence of these components among PAD patients and control groups, as well as clinical rationale. RESULTS We analyzed the primary care EHRs of 30 PAD patients, 26 primary care immunodeficiency patients and 58,223 control patients. The median diagnostic delay of PAD patients was 9.5 years. Several candidate components showed a clear difference in prevalence between PAD patients and controls, most notably the mean number of antibiotic prescriptions in the 4 years prior to diagnosis (5.14 vs. 0.48). The final algorithm included antibiotic prescriptions, diagnostic codes for respiratory tract and other infections, gastro-intestinal complaints, auto-immune symptoms, malignancies and lymphoproliferative symptoms, as well as laboratory values and visits to the general practitioner. CONCLUSIONS In this study, we developed a screening algorithm based on a broad range of presenting signs and symptoms of PAD, which is suitable to implement in primary care. It has the potential to considerably reduce diagnostic delay in PAD, and will be validated in a prospective study. Trial registration The consecutive prospective study is registered at clinicaltrials.gov under NCT05310604.
Collapse
Affiliation(s)
- Marianne A Messelink
- Department of Rheumatology & Clinical Immunology, University Medical Center Utrecht, Heidelberglaan 100, 3508 GA, Utrecht, The Netherlands.
| | - Roos M Berbers
- Department of Rheumatology & Clinical Immunology, University Medical Center Utrecht, Heidelberglaan 100, 3508 GA, Utrecht, The Netherlands
| | - Joris M van Montfrans
- Department of Rheumatology & Clinical Immunology, University Medical Center Utrecht, Heidelberglaan 100, 3508 GA, Utrecht, The Netherlands
| | - Pauline M Ellerbroek
- Department of Rheumatology & Clinical Immunology, University Medical Center Utrecht, Heidelberglaan 100, 3508 GA, Utrecht, The Netherlands
| | - André Gladiator
- Takeda Pharmaceuticals International AG, Thurgauerstrasse 130, 8152, Glattpark-Opfikon, Zurich, Switzerland
| | - Paco M J Welsing
- Department of Rheumatology & Clinical Immunology, University Medical Center Utrecht, Heidelberglaan 100, 3508 GA, Utrecht, The Netherlands
| | - Helen Leavis
- Department of Rheumatology & Clinical Immunology, University Medical Center Utrecht, Heidelberglaan 100, 3508 GA, Utrecht, The Netherlands
| |
Collapse
|
10
|
A validated artificial intelligence-based pipeline for population-wide primary immunodeficiency screening. J Allergy Clin Immunol 2023; 151:272-279. [PMID: 36243223 DOI: 10.1016/j.jaci.2022.10.005] [Citation(s) in RCA: 8] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/05/2022] [Revised: 09/21/2022] [Accepted: 10/05/2022] [Indexed: 11/07/2022]
Abstract
BACKGROUND Identification of patients with underlying inborn errors of immunity and inherent susceptibility to infection remains challenging. The ensuing protracted diagnostic odyssey for such patients often results in greater morbidity and suboptimal outcomes, underscoring a need to develop systematic methods for improving diagnostic rates. OBJECTIVE The principal aim of this study is to build and validate a generalizable analytical pipeline for population-wide detection of infection susceptibility and risk of primary immunodeficiency. METHODS This prospective, longitudinal cohort study coupled weighted rules with a machine learning classifier for risk stratification. Claims data were analyzed from a diverse population (n = 427,110) iteratively over 30 months. Cohort outcomes were enumerated for new diagnoses, hospitalizations, and acute care visits. This study followed TRIPOD (Transparent Reporting of a multivariable prediction model for Individual Prognosis Or Diagnosis) standards. RESULTS Cohort members initially identified as high risk were proportionally more likely to receive a diagnosis of primary immunodeficiency compared to those at low-medium risk or those without claims of interest respectively (9% vs 1.5% vs 0.2%; P < .001, chi-square test). Subsequent machine learning stratification enabled an annualized individual snapshot of complexity for triaging referrals. This study's top-performing machine learning model for visit-level prediction used a single dense layer neural network architecture (area under the receiver-operator characteristic curve = 0.98; F1 score = 0.98). CONCLUSIONS A 2-step analytical pipeline can facilitate identification of individuals with primary immunodeficiency and accurately quantify clinical risk.
Collapse
|
11
|
Mayampurath A, Ajith A, Anderson-Smits C, Chang SC, Brouwer E, Johnson J, Baltasi M, Volchenboum S, Devercelli G, Ciaccio CE. Early Diagnosis of Primary Immunodeficiency Disease Using Clinical Data and Machine Learning. THE JOURNAL OF ALLERGY AND CLINICAL IMMUNOLOGY. IN PRACTICE 2022; 10:3002-3007.e5. [PMID: 36108921 DOI: 10.1016/j.jaip.2022.08.041] [Citation(s) in RCA: 8] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/16/2022] [Revised: 08/19/2022] [Accepted: 08/22/2022] [Indexed: 06/15/2023]
Abstract
BACKGROUND Primary immunodeficiency diseases (PIDD) are a group of immune-related disorders that have a current median delay of diagnosis between 6 and 9 years. Early diagnosis and treatment of PIDD has been associated with improved patient outcomes. OBJECTIVE To develop a machine learning model using elements within the electronic health record data that are related to prior symptomatic treatment to predict PIDD. METHODS We conducted a retrospective study of patients with PIDD identified using inclusion criteria of PIDD-related diagnoses, immunodeficiency-specific medications, and low immunoglobulin levels. We constructed a control group of age-, sex-, and race-matched patients with asthma. The primary outcome was the diagnosis of PIDD. We considered comorbidities, laboratory tests, medications, and radiological orders as features, all before diagnosis and indicative of symptom-related treatment. Features were presented sequentially to logistic regression, elastic net, and random forest classifiers, which were trained using a nested cross-validation approach. RESULTS Our cohort consisted of 6422 patients, of whom 247 (4%) were diagnosed with PIDD. Our logistic regression model with comorbidities demonstrated good discrimination between patients with PIDD and those with asthma (c-statistic: 0.62 [0.58-0.65]). Adding laboratory results, medications, and radiological orders improved discrimination (c-statistic: 0.70 vs 0.62, P < .001), sensitivity, and specificity. Extending to the advanced machine learning models did not improve performance. CONCLUSIONS We developed a prediction model for early diagnosis of PIDD using historical data that are related to symptomatic care, which has potential to fill an important need in reducing the time to diagnose PIDD, leading to better outcomes for immunodeficient patients.
Collapse
Affiliation(s)
| | - Aswathy Ajith
- Center for Research Informatics, University of Chicago, Chicago, Ill
| | | | | | - Emily Brouwer
- Takeda Development Center Americas, Inc., Cambridge, Mass
| | - Julie Johnson
- Center for Research Informatics, University of Chicago, Chicago, Ill
| | - Michael Baltasi
- Center for Research Informatics, University of Chicago, Chicago, Ill
| | | | | | | |
Collapse
|
12
|
"Common variable immunodeficiency: Challenges for diagnosis". J Immunol Methods 2022; 509:113342. [PMID: 36027932 DOI: 10.1016/j.jim.2022.113342] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/17/2022] [Revised: 07/21/2022] [Accepted: 08/18/2022] [Indexed: 11/22/2022]
Abstract
Common variable immunodeficiency is a heterogeneous condition characterized by B cell dysfunction with reduced serum immunoglobulin levels and a highly variable spectrum of clinical manifestations ranging from recurrent infections to autoimmune disease. The diagnosis of CVID is often challenging due to the diverse clinical presentation of patients and the existence of multiple diagnostic criteria without a universally adopted consensus. Laboratory evaluation to assist with diagnosis currently includes serum immunoglobulin testing, immunophenotyping, assessment of vaccine response, and genetic testing. Additional emerging techniques include investigation of the B cell repertoire and the use of machine learning algorithms. Advances in our understanding of common variable immunodeficiency will ultimately contribute to earlier diagnosis and novel interventions with the goal of improving prognosis for these patients.
Collapse
|
13
|
Lehman HK, Yu KOA, Towe CT, Risma KA. Respiratory Infections in Patients with Primary Immunodeficiency. THE JOURNAL OF ALLERGY AND CLINICAL IMMUNOLOGY. IN PRACTICE 2022; 10:683-691.e1. [PMID: 34890826 DOI: 10.1016/j.jaip.2021.10.073] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/22/2021] [Accepted: 10/18/2021] [Indexed: 06/13/2023]
Abstract
Recurrent and life-threatening respiratory infections are nearly universal in patients with primary immunodeficiency diseases (PIDD). Early recognition, aggressive treatment, and prophylaxis with antimicrobials and immunoglobulin replacement have been the mainstays of management and will be reviewed here with an emphasis on respiratory infections. Genetic discoveries have allowed direct translation of research to clinical practice, improving our understanding of clinical patterns of pathogen susceptibilities and guiding prophylaxis. The recent identification of inborn errors in type I interferon signaling as a basis for life-threatening viral infections in otherwise healthy individuals suggests another targetable pathway for treatment and/or prophylaxis. The future of PIDD diagnosis will certainly involve early genetic identification by newborn screening before onset of infections, with early treatment offering the potential of preventing disease complications such as chronic lung changes. Gene editing approaches offer tremendous therapeutic potential, with rapidly emerging delivery systems. Antiviral therapies are desperately needed, and specific cellular therapies show promise in patients requiring hematopoietic stem cell transplantation. The introduction of approved therapies for clinical use in PIDD is limited by the difficulty of studying outcomes in rare patients/conditions with conventional clinical trials.
Collapse
Affiliation(s)
- Heather K Lehman
- Division of Allergy, Immunology, and Rheumatology, Department of Pediatrics, Jacobs School of Medicine and Biomedical Sciences, University at Buffalo, the State University of New York, and John R. Oishei Children's Hospital, Buffalo, NY.
| | - Karl O A Yu
- Division of Infectious Diseases, Department of Pediatrics, Jacobs School of Medicine and Biomedical Sciences, University at Buffalo, the State University of New York, and John R. Oishei Children's Hospital, Buffalo, NY
| | - Christopher T Towe
- Division of Pulmonary Medicine, Department of Pediatrics, University of Cincinnati College of Medicine, University of Cincinnati, and Cincinnati Children's Hospital Medical Center, Cincinnati, Ohio
| | - Kimberly A Risma
- Division of Allergy and Immunology, Department of Pediatrics, University of Cincinnati College of Medicine, University of Cincinnati, and Cincinnati Children's Hospital Medical Center, Cincinnati, Ohio
| |
Collapse
|
14
|
Artificial Intelligence in Clinical Immunology. Artif Intell Med 2022. [DOI: 10.1007/978-3-030-64573-1_83] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/25/2022]
|
15
|
Seeing the Forest for the Trees: Evaluating Population Data in Allergy-Immunology. THE JOURNAL OF ALLERGY AND CLINICAL IMMUNOLOGY-IN PRACTICE 2021; 9:4193-4199. [PMID: 34571199 DOI: 10.1016/j.jaip.2021.09.018] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/04/2021] [Revised: 09/15/2021] [Accepted: 09/15/2021] [Indexed: 01/04/2023]
Abstract
A population-level study is essential for understanding treatment effects, epidemiologic phenomena, and health care best practices. Evaluating large populations and associated data requires an analytic framework, which is commonly used by statisticians, epidemiologists, and data scientists. This document will serve to provide an overview of these commonly employed methods in allergy and immunology research. We will draw upon recent examples from the allergy-immunology literature to contextualize discrete principles of relevance to population-level analysis that include statistical features of a study population, elements of statistical inference, regression analysis, and an overview of machine learning practices. Our intent is to guide the reader through a practical description of this important quantitative discipline and facilitate greater understanding about data and result display in the medical literature.
Collapse
|
16
|
Racial, Ethnic, and Socioeconomic Disparities in the Diagnosis and Management of Primary Immunodeficiencies. THE JOURNAL OF ALLERGY AND CLINICAL IMMUNOLOGY-IN PRACTICE 2021; 9:2926-2927. [PMID: 34246439 DOI: 10.1016/j.jaip.2021.04.037] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Subscribe] [Scholar Register] [Received: 04/20/2021] [Accepted: 04/21/2021] [Indexed: 11/22/2022]
|
17
|
Artificial Intelligence in Clinical Immunology. Artif Intell Med 2021. [DOI: 10.1007/978-3-030-58080-3_83-1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/25/2022]
|