1
|
Jiang M, Hong X, Gao Y, Kho AT, Tantisira KG, Li J. piRNA associates with immune diseases. Cell Commun Signal 2024; 22:347. [PMID: 38943141 PMCID: PMC11214247 DOI: 10.1186/s12964-024-01724-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/01/2024] [Accepted: 06/23/2024] [Indexed: 07/01/2024] Open
Abstract
PIWI-interacting RNA (piRNA) is the most abundant small non-coding RNA in animal cells, typically 26-31 nucleotides in length and it binds with PIWI proteins, a subfamily of Argonaute proteins. Initially discovered in germ cells, piRNA is well known for its role in silencing transposons and maintaining genome integrity. However, piRNA is also present in somatic cells as well as in extracellular vesicles and exosomes. While piRNA has been extensively studied in various diseases, particular cancer, its function in immune diseases remains unclear. In this review, we summarize current research on piRNA in immune diseases. We first introduce the basic characteristics, biogenesis and functions of piRNA. Then, we review the association of piRNA with different types of immune diseases, including autoimmune diseases, immunodeficiency diseases, infectious diseases, and other immune-related diseases. piRNA is considered a promising biomarker for diseases, highlighting the need for further research into its potential mechanisms in disease pathogenesis.
Collapse
Affiliation(s)
- Mingye Jiang
- Clinical Big Data Research Center, The Seventh Affiliated Hospital, Sun Yat-Sen University, Shenzhen, Guangdong, China
| | - Xiaoning Hong
- Clinical Big Data Research Center, The Seventh Affiliated Hospital, Sun Yat-Sen University, Shenzhen, Guangdong, China
| | - Yunfei Gao
- Department of Otolaryngology, The Seventh Affiliated Hospital, Sun Yat-sen University, Shenzhen, Guangdong, China
| | - Alvin T Kho
- Channing Division of Network Medicine, Brigham and Women's Hospital, Harvard Medical School, Boston, MA, USA
- Computational Health Informatics Program, Boston Children's Hospital, Boston, MA, USA
| | - Kelan G Tantisira
- Channing Division of Network Medicine, Brigham and Women's Hospital, Harvard Medical School, Boston, MA, USA
- Department of Pediatrics, Division of Respiratory Medicine, University of California San Diego, La Jolla, CA, USA
| | - Jiang Li
- Clinical Big Data Research Center, The Seventh Affiliated Hospital, Sun Yat-Sen University, Shenzhen, Guangdong, China.
- Channing Division of Network Medicine, Brigham and Women's Hospital, Harvard Medical School, Boston, MA, USA.
- Shenzhen Key Laboratory of Chinese Medicine Active Substance Screening and Translational Research, Guangdong, Shenzhen, China.
| |
Collapse
|
2
|
Roberts K, Chin AT, Loewy K, Pompeii L, Shin H, Rider NL. Natural language processing of clinical notes enables early inborn error of immunity risk ascertainment. THE JOURNAL OF ALLERGY AND CLINICAL IMMUNOLOGY. GLOBAL 2024; 3:100224. [PMID: 38439946 PMCID: PMC10910118 DOI: 10.1016/j.jacig.2024.100224] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 11/06/2023] [Revised: 12/24/2023] [Accepted: 01/21/2024] [Indexed: 03/06/2024]
Abstract
Background There are now approximately 450 discrete inborn errors of immunity (IEI) described; however, diagnostic rates remain suboptimal. Use of structured health record data has proven useful for patient detection but may be augmented by natural language processing (NLP). Here we present a machine learning model that can distinguish patients from controls significantly in advance of ultimate diagnosis date. Objective We sought to create an NLP machine learning algorithm that could identify IEI patients early during the disease course and shorten the diagnostic odyssey. Methods Our approach involved extracting a large corpus of IEI patient clinical-note text from a major referral center's electronic health record (EHR) system and a matched control corpus for comparison. We built text classifiers with simple machine learning methods and trained them on progressively longer time epochs before date of diagnosis. Results The top performing NLP algorithm effectively distinguished cases from controls robustly 36 months before ultimate clinical diagnosis (area under precision recall curve > 0.95). Corpus analysis demonstrated that statistically enriched, IEI-relevant terms were evident 24+ months before diagnosis, validating that clinical notes can provide a signal for early prediction of IEI. Conclusion Mining EHR notes with NLP holds promise for improving early IEI patient detection.
Collapse
Affiliation(s)
- Kirk Roberts
- McWilliams School of Biomedical Informatics, The University of Texas Health Science Center at Houston, Houston, Tex
| | - Aaron T. Chin
- Division of Immunology, Allergy, and Rheumatology, University of California, Los Angeles, Calif
| | | | - Lisa Pompeii
- Department of Patient Services, Cincinnati Children’s Hospital Medical Center, Cincinnati, Ohio
| | - Harold Shin
- College of Osteopathic Medicine, Liberty University, Lynchburg, Va
| | - Nicholas L. Rider
- Division of Health System & Implementation Science, Virginia Tech Carilion School of Medicine, Roanoke, Va
- Section of Allergy and Immunology, Carilion Clinic, Roanoke, Va
| |
Collapse
|
3
|
Sanchez DA, Lee ASE, Rotella K, Eng A, Cunningham-Rundles C. Social Determinants of Health Impacting Diagnosis and Management of Primary Immunodeficiencies: A Case Series. THE JOURNAL OF ALLERGY AND CLINICAL IMMUNOLOGY. IN PRACTICE 2024; 12:491-494. [PMID: 38061547 DOI: 10.1016/j.jaip.2023.11.043] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/06/2023] [Revised: 11/16/2023] [Accepted: 11/21/2023] [Indexed: 01/12/2024]
Affiliation(s)
- David A Sanchez
- Division of Allergy and Immunology, Icahn School of Medicine at Mount Sinai, New York, NY.
| | - Ashley Sang Eun Lee
- Division of Allergy and Immunology, Icahn School of Medicine at Mount Sinai, New York, NY
| | - Karina Rotella
- Division of Allergy and Immunology, Icahn School of Medicine at Mount Sinai, New York, NY
| | - Andrew Eng
- Division of Allergy and Immunology, Icahn School of Medicine at Mount Sinai, New York, NY
| | | |
Collapse
|
4
|
Coppola E, Sgrulletti M, Cortesi M, Romano R, Cirillo E, Giardino G, Dotta L, Cancrini C, Bruzzese D, Badolato R, Moschese V, Pignata C. The Inborn Errors of Immunity-Virtual Consultation System Platform in Service for the Italian Primary Immunodeficiency Network: Results from the Validation Phase. J Clin Immunol 2024; 44:47. [PMID: 38231401 PMCID: PMC10794402 DOI: 10.1007/s10875-023-01644-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/31/2023] [Accepted: 12/17/2023] [Indexed: 01/18/2024]
Abstract
PURPOSE Inborn errors of immunity (IEI) represent a heterogeneous group of rare genetically determined diseases. In some cases, patients present with complex or atypical phenotypes, not fulfilling the accepted diagnostic criteria for IEI and, thus, at high risk of misdiagnosis or diagnostic delay. This study aimed to validate a platform that, through the opinion of immunologist experts, improves the diagnostic process and the level of care of patients with atypical/complex IEI. METHODS Here, we describe the functioning of the IEI-Virtual Consultation System (VCS), an innovative platform created by the Italian Immunodeficiency Network (IPINet). RESULTS In the validation phase, from January 2020 to June 2021, 68 cases were entered on the IEI-VCS platform. A final diagnosis was achieved in 35/68 cases (51%, 95% CI 38.7 to 64.2). In 22 out of 35 solved cases, the diagnosis was confirmed by genetic analysis. In 3/35 cases, a diagnosis of secondary immunodeficiency was made. In the remaining 10 cases, an unequivocal clinical and immunological diagnosis was obtained, even though not substantiated by genetic analysis. CONCLUSION From our preliminary study, the VCS represents an innovative and useful system to improve the diagnostic process of patients with complex unsolved IEI disorders, with benefits both in terms of reduction of time of diagnosis and access to the required therapies. These results may help the functioning of other international platforms for the management of complex cases.
Collapse
Affiliation(s)
- Emma Coppola
- Section of Pediatrics, Department of Translational Medical Sciences, Federico II University, Via S. Pansini, 5-80131, Naples, Italy
| | - Mayla Sgrulletti
- Pediatric Immunopathology and Allergology Unit, Policlinico Tor Vergata, University of Tor Vergata, Rome, Italy
- PhD Program in Immunology, Molecular Medicine and Applied Biotechnology, University of Rome Tor Vergata, Rome, Italy
| | - Manuela Cortesi
- Pediatrics Clinic and Institute for Molecular Medicine A. Nocivelli, Department of Clinical and Experimental Sciences, ASST- Spedali Civili of Brescia, University of Brescia, Brescia, Italy
| | - Roberta Romano
- Section of Pediatrics, Department of Translational Medical Sciences, Federico II University, Via S. Pansini, 5-80131, Naples, Italy
| | - Emilia Cirillo
- Section of Pediatrics, Department of Translational Medical Sciences, Federico II University, Via S. Pansini, 5-80131, Naples, Italy
| | - Giuliana Giardino
- Section of Pediatrics, Department of Translational Medical Sciences, Federico II University, Via S. Pansini, 5-80131, Naples, Italy
| | - Laura Dotta
- Pediatrics Clinic and Institute for Molecular Medicine A. Nocivelli, Department of Clinical and Experimental Sciences, ASST- Spedali Civili of Brescia, University of Brescia, Brescia, Italy
| | - Caterina Cancrini
- Research Unit of Primary Immunodeficiency, IRCCS Bambin Gesù Children Hospital, Rome, Italy
| | - Dario Bruzzese
- Department of Public Health, Federico II University, Naples, Italy
| | - Raffaele Badolato
- Pediatrics Clinic and Institute for Molecular Medicine A. Nocivelli, Department of Clinical and Experimental Sciences, ASST- Spedali Civili of Brescia, University of Brescia, Brescia, Italy
| | - Viviana Moschese
- Pediatric Immunopathology and Allergology Unit, Policlinico Tor Vergata, University of Tor Vergata, Rome, Italy
| | - Claudio Pignata
- Section of Pediatrics, Department of Translational Medical Sciences, Federico II University, Via S. Pansini, 5-80131, Naples, Italy.
| |
Collapse
|
5
|
Papanastasiou G, Yang G, Fotiadis DI, Dikaios N, Wang C, Huda A, Sobolevsky L, Raasch J, Perez E, Sidhu G, Palumbo D. Large-scale deep learning analysis to identify adult patients at risk for combined and common variable immunodeficiencies. COMMUNICATIONS MEDICINE 2023; 3:189. [PMID: 38123736 PMCID: PMC10733406 DOI: 10.1038/s43856-023-00412-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/17/2023] [Accepted: 11/21/2023] [Indexed: 12/23/2023] Open
Abstract
BACKGROUND Primary immunodeficiency (PI) is a group of heterogeneous disorders resulting from immune system defects. Over 70% of PI is undiagnosed, leading to increased mortality, co-morbidity and healthcare costs. Among PI disorders, combined immunodeficiencies (CID) are characterized by complex immune defects. Common variable immunodeficiency (CVID) is among the most common types of PI. In light of available treatments, it is critical to identify adult patients at risk for CID and CVID, before the development of serious morbidity and mortality. METHODS We developed a deep learning-based method (named "TabMLPNet") to analyze clinical history from nationally representative medical claims from electronic health records (Optum® data, covering all US), evaluated in the setting of identifying CID/CVID in adults. Further, we revealed the most important CID/CVID-associated antecedent phenotype combinations. Four large cohorts were generated: a total of 47,660 PI cases and (1:1 matched) controls. RESULTS The sensitivity/specificity of TabMLPNet modeling ranges from 0.82-0.88/0.82-0.85 across cohorts. Distinctive combinations of antecedent phenotypes associated with CID/CVID are identified, consisting of respiratory infections/conditions, genetic anomalies, cardiac defects, autoimmune diseases, blood disorders and malignancies, which can possibly be useful to systematize the identification of CID and CVID. CONCLUSIONS We demonstrated an accurate method in terms of CID and CVID detection evaluated on large-scale medical claims data. Our predictive scheme can potentially lead to the development of new clinical insights and expanded guidelines for identification of adult patients at risk for CID and CVID as well as be used to improve patient outcomes on population level.
Collapse
Affiliation(s)
| | - Guang Yang
- National Heart and Lung Institute, Imperial College London, London, UK
- Cardiovascular Research Centre, Royal Brompton Hospital, London, UK
- School of Biomedical Engineering & Imaging Sciences, King's College London, London, UK
| | - Dimitris I Fotiadis
- Department of Biomedical Research, Institute of Molecular Biology and Biotechnology, FORTH, Ioannina, Greece
- Unit of Medical Technology and Intelligent Information Systems, University of Ioannina, Ioannina, Greece
| | | | - Chengjia Wang
- School of Mathematical and Computer Sciences, Heriot Watt, Edinburgh, UK
- Edinburgh Centre for Robotics, Edinburgh, UK
| | | | | | | | - Elena Perez
- Allergy Associates of the Palm Beaches, North Palm Beach, FL, USA
| | | | | |
Collapse
|
6
|
Messelink MA, Welsing PMJ, Devercelli G, Marsden JWN, Leavis HL. Clinical Validation of a Primary Antibody Deficiency Screening Algorithm for Primary Care. J Clin Immunol 2023; 43:2022-2032. [PMID: 37715890 PMCID: PMC10660978 DOI: 10.1007/s10875-023-01575-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/07/2023] [Accepted: 08/27/2023] [Indexed: 09/18/2023]
Abstract
PURPOSE The diagnostic delay of primary antibody deficiencies (PADs) is associated with increased morbidity, mortality, and healthcare costs. Therefore, a screening algorithm was previously developed for the early detection of patients at risk of PAD in primary care. We aimed to clinically validate and optimize the PAD screening algorithm by applying it to a primary care database in the Netherlands. METHODS The algorithm was applied to a data set of 61,172 electronic health records (EHRs). Four hundred high-scoring EHRs were screened for exclusion criteria, and remaining patients were invited for serum immunoglobulin analysis and referred if clinically necessary. RESULTS Of the 104 patients eligible for inclusion, 16 were referred by their general practitioner for suspected PAD, of whom 10 had a PAD diagnosis. In patients selected by the screening algorithm and included for laboratory analysis, prevalence of PAD was ~ 1:10 versus 1:1700-1:25,000 in the general population. To optimize efficiency of the screening process, we refitted the algorithm with the subset of high-risk patients, which improved the area under the curve-receiver operating characteristics curve value to 0.80 (95% confidence interval 0.63-0.97). We propose a two-step screening process, first applying the original algorithm to distinguish high-risk from low-risk patients, then applying the optimized algorithm to select high-risk patients for serum immunoglobulin analysis. CONCLUSION Using the screening algorithm, we were able to identify 10 new PAD patients from a primary care population, thus reducing diagnostic delay. Future studies should address further validation in other populations and full cost-effectiveness analyses. REGISTRATION Clinicaltrials.gov record number NCT05310604, first submitted 25 March 2022.
Collapse
Affiliation(s)
- Marianne A Messelink
- Department of Rheumatology & Clinical Immunology, University Medical Center Utrecht, Utrecht, Netherlands.
| | - Paco M J Welsing
- Department of Rheumatology & Clinical Immunology, University Medical Center Utrecht, Utrecht, Netherlands
| | | | - Jan Willem N Marsden
- Department of Rheumatology & Clinical Immunology, University Medical Center Utrecht, Utrecht, Netherlands
| | - Helen L Leavis
- Department of Rheumatology & Clinical Immunology, University Medical Center Utrecht, Utrecht, Netherlands
| |
Collapse
|
7
|
Méndez Barrera JA, Rocha Guzmán S, Hierro Cascajares E, Garabedian EK, Fuleihan RL, Sullivan KE, Lugo Reyes SO. Who's your data? Primary immune deficiency differential diagnosis prediction via machine learning and data mining of the USIDNET registry. Clin Immunol 2023; 255:109759. [PMID: 37678719 DOI: 10.1016/j.clim.2023.109759] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/30/2023] [Revised: 07/31/2023] [Accepted: 09/02/2023] [Indexed: 09/09/2023]
Abstract
PURPOSE There are currently more than 480 primary immune deficiency (PID) diseases and about 7000 rare diseases that together afflict around 1 in every 17 humans. Computational aids based on data mining and machine learning might facilitate the diagnostic task by extracting rules from large datasets and making predictions when faced with new problem cases. In a proof-of-concept data mining study, we aimed to predict PID diagnoses using a supervised machine learning algorithm based on classification tree boosting. METHODS Through a data query at the USIDNET registry we obtained a database of 2396 patients with common diagnoses of PID, including their clinical and laboratory features. We kept 286 features and all 12 diagnoses to include in the model. We used the XGBoost package with parallel tree boosting for the supervised classification model, and SHAP for variable importance interpretation, on Python v3.7. The patient database was split into training and testing subsets, and after boosting through gradient descent, the predictive model provides measures of diagnostic prediction accuracy and individual feature importance. After a baseline performance test, we used the Class Weighting Hyperparameter, or scale_pos_weight to correct for imbalanced classification. RESULTS The twelve PID diagnoses were CVID (1098 patients), DiGeorge syndrome, Chronic granulomatous disease, Congenital agammaglobulinemia, PID not otherwise classified, Specific antibody deficiency, Complement deficiency, Hyper-IgM, Leukocyte adhesion deficiency, ectodermal dysplasia with immune deficiency, Severe combined immune deficiency, and Wiskott-Aldrich syndrome. For CVID, the model found an accuracy on the train sample of 0.80, with an area under the ROC curve (AUC) of 0.80, and a Gini coefficient of 0.60. In the test subset, accuracy was 0.76, AUC 0.75, and Gini 0.51. The positive feature value to predict CVID was highest for upper respiratory infections, asthma, autoimmunity and hypogammaglobulinemia. Features with the highest negative predictive value were high IgE, growth delay, abscess, lymphopenia, and congenital heart disease. For the rest of the diagnoses, accuracy stayed between 0.75 and 0.99, AUC 0.46-0.87, Gini 0.07-0.75, and LogLoss 0.09-8.55. DISCUSSION Clinicians should remember to consider the negative predictive features together with the positives. We are calling this a proof-of-concept study to continue with our explorations. A good performance is encouraging, and feature importance might aid feature selection for future endeavors. In the meantime, we can learn from the rules derived by the model and build a user-friendly decision tree to generate differential diagnoses.
Collapse
Affiliation(s)
| | - Samuel Rocha Guzmán
- Data Science Department, Autonomous Technological Institute of Mexico, Mexico City, Mexico
| | - Elisa Hierro Cascajares
- Immune deficiencies Lab, National Institute of Pediatrics, Secretariat of Health, Mexico City, Mexico
| | | | - Ramsay L Fuleihan
- Division of Pediatric Allergy, Immunology and Rheumatology at Columbia University, New York City, NY, USA
| | | | - Saul O Lugo Reyes
- Immune deficiencies Lab, National Institute of Pediatrics, Secretariat of Health, Mexico City, Mexico.
| |
Collapse
|
8
|
Dong H, Zhu B, Kong X, Zhang X. Efficient clinical data analysis for prediction of coal workers' pneumoconiosis using machine learning algorithms. THE CLINICAL RESPIRATORY JOURNAL 2023. [PMID: 37380332 PMCID: PMC10363790 DOI: 10.1111/crj.13657] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/16/2023] [Accepted: 06/14/2023] [Indexed: 06/30/2023]
Abstract
PURPOSE The purpose of this study is to propose an efficient coal workers' pneumoconiosis (CWP) clinical prediction system and put it into clinical use for clinical diagnosis of pneumoconiosis. METHODS Patients with CWP and dust-exposed workers who were enrolled from August 2021 to December 2021 were included in this study. Firstly, we chose the embedded method through using three feature selection approaches to perform the prediction analysis. Then, we performed the machine learning algorithms as the model backbone and combined them with three feature selection methods, respectively, to determine the optimal predictive model for CWP. RESULTS Through applying three feature selection approaches based on machine learning algorithms, it was found that AaDO2 and some pulmonary function indicators played an important role in prediction for identifying CWP of early stage. The support vector machine (SVM) algorithm was proved as the optimal machine learning model for predicting CWP, with the ROC curves obtained from three feature selection methods using SVM algorithm whose AUC values of 97.78%, 93.7%, and 95.56%, respectively. CONCLUSION We developed the optimal model (SVM algorithm) through comparisons and analyses among the performances of different models for the prediction of CWP as a clinical application.
Collapse
Affiliation(s)
- Hantian Dong
- Department of Geriatric Diseases, First Hospital of Shanxi Medical University, Taiyuan, Shanxi, People's Republic of China
- National Health Commission Key Laboratory of Pneumoconiosis, Shanxi Province Key Laboratory of Respiratory Diseases, Department of Pulmonary and Critical Care Medicine, First Hospital of Shanxi Medical University, Taiyuan, Shanxi, People's Republic of China
| | - Biaokai Zhu
- Network Security Department, Shanxi Police College, Taiyuan, Shanxi, People's Republic of China
| | - Xiaomei Kong
- National Health Commission Key Laboratory of Pneumoconiosis, Shanxi Province Key Laboratory of Respiratory Diseases, Department of Pulmonary and Critical Care Medicine, First Hospital of Shanxi Medical University, Taiyuan, Shanxi, People's Republic of China
| | - Xinri Zhang
- National Health Commission Key Laboratory of Pneumoconiosis, Shanxi Province Key Laboratory of Respiratory Diseases, Department of Pulmonary and Critical Care Medicine, First Hospital of Shanxi Medical University, Taiyuan, Shanxi, People's Republic of China
| |
Collapse
|
9
|
Xuan S, Zhang J, Guo Q, Zhao L, Yao X. A Diagnostic Classifier Based on Circulating miRNA Pairs for COPD Using a Machine Learning Approach. Diagnostics (Basel) 2023; 13:diagnostics13081440. [PMID: 37189541 DOI: 10.3390/diagnostics13081440] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/16/2023] [Revised: 03/29/2023] [Accepted: 04/14/2023] [Indexed: 05/17/2023] Open
Abstract
Chronic obstructive pulmonary disease (COPD) is highly underdiagnosed, and early detection is urgent to prevent advanced progression. Circulating microRNAs (miRNAs) have been diagnostic candidates for multiple diseases. However, their diagnostic value has not yet been fully established in COPD. The purpose of this study was to develop an effective model for the diagnosis of COPD based on circulating miRNAs. We included circulating miRNA expression profiles of two independent cohorts consisting of 63 COPD and 110 normal samples, and then we constructed a miRNA pair-based matrix. Diagnostic models were developed using several machine learning algorithms. The predictive performance of the optimal model was validated in our external cohort. In this study, the diagnostic values of miRNAs based on the expression levels were unsatisfactory. We identified five key miRNA pairs and further developed seven machine learning models. The classifier based on LightGBM was selected as the final model with the area under the curve (AUC) values of 0.883 and 0.794 in test and validation datasets, respectively. We also built a web tool to assist diagnosis for clinicians. Enriched signaling pathways indicated the potential biological functions of the model. Collectively, we developed a robust machine learning model based on circulating miRNAs for COPD screening.
Collapse
Affiliation(s)
- Shurui Xuan
- Department of Respiratory & Critical Care Medicine, The First Affiliated Hospital of Nanjing Medical University, 300 Guangzhou Road, Nanjing 210029, China
| | - Jiayue Zhang
- Department of Neurosurgery, The First Affiliated Hospital of Nanjing Medical University, 300 Guangzhou Road, Nanjing 210029, China
| | - Qinxing Guo
- Department of Respiratory & Critical Care Medicine, The First Affiliated Hospital of Nanjing Medical University, 300 Guangzhou Road, Nanjing 210029, China
| | - Liang Zhao
- Department of Neurosurgery, The Affiliated Brain Hospital of Nanjing Medical University, 264 Guangzhou Road, Nanjing 210029, China
| | - Xin Yao
- Department of Respiratory & Critical Care Medicine, The First Affiliated Hospital of Nanjing Medical University, 300 Guangzhou Road, Nanjing 210029, China
| |
Collapse
|