1
|
Li P, Alnoor FNU, Xie W, Williams M, Feusier J, Ding Y, Zhao X, Zheng G, Zhao C, Zieske AW, Zu Y, Raess PW, Tantravahi S, Osman A, Patel AB, Tashi T, Patel JL, Matynia AP, Menon MP, Miles RR, Jacobsen JR, George TI, Sborov DW, Szankasi P, Rindler P, Close D, Ohgami RS. Rapid growth of acquired UBA1 mutations predisposes male patients to low-risk MDS. Leukemia 2025; 39:248-256. [PMID: 39516371 DOI: 10.1038/s41375-024-02397-2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/09/2024] [Revised: 08/21/2024] [Accepted: 08/22/2024] [Indexed: 11/16/2024]
Affiliation(s)
- Peng Li
- Department of Pathology, University of Utah Health, Salt Lake City, UT, USA.
- ARUP Laboratories, Salt Lake City, UT, USA.
| | - F N U Alnoor
- Department of Pathology, University of Utah Health, Salt Lake City, UT, USA
- Department of Pathology & Immunology, Washington University, St. Louis, MO, USA
| | - Wei Xie
- Department of Pathology and Laboratory Medicine, School of Medicine, Oregon Health & Science University, Portland, OR, USA
| | - Margaret Williams
- Department of Pathology, University of Utah Health, Salt Lake City, UT, USA
- ARUP Laboratories, Salt Lake City, UT, USA
| | - Julie Feusier
- Department of Pathology, University of Utah Health, Salt Lake City, UT, USA
- ARUP Laboratories, Salt Lake City, UT, USA
| | - Yi Ding
- Laboratory Medicine, Geisinger Medical Center, Danville, PA, USA
| | - Xiangrong Zhao
- Kaiser Permanente Sacramento Medical Center, Sacramento, CA, USA
| | - Gang Zheng
- Mayo Clinic Laboratories, Rochester, MN, USA
| | - Chen Zhao
- Department of Pathology and Laboratory Medicine, Case Western Reserve University, Cleveland, OH, USA
| | - Arthur W Zieske
- Pathology and Genomic Medicine, Houston Methodist Hospital, Houston, TX, USA
| | - Youli Zu
- Pathology and Genomic Medicine, Houston Methodist Hospital, Houston, TX, USA
| | - Philipp W Raess
- Department of Pathology and Laboratory Medicine, School of Medicine, Oregon Health & Science University, Portland, OR, USA
| | - Srinivas Tantravahi
- Department of Internal Medicine, University of Utah Health, Salt Lake City, UT, USA
- Huntsman Cancer Institute, Salt Lake City, UT, USA
| | - Afaf Osman
- Department of Internal Medicine, University of Utah Health, Salt Lake City, UT, USA
- Huntsman Cancer Institute, Salt Lake City, UT, USA
| | - Ami B Patel
- Department of Internal Medicine, University of Utah Health, Salt Lake City, UT, USA
- Huntsman Cancer Institute, Salt Lake City, UT, USA
| | - Tsewang Tashi
- Department of Internal Medicine, University of Utah Health, Salt Lake City, UT, USA
- Huntsman Cancer Institute, Salt Lake City, UT, USA
| | - Jay L Patel
- Department of Pathology, University of Utah Health, Salt Lake City, UT, USA
- ARUP Laboratories, Salt Lake City, UT, USA
| | - Anna P Matynia
- Department of Pathology, University of Utah Health, Salt Lake City, UT, USA
- ARUP Laboratories, Salt Lake City, UT, USA
| | - Madhu P Menon
- Department of Pathology, University of Utah Health, Salt Lake City, UT, USA
- ARUP Laboratories, Salt Lake City, UT, USA
| | - Rodney R Miles
- Department of Pathology, University of Utah Health, Salt Lake City, UT, USA
- ARUP Laboratories, Salt Lake City, UT, USA
- Huntsman Cancer Institute, Salt Lake City, UT, USA
| | - Jeffrey R Jacobsen
- Department of Pathology, University of Utah Health, Salt Lake City, UT, USA
- ARUP Laboratories, Salt Lake City, UT, USA
| | - Tracy I George
- Department of Pathology, University of Utah Health, Salt Lake City, UT, USA
- ARUP Laboratories, Salt Lake City, UT, USA
- Huntsman Cancer Institute, Salt Lake City, UT, USA
| | - Douglas W Sborov
- Department of Internal Medicine, University of Utah Health, Salt Lake City, UT, USA
- Huntsman Cancer Institute, Salt Lake City, UT, USA
| | | | | | | | - Robert S Ohgami
- Department of Pathology, University of Utah Health, Salt Lake City, UT, USA
- ARUP Laboratories, Salt Lake City, UT, USA
| |
Collapse
|
2
|
Ghete T, Kock F, Pontones M, Pfrang D, Westphal M, Höfener H, Metzler M. Models for the marrow: A comprehensive review of AI-based cell classification methods and malignancy detection in bone marrow aspirate smears. Hemasphere 2024; 8:e70048. [PMID: 39629240 PMCID: PMC11612571 DOI: 10.1002/hem3.70048] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/04/2024] [Revised: 09/25/2024] [Accepted: 10/26/2024] [Indexed: 12/07/2024] Open
Abstract
Given the high prevalence of artificial intelligence (AI) research in medicine, the development of deep learning (DL) algorithms based on image recognition, such as the analysis of bone marrow aspirate (BMA) smears, is rapidly increasing in the field of hematology and oncology. The models are trained to identify the optimal regions of the BMA smear for differential cell count and subsequently detect and classify a number of cell types, which can ultimately be utilized for diagnostic purposes. Moreover, AI is capable of identifying genetic mutations phenotypically. This pipeline has the potential to offer an accurate and rapid preliminary analysis of the bone marrow in the clinical routine. However, the intrinsic complexity of hematological diseases presents several challenges for the automatic morphological assessment. To ensure general applicability across multiple medical centers and to deliver high accuracy on prospective clinical data, AI models would require highly heterogeneous training datasets. This review presents a systematic analysis of models for cell classification and detection of hematological malignancies published in the last 5 years (2019-2024). It provides insight into the challenges and opportunities of these DL-assisted tasks.
Collapse
Affiliation(s)
- Tabita Ghete
- Department of Pediatrics and Adolescent MedicineUniversity Hospital ErlangenErlangenGermany
- Bavarian Cancer Research Center (BZKF)ErlangenGermany
| | - Farina Kock
- Computational PathologyFraunhofer Institute for Digital Medicine (MEVIS)BremenGermany
| | - Martina Pontones
- Department of Pediatrics and Adolescent MedicineUniversity Hospital ErlangenErlangenGermany
- Bavarian Cancer Research Center (BZKF)ErlangenGermany
| | - David Pfrang
- Computational PathologyFraunhofer Institute for Digital Medicine (MEVIS)BremenGermany
| | - Max Westphal
- Computational PathologyFraunhofer Institute for Digital Medicine (MEVIS)BremenGermany
| | - Henning Höfener
- Computational PathologyFraunhofer Institute for Digital Medicine (MEVIS)BremenGermany
| | - Markus Metzler
- Department of Pediatrics and Adolescent MedicineUniversity Hospital ErlangenErlangenGermany
- Bavarian Cancer Research Center (BZKF)ErlangenGermany
- Comprehensive Cancer Center Erlangen‐EMN (CCC ER‐EMN)ErlangenGermany
| |
Collapse
|
3
|
Seo J, Lee C, Koh Y, Sun CH, Lee JM, An HY, Kim M. Machine-learning-based predictive classifier for bone marrow failure syndrome using complete blood count data. iScience 2024; 27:111082. [PMID: 39502286 PMCID: PMC11535363 DOI: 10.1016/j.isci.2024.111082] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/27/2023] [Revised: 04/23/2024] [Accepted: 09/27/2024] [Indexed: 11/08/2024] Open
Abstract
Accurate risk assessment of bone marrow failure syndrome (BMFS) is crucial for early diagnosis and intervention. Interpreting complete blood count (CBC) data is challenging without hematological expertise. To support primary physicians, we developed a predictive model using basic demographics and CBC data collected retrospectively from two major hospitals in South Korea. Binary classifiers for aplastic anemia and myelodysplastic syndrome were created and combined to form a BMFS classifier. The model demonstrated high performance in distinguishing BMFS, with consistent results across different CBC feature sets, confirmed by external validation. This algorithm provides a practical guide for primary physicians to identify BMFS based on initial CBC data, aiding in effective triage, timely referrals, and improved patient care.
Collapse
Affiliation(s)
- Jeongmin Seo
- Department of Internal Medicine, Seoul National University Hospital, Seoul, Republic of Korea
- Department of Internal Medicine, Seoul National University Bundang Hospital, Seongnam-si, Gyeonggi-do, Republic of Korea
| | - Chansub Lee
- NOBO Medicine Inc., Seoul, Republic of Korea
| | - Youngil Koh
- Department of Internal Medicine, Seoul National University Hospital, Seoul, Republic of Korea
- NOBO Medicine Inc., Seoul, Republic of Korea
- Center for Precision Medicine, Seoul National University Hospital, Seoul, Republic of Korea
| | | | - Jong-Mi Lee
- Department of Laboratory Medicine, College of Medicine, The Catholic University of Korea, Seoul, Republic of Korea
- Catholic Genetic Laboratory Centre, Seoul St. Mary’s Hospital, College of Medicine, The Catholic University of Korea, Seoul, Republic of Korea
| | - Hong Yul An
- NOBO Medicine Inc., Seoul, Republic of Korea
| | - Myungshin Kim
- Department of Laboratory Medicine, College of Medicine, The Catholic University of Korea, Seoul, Republic of Korea
- Catholic Genetic Laboratory Centre, Seoul St. Mary’s Hospital, College of Medicine, The Catholic University of Korea, Seoul, Republic of Korea
| |
Collapse
|
4
|
Castro GA, Almeida JM, Machado-Neto JA, Almeida TA. A decision support system to recommend appropriate therapy protocol for AML patients. Front Artif Intell 2024; 7:1343447. [PMID: 38510471 PMCID: PMC10950921 DOI: 10.3389/frai.2024.1343447] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/23/2023] [Accepted: 02/19/2024] [Indexed: 03/22/2024] Open
Abstract
Introduction Acute Myeloid Leukemia (AML) is one of the most aggressive hematological neoplasms, emphasizing the critical need for early detection and strategic treatment planning. The association between prompt intervention and enhanced patient survival rates underscores the pivotal role of therapy decisions. To determine the treatment protocol, specialists heavily rely on prognostic predictions that consider the response to treatment and clinical outcomes. The existing risk classification system categorizes patients into favorable, intermediate, and adverse groups, forming the basis for personalized therapeutic choices. However, accurately assessing the intermediate-risk group poses significant challenges, potentially resulting in treatment delays and deterioration of patient conditions. Methods This study introduces a decision support system leveraging cutting-edge machine learning techniques to address these issues. The system automatically recommends tailored oncology therapy protocols based on outcome predictions. Results The proposed approach achieved a high performance close to 0.9 in F1-Score and AUC. The model generated with gene expression data exhibited superior performance. Discussion Our system can effectively support specialists in making well-informed decisions regarding the most suitable and safe therapy for individual patients. The proposed decision support system has the potential to not only streamline treatment initiation but also contribute to prolonged survival and improved quality of life for individuals diagnosed with AML. This marks a significant stride toward optimizing therapeutic interventions and patient outcomes.
Collapse
Affiliation(s)
- Giovanna A. Castro
- Department of Computer Science, Federal University of São Carlos (UFSCar) Sorocaba, São Paulo, Brazil
| | - Jade M. Almeida
- Department of Computer Science, Federal University of São Carlos (UFSCar) Sorocaba, São Paulo, Brazil
| | - João A. Machado-Neto
- Institute of Biomedical Sciences, The University of São Paulo (USP), São Paulo, Brazil
| | - Tiago A. Almeida
- Department of Computer Science, Federal University of São Carlos (UFSCar) Sorocaba, São Paulo, Brazil
| |
Collapse
|
5
|
Liu L, Li Y, Liu N, Luo J, Deng J, Peng W, Bai Y, Zhang G, Zhao G, Yang N, Li C, Long X. Establishment of machine learning-based tool for early detection of pulmonary embolism. COMPUTER METHODS AND PROGRAMS IN BIOMEDICINE 2024; 244:107977. [PMID: 38113803 DOI: 10.1016/j.cmpb.2023.107977] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/07/2023] [Revised: 09/11/2023] [Accepted: 12/11/2023] [Indexed: 12/21/2023]
Abstract
BACKGROUND AND OBJECTIVES Pulmonary embolism (PE) is a complex disease with high mortality and morbidity rate, leading to increasing society burden. However, current diagnosis is solely based on symptoms and laboratory data despite its complex pathology, which easily leads to misdiagnosis and missed diagnosis by inexperienced doctors. Especially, CT pulmonary angiography, the gold standard method, is not widely available. In this study, we aim to establish a rapid and accurate screening model for pulmonary embolism using machine learning technology. Importantly, data required for disease prediction are easily accessed, including routine laboratory data and medical record information of patients. METHODS We extracted features from patients' routine laboratory results and medical records, including blood routine, biochemical group, blood coagulation routine and other test results, as well as symptoms and medical history information. Samples with a feature loss rate greater than 0.8 were deleted from the original database. Data from 4723 cases were retained, 231 of which were positive for pulmonary embolism. 50 features were retained through the positive and negative statistical hypothesis testing which was used to build the predictive model. In order to avoid identification as majority-class samples caused by the imbalance of sample proportion, we used the method of Synthetic Minority Oversampling Technique (SMOTE) to increase the amount of information on minority samples. Five typical machine learning algorithms were used to model the screening of pulmonary embolism, including Support Vector Machines, Logistic Regression, Random Forest, XGBoost, and Back Propagation Neural Networks. To evaluate model performance, sensitivity, specificity and AUC curve were analyzed as the main evaluation indicators. Furthermore, a baseline model was established using the characteristics of the pulmonary embolism guidelines as a comparison model. RESULTS We found that XGBoost showed better performance compared to other models, with the highest sensitivity and specificity (0.99 and 0.99, respectively). Moreover, it showed significant improvement in performance compared to the baseline model (sensitivity and specificity were 0.76 and 0.76 respectively). More important, our model showed low missed diagnosis rate (0.46) and high AUC value (0.992). Finally, the calculation time of our model is only about 0.05 s to obtain the possibility of pulmonary embolism. CONCLUSIONS In this study, five machine learning classification models were established to assess the likelihood of patients suffering from pulmonary embolism, and the XGBoost model most significantly improved the precision, sensitivity, and AUC for pulmonary embolism screening. Collectively, we have established an AI-based model to accurately predict pulmonary embolism at early stage.
Collapse
Affiliation(s)
- Lijue Liu
- School of Automation, Central South University, Changsha, Hunan 410083, China; Xiangjiang Laboratory, Changsha 410205, China; Hunan Zixing Intelligent Medical Technology Co., Ltd, Changsha, Hunan 410000, China
| | - Yaming Li
- School of Automation, Central South University, Changsha, Hunan 410083, China
| | - Na Liu
- Xiangya Hospital, Central South University, Xiangya Road 87#, Changsha 410008, China
| | - Jingmin Luo
- Xiangya Hospital, Central South University, Xiangya Road 87#, Changsha 410008, China
| | - Jinhai Deng
- Hunan Zixing Intelligent Medical Technology Co., Ltd, Changsha, Hunan 410000, China; Richard Dimbleby Laboratory of Cancer Research, School of Cancer & Pharmaceutical Sciences, King's College London, London SE1 1UL, UK
| | - Weixiong Peng
- Hunan Zixing Intelligent Medical Technology Co., Ltd, Changsha, Hunan 410000, China; Department of Electrical and Electronic Engineering, College of Engineering, Southern University of Science and Technology (SUSTech), Shenzhen, Guangdong 518055, China
| | - Yongping Bai
- Xiangya Hospital, Central South University, Xiangya Road 87#, Changsha 410008, China
| | - Guogang Zhang
- Department of Cardiovascular Medicine, The Third Xiangya Hospital, Central South University, Tongzipo Road 138#, Changsha 410008,China.
| | - Guihu Zhao
- Xiangya Hospital, Central South University, Xiangya Road 87#, Changsha 410008, China
| | - Ning Yang
- Xiangya Hospital, Central South University, Xiangya Road 87#, Changsha 410008, China
| | - Chuanchang Li
- Xiangya Hospital, Central South University, Xiangya Road 87#, Changsha 410008, China
| | - Xueying Long
- Xiangya Hospital, Central South University, Xiangya Road 87#, Changsha 410008, China
| |
Collapse
|
6
|
Kewan T, Durmaz A, Bahaj W, Gurnari C, Terkawi L, Awada H, Ogbue OD, Ahmed R, Pagliuca S, Awada H, Kubota Y, Mori M, Ponvilawan B, Al-Share B, Patel BJ, Carraway HE, Scott J, Balasubramanian SK, Bat T, Madanat Y, Sekeres MA, Haferlach T, Visconte V, Maciejewski JP. Molecular patterns identify distinct subclasses of myeloid neoplasia. Nat Commun 2023; 14:3136. [PMID: 37253784 PMCID: PMC10229666 DOI: 10.1038/s41467-023-38515-4] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/26/2022] [Accepted: 05/03/2023] [Indexed: 06/01/2023] Open
Abstract
Genomic mutations drive the pathogenesis of myelodysplastic syndromes and acute myeloid leukemia. While morphological and clinical features have dominated the classical criteria for diagnosis and classification, incorporation of molecular data can illuminate functional pathobiology. Here we show that unsupervised machine learning can identify functional objective molecular clusters, irrespective of anamnestic clinico-morphological features, despite the complexity of the molecular alterations in myeloid neoplasia. Our approach reflects disease evolution, informed classification, prognostication, and molecular interactions. We apply machine learning methods on 3588 patients with myelodysplastic syndromes and secondary acute myeloid leukemia to identify 14 molecularly distinct clusters. Remarkably, our model shows clinical implications in terms of overall survival and response to treatment even after adjusting to the molecular international prognostic scoring system (IPSS-M). In addition, the model is validated on an external cohort of 412 patients. Our subclassification model is available via a web-based open-access resource ( https://drmz.shinyapps.io/mds_latent ).
Collapse
Affiliation(s)
- Tariq Kewan
- Department of Translational Hematology and Oncology Research, Taussig Cancer Institute, Cleveland Clinic, Cleveland, OH, USA.
- Department of Hematology and Medical Oncology, Yale University, New Haven, CT, USA.
| | - Arda Durmaz
- Department of Translational Hematology and Oncology Research, Taussig Cancer Institute, Cleveland Clinic, Cleveland, OH, USA
- Systems Biology and Bioinformatics Department, School of Medicine, Case Western Reserve University, Cleveland, OH, USA
| | - Waled Bahaj
- Department of Translational Hematology and Oncology Research, Taussig Cancer Institute, Cleveland Clinic, Cleveland, OH, USA
| | - Carmelo Gurnari
- Department of Translational Hematology and Oncology Research, Taussig Cancer Institute, Cleveland Clinic, Cleveland, OH, USA
- Department of Biomedicine and Prevention, Ph.D. in Immunology, Molecular Medicine and Applied Biotechnology, University of Rome Tor Vergata, Rome, Italy
| | - Laila Terkawi
- Department of Translational Hematology and Oncology Research, Taussig Cancer Institute, Cleveland Clinic, Cleveland, OH, USA
| | - Hussein Awada
- Department of Translational Hematology and Oncology Research, Taussig Cancer Institute, Cleveland Clinic, Cleveland, OH, USA
| | - Olisaemeka D Ogbue
- Department of Translational Hematology and Oncology Research, Taussig Cancer Institute, Cleveland Clinic, Cleveland, OH, USA
| | - Ramsha Ahmed
- Department of Translational Hematology and Oncology Research, Taussig Cancer Institute, Cleveland Clinic, Cleveland, OH, USA
| | - Simona Pagliuca
- Department of Translational Hematology and Oncology Research, Taussig Cancer Institute, Cleveland Clinic, Cleveland, OH, USA
- Department of Clinical Hematology, CHRU de Nancy, Nancy, France
| | - Hassan Awada
- Roswell Park Comprehensive Cancer Center, Buffalo, NY, USA
| | - Yasuo Kubota
- Department of Translational Hematology and Oncology Research, Taussig Cancer Institute, Cleveland Clinic, Cleveland, OH, USA
| | - Minako Mori
- Department of Translational Hematology and Oncology Research, Taussig Cancer Institute, Cleveland Clinic, Cleveland, OH, USA
| | - Ben Ponvilawan
- Department of Translational Hematology and Oncology Research, Taussig Cancer Institute, Cleveland Clinic, Cleveland, OH, USA
| | - Bayan Al-Share
- Department of Hematology and Oncology, Karmanos Cancer Institute, Wayne State University, Detroit, MI, USA
| | - Bhumika J Patel
- Department of Translational Hematology and Oncology Research, Taussig Cancer Institute, Cleveland Clinic, Cleveland, OH, USA
| | - Hetty E Carraway
- Department of Translational Hematology and Oncology Research, Taussig Cancer Institute, Cleveland Clinic, Cleveland, OH, USA
| | - Jacob Scott
- Department of Translational Hematology and Oncology Research, Taussig Cancer Institute, Cleveland Clinic, Cleveland, OH, USA
- Systems Biology and Bioinformatics Department, School of Medicine, Case Western Reserve University, Cleveland, OH, USA
| | - Suresh K Balasubramanian
- Department of Hematology and Oncology, Karmanos Cancer Institute, Wayne State University, Detroit, MI, USA
| | - Taha Bat
- Department of Internal Medicine, Division of Hematology and Oncology, University of Texas Southwestern Medical Center, Dallas, TX, USA
| | - Yazan Madanat
- Department of Internal Medicine, Division of Hematology and Oncology, University of Texas Southwestern Medical Center, Dallas, TX, USA
| | - Mikkael A Sekeres
- Division of Hematology, Sylvester Cancer Center, University of Miami, Miami, FL, USA
| | | | - Valeria Visconte
- Department of Translational Hematology and Oncology Research, Taussig Cancer Institute, Cleveland Clinic, Cleveland, OH, USA.
| | - Jaroslaw P Maciejewski
- Department of Translational Hematology and Oncology Research, Taussig Cancer Institute, Cleveland Clinic, Cleveland, OH, USA.
| |
Collapse
|
7
|
Weeks LD, Niroula A, Neuberg D, Wong W, Lindsley RC, Luskin M, Berliner N, Stone RM, DeAngelo DJ, Soiffer R, Uddin MM, Griffin G, Vlasschaert C, Gibson CJ, Jaiswal S, Bick AG, Malcovati L, Natarajan P, Ebert BL. Prediction of risk for myeloid malignancy in clonal hematopoiesis. NEJM EVIDENCE 2023; 2:10.1056/evidoa2200310. [PMID: 37483562 PMCID: PMC10361696 DOI: 10.1056/evidoa2200310] [Citation(s) in RCA: 113] [Impact Index Per Article: 56.5] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 07/25/2023]
Abstract
Background Clonal hematopoiesis of indeterminate potential (CHIP) and clonal cytopenia of undetermined significance (CCUS) are defined by somatic mutations in genes associated with myeloid neoplasms (MN) at a variant allele fraction (VAF) ≥ 0.02, in the absence and presence of cytopenia, respectively. CHIP/CCUS is highly prevalent in adults and defining predictors of MN risk would aid clinical management and research. Methods We analyzed sequenced exomes of healthy UK Biobank (UKB) participants (n = 438,890) in separate derivation and validation cohorts. Genetic mutations, laboratory values, and MN outcomes were used in conditional probability-based recursive partitioning and Cox regression to determine predictors of incident MN. Combined statistical weights defined a clonal hematopoiesis risk score (CHRS). Independent CHIP/CCUS patient cohorts were used to test prognostic capability of the CHRS in the clinical setting. Results Recursive partitioning distinguished CHIP/CCUS cases with 10-year probabilities of MN ranging from 0.0078 - 0.85. Multivariable analysis validated partitioning variables as predictors of MN. Key features, including single DNMT3A mutations, high risk mutations, ≥ 2 mutations, VAF ≥ 0.2, age ≥ 65 years, CCUS vs CHIP and red blood cell indices, influenced MN risk in variable direction. The CHRS defined low risk (n = 10018, 88.4%), intermediate risk (n = 1196, 10.5%), and high risk (n = 123, 1.1%) groups. In clinical cohorts, most MN events occurred in high risk CHIP/CCUS patients. Conclusions The CHRS provides simple prognostic framework for CHIP/CCUS, distinguishing a high risk minority from the majority of CHIP/CCUS which has minimal risk for progression to MN.
Collapse
Affiliation(s)
- Lachelle D. Weeks
- Department of Medical Oncology, Dana-Farber Cancer Institute, Boston, MA
- Department of Medicine, Harvard Medical School, Boston, MA
- Center for Prevention of Progression, Dana-Farber Cancer Institute, Boston, MA
| | - Abhishek Niroula
- Broad Institute of MIT and Harvard University, Cambridge, MA
- Department of Lab Medicine, Lund University, Lund, Sweden
| | - Donna Neuberg
- Department of Data Science, Dana-Farber Cancer Institute, Boston, MA
| | - Waihay Wong
- Department of Pathology, Brigham and Women’s Hospital, Boston, MA
| | - R. Coleman Lindsley
- Department of Medical Oncology, Dana-Farber Cancer Institute, Boston, MA
- Department of Medicine, Harvard Medical School, Boston, MA
| | - Marlise Luskin
- Department of Medical Oncology, Dana-Farber Cancer Institute, Boston, MA
- Department of Medicine, Harvard Medical School, Boston, MA
| | - Nancy Berliner
- Department of Medicine, Harvard Medical School, Boston, MA
- Department of Hematology, Brigham and Women’s Hospital, Boston, MA
| | - Richard M. Stone
- Department of Medical Oncology, Dana-Farber Cancer Institute, Boston, MA
- Department of Medicine, Harvard Medical School, Boston, MA
| | - Daniel J. DeAngelo
- Department of Medical Oncology, Dana-Farber Cancer Institute, Boston, MA
- Department of Medicine, Harvard Medical School, Boston, MA
| | - Robert Soiffer
- Department of Medical Oncology, Dana-Farber Cancer Institute, Boston, MA
- Department of Medicine, Harvard Medical School, Boston, MA
- Center for Prevention of Progression, Dana-Farber Cancer Institute, Boston, MA
| | - Md Mesbah Uddin
- Cardiovascular Research Center, Massachusetts General Hospital, 185 Cambridge Street, CPZN 3.184, Boston, MA
- Program in Medical and Population Genetics and Cardiovascular Disease Initiative, Broad Institute of Harvard and MIT, Cambridge, MA
| | - Gabriel Griffin
- Department of Pathology, Brigham and Women’s Hospital, Boston, MA
- Department of Pathology, Dana-Farber Cancer Institute, Boston, MA
| | | | - Christopher J. Gibson
- Department of Medical Oncology, Dana-Farber Cancer Institute, Boston, MA
- Department of Medicine, Harvard Medical School, Boston, MA
| | | | - Alexander G. Bick
- Division of Genetic Medicine, Department of Medicine, Vanderbilt University, Nashville, TN
| | - Luca Malcovati
- Department of Molecular Medicine, University of Pavia, Pavia, Italy
- Department of Hematology Oncology, Istituto di Ricovero e Cura a Carattere Scientifico (IRCCS) Policlinico San Matteo Foundation, Pavia, Italy
| | - Pradeep Natarajan
- Department of Medicine, Harvard Medical School, Boston, MA
- Cardiovascular Research Center, Massachusetts General Hospital, 185 Cambridge Street, CPZN 3.184, Boston, MA
- Program in Medical and Population Genetics and Cardiovascular Disease Initiative, Broad Institute of Harvard and MIT, Cambridge, MA
| | - Benjamin L. Ebert
- Department of Medical Oncology, Dana-Farber Cancer Institute, Boston, MA
- Department of Medicine, Harvard Medical School, Boston, MA
- Center for Prevention of Progression, Dana-Farber Cancer Institute, Boston, MA
- Broad Institute of MIT and Harvard University, Cambridge, MA
- Howard Hughes Medical Institute, Boston, MA
| |
Collapse
|
8
|
Automated Bone Marrow Cell Classification for Haematological Disease Diagnosis Using Siamese Neural Network. Diagnostics (Basel) 2022; 13:diagnostics13010112. [PMID: 36611404 PMCID: PMC9818919 DOI: 10.3390/diagnostics13010112] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/10/2022] [Revised: 12/10/2022] [Accepted: 12/13/2022] [Indexed: 12/31/2022] Open
Abstract
The critical structure and nature of different bone marrow cells which form a base in the diagnosis of haematological ailments requires a high-grade classification which is a very prolonged approach and accounts for human error if performed manually, even by field experts. Therefore, the aim of this research is to automate the process to study and accurately classify the structure of bone marrow cells which will help in the diagnosis of haematological ailments at a much faster and better rate. Various machine learning algorithms and models, such as CNN + SVM, CNN + XGB Boost and Siamese network, were trained and tested across a dataset of 170,000 expert-annotated cell images from 945 patients' bone marrow smears with haematological disorders. The metrics used for evaluation of this research are accuracy of model, precision and recall of all the different classes of cells. Based on these performance metrics the CNN + SVM, CNN + XGB, resulted in 32%, 28% accuracy, respectively, and therefore these models were discarded. Siamese neural resulted in 91% accuracy and 84% validation accuracy. Moreover, the weighted average recall values of the Siamese neural network were 92% for training and 91% for validation. Hence, the final results are based on Siamese neural network model as it was outperforming all the other algorithms used in this research.
Collapse
|
9
|
Shen J, Casie Chetty S, Shokouhi S, Maharjan J, Chuba Y, Calvert J, Mao Q. Massive external validation of a machine learning algorithm to predict pulmonary embolism in hospitalized patients. Thromb Res 2022; 216:14-21. [DOI: 10.1016/j.thromres.2022.05.016] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/23/2022] [Revised: 05/26/2022] [Accepted: 05/30/2022] [Indexed: 10/18/2022]
|
10
|
Awada H, Gurnari C, Durmaz A, Awada H, Pagliuca S, Visconte V. Personalized Risk Schemes and Machine Learning to Empower Genomic Prognostication Models in Myelodysplastic Syndromes. Int J Mol Sci 2022; 23:2802. [PMID: 35269943 PMCID: PMC8911403 DOI: 10.3390/ijms23052802] [Citation(s) in RCA: 9] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/31/2022] [Revised: 02/28/2022] [Accepted: 03/01/2022] [Indexed: 02/04/2023] Open
Abstract
Myelodysplastic syndromes (MDS) are characterized by variable clinical manifestations and outcomes. Several prognostic systems relying on clinical factors and cytogenetic abnormalities have been developed to help stratify MDS patients into different risk categories of distinct prognoses and therapeutic implications. The current abundance of molecular information poses the challenges of precisely defining patients' molecular profiles and their incorporation in clinically established diagnostic and prognostic schemes. Perhaps the prognostic power of the current systems can be boosted by incorporating molecular features. Machine learning (ML) algorithms can be helpful in developing more precise prognostication models that integrate complex genomic interactions at a higher dimensional level. These techniques can potentially generate automated diagnostic and prognostic models and assist in advancing personalized therapies. This review highlights the current prognostication models used in MDS while shedding light on the latest achievements in ML-based research.
Collapse
Affiliation(s)
- Hussein Awada
- Department of Translational Hematology and Oncology Research, Taussig Cancer Institute, Cleveland Clinic, Cleveland, OH 44195, USA; (H.A.); (C.G.); (A.D.); (S.P.)
| | - Carmelo Gurnari
- Department of Translational Hematology and Oncology Research, Taussig Cancer Institute, Cleveland Clinic, Cleveland, OH 44195, USA; (H.A.); (C.G.); (A.D.); (S.P.)
- Department of Biomedicine and Prevention, University of Rome Tor Vergata, 00133 Rome, Italy
| | - Arda Durmaz
- Department of Translational Hematology and Oncology Research, Taussig Cancer Institute, Cleveland Clinic, Cleveland, OH 44195, USA; (H.A.); (C.G.); (A.D.); (S.P.)
| | - Hassan Awada
- Roswell Park Comprehensive Cancer Center, Buffalo, NY 14203, USA;
| | - Simona Pagliuca
- Department of Translational Hematology and Oncology Research, Taussig Cancer Institute, Cleveland Clinic, Cleveland, OH 44195, USA; (H.A.); (C.G.); (A.D.); (S.P.)
- Department of Clinical Hematology, CHRU Nancy, CEDEX, 54035 Nancy, France
| | - Valeria Visconte
- Department of Translational Hematology and Oncology Research, Taussig Cancer Institute, Cleveland Clinic, Cleveland, OH 44195, USA; (H.A.); (C.G.); (A.D.); (S.P.)
| |
Collapse
|
11
|
A comparative analysis of machine learning approaches to predict C. difficile infection in hospitalized patients. Am J Infect Control 2022; 50:250-257. [PMID: 35067382 DOI: 10.1016/j.ajic.2021.11.012] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/07/2021] [Revised: 11/03/2021] [Accepted: 11/06/2021] [Indexed: 11/24/2022]
Abstract
BACKGROUND Interventions to better prevent or manage Clostridioides difficile infection (CDI) may significantly reduce morbidity, mortality, and healthcare spending. METHODS We present a retrospective study using electronic health record data from over 700 United States hospitals. A subset of hospitals was used to develop machine learning algorithms (MLAs); the remaining hospitals served as an external test set. Three MLAs were evaluated: gradient-boosted decision trees (XGBoost), Deep Long Short Term Memory neural network, and one-dimensional convolutional neural network. MLA performance was evaluated with area under the receiver operating characteristic curve (AUROC), sensitivity, specificity, diagnostic odds ratios and likelihood ratios. RESULTS The development dataset contained 13,664,840 inpatient encounters with 80,046 CDI encounters; the external dataset contained 1,149,088 inpatient encounters with 7,107 CDI encounters. The highest AUROCs were achieved for XGB, Deep Long Short Term Memory neural network, and one-dimensional convolutional neural network via abstaining from use of specialized training techniques, resampling in isolation, and resampling and output bias in combination, respectively. XGBoost achieved the highest AUROC. CONCLUSIONS MLAs can predict future CDI in hospitalized patients using just 6 hours of data. In clinical practice, a machine-learning based tool may support prophylactic measures, earlier diagnosis, and more timely implementation of infection control measures.
Collapse
|
12
|
Thapa R, Garikipati A, Shokouhi S, Hurtado M, Barnes G, Hoffman J, Calvert J, Katzmann L, Mao Q, Das R. Usability of Electronic Health records in Predicting Short-term falls: Machine learning Applications in Senior Care Facilities (Preprint). JMIR Aging 2021; 5:e35373. [PMID: 35363146 PMCID: PMC9015781 DOI: 10.2196/35373] [Citation(s) in RCA: 11] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/02/2021] [Revised: 01/16/2022] [Accepted: 02/07/2022] [Indexed: 11/23/2022] Open
Abstract
Background Short-term fall prediction models that use electronic health records (EHRs) may enable the implementation of dynamic care practices that specifically address changes in individualized fall risk within senior care facilities. Objective The aim of this study is to implement machine learning (ML) algorithms that use EHR data to predict a 3-month fall risk in residents from a variety of senior care facilities providing different levels of care. Methods This retrospective study obtained EHR data (2007-2021) from Juniper Communities’ proprietary database of 2785 individuals primarily residing in skilled nursing facilities, independent living facilities, and assisted living facilities across the United States. We assessed the performance of 3 ML-based fall prediction models and the Juniper Communities’ fall risk assessment. Additional analyses were conducted to examine how changes in the input features, training data sets, and prediction windows affected the performance of these models. Results The Extreme Gradient Boosting model exhibited the highest performance, with an area under the receiver operating characteristic curve of 0.846 (95% CI 0.794-0.894), specificity of 0.848, diagnostic odds ratio of 13.40, and sensitivity of 0.706, while achieving the best trade-off in balancing true positive and negative rates. The number of active medications was the most significant feature associated with fall risk, followed by a resident’s number of active diseases and several variables associated with vital signs, including diastolic blood pressure and changes in weight and respiratory rates. The combination of vital signs with traditional risk factors as input features achieved higher prediction accuracy than using either group of features alone. Conclusions This study shows that the Extreme Gradient Boosting technique can use a large number of features from EHR data to make short-term fall predictions with a better performance than that of conventional fall risk assessments and other ML models. The integration of routinely collected EHR data, particularly vital signs, into fall prediction models may generate more accurate fall risk surveillance than models without vital signs. Our data support the use of ML models for dynamic, cost-effective, and automated fall predictions in different types of senior care facilities.
Collapse
|