1
|
Laffafchi S, Ebrahimi A, Kafan S. Efficient management of pulmonary embolism diagnosis using a two-step interconnected machine learning model based on electronic health records data. Health Inf Sci Syst 2024; 12:17. [PMID: 38464464 PMCID: PMC10917730 DOI: 10.1007/s13755-024-00276-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/15/2022] [Accepted: 01/17/2024] [Indexed: 03/12/2024] Open
Abstract
Pulmonary Embolism (PE) is a life-threatening clinical disease with no specific clinical symptoms and Computed Tomography Angiography (CTA) is used for diagnosis. Clinical decision support scoring systems like Wells and rGeneva based on PE risk factors have been developed to estimate the pre-test probability but are underused, leading to continuous overuse of CTA imaging. This diagnostic study aimed to propose a novel approach for efficient management of PE diagnosis using a two-step interconnected machine learning framework directly by analyzing patients' Electronic Health Records data. First, we performed feature importance analysis according to the result of LightGBM superiority for PE prediction, then four state-of-the-art machine learning methods were applied for PE prediction based on the feature importance results, enabling swift and accurate pre-test diagnosis. Throughout the study patients' data from different departments were collected from Sina educational hospital, affiliated with the Tehran University of medical sciences in Iran. Generally, the Ridge classification method obtained the best performance with an F1 score of 0.96. Extensive experimental findings showed the effectiveness and simplicity of this diagnostic process of PE in comparison with the existing scoring systems. The main strength of this approach centered on PE disease management procedures, which would reduce avoidable invasive CTA imaging and be applied as a primary prognosis of PE, hence assisting the healthcare system, clinicians, and patients by reducing costs and promoting treatment quality and patient satisfaction.
Collapse
Affiliation(s)
- Soroor Laffafchi
- Department of Business Administration and Entrepreneurship, Faculty of Management and Economics, Science and Research Branch, Islamic Azad University, Daneshgah Blvd, Simon Bulivar Blvd, Tehran, Iran
| | - Ahmad Ebrahimi
- Department of Industrial and Technology Management, Faculty of Management and Economics, Science and Research Branch, Islamic Azad University, Daneshgah Blvd, Simon Bulivar Blvd, Tehran, Iran
| | - Samira Kafan
- Department of Pulmonary Medicine, Sina Hospital, International Relations Office, Medical School, Tehran University of Medical Sciences, PourSina St., Tehran, 1417613151 Iran
| |
Collapse
|
2
|
Gómez-Valadés A, Martínez-Tomás R, García-Herranz S, Bjørnerud A, Rincón M. Early detection of mild cognitive impairment through neuropsychological tests in population screenings: a decision support system integrating ontologies and machine learning. Front Neuroinform 2024; 18:1378281. [PMID: 39478874 PMCID: PMC11522961 DOI: 10.3389/fninf.2024.1378281] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/29/2024] [Accepted: 10/04/2024] [Indexed: 11/02/2024] Open
Abstract
Machine learning (ML) methodologies for detecting Mild Cognitive Impairment (MCI) are progressively gaining prevalence to manage the vast volume of processed information. Nevertheless, the black-box nature of ML algorithms and the heterogeneity within the data may result in varied interpretations across distinct studies. To avoid this, in this proposal, we present the design of a decision support system that integrates a machine learning model represented using the Semantic Web Rule Language (SWRL) in an ontology with specialized knowledge in neuropsychological tests, the NIO ontology. The system's ability to detect MCI subjects was evaluated on a database of 520 neuropsychological assessments conducted in Spanish and compared with other well-established ML methods. Using the F2 coefficient to minimize false negatives, results indicate that the system performs similarly to other well-established ML methods (F2TE2 = 0.830, only below bagging, F2BAG = 0.832) while exhibiting other significant attributes such as explanation capability and data standardization to a common framework thanks to the ontological part. On the other hand, the system's versatility and ease of use were demonstrated with three additional use cases: evaluation of new cases even if the acquisition stage is incomplete (the case records have missing values), incorporation of a new database into the integrated system, and use of the ontology capabilities to relate different domains. This makes it a useful tool to support physicians and neuropsychologists in population-based screenings for early detection of MCI.
Collapse
Affiliation(s)
- Alba Gómez-Valadés
- Department of Artificial Intelligence, Universidad Nacional de Educación a Distancia (UNED), Madrid, Spain
| | - Rafael Martínez-Tomás
- Department of Artificial Intelligence, Universidad Nacional de Educación a Distancia (UNED), Madrid, Spain
| | | | - Atle Bjørnerud
- Computational Radiology and Artificial Intelligence Unit, Department of Physics and Computational Radiology, Clinic for Radiology and Nuclear Medicine, Oslo University Hospital, Oslo, Norway
- Department of Physics, University of Oslo, Oslo, Norway
| | - Mariano Rincón
- Department of Artificial Intelligence, Universidad Nacional de Educación a Distancia (UNED), Madrid, Spain
| |
Collapse
|
3
|
Hao T, Zhang M, Song Z, Gou Y, Wang B, Sun J. Reconstruction of Eriocheir sinensis Protein-Protein Interaction Network Based on DGO-SVM Method. Curr Issues Mol Biol 2024; 46:7353-7372. [PMID: 39057077 PMCID: PMC11276262 DOI: 10.3390/cimb46070436] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/26/2024] [Revised: 06/25/2024] [Accepted: 07/10/2024] [Indexed: 07/28/2024] Open
Abstract
Eriocheir sinensis is an economically important aquatic animal. Its regulatory mechanisms underlying many biological processes are still vague due to the lack of systematic analysis tools. The protein-protein interaction network (PIN) is an important tool for the systematic analysis of regulatory mechanisms. In this work, a novel machine learning method, DGO-SVM, was applied to predict the protein-protein interaction (PPI) in E. sinensis, and its PIN was reconstructed. With the domain, biological process, molecular functions and subcellular locations of proteins as the features, DGO-SVM showed excellent performance in Bombyx mori, humans and five aquatic crustaceans, with 92-96% accuracy. With DGO-SVM, the PIN of E. sinensis was reconstructed, containing 14,703 proteins and 7,243,597 interactions, in which 35,604 interactions were associated with 566 novel proteins mainly involved in the response to exogenous stimuli, cellular macromolecular metabolism and regulation. The DGO-SVM demonstrated that the biological process, molecular functions and subcellular locations of proteins are significant factors for the precise prediction of PPIs. We reconstructed the largest PIN for E. sinensis, which provides a systematic tool for the regulatory mechanism analysis. Furthermore, the novel-protein-related PPIs in the PIN may provide important clues for the mechanism analysis of the underlying specific physiological processes in E. sinensis.
Collapse
Affiliation(s)
| | | | | | | | - Bin Wang
- Tianjin Key Laboratory of Animal and Plant Resistance, College of Life Sciences, Tianjin Normal University, Tianjin 300387, China; (T.H.); (M.Z.); (Z.S.); (Y.G.)
| | - Jinsheng Sun
- Tianjin Key Laboratory of Animal and Plant Resistance, College of Life Sciences, Tianjin Normal University, Tianjin 300387, China; (T.H.); (M.Z.); (Z.S.); (Y.G.)
| |
Collapse
|
4
|
Lin RZ, Amith MT, Wang CX, Strickley J, Tao C. Dermoscopy Differential Diagnosis Explorer (D3X) Ontology to Aggregate and Link Dermoscopic Patterns to Differential Diagnoses: Development and Usability Study. JMIR Med Inform 2024; 12:e49613. [PMID: 38904996 PMCID: PMC11226929 DOI: 10.2196/49613] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/12/2023] [Revised: 04/18/2024] [Accepted: 05/04/2024] [Indexed: 06/22/2024] Open
Abstract
BACKGROUND Dermoscopy is a growing field that uses microscopy to allow dermatologists and primary care physicians to identify skin lesions. For a given skin lesion, a wide variety of differential diagnoses exist, which may be challenging for inexperienced users to name and understand. OBJECTIVE In this study, we describe the creation of the dermoscopy differential diagnosis explorer (D3X), an ontology linking dermoscopic patterns to differential diagnoses. METHODS Existing ontologies that were incorporated into D3X include the elements of visuals ontology and dermoscopy elements of visuals ontology, which connect visual features to dermoscopic patterns. A list of differential diagnoses for each pattern was generated from the literature and in consultation with domain experts. Open-source images were incorporated from DermNet, Dermoscopedia, and open-access research papers. RESULTS D3X was encoded in the OWL 2 web ontology language and includes 3041 logical axioms, 1519 classes, 103 object properties, and 20 data properties. We compared D3X with publicly available ontologies in the dermatology domain using a semiotic theory-driven metric to measure the innate qualities of D3X with others. The results indicate that D3X is adequately comparable with other ontologies of the dermatology domain. CONCLUSIONS The D3X ontology is a resource that can link and integrate dermoscopic differential diagnoses and supplementary information with existing ontology-based resources. Future directions include developing a web application based on D3X for dermoscopy education and clinical practice.
Collapse
Affiliation(s)
- Rebecca Z Lin
- Division of Dermatology, Washington University School of Medicine, St. Louis, MO, United States
| | - Muhammad Tuan Amith
- Department of Information Science, University of North Texas, Denton, TX, United States
- Department of Biostatistics and Data Science, The University of Texas Medical Branch, Galveston, TX, United States
- Department of Internal Medicine, The University of Texas Medical Branch, Galveston, TX, United States
| | - Cynthia X Wang
- Department of Dermatology, Kaiser Permanente Redwood City Medical Center, Redwood City, CA, United States
| | - John Strickley
- Division of Dermatology, University of Louisville, Louisville, KY, United States
| | - Cui Tao
- Department of Artificial Intelligence and Informatics, Mayo Clinic, Jacksonville, FL, United States
| |
Collapse
|
5
|
Jandy K, Weichbroth P. A machine learning approach to classifying New York Heart Association (NYHA) heart failure. Sci Rep 2024; 14:11496. [PMID: 38769444 PMCID: PMC11106248 DOI: 10.1038/s41598-024-62555-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/23/2024] [Accepted: 05/18/2024] [Indexed: 05/22/2024] Open
Abstract
According to the European Society of Cardiology, globally the number of patients with heart failure nearly doubled from 33.5 million in 1990 to 64.3 million in 2017, and is further projected to increase dramatically in this decade, still remaining a leading cause of morbidity and mortality. One of the most frequently applied heart failure classification systems that physicians use is the New York Heart Association (NYHA) Functional Classification. Each NYHA class describes a patient's symptoms while performing physical activities, delivering a strong indicator of the heart performance. In each case, a NYHA class is individually determined routinely based on the subjective assessment of the treating physician. However, such diagnosis can suffer from bias, eventually affecting a valid assessment. To tackle this issue, we take advantage of the machine learning approach to develop a decision-tree, along with a set of decision rules, which can serve as additional blinded investigator tool to make unbiased assessment. On a dataset containing 434 observations, the supervised learning approach was initially employed to train a Decision Tree model. In the subsequent phase, ensemble learning techniques were utilized to develop both the Voting Classifier and the Random Forest model. The performance of all models was assessed using 10-fold cross-validation with stratification.The Decision Tree, Random Forest, and Voting Classifier models reported accuracies of 76.28%, 96.77%, and 99.54% respectively. The Voting Classifier led in classifying NYHA I and III with 98.7% and 100% accuracy. Both Random Forest and Voting Classifier flawlessly classified NYHA II at 100%. However, for NYHA IV, Random Forest achieved a perfect score, while the Voting Classifier reported 90%. The Decision Tree showed the least effectiveness among all the models tested. In our opinion, the results seem satisfactory in terms of their supporting role in clinical practice. In particular, the use of a machine learning tool could reduce or even eliminate the bias in the physician's assessment. In addition, future research should consider testing other variables in different datasets to gain a better understanding of the significant factors affecting heart failure.
Collapse
Affiliation(s)
| | - Pawel Weichbroth
- Department of Software Engineering, Faculty of Electronics, Telecommunications and Informatics, Gdansk University of Technology, Gdańsk, Poland.
| |
Collapse
|
6
|
Spoladore D, Tosi M, Lorenzini EC. Ontology-based decision support systems for diabetes nutrition therapy: A systematic literature review. Artif Intell Med 2024; 151:102859. [PMID: 38564880 DOI: 10.1016/j.artmed.2024.102859] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/12/2023] [Revised: 02/05/2024] [Accepted: 03/25/2024] [Indexed: 04/04/2024]
Abstract
Diabetes is a non-communicable disease that has reached epidemic proportions, affecting 537 million people globally. Artificial Intelligence can support patients or clinicians in diabetes nutrition therapy - the first medical therapy in most cases of Type 1 and Type 2 diabetes. In particular, ontology-based recommender and decision support systems can deliver a computable representation of experts' knowledge, thus delivering patient-tailored nutritional recommendations or supporting clinical personnel in identifying the most suitable diet. This work proposes a systematic literature review of the domain ontologies describing diabetes in such systems, identifying their underlying conceptualizations, the users targeted by the systems, the type(s) of diabetes tackled, and the nutritional recommendations provided. This review also delves into the structure of the domain ontologies, highlighting several aspects that may hinder (or foster) their adoption in recommender and decision support systems for diabetes nutrition therapy. The results of this review process allow to underline how recommendations are formulated and the role of clinical experts in developing domain ontologies, outlining the research trends characterizing this research area. The results also allow for identifying research directions that can foster a preeminent role for clinical experts and clinical guidelines in a cooperative effort to make ontologies more interoperable - thus enabling them to play a significant role in the decision-making processes about diabetes nutrition therapy.
Collapse
Affiliation(s)
- Daniele Spoladore
- Institute of Intelligent Industrial Technologies and Systems for Advanced Manufacturing - National Research Council, (CNR-STIIMA), Lecco, Italy.
| | - Martina Tosi
- Department of Health Sciences, University of Milan, 20142 Milan, Italy; Institute of Agricultural Biology and Biotechnology - National Research Council (CNR-IBBA), Milan, Italy.
| | - Erna Cecilia Lorenzini
- Department of Biomedical Sciences for Health, University of Milan, I-20133 Milan, Italy.
| |
Collapse
|
7
|
Lee VK, Wallace J, Meyers B, Racki A, Shah A, Beluk NH, Cabral L, Beers S, Badaly D, Lo C, Panigrahy A, Ceschin R. Cerebral Spinal Fluid Volumetrics and Paralimbic Predictors of Executive Dysfunction in Congenital Heart Disease: A Machine Learning Approach Informing Mechanistic Insights. MEDRXIV : THE PREPRINT SERVER FOR HEALTH SCIENCES 2023:2023.10.16.23297055. [PMID: 37905005 PMCID: PMC10615017 DOI: 10.1101/2023.10.16.23297055] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/02/2023]
Abstract
The relationship between increased cerebral spinal fluid (CSF) ventricular compartments, structural and microstructural dysmaturation, and executive function in patients with congenital heart disease (CHD) is unknown. Here, we leverage a novel machine-learning data-driven technique to delineate interrelationships between CSF ventricular volume, structural and microstructural alterations, clinical risk factors, and sub-domains of executive dysfunction in adolescent CHD patients. We trained random forest regression models to predict measures of executive function (EF) from the NIH Toolbox, the Delis-Kaplan Executive Function System (D-KEFS), and the Behavior Rating Inventory of Executive Function (BRIEF) and across three subdomains of EF - mental flexibility, working memory, and inhibition. We estimated the best parameters for the random forest algorithm via a randomized grid search of parameters using 10-fold cross-validation on the training set only. The best parameters were then used to fit the model on the full training set and validated on the test set. Algorithm performance was measured using root-mean squared-error (RMSE). As predictors, we included patient clinical variables, perioperative clinical measures, microstructural white matter (diffusion tensor imaging- DTI), and structural volumes (volumetric magnetic resonance imaging- MRI). Structural white matter was measured using along-tract diffusivity measures of 13 inter-hemispheric and cortico-association fibers. Structural volumes were measured using FreeSurfer and manual segmentation of key structures. Variable importance was measured by the average Gini-impurity of each feature across all decision trees in which that feature is present in the model, and functional ontology mapping (FOM) was used to measure the degree of overlap in feature importance for each EF subdomain and across subdomains. We found that CSF structural properties (including increased lateral ventricular volume and reduced choroid plexus volumes) in conjunction with proximate cortical projection and paralimbic-related association white matter tracts that straddle the lateral ventricles and distal paralimbic-related subcortical structures (basal ganglia, hippocampus, cerebellum) are predictive of two-specific subdomains of executive dysfunction in CHD patients: cognitive flexibility and inhibition. These findings in conjunction with combined RF models that incorporated clinical risk factors, highlighted important clinical risk factors, including the presence of microbleeds, altered vessel volume, and delayed PDA closure, suggesting that CSF-interstitial fluid clearance, vascular pulsatility, and glymphatic microfluid dynamics may be pathways that are impaired in CHD, providing mechanistic information about the relationship between CSF and executive dysfunction.
Collapse
Affiliation(s)
- Vince K. Lee
- Department of Radiology, University of Pittsburgh School of Medicine
- Department of Bioengineering, University of Pittsburgh School of Medicine
| | - Julia Wallace
- Department of Radiology, University of Pittsburgh School of Medicine
| | - Benjamin Meyers
- Department of Radiology, University of Pittsburgh School of Medicine
| | - Adriana Racki
- Department of Radiology, University of Pittsburgh School of Medicine
| | - Anushka Shah
- Department of Radiology, University of Pittsburgh School of Medicine
| | - Nancy H. Beluk
- Department of Radiology, University of Pittsburgh School of Medicine
| | - Laura Cabral
- Department of Radiology, University of Pittsburgh School of Medicine
- Department of Biomedical Informatics, University of Pittsburgh
| | - Sue Beers
- Department of Psychiatry, University of Pittsburgh Medical Center
- Department of Psychiatry, University of Pittsburgh School of Medicine
| | | | - Cecilia Lo
- Department of Developmental Biology, University of Pittsburgh School of Medicine
| | - Ashok Panigrahy
- Department of Radiology, University of Pittsburgh School of Medicine
- Department of Biomedical Informatics, University of Pittsburgh
| | - Rafael Ceschin
- Department of Radiology, University of Pittsburgh School of Medicine
- Department of Biomedical Informatics, University of Pittsburgh
| |
Collapse
|
8
|
Moreno A, Solanki AA, Xu T, Lin R, Palta J, Daugherty E, Hong D, Hong J, Kamran SC, Katsoulakis E, Brock K, Feng M, Fuller C, Mayo C, Consortium BDSCPC. Identification of Key Elements in Prostate Cancer for Ontology Building via a Multidisciplinary Consensus Agreement. Cancers (Basel) 2023; 15:3121. [PMID: 37370731 PMCID: PMC10295832 DOI: 10.3390/cancers15123121] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/01/2023] [Revised: 05/25/2023] [Accepted: 06/01/2023] [Indexed: 06/29/2023] Open
Abstract
BACKGROUND Clinical data collection related to prostate cancer (PCa) care is often unstructured or heterogeneous among providers, resulting in a high risk for ambiguity in its meaning when sharing or analyzing data. Ontologies, which are shareable formal (i.e., computable) representations of knowledge, can address these challenges by enabling machine-readable semantic interoperability. The purpose of this study was to identify PCa-specific key data elements (KDEs) for standardization in clinic and research. METHODS A modified Delphi method using iterative online surveys was performed to report a consensus agreement on KDEs by a multidisciplinary panel of 39 PCa specialists. Data elements were divided into three themes in PCa and included (1) treatment-related toxicities (TRT), (2) patient-reported outcome measures (PROM), and (3) disease control metrics (DCM). RESULTS The panel reached consensus on a thirty-item, two-tiered list of KDEs focusing mainly on urinary and rectal symptoms. The Expanded Prostate Cancer Index Composite (EPIC-26) questionnaire was considered most robust for PROM multi-domain monitoring, and granular KDEs were defined for DCM. CONCLUSIONS This expert consensus on PCa-specific KDEs has served as a foundation for a professional society-endorsed, publicly available operational ontology developed by the American Association of Physicists in Medicine (AAPM) Big Data Sub Committee (BDSC).
Collapse
Affiliation(s)
- Amy Moreno
- Department of Radiation Oncology, University of Texas MD Anderson Cancer Center, Houston, TX 77030, USA;
| | - Abhishek A. Solanki
- Department of Radiation Oncology, Loyola University Medical Center, Berwyn, IL 60402, USA;
| | - Tianlin Xu
- Department of Biostatistics, University of Texas MD Anderson Cancer Center, Houston, TX 77030, USA; (T.X.); (R.L.)
| | - Ruitao Lin
- Department of Biostatistics, University of Texas MD Anderson Cancer Center, Houston, TX 77030, USA; (T.X.); (R.L.)
| | - Jatinder Palta
- Department of Medical Physics, Virginia Commonwealth University, Richmond, VA 23284, USA;
| | - Emily Daugherty
- Department of Radiation Oncology, University of Cincinnati College of Medicine, Cincinnati, OH 45267, USA;
| | - David Hong
- Department of Radiation Oncology, University of Southern California, Los Angeles, CA 90089, USA;
| | - Julian Hong
- Department of Radiation Oncology, University of California San Francisco, San Francisco, CA 93701, USA; (J.H.); (M.F.)
| | - Sophia C. Kamran
- Department of Radiation Oncology, Massachusetts General Hospital, Boston, MA 02129, USA;
| | - Evangelia Katsoulakis
- Department of Radiation Oncology, James A Haley VA Medical Center, Tampa, FL 33612, USA;
| | - Kristy Brock
- Department of Imaging Physics, University of Texas MD Anderson Cancer Center, Houston, TX 77030, USA;
| | - Mary Feng
- Department of Radiation Oncology, University of California San Francisco, San Francisco, CA 93701, USA; (J.H.); (M.F.)
| | - Clifton Fuller
- Department of Radiation Oncology, University of Texas MD Anderson Cancer Center, Houston, TX 77030, USA;
| | - Charles Mayo
- Department of Radiation Physics, University of Michigan, Ann Arbor, MI 48109, USA;
| | | |
Collapse
|