1
|
Yin SQ, Li YH. Advancing the diagnosis of major depressive disorder: Integrating neuroimaging and machine learning. World J Psychiatry 2025; 15:103321. [DOI: 10.5498/wjp.v15.i3.103321] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 11/20/2024] [Revised: 12/27/2024] [Accepted: 01/08/2025] [Indexed: 02/26/2025] Open
Abstract
Major depressive disorder (MDD), a psychiatric disorder characterized by functional brain deficits, poses considerable diagnostic and treatment challenges, especially in adolescents owing to varying clinical presentations. Biomarkers hold substantial clinical potential in the field of mental health, enabling objective assessments of physiological and pathological states, facilitating early diagnosis, and enhancing clinical decision-making and patient outcomes. Recent breakthroughs combine neuroimaging with machine learning (ML) to distinguish brain activity patterns between MDD patients and healthy controls, paving the way for diagnostic support and personalized treatment. However, the accuracy of the results depends on the selection of neuroimaging features and algorithms. Ensuring privacy protection, ML model accuracy, and fostering trust are essential steps prior to clinical implementation. Future research should prioritize the establishment of comprehensive legal frameworks and regulatory mechanisms for using ML in MDD diagnosis while safeguarding patient privacy and rights. By doing so, we can advance accuracy and personalized care for MDD.
Collapse
Affiliation(s)
- Shi-Qi Yin
- School of Pharmaceutical Sciences, Capital Medical University, Beijing 100069, China
| | - Ying-Huan Li
- School of Pharmaceutical Sciences, Capital Medical University, Beijing 100069, China
| |
Collapse
|
2
|
Wawer A, Chojnicka I, Sarzyńska‐Wawer J, Krawczyk M. A cross-dataset study on automatic detection of autism spectrum disorder from text data. Acta Psychiatr Scand 2025; 151:259-269. [PMID: 39032040 PMCID: PMC11787923 DOI: 10.1111/acps.13737] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 03/11/2024] [Revised: 06/20/2024] [Accepted: 07/06/2024] [Indexed: 07/22/2024]
Abstract
OBJECTIVE The goals of this article are as follows. First, to investigate the possibility of detecting autism spectrum disorder (ASD) from text data using the latest generation of machine learning tools. Second, to compare model performance on two datasets of transcribed statements, collected using two different diagnostic tools. Third, to investigate the feasibility of knowledge transfer between models trained on both datasets and check if data augmentation can help alleviate the problem of a small number of observations. METHOD We explore two techniques to detect ASD. The first one is based on fine-tuning HerBERT, a BERT-based, monolingual deep transformer neural network. The second one uses the newest, multipurpose text embeddings from OpenAI and a classifier. We apply the methods to two separate datasets of transcribed statements, collected using two different diagnostic tools: thought, language, and communication (TLC) and autism diagnosis observation schedule-2 (ADOS-2). We conducted several cross-dataset experiments in both a zero-shot setting and a setting where models are pretrained on one dataset and then training continues on another to test the possibility of knowledge transfer. RESULTS Unlike previous studies, the models we tested obtained average results on ADOS-2 data but reached very good performance of the models in TLC. We did not observe any benefits from knowledge transfer between datasets. We observed relatively poor performance of models trained on augmented data and hypothesize that data augmentation by back translation obfuscates autism-specific signals. CONCLUSION The quality of machine learning models that detect ASD from text data is improving, but model results are dependent on the type of input data or diagnostic tool.
Collapse
Affiliation(s)
- Aleksander Wawer
- Polish Academy of SciencesInstitute of Computer ScienceWarsawPoland
| | - Izabela Chojnicka
- Department of Health and Rehabilitation Psychology, Faculty of PsychologyUniversity of WarsawWarsawPoland
| | | | | |
Collapse
|
3
|
Chen C, Khanthiyong B, Thaweetee-Sukjai B, Charoenlappanit S, Roytrakul S, Surit P, Phoungpetchara I, Thanoi S, Reynolds GP, Nudmamud-Thanoi S. Proteomic associations with cognitive variability as measured by the Wisconsin Card Sorting Test in a healthy Thai population: A machine learning approach. PLoS One 2025; 20:e0313365. [PMID: 39977438 PMCID: PMC11841870 DOI: 10.1371/journal.pone.0313365] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/23/2024] [Accepted: 01/21/2025] [Indexed: 02/22/2025] Open
Abstract
Inter-individual cognitive variability, influenced by genetic and environmental factors, is crucial for understanding typical cognition and identifying early cognitive disorders. This study investigated the association between serum protein expression profiles and cognitive variability in a healthy Thai population using machine learning algorithms. We included 199 subjects, aged 20 to 70, and measured cognitive performance with the Wisconsin Card Sorting Test. Differentially expressed proteins (DEPs) were identified using label-free proteomics and analyzed with the Linear Model for Microarray Data. We discovered 213 DEPs between lower and higher cognition groups, with 155 upregulated in the lower cognition group and enriched in the IL-17 signaling pathway. Subsequent bioinformatic analysis linked these DEPs to neuroinflammation-related cognitive impairment. A random forest model classified cognitive ability groups with an accuracy of 81.5%, sensitivity of 65%, specificity of 85.9%, and an AUC of 0.79. By targeting a specific Thai cohort, this research provides novel insights into the link between neuroinflammation and cognitive performance, advancing our understanding of cognitive variability, highlighting the role of biological markers in cognitive function, and contributing to developing more accurate machine learning models for diverse populations.
Collapse
Affiliation(s)
- Chen Chen
- Faculty of Medical Science, Medical Science graduate program, Naresuan University, Phitsanulok, Thailand
| | | | | | - Sawanya Charoenlappanit
- National Centre for Genetic Engineering and Biotechnology, National Science and Technology Development Agency, Pathum Thani, Thailand
| | - Sittiruk Roytrakul
- National Centre for Genetic Engineering and Biotechnology, National Science and Technology Development Agency, Pathum Thani, Thailand
| | - Phrutthinun Surit
- Department of Biochemistry, Faculty of Medical Science, Naresuan University, Phitsanulok, Thailand
| | - Ittipon Phoungpetchara
- Department of Anatomy, Faculty of Medical Science, Naresuan University, Phitsanulok, Thailand
- Centre of Excellence in Medical Biotechnology, Faculty of Medical Science, Naresuan University, Phitsanulok, Thailand
| | - Samur Thanoi
- School of Medical Sciences, University of Phayao, Phayao, Thailand
| | - Gavin P. Reynolds
- Centre of Excellence in Medical Biotechnology, Faculty of Medical Science, Naresuan University, Phitsanulok, Thailand
- Biomolecular Sciences Research Centre, Sheffield Hallam University, Sheffield, United Kingdom
| | - Sutisa Nudmamud-Thanoi
- Department of Anatomy, Faculty of Medical Science, Naresuan University, Phitsanulok, Thailand
- Centre of Excellence in Medical Biotechnology, Faculty of Medical Science, Naresuan University, Phitsanulok, Thailand
| |
Collapse
|
4
|
Mari T, Ali SH, Pacinotti L, Powsey S, Fallon N. Machine learning classification of active viewing of pain and non-pain images using EEG does not exceed chance in external validation samples. COGNITIVE, AFFECTIVE & BEHAVIORAL NEUROSCIENCE 2025:10.3758/s13415-025-01268-2. [PMID: 39966304 DOI: 10.3758/s13415-025-01268-2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Accepted: 01/18/2025] [Indexed: 02/20/2025]
Abstract
Previous research has demonstrated that machine learning (ML) could not effectively decode passive observation of neutral versus pain photographs by using electroencephalogram (EEG) data. Consequently, the present study explored whether active viewing, i.e., requiring participant engagement in a task, of neutral and pain stimuli improves ML performance. Random forest (RF) models were trained on cortical event-related potentials (ERPs) during a two-alternative forced choice paradigm, whereby participants determined the presence or absence of pain in photographs of facial expressions and action scenes. Sixty-two participants were recruited for the model development sample. Moreover, a within-subject temporal validation sample was collected, consisting of 27 subjects. In line with our previous research, three RF models were developed to classify images into faces and scenes, neutral and pain scenes, and neutral and pain expressions. The results demonstrated that the RF successfully classified discrete categories of visual stimuli (faces and scenes) with accuracies of 78% and 66% on cross-validation and external validation, respectively. However, despite promising cross-validation results of 61% and 67% for the classification of neutral and pain scenes and neutral and pain faces, respectively, the RF models failed to exceed chance performance on the external validation dataset on both empathy classification attempts. These results align with previous research, highlighting the challenges of classifying complex states, such as pain empathy using ERPs. Moreover, the results suggest that active observation fails to enhance ML performance beyond previous passive studies. Future research should prioritise improving model performance to obtain levels exceeding chance, which would demonstrate increased utility.
Collapse
Affiliation(s)
- Tyler Mari
- Department of Psychology, Institute of Population Health, Faculty of Health and Life Sciences, University of Liverpool, Bedford Street South, Liverpool, L69 7ZA, UK.
| | - S Hasan Ali
- Department of Psychology, Institute of Population Health, Faculty of Health and Life Sciences, University of Liverpool, Bedford Street South, Liverpool, L69 7ZA, UK
| | - Lucrezia Pacinotti
- Department of Psychology, Institute of Population Health, Faculty of Health and Life Sciences, University of Liverpool, Bedford Street South, Liverpool, L69 7ZA, UK
| | - Sarah Powsey
- Department of Psychology, Institute of Population Health, Faculty of Health and Life Sciences, University of Liverpool, Bedford Street South, Liverpool, L69 7ZA, UK
| | - Nicholas Fallon
- Department of Psychology, Institute of Population Health, Faculty of Health and Life Sciences, University of Liverpool, Bedford Street South, Liverpool, L69 7ZA, UK
| |
Collapse
|
5
|
Cheng Y, Petrides KV, Li J. Estimating the Minimum Sample Size for Neural Network Model Fitting-A Monte Carlo Simulation Study. Behav Sci (Basel) 2025; 15:211. [PMID: 40001842 PMCID: PMC11851979 DOI: 10.3390/bs15020211] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/28/2024] [Revised: 02/07/2025] [Accepted: 02/11/2025] [Indexed: 02/27/2025] Open
Abstract
In the era of machine learning, many psychological studies use machine learning methods. Specifically, neural networks, a set of machine learning methods that exhibit exceptional performance in various tasks, have been used on psychometric datasets for supervised model fitting. From the computer scientist's perspective, psychometric independent variables are typically ordinal and low-dimensional-characteristics that can significantly impact model performance. To our knowledge, there is no guidance about the sample planning suggestion for this task. Therefore, we conducted a simulation study to test the performance of an NN with different sample sizes and the simulation of both linear and nonlinear relationships. We proposed the minimum sample size for the neural network model fitting with two criteria: the performance of 95% of the models is close to the theoretical maximum, and 80% of the models can outperform the linear model. The findings of this simulation study show that the performance of neural networks can be unstable with ordinal variables as independent variables, and we suggested that neural networks should not be used on ordinal independent variables with at least common nonlinear relationships in psychology. Further suggestions and research directions are also provided.
Collapse
Affiliation(s)
- Yongtian Cheng
- Division of Psychology and Language Sciences, University College London (UCL), 26 Bedford Way, London WC1H 0AP, UK;
| | | | - Johnson Li
- Department of Psychology, University of Manitoba, Winnipeg, MB R3T 2N2, Canada;
| |
Collapse
|
6
|
Hong T, Xie S, Liu X, Wu J, Chen G. Do Machine Learning Approaches Perform Better Than Regression Models in Mapping Studies? A Systematic Review. VALUE IN HEALTH : THE JOURNAL OF THE INTERNATIONAL SOCIETY FOR PHARMACOECONOMICS AND OUTCOMES RESEARCH 2025:S1098-3015(25)00045-2. [PMID: 39922301 DOI: 10.1016/j.jval.2024.12.010] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/01/2024] [Revised: 12/06/2024] [Accepted: 12/16/2024] [Indexed: 02/10/2025]
Abstract
OBJECTIVES To identify how machine learning (ML) approaches were implemented in mapping studies and to determine the extent to which ML improved performance compared with regression models (RMs). METHODS A systematic literature search was conducted in 12 databases from inception to December 2023 to identify studies that applied ML to develop mapping algorithms. A data template was applied to extract data set information, source and target measures, ML approaches and RMs, mapping types (direct vs indirect), goodness-of-fit indicators (mean absolute error, mean squared error, root mean squared error, R-squared, and intraclass correlation coefficient), and validation methods. Differences in goodness-of-fit indicators between ML and RMs were summarized. Potential advantages and challenges for ML were further discussed. RESULTS Thirteen mapping studies were identified, in which both ML and RM were adopted. Bayesian networks were the most frequently used ML approach (n = 6), followed by the least absolute shrinkage and selection operator (n = 4). The ordinary least square model was the most used RM (n = 8), followed by the censored least absolute deviation and multinomial logit models (n = 5 each). The average improvement in the goodness-of-fit of ML compared with that of RMs by indicators were 0.007 (mean absolute error), 0.004 (mean squared error), 0.058 (R-squared), 0.016 (intraclass correlation coefficient), and -0.0004 (root mean squared error). CONCLUSIONS There is an increasing number of studies using ML in developing mapping algorithms. Generally, a minor improvement of goodness-of-fit was observed compared with RMs when using mean-based comparisons. Issues such as how to interpret, apply, and externally validate the ML-based outputs would affect their implementation. Future studies are warranted to verify advantages of ML approaches.
Collapse
Affiliation(s)
- Tianqi Hong
- School of Biomedical Engineering, McMaster University, Hamilton, ON, Canada
| | - Shitong Xie
- School of Pharmaceutical Science and Technology, Tianjin University, Tianjin, China; Center for Social Science Survey and Data, Tianjin University, Tianjin, China
| | - Xinran Liu
- School of Pharmaceutical Science and Technology, Tianjin University, Tianjin, China; Center for Social Science Survey and Data, Tianjin University, Tianjin, China
| | - Jing Wu
- School of Pharmaceutical Science and Technology, Tianjin University, Tianjin, China; Center for Social Science Survey and Data, Tianjin University, Tianjin, China.
| | - Gang Chen
- Centre for Health Economics, Monash Business School, Monash University, Melbourne, VIC, Australia; Melbourne School of Population and Global Health, University of Melbourne, Melbourne, VIC, Australia
| |
Collapse
|
7
|
Jiwani R, Pal K, Paolucci I, Odisio B, Brock K, Tannir NM, Shapiro DD, Msaouel P, Sheth RA. Differentiating between renal medullary and clear cell renal carcinoma with a machine learning radiomics approach. Oncologist 2025; 30:oyae337. [PMID: 39963829 PMCID: PMC11833245 DOI: 10.1093/oncolo/oyae337] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/28/2024] [Accepted: 11/01/2024] [Indexed: 02/21/2025] Open
Abstract
BACKGROUND The objective of this study was to develop and validate a radiomics-based machine learning (ML) model to differentiate between renal medullary carcinoma (RMC) and clear cell renal carcinoma (ccRCC). METHODS This retrospective Institutional Review Board -approved study analyzed CT images and clinical data from patients with RMC (n = 87) and ccRCC (n = 93). Patients without contrast-enhanced CT scans obtained before nephrectomy were excluded. A standard volumetric software package (MIM 7.1.4, MIM Software Inc.) was used for contouring, after which 949 radiomics features were extracted with PyRadiomics 3.1.0. Radiomics analysis was then performed with RadAR for differential radiomics analysis. ML was then performed with extreme gradient boosting (XGBoost 2.0.3) to differentiate between RMC and ccRCC. Three separate ML models were created to differentiate between ccRCC and RMC. These models were based on clinical demographics, radiomics, and radiomics incorporating hemoglobin electrophoresis for sickle cell trait, respectively. RESULTS Performance metrics for the 3 developed ML models were as follows: demographic factors only (AUC = 0.777), calibrated radiomics (AUC = 0.915), and calibrated radiomics with sickle cell trait incorporated (AUC = 1.0). The top 4 ranked features from differential radiomic analysis, ranked by their importance, were run entropy (preprocessing filter = original, AUC = 0.67), dependence entropy (preprocessing filter = wavelet, AUC = 0.67), zone entropy (preprocessing filter = original, AUC = 0.67), and dependence entropy (preprocessing filter = original, AUC = 0.66). CONCLUSION A radiomics-based machine learning model effectively differentiates between ccRCC and RMC. This tool can facilitate the radiologist's ability to suspicion and decrease the misdiagnosis rate of RMC.
Collapse
Affiliation(s)
- Rahim Jiwani
- Department of Interventional Radiology, The University of Texas MD Anderson Cancer Center, Houston, TX 77030, United States
| | - Koustav Pal
- Department of Interventional Radiology, The University of Texas MD Anderson Cancer Center, Houston, TX 77030, United States
| | - Iwan Paolucci
- Department of Interventional Radiology, The University of Texas MD Anderson Cancer Center, Houston, TX 77030, United States
| | - Bruno Odisio
- Department of Interventional Radiology, The University of Texas MD Anderson Cancer Center, Houston, TX 77030, United States
| | - Kristy Brock
- Department of Imaging Physics, The University of Texas MD Anderson Cancer Center, Houston, TX 77030, United States
| | - Nizar M Tannir
- Department of Genitourinary Medical Oncology, The University of Texas MD Anderson Cancer Center, Houston, TX 77030, United States
| | - Daniel D Shapiro
- Department of Urology, University of Wisconsin School of Medicine and Public Health, Madison, WI 77030, United States
| | - Pavlos Msaouel
- Department of Genitourinary Medical Oncology, The University of Texas MD Anderson Cancer Center, Houston, TX 77030, United States
- Department of Translational Molecular Pathology, The University of Texas MD Anderson Cancer Center, Houston, TX 77030, United States
- David H. Koch Center for Applied Research of Genitourinary Cancers, The University of Texas MD Anderson Cancer Center, Houston, TX 77030, United States
| | - Rahul A Sheth
- Department of Interventional Radiology, The University of Texas MD Anderson Cancer Center, Houston, TX 77030, United States
| |
Collapse
|
8
|
Langford DJ, Reichel JF, Zhong H, Basseri BH, Koch MP, Kolady R, Liu J, Sideris A, Dworkin RH, Poeran J, Wu CL. Machine learning research methods to predict postoperative pain and opioid use: a narrative review. Reg Anesth Pain Med 2025; 50:102-109. [PMID: 39909542 DOI: 10.1136/rapm-2024-105603] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/06/2024] [Accepted: 10/20/2024] [Indexed: 02/07/2025]
Abstract
The use of machine learning to predict postoperative pain and opioid use has likely been catalyzed by the availability of complex patient-level data, computational and statistical advancements, the prevalence and impact of chronic postsurgical pain, and the persistence of the opioid crisis. The objectives of this narrative review were to identify and characterize methodological aspects of studies that have developed and/or tested machine learning algorithms to predict acute, subacute, or chronic pain or opioid use after any surgery and to propose considerations for future machine learning studies. Pairs of independent reviewers screened titles and abstracts of 280 PubMed-indexed articles and ultimately extracted data from 61 studies that met entry criteria. We observed a marked increase in the number of relevant publications over time. Studies most commonly focused on machine learning algorithms to predict chronic postsurgical pain or opioid use, using real-world data from patients undergoing orthopedic surgery. We identified variability in sample size, number and type of predictors, and how outcome variables were defined. Patient-reported predictors were highlighted as particularly informative and important to include in such machine learning algorithms, where possible. We hope that findings from this review might inform future applications of machine learning that improve the performance and clinical utility of resultant machine learning algorithms.
Collapse
Affiliation(s)
- Dale J Langford
- Pain Prevention Research Center, Department of Anesthesiology, Critical Care & Pain Management, Hospital for Special Surgery, New York, New York, USA
- Department of Anesthesiology, Weill Cornell Medicine, New York, New York, USA
| | - Julia F Reichel
- Department of Anesthesiology, Critical Care & Pain Management, Hospital for Special Surgery, New York, New York, USA
| | - Haoyan Zhong
- Department of Anesthesiology, Critical Care & Pain Management, Hospital for Special Surgery, New York, New York, USA
| | - Benjamin H Basseri
- Department of Anesthesiology, Critical Care & Pain Management, Hospital for Special Surgery, New York, New York, USA
| | - Marc P Koch
- Davidson College, Davidson, North Carolina, USA
| | | | - Jiabin Liu
- Pain Prevention Research Center, Department of Anesthesiology, Critical Care & Pain Management, Hospital for Special Surgery, New York, New York, USA
- Department of Anesthesiology, Weill Cornell Medicine, New York, New York, USA
| | - Alexandra Sideris
- Pain Prevention Research Center, Department of Anesthesiology, Critical Care & Pain Management, Hospital for Special Surgery, New York, New York, USA
- Department of Anesthesiology, Weill Cornell Medicine, New York, New York, USA
- Hospital for Special Surgery Research Institute, New York, New York, USA
| | - Robert H Dworkin
- Pain Prevention Research Center, Department of Anesthesiology, Critical Care & Pain Management, Hospital for Special Surgery, New York, New York, USA
- Hospital for Special Surgery Research Institute, New York, New York, USA
- Anesthesiology and Perioperative Medicine, University of Rochester, Rochester, New York, USA
| | - Jashvant Poeran
- Pain Prevention Research Center, Department of Anesthesiology, Critical Care & Pain Management, Hospital for Special Surgery, New York, New York, USA
- Hospital for Special Surgery Research Institute, New York, New York, USA
| | - Christopher L Wu
- Pain Prevention Research Center, Department of Anesthesiology, Critical Care & Pain Management, Hospital for Special Surgery, New York, New York, USA
- Department of Anesthesiology, Weill Cornell Medicine, New York, New York, USA
| |
Collapse
|
9
|
Gill SS, Ponniah HS, Giersztein S, Anantharaj RM, Namireddy SR, Killilea J, Ramsay D, Salih A, Thavarajasingam A, Scurtu D, Jankovic D, Russo S, Kramer A, Thavarajasingam SG. The diagnostic and prognostic capability of artificial intelligence in spinal cord injury: A systematic review. BRAIN & SPINE 2025; 5:104208. [PMID: 40027293 PMCID: PMC11871462 DOI: 10.1016/j.bas.2025.104208] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 12/23/2024] [Revised: 01/20/2025] [Accepted: 02/04/2025] [Indexed: 03/05/2025]
Abstract
Background Artificial intelligence (AI) models have shown potential for diagnosing and prognosticating traumatic spinal cord injury (tSCI), but their clinical utility remains uncertain. Method ology: The primary aim was to evaluate the performance of AI algorithms in diagnosing and prognosticating tSCI. Subsequent systematic searching of seven databases identified studies evaluating AI models. PROBAST and TRIPOD tools were used to assess the quality and reporting of included studies (PROSPERO: CRD42023464722). Fourteen studies, comprising 20 models and 280,817 pooled imaging datasets, were included. Analysis was conducted in line with the SWiM guidelines. Results For prognostication, 11 studies predicted outcomes including AIS improvement (30%), mortality and ambulatory ability (20% each), and discharge or length of stay (10%). The mean AUC was 0.770 (range: 0.682-0.902), indicating moderate predictive performance. Diagnostic models utilising DTI, CT, and T2-weighted MRI with CNN-based segmentation achieved a weighted mean accuracy of 0.898 (range: 0.813-0.938), outperforming prognostic models. Conclusion AI demonstrates strong diagnostic accuracy (mean accuracy: 0.898) and moderate prognostic capability (mean AUC: 0.770) for tSCI. However, the lack of standardised frameworks and external validation limits clinical applicability. Future models should integrate multimodal data, including imaging, patient characteristics, and clinician judgment, to improve utility and alignment with clinical practice.
Collapse
Affiliation(s)
- Saran Singh Gill
- Imperial Brain & Spine Initiative, Imperial College London, London, United Kingdom
- Faculty of Medicine, Imperial College London, London, United Kingdom
| | - Hariharan Subbiah Ponniah
- Imperial Brain & Spine Initiative, Imperial College London, London, United Kingdom
- Faculty of Medicine, Imperial College London, London, United Kingdom
| | - Sho Giersztein
- Imperial Brain & Spine Initiative, Imperial College London, London, United Kingdom
| | | | - Srikar Reddy Namireddy
- Imperial Brain & Spine Initiative, Imperial College London, London, United Kingdom
- Faculty of Medicine, Imperial College London, London, United Kingdom
| | - Joshua Killilea
- Imperial Brain & Spine Initiative, Imperial College London, London, United Kingdom
| | - DanieleS.C. Ramsay
- Imperial Brain & Spine Initiative, Imperial College London, London, United Kingdom
| | - Ahmed Salih
- Imperial Brain & Spine Initiative, Imperial College London, London, United Kingdom
| | | | - Daniel Scurtu
- Department of Neurosurgery, Universitätsmedizin Mainz, Mainz, Germany
| | - Dragan Jankovic
- Department of Neurosurgery, LMU University Hospital, LMU, Munich, Germany
| | - Salvatore Russo
- Imperial College Healthcare NHS Trust, London, United Kingdom
| | - Andreas Kramer
- Department of Neurosurgery, LMU University Hospital, LMU, Munich, Germany
| | - Santhosh G. Thavarajasingam
- Imperial Brain & Spine Initiative, Imperial College London, London, United Kingdom
- Department of Neurosurgery, LMU University Hospital, LMU, Munich, Germany
| |
Collapse
|
10
|
Min SH, Woo K, Song J, Alexander GL, O’Malley T, Moen MD, Topaz M. Understanding Daily Care Experience Preferences Across the Lifespan of Older Adults: Application of Natural Language Processing. West J Nurs Res 2025; 47:71-81. [PMID: 39707813 PMCID: PMC11742706 DOI: 10.1177/01939459241306946] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/23/2024]
Abstract
INTRODUCTION Older adults are a heterogeneous group, and their care experience preferences are likely to be diverse and individualized. Thus, the aim of this study was to identify categories of older adults' care experience preferences and to examine similarities and differences across different age groups. METHODS The initial categories of older adults' care experience preferences were identified through a qualitative review of narrative text (n = 3134) in the ADVault data set. A natural language processing (NLP) algorithm was used to automatically and accurately define older adults' care experience preference categories. Descriptive statistics were used to examine similarities and differences in care experience preference categories across different age groups. RESULTS The overall average performance of NLP algorithms was relatively high (average F-score = 0.88; range: 0.77-0.96). Through a qualitative review of 350 randomly selected texts, a total of 11 categories were identified. The most frequent category was music, followed by photographs, entertainment, family/friends, religion-related, atmosphere, flower/plants, pet, bed/bedding, hobby, and other. After applying the best performing NLP algorithm to each category, older adults' care experience preference categories were music (41.32%), followed by photographs (28.47%), entertainment (13.46%), religion-related (n = 12.69%), pet (12.22%), flower/plants (11.51%), family/friends (8.45%), atmosphere (7.75%), bed/bedding (6.12%), and hobby (5.45%). Young-old and old-old adults had similar care experience preferences with music being the most frequent category while old-old adults had photographs as the most frequent category for their care experience preference. CONCLUSION Clinicians must understand the distinct categories of care experience preferences and incorporate them into personalized care planning.
Collapse
Affiliation(s)
- Se Hee Min
- University of Pennsylvania School of Nursing, Philadelphia, PA, USA
| | - Kyungmi Woo
- Seoul National University College of Nursing, Seoul, South Korea
| | - Jiyoun Song
- University of Pennsylvania School of Nursing, Philadelphia, PA, USA
| | | | | | | | - Maxim Topaz
- Columbia University School of Nursing, New York, NY, USA
| |
Collapse
|
11
|
van Rijswijk RE, Bogdanovic M, Roy J, Yeung KK, Zeebregts CJ, Geelkerken RH, Groot Jebbink E, Wolterink JM, Reijnen MMPJ. Multimodal Artificial Intelligence Model for Prediction of Abdominal Aortic Aneurysm Shrinkage After Endovascular Repair ( the ART in EVAR study). J Endovasc Ther 2025:15266028251314359. [PMID: 39882767 DOI: 10.1177/15266028251314359] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/31/2025]
Abstract
PURPOSE The goal of the study described in this protocol is to build a multimodal artificial intelligence (AI) model to predict abdominal aortic aneurysm (AAA) shrinkage 1 year after endovascular aneurysm repair (EVAR). METHODS In this retrospective observational multicenter study, approximately 1000 patients will be enrolled from hospital records of 5 experienced vascular centers. Patients will be included if they underwent elective EVAR for infrarenal AAA with initial assisted technical success and had imaging available of the same modality preoperatively and at 1-year follow-up (CTA-CTA or US-US). Data collection will include baseline and vascular characteristics, medication use, procedural data, preoperative and postoperative imaging data, follow-up data, and complications. PROPOSED ANALYSES The cohort will be stratified into 3 groups of AAA remodeling based on the maximum AAA diameter difference between the preoperative and 1-year postoperative moment. Patients with a diameter reduction of ≥5 mm will be assigned to the AAA shrinkage group, cases with an increase of ≥5 mm will be assigned to the AAA growth group, and patients with a diameter increase or reduction of <5 mm will be assigned to the stable AAA group. Furthermore, an additional fourth group will include all patients who underwent an AAA-related reintervention within the first year after EVAR, because both the complication and the reintervention might have influenced the state of AAA remodeling at 1 year. The preoperative and postoperative CTA scans will be used for anatomical AAA analysis and biomechanical assessment through semi-automatic segmentation and finite element analysis. All collected clinical, biomechanical, and imaging data will be used to create an AI prediction model for AAA shrinkage. Explainable AI techniques will be used to identify the most descriptive input features in the model. Predicting factors resulting from the AI model will be compared with conventional univariate and multivariate logistic regression analyses to find the best model for the prediction of AAA shrinkage. The study is registered at www.clinicaltrials.gov under the registration number NCT06250998. CLINICAL IMPACT This study aims to develop a robust and high-performance AI model for predicting AAA shrinkage one-year after EVAR, with great potential for optimizing both EVAR treatment and follow-up. The model can identify cases with an initially lower chance of early AAA shrinkage, in whom EVAR-treatment could be tailored by including additional preoperative coil embolization, active sac management and/or postoperative tranexamic acid therapy, which have shown to promote AAA shrinkage rate but are too complex and costly to perform in all patients. The model could aid in stratification of post-EVAR surveillance based on the patient's individual risk and possibly decrease follow-up for the 40-50% of patients who will experience AAA sac shrinkage. Overall, the AI prediction model is expected to improve patient survival and decrease the number of reinterventions after EVAR and associated healthcare costs.
Collapse
Affiliation(s)
- Rianne E van Rijswijk
- Department of Vascular Surgery, Rijnstate, Arnhem, The Netherlands
- Multi-Modality Medical Imaging Group, Technical Medical Centre, University of Twente, Enschede, The Netherlands
| | - Marko Bogdanovic
- Department of Molecular Medicine and Surgery, Karolinska Institutet, Stockholm, Sweden
- Department of Vascular Surgery, Karolinska University Hospital, Stockholm, Sweden
| | - Joy Roy
- Department of Molecular Medicine and Surgery, Karolinska Institutet, Stockholm, Sweden
- Department of Vascular Surgery, Karolinska University Hospital, Stockholm, Sweden
| | - Kak Khee Yeung
- Department of Surgery, Amsterdam University Medical Center, Vrije Universiteit Amsterdam, Amsterdam Cardiovascular Sciences, Amsterdam, The Netherlands
| | - Clark J Zeebregts
- Division of Vascular Surgery, Department of Surgery, University Medical Center Groningen, Groningen, The Netherlands
| | - Robert H Geelkerken
- Multi-Modality Medical Imaging Group, Technical Medical Centre, University of Twente, Enschede, The Netherlands
- Department of Vascular Surgery, Medisch Spectrum Twente, Enschede, The Netherlands
| | - Erik Groot Jebbink
- Department of Vascular Surgery, Rijnstate, Arnhem, The Netherlands
- Multi-Modality Medical Imaging Group, Technical Medical Centre, University of Twente, Enschede, The Netherlands
| | - Jelmer M Wolterink
- Department of Applied Mathematics, Technical Medical Centre, University of Twente, Enschede, The Netherlands
| | - Michel M P J Reijnen
- Department of Vascular Surgery, Rijnstate, Arnhem, The Netherlands
- Multi-Modality Medical Imaging Group, Technical Medical Centre, University of Twente, Enschede, The Netherlands
| |
Collapse
|
12
|
Maiter A, Alabed S, Allen G, Alahdab F. AI in healthcare: an introduction for clinicians. BMJ Evid Based Med 2025:bmjebm-2024-112966. [PMID: 39863401 DOI: 10.1136/bmjebm-2024-112966] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Key Words] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Accepted: 11/10/2024] [Indexed: 01/27/2025]
Affiliation(s)
- Ahmed Maiter
- Department of Radiology, Sheffield Teaching Hospitals NHS Foundation Trust, Sheffield, UK
| | - Samer Alabed
- Department of Radiology, Sheffield Teaching Hospitals NHS Foundation Trust, Sheffield, UK
| | - Genevera Allen
- Center for Theoretical Neuroscience, Columbia University, New York, New York, USA
| | - Fares Alahdab
- Departments of Biomedical Informatics, Biostatistics, and Epidemiology, and Cardiology, University of Missouri, Columbia, Missouri, USA
| |
Collapse
|
13
|
Zegeye AT, Tilahun BC, Fekadie M, Addisu E, Wassie B, Alelign B, Sharew M, Baykemagn ND, Kebede A, Yehuala TZ. Predicting home delivery and identifying its determinants among women aged 15-49 years in sub-Saharan African countries using a Demographic and Health Surveys 2016-2023: a machine learning algorithm. BMC Public Health 2025; 25:302. [PMID: 39856651 PMCID: PMC11760118 DOI: 10.1186/s12889-025-21334-1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/13/2024] [Accepted: 01/06/2025] [Indexed: 01/27/2025] Open
Abstract
BACKGROUND Birth-related mortality is significantly increased by home births without skilled medical assistance during delivery, presenting a major risk to the public's health. The objective of this study is to predict home delivery and identify the determinants using machine learning algorithm in sub-Saharan African. METHODS This study used design science approaches. The data set obtained from demographic health survey in sub-Saharan African weighted sample of 299,759 women was included in the stud. Machine learning models such as Random Forest, Decision Tree, K-Nearest Neighbor, Logistic Regression, Extreme Gradient Boosting, AdaBoost, Artificial Neural Network, and Naive Bayes were used. The predictive model was evaluated by area under the curve, accuracy, precision, recall, and F-measure. RESULTS The final experimentation results indicated that random forest model performed the best to predict home delivery with accuracy (83%) and, ROC curve (89%). The Shapley additive explanation features an importance plot optimized for random forest model to identifying the most predictors of home delivery. Association rules findings showed that inadequate antenatal care visits, marital status married, no education, mobile phone, television, electricity, poor wealth index, infrequent television viewing, and rural residence were predictor of home delivery. CONCLUSION The random forest machine learning model provides greater predictive power for estimating home delivery risk factors. To reduce the prevalence of home delivery, this finding recommends to emphasis on improving antenatal care services, education, and awareness about health facility delivery.
Collapse
Affiliation(s)
- Adem Tsegaw Zegeye
- Department of Health Informatics, Institute of Public Health, College of Medicine and Health Sciences, University of Gondar, Gondar, Ethiopia.
| | - Binyam Chaklu Tilahun
- Department of Health Informatics, Institute of Public Health, College of Medicine and Health Sciences, University of Gondar, Gondar, Ethiopia
| | - Makida Fekadie
- Department of Health Informatics, Institute of Public Health, College of Medicine and Health Sciences, University of Gondar, Gondar, Ethiopia
| | - Eliyas Addisu
- Department of Health Informatics, Institute of Public Health, College of Medicine and Health Sciences, University of Gondar, Gondar, Ethiopia
| | - Birhan Wassie
- Department of Health Informatics, Institute of Public Health, College of Medicine and Health Sciences, University of Gondar, Gondar, Ethiopia
| | - Berihun Alelign
- Department of Health Informatics, Institute of Public Health, College of Medicine and Health Sciences, University of Gondar, Gondar, Ethiopia
| | - Mequannet Sharew
- Department of Health Informatics, Institute of Public Health, College of Medicine and Health Sciences, University of Gondar, Gondar, Ethiopia
| | - Nebebe Demis Baykemagn
- Department of Health Informatics, Institute of Public Health, College of Medicine and Health Sciences, University of Gondar, Gondar, Ethiopia
| | - Abdulaziz Kebede
- Department of Health Informatics, School of Public Health, College of Medicine and Health Sciences, Wollo University, Dessie, Ethiopia
| | - Tirualem Zeleke Yehuala
- Department of Health Informatics, Institute of Public Health, College of Medicine and Health Sciences, University of Gondar, Gondar, Ethiopia
| |
Collapse
|
14
|
Przymus P, Rykaczewski K, Martín-Segura A, Truu J, Carrillo De Santa Pau E, Kolev M, Naskinova I, Gruca A, Sampri A, Frohme M, Nechyporenko A. Deep learning in microbiome analysis: a comprehensive review of neural network models. Front Microbiol 2025; 15:1516667. [PMID: 39911715 PMCID: PMC11794229 DOI: 10.3389/fmicb.2024.1516667] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/24/2024] [Accepted: 12/16/2024] [Indexed: 02/07/2025] Open
Abstract
Microbiome research, the study of microbial communities in diverse environments, has seen significant advances due to the integration of deep learning (DL) methods. These computational techniques have become essential for addressing the inherent complexity and high-dimensionality of microbiome data, which consist of different types of omics datasets. Deep learning algorithms have shown remarkable capabilities in pattern recognition, feature extraction, and predictive modeling, enabling researchers to uncover hidden relationships within microbial ecosystems. By automating the detection of functional genes, microbial interactions, and host-microbiome dynamics, DL methods offer unprecedented precision in understanding microbiome composition and its impact on health, disease, and the environment. However, despite their potential, deep learning approaches face significant challenges in microbiome research. Additionally, the biological variability in microbiome datasets requires tailored approaches to ensure robust and generalizable outcomes. As microbiome research continues to generate vast and complex datasets, addressing these challenges will be crucial for advancing microbiological insights and translating them into practical applications with DL. This review provides an overview of different deep learning models in microbiome research, discussing their strengths, practical uses, and implications for future studies. We examine how these models are being applied to solve key problems and highlight potential pathways to overcome current limitations, emphasizing the transformative impact DL could have on the field moving forward.
Collapse
Affiliation(s)
- Piotr Przymus
- Faculty of Mathematics and Computer Science, Nicolaus Copernicus University in Toruń, Toruń, Pomeranian, Poland
| | - Krzysztof Rykaczewski
- Faculty of Mathematics and Computer Science, Nicolaus Copernicus University in Toruń, Toruń, Pomeranian, Poland
| | | | - Jaak Truu
- Institute of Molecular and Cell Biology, University of Tartu, Tartu, Estonia
| | | | - Mikhail Kolev
- Department of Mathematics, University of Architecture, Civil Engineering and Geodesy, Sofia, Bulgaria
- Department of Applied Computer Science and Mathematical Modeling, Faculty of Mathematics and Computer Science, University of Warmia and Mazury in Olsztyn, Olsztyn, Poland
| | - Irina Naskinova
- Department of Mathematics, University of Architecture, Civil Engineering and Geodesy, Sofia, Bulgaria
| | - Aleksandra Gruca
- Department of Computer Networks and Systems, Silesian University of Technology, Gliwice, Poland
| | - Alexia Sampri
- British Heart Foundation Cardiovascular Epidemiology Unit, Department of Public Health and Primary Care, University of Cambridge, Cambridge, United Kingdom
- Victor Phillip Dahdaleh Heart and Lung Research Institute, University of Cambridge, Cambridge, United Kingdom
| | - Marcus Frohme
- Molecular Biotechnology and Functional Genomics, Technical University of Applied Sciences Wildau, Wildau, Brandenburg, Germany
| | - Alina Nechyporenko
- Molecular Biotechnology and Functional Genomics, Technical University of Applied Sciences Wildau, Wildau, Brandenburg, Germany
- Department of System Engineering, Kharkiv National University of Radioelectronics, Kharkiv, Ukraine
| |
Collapse
|
15
|
Huang CC, Chiang HF, Hsieh CC, Zhu BR, Wu WJ, Shaw JS. Impact of Dataset Size on 3D CNN Performance in Intracranial Hemorrhage Classification. Diagnostics (Basel) 2025; 15:216. [PMID: 39857100 PMCID: PMC11763925 DOI: 10.3390/diagnostics15020216] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/11/2024] [Revised: 01/06/2025] [Accepted: 01/07/2025] [Indexed: 01/27/2025] Open
Abstract
Background: This study aimed to evaluate the effect of sample size on the development of a three-dimensional convolutional neural network (3DCNN) model for predicting the binary classification of three types of intracranial hemorrhage (ICH): intraparenchymal, subarachnoid, and subdural (IPH, SAH, SDH, respectively). Methods: During the training, we compiled all images of each brain computed tomography scan into a single 3D image, which was then fed into the model to classify the presence of ICH. We divided the non-hemorrhage quantities into 20, 30, 40, 50, 100, and 150 and the ICH quantities into 20, 30, 40, and 50. Cross-validation was performed to compute the average area under the curve (AUC) over the last five iterations. The AUC and accuracy were used to evaluate the performance of the models. Results: Fifty patients, each with the three ICH types, and 150 non-hemorrhage cases were enrolled. Larger sample sizes achieved stable and acceptable performance in the artificial intelligence (AI) models, whereas training with a limited number of cases posed the risk of falsely high AUC values or accuracy. The overall trends and fluctuations in AUC values were similar between IPH and SDH but different for SAH. The accuracy of the results was relatively consistent among the three ICH types. Conclusions: The 3DCNN technique can be used to develop AI models capable of detecting ICH from limited case numbers. However, a minimal case number must be provided. The performance of AI models varies across different ICH types and is more stable with larger sample sizes.
Collapse
Affiliation(s)
- Chun-Chao Huang
- Department of Radiology, MacKay Memorial Hospital, Taipei 104, Taiwan; (C.-C.H.); (H.-F.C.); (C.-C.H.)
- Department of Medicine, MacKay Medical College, New Taipei City 252, Taiwan
| | - Hsin-Fan Chiang
- Department of Radiology, MacKay Memorial Hospital, Taipei 104, Taiwan; (C.-C.H.); (H.-F.C.); (C.-C.H.)
- Department of Medicine, MacKay Medical College, New Taipei City 252, Taiwan
- Mackay Junior College of Medicine, Nursing, and Management, Taipei 112, Taiwan
| | - Cheng-Chih Hsieh
- Department of Radiology, MacKay Memorial Hospital, Taipei 104, Taiwan; (C.-C.H.); (H.-F.C.); (C.-C.H.)
- Department of Medicine, MacKay Medical College, New Taipei City 252, Taiwan
- Mackay Junior College of Medicine, Nursing, and Management, Taipei 112, Taiwan
| | - Bo-Rui Zhu
- Institute of Mechatronic Engineering, National Taipei University of Technology, Taipei 106, Taiwan; (B.-R.Z.); (W.-J.W.)
| | - Wen-Jie Wu
- Institute of Mechatronic Engineering, National Taipei University of Technology, Taipei 106, Taiwan; (B.-R.Z.); (W.-J.W.)
| | - Jin-Siang Shaw
- Institute of Mechatronic Engineering, National Taipei University of Technology, Taipei 106, Taiwan; (B.-R.Z.); (W.-J.W.)
| |
Collapse
|
16
|
van Boven MR, Bennis FC, Onland W, Aarnoudse-Moens CSH, Frings M, Tran K, Katz TA, Romijn M, Hoogendoorn M, van Kaam AH, Leemhuis AG, Oosterlaan J, Königs M. Machine learning models for neurocognitive outcome prediction in preterm born infants. Pediatr Res 2025:10.1038/s41390-025-03815-6. [PMID: 39827255 DOI: 10.1038/s41390-025-03815-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 07/19/2024] [Revised: 11/24/2024] [Accepted: 12/09/2024] [Indexed: 01/22/2025]
Abstract
BACKGROUND Outcome prediction after preterm birth is important for long-term neonatal care, but has proven notoriously challenging for neurocognitive outcome. This study investigated the potential of machine learning to improve neurocognitive outcome prediction at two and five years of corrected age in preterm infants, using readily available predictors from the neonatal setting. METHODS Predictors originating from the antenatal and neonatal period of preterm infants born <30 weeks gestation were used to predict adverse neurocognitive outcome on the Bayley Scale and Wechsler Preschool and Primary Scale of Intelligence. Machine learning models were compared to conventional logistic regression and validated using internal cross-validation. RESULTS Best performing models were a random forest (two-year outcome) and a support vector machine (five-year outcome) with an area under the receiver operating characteristic curve (AUC) of 0.682 and 0.695 respectively, reaching high negative predictive values (95% and 91%, respectively). These models performed significantly better than the conventional models. CONCLUSIONS The models reached moderate overall predictive performance, yet with promising potential for early identification of children without adverse neurocognitive outcome. Machine learning modestly improved neurocognitive outcome prediction. Future research may harvest the predictive potential of a wider variety of routine (clinical) data, such as vital sign time series. IMPACT Early prediction of neurocognitive outcome in preterm infants will enable targeted follow-up and deployment of early (preventative) interventions to improve outcome. Neurocognitive outcome remains notoriously challenging using conventional models, while existing machine learning models depend on advanced MRI-derived predictors with limited potential for implementation into daily clinical practice. This study developed machine learning models for neurocognitive outcome prediction using predictors that are readily available in neonatal settings. Neurocognitive outcome prediction remains challenging due to low AUC and PPV, however, the models demonstrate high NPV, indicating potential for identifying children at low risk for adverse outcome.
Collapse
Affiliation(s)
- Menne R van Boven
- Emma Children's Hospital Amsterdam UMC, location University of Amsterdam, Department of Neonatology, Meibergdreef 9, Amsterdam, The Netherlands.
- Emma Children's Hospital Amsterdam UMC, location University of Amsterdam, Follow-Me program & Emma Neuroscience group, Meibergdreef 9, Amsterdam, The Netherlands.
- Amsterdam Reproduction and Development research institute, Amsterdam, The Netherlands.
| | - Frank C Bennis
- Emma Children's Hospital Amsterdam UMC, location University of Amsterdam, Follow-Me program & Emma Neuroscience group, Meibergdreef 9, Amsterdam, The Netherlands
- Amsterdam Reproduction and Development research institute, Amsterdam, The Netherlands
- Vrije Universiteit Amsterdam, Faculty of Science, Department Computer Science, Quantitative Data Analytics Group, Amsterdam, The Netherlands
| | - Wes Onland
- Emma Children's Hospital Amsterdam UMC, location University of Amsterdam, Department of Neonatology, Meibergdreef 9, Amsterdam, The Netherlands
- Amsterdam Reproduction and Development research institute, Amsterdam, The Netherlands
| | - Cornelieke S H Aarnoudse-Moens
- Emma Children's Hospital Amsterdam UMC, location University of Amsterdam, Department of Neonatology, Meibergdreef 9, Amsterdam, The Netherlands
- Amsterdam Reproduction and Development research institute, Amsterdam, The Netherlands
- Emma Children's Hospital Amsterdam UMC, location University of Amsterdam, Psychosocial Department, Meibergdreef 9, Amsterdam, The Netherlands
| | - Max Frings
- University of Amsterdam, Faculty of Science, Data science, Amsterdam, The Netherlands
| | - Kevin Tran
- University of Amsterdam, Faculty of Science, Data science, Amsterdam, The Netherlands
| | - Trixie A Katz
- Emma Children's Hospital Amsterdam UMC, location University of Amsterdam, Department of Neonatology, Meibergdreef 9, Amsterdam, The Netherlands
- Amsterdam Reproduction and Development research institute, Amsterdam, The Netherlands
| | - Michelle Romijn
- Emma Children's Hospital Amsterdam UMC, location University of Amsterdam, Department of Neonatology, Meibergdreef 9, Amsterdam, The Netherlands
- Amsterdam Reproduction and Development research institute, Amsterdam, The Netherlands
| | - Mark Hoogendoorn
- Vrije Universiteit Amsterdam, Faculty of Science, Department Computer Science, Quantitative Data Analytics Group, Amsterdam, The Netherlands
| | - Anton H van Kaam
- Emma Children's Hospital Amsterdam UMC, location University of Amsterdam, Department of Neonatology, Meibergdreef 9, Amsterdam, The Netherlands
- Amsterdam Reproduction and Development research institute, Amsterdam, The Netherlands
| | - Aleid G Leemhuis
- Emma Children's Hospital Amsterdam UMC, location University of Amsterdam, Department of Neonatology, Meibergdreef 9, Amsterdam, The Netherlands
| | - Jaap Oosterlaan
- Emma Children's Hospital Amsterdam UMC, location University of Amsterdam, Follow-Me program & Emma Neuroscience group, Meibergdreef 9, Amsterdam, The Netherlands
- Amsterdam Reproduction and Development research institute, Amsterdam, The Netherlands
| | - Marsh Königs
- Emma Children's Hospital Amsterdam UMC, location University of Amsterdam, Follow-Me program & Emma Neuroscience group, Meibergdreef 9, Amsterdam, The Netherlands
- Amsterdam Reproduction and Development research institute, Amsterdam, The Netherlands
| |
Collapse
|
17
|
Bozcuk HŞ, Sert L, Kaplan MA, Tatlı AM, Karaca M, Muğlu H, Bilici A, Kılıçtaş BŞ, Artaç M, Erel P, Yumuk PF, Bilgin B, Şendur MAN, Kılıçkap S, Taban H, Ballı S, Demirkazık A, Akdağ F, Hacıbekiroğlu İ, Güzel HG, Koçer M, Gürsoy P, Köylü B, Selçukbiricik F, Karakaya G, Alemdar MS. Enhancing Treatment Decisions for Advanced Non-Small Cell Lung Cancer with Epidermal Growth Factor Receptor Mutations: A Reinforcement Learning Approach. Cancers (Basel) 2025; 17:233. [PMID: 39858018 PMCID: PMC11763509 DOI: 10.3390/cancers17020233] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/30/2024] [Revised: 12/29/2024] [Accepted: 01/07/2025] [Indexed: 01/27/2025] Open
Abstract
BACKGROUND Although higher-generation TKIs are associated with improved progression-free survival in advanced NSCLC patients with EGFR mutations, the optimal selection of TKI treatment remains uncertain. To address this gap, we developed a web application powered by a reinforcement learning (RL) algorithm to assist in guiding initial TKI treatment decisions. METHODS Clinical and mutational data from advanced NSCLC patients were retrospectively collected from 14 medical centers. Only patients with complete data and sufficient follow-up were included. Multiple supervised machine learning models were tested, with the Extra Trees Classifier (ETC) identified as the most effective for predicting progression-free survival. Feature importance scores were calculated by the ETC, and features were then integrated into a Deep Q-Network (DQN) RL algorithm. The RL model was designed to select optimal TKI generation and a treatment line for each patient and was embedded into an open-source web application for experimental clinical use. RESULTS In total, 318 cases of EGFR-mutant advanced NSCLC were analyzed, with a median patient age of 63. A total of 52.2% of patients were female, and 83.3% had ECOG scores of 0 or 1. The top three most influential features identified were neutrophil-to-lymphocyte ratio (log-transformed), age (log-transformed), and the treatment line of TKI administration, as tested by the ETC algorithm, with an area under curve (AUC) value of 0.73, whereas the DQN RL algorithm achieved a higher AUC value of 0.80, assigning distinct Q-values across four TKI treatment categories. This supports the decision-making process in the web-based 'EGFR Mutant NSCLC Treatment Advisory System', where clinicians can input patient-specific data to receive tailored recommendations. CONCLUSIONS The RL-based web application shows promise in assisting TKI treatment selection for EGFR-mutant advanced NSCLC patients, underscoring the potential for reinforcement learning to enhance decision-making in oncology care.
Collapse
Affiliation(s)
- Hakan Şat Bozcuk
- Independent Researcher, Antalya 07100, Turkey
- Private Practice, Burhanettin Onat Caddesi, 1419. Sokak, No:59, Ocean City, C-Blok, Kat:3, Daire:5, Muratpaşa, Antalya 07100, Turkey
| | - Leyla Sert
- Department of Medical Oncology, Dicle University, Diyarbakır 21280, Turkey; (L.S.); (M.A.K.)
| | - Muhammet Ali Kaplan
- Department of Medical Oncology, Dicle University, Diyarbakır 21280, Turkey; (L.S.); (M.A.K.)
| | - Ali Murat Tatlı
- Department of Medical Oncology, Akdeniz University, Antalya 07058, Turkey; (A.M.T.); (M.K.)
| | - Mustafa Karaca
- Department of Medical Oncology, Akdeniz University, Antalya 07058, Turkey; (A.M.T.); (M.K.)
| | - Harun Muğlu
- Department of Medical Oncology, Faculty of Medicine, İstanbul Medipol University, İstanbul 34810, Turkey; (H.M.); (A.B.)
| | - Ahmet Bilici
- Department of Medical Oncology, Faculty of Medicine, İstanbul Medipol University, İstanbul 34810, Turkey; (H.M.); (A.B.)
| | - Bilge Şah Kılıçtaş
- Department of Medical Oncology, Necmettin Erbakan University, Konya 42090, Turkey; (B.Ş.K.); (M.A.)
| | - Mehmet Artaç
- Department of Medical Oncology, Necmettin Erbakan University, Konya 42090, Turkey; (B.Ş.K.); (M.A.)
| | - Pınar Erel
- Department of Medical Oncology, Marmara University, İstanbul 34722, Turkey; (P.E.); (P.F.Y.)
| | - Perran Fulden Yumuk
- Department of Medical Oncology, Marmara University, İstanbul 34722, Turkey; (P.E.); (P.F.Y.)
- Division of Medical Oncology, School of Medicine, Koç University, İstanbul 34450, Turkey; (B.K.); (F.S.)
| | - Burak Bilgin
- Faculty of Medicine, Ankara Yıldırım Beyazıt University, Ankara 06010, Turkey; (B.B.); (M.A.N.Ş.)
| | - Mehmet Ali Nahit Şendur
- Faculty of Medicine, Ankara Yıldırım Beyazıt University, Ankara 06010, Turkey; (B.B.); (M.A.N.Ş.)
| | - Saadettin Kılıçkap
- Department of Medical Oncology, Faculty of Medicine, İstinye University, İstanbul 34010, Turkey; (S.K.); (M.S.A.)
| | - Hakan Taban
- Department of Medical Oncology, Medical Park Keçiören Hospital, Ankara 06120, Turkey;
| | - Sevinç Ballı
- Department of Medical Oncology, Ankara University, Ankara 06100, Turkey; (S.B.); (A.D.)
| | - Ahmet Demirkazık
- Department of Medical Oncology, Ankara University, Ankara 06100, Turkey; (S.B.); (A.D.)
| | - Fatma Akdağ
- Department of Medical Oncology, Sakarya University, Sakarya 54050, Turkey; (F.A.); (İ.H.)
| | - İlhan Hacıbekiroğlu
- Department of Medical Oncology, Sakarya University, Sakarya 54050, Turkey; (F.A.); (İ.H.)
| | - Halil Göksel Güzel
- Antalya Education and Research Hospital, Antalya 07100, Turkey; (H.G.G.); (M.K.)
| | - Murat Koçer
- Antalya Education and Research Hospital, Antalya 07100, Turkey; (H.G.G.); (M.K.)
| | - Pınar Gürsoy
- Department of Medical Oncology, Ege University, İzmir 35040, Turkey;
| | - Bahadır Köylü
- Division of Medical Oncology, School of Medicine, Koç University, İstanbul 34450, Turkey; (B.K.); (F.S.)
| | - Fatih Selçukbiricik
- Division of Medical Oncology, School of Medicine, Koç University, İstanbul 34450, Turkey; (B.K.); (F.S.)
| | - Gökhan Karakaya
- Department of Medical Oncology, ASV Yaşam Hospital, Antalya 07300, Turkey;
| | - Mustafa Serkan Alemdar
- Department of Medical Oncology, Faculty of Medicine, İstinye University, İstanbul 34010, Turkey; (S.K.); (M.S.A.)
| |
Collapse
|
18
|
Yang Z, Teaney NA, Buttermore ED, Sahin M, Afshar-Saber W. Harnessing the potential of human induced pluripotent stem cells, functional assays and machine learning for neurodevelopmental disorders. Front Neurosci 2025; 18:1524577. [PMID: 39844857 PMCID: PMC11750789 DOI: 10.3389/fnins.2024.1524577] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/07/2024] [Accepted: 12/19/2024] [Indexed: 01/24/2025] Open
Abstract
Neurodevelopmental disorders (NDDs) affect 4.7% of the global population and are associated with delays in brain development and a spectrum of impairments that can lead to lifelong disability and even mortality. Identification of biomarkers for accurate diagnosis and medications for effective treatment are lacking, in part due to the historical use of preclinical model systems that do not translate well to the clinic for neurological disorders, such as rodents and heterologous cell lines. Human-induced pluripotent stem cells (hiPSCs) are a promising in vitro system for modeling NDDs, providing opportunities to understand mechanisms driving NDDs in human neurons. Functional assays, including patch clamping, multielectrode array, and imaging-based assays, are popular tools employed with hiPSC disease models for disease investigation. Recent progress in machine learning (ML) algorithms also presents unprecedented opportunities to advance the NDD research process. In this review, we compare two-dimensional and three-dimensional hiPSC formats for disease modeling, discuss the applications of functional assays, and offer insights on incorporating ML into hiPSC-based NDD research and drug screening.
Collapse
Affiliation(s)
- Ziqin Yang
- Rosamund Stone Zander Translational Neuroscience Center, Boston Children’s Hospital, Harvard Medical School, Boston, MA, United States
- FM Kirby Neurobiology Center, Department of Neurology, Boston Children’s Hospital, Harvard Medical School, Boston, MA, United States
| | - Nicole A. Teaney
- Rosamund Stone Zander Translational Neuroscience Center, Boston Children’s Hospital, Harvard Medical School, Boston, MA, United States
- FM Kirby Neurobiology Center, Department of Neurology, Boston Children’s Hospital, Harvard Medical School, Boston, MA, United States
| | - Elizabeth D. Buttermore
- Rosamund Stone Zander Translational Neuroscience Center, Boston Children’s Hospital, Harvard Medical School, Boston, MA, United States
- FM Kirby Neurobiology Center, Department of Neurology, Boston Children’s Hospital, Harvard Medical School, Boston, MA, United States
- Human Neuron Core, Boston Children’s Hospital, Boston, MA, United States
| | - Mustafa Sahin
- Rosamund Stone Zander Translational Neuroscience Center, Boston Children’s Hospital, Harvard Medical School, Boston, MA, United States
- FM Kirby Neurobiology Center, Department of Neurology, Boston Children’s Hospital, Harvard Medical School, Boston, MA, United States
- Human Neuron Core, Boston Children’s Hospital, Boston, MA, United States
| | - Wardiya Afshar-Saber
- Rosamund Stone Zander Translational Neuroscience Center, Boston Children’s Hospital, Harvard Medical School, Boston, MA, United States
- FM Kirby Neurobiology Center, Department of Neurology, Boston Children’s Hospital, Harvard Medical School, Boston, MA, United States
| |
Collapse
|
19
|
Honchar O, Ashcheulova T, Chumachenko T, Chumachenko D. Early prediction of long COVID-19 syndrome persistence at 12 months after hospitalisation: a prospective observational study from Ukraine. BMJ Open 2025; 15:e084311. [PMID: 39762090 PMCID: PMC11748775 DOI: 10.1136/bmjopen-2024-084311] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 01/18/2024] [Accepted: 12/09/2024] [Indexed: 01/23/2025] Open
Abstract
OBJECTIVE To identify the early predictors of a self-reported persistence of long COVID syndrome (LCS) at 12 months after hospitalisation and to propose the prognostic model of its development. DESIGN A combined cross-sectional and prospective observational study. SETTING A tertiary care hospital. PARTICIPANTS 221 patients hospitalised for COVID-19 who have undergone comprehensive clinical, sonographic and survey-based evaluation predischarge and at 1 month with subsequent 12-month follow-up. The final cohort included 166 patients who had completed the final visit at 12 months. MAIN OUTCOME MEASURE A self-reported persistence of LCS at 12 months after discharge. RESULTS Self-reported LCS was detected in 76% of participants at 3 months and in 43% at 12 months after discharge. Patients who reported incomplete recovery at 1 year were characterised by a higher burden of comorbidities (Charlson index of 0.69±0.96 vs 0.31±0.51, p=0.001) and residual pulmonary consolidations (1.56±1.78 vs 0.98±1.56, p=0.034), worse blood pressure (BP) control (systolic BP of 138.1±16.2 vs 132.2±15.8 mm Hg, p=0.041), renal (estimated glomerular filtration rate of 59.5±14.7 vs 69.8±20.7 mL/min/1.73 m2, p=0.007) and endothelial function (flow-mediated dilation of the brachial artery of 10.4±5.4 vs 12.4±5.6%, p=0.048), higher in-hospital levels of liver enzymes (alanine aminotransferase (ALT) of 76.3±60.8 vs 46.3±25.3 IU/L, p=0.002) and erythrocyte sedimentation rate (ESR) (34.3±12.1 vs 28.3±12.6 mm/h, p=0.008), slightly higher indices of ventricular longitudinal function (left ventricular (LV) global longitudinal strain (GLS) of 18.0±2.4 vs 17.0±2.3%, p=0011) and higher levels of Hospital Anxiety and Depression Scale anxiety (7.3±4.2 vs 5.6±3.8, p=0.011) and depression scores (6.4±3.9 vs 4.9±4.3, p=0.022) and EFTER-COVID study physical symptoms score (12.3±3.8 vs 9.2±4.2, p<0.001). At 1 month postdischarge, the persisting differences included marginally higher LV GLS, mitral E/e' ratio and significantly higher levels of both resting and exertional physical symptoms versus patients who reported complete recovery. Logistic regression and machine learning-based binary classification models have been developed to predict the persistence of LCS symptoms at 12 months after discharge. CONCLUSIONS Compared with post-COVID-19 patients who have completely recovered by 12 months after hospital discharge, those who have subsequently developed 'very long' COVID were characterised by a variety of more pronounced residual predischarge abnormalities that had mostly subsided by 1 month, except for steady differences in the physical symptoms levels. A simple artificial neural networks-based binary classification model using peak ESR, creatinine, ALT and weight loss during the acute phase, predischarge 6-minute walk distance and complex survey-based symptoms assessment as inputs has shown a 92% accuracy with an area under receiver-operator characteristic curve 0.931 in prediction of LCS symptoms persistence at 12 months.
Collapse
Affiliation(s)
- Oleksii Honchar
- Department of Propedeutics of Internal Medicine, Nursing and Bioethics, Kharkiv National Medical University, Kharkiv, Ukraine
| | - Tetiana Ashcheulova
- Department of Propedeutics of Internal Medicine, Nursing and Bioethics, Kharkiv National Medical University, Kharkiv, Ukraine
| | - Tetyana Chumachenko
- Department of Epidemiology, Kharkiv National Medical University, Kharkiv, Ukraine
| | - Dmytro Chumachenko
- Department of Mathematical Modelling and Artificial Intelligence, National Aerospace University Kharkiv Aviation Institute, Kharkiv, Ukraine
| |
Collapse
|
20
|
Rafiepoor H, Ghorbankhanloo A, Soleimani Dorcheh S, Angouraj Taghavi E, Ghanadan A, Shirkoohi R, Aryanian Z, Amanpour S. Diagnostic Power of MicroRNAs in Melanoma: Integrating Machine Learning for Enhanced Accuracy and Pathway Analysis. J Cell Mol Med 2025; 29:e70367. [PMID: 39823244 PMCID: PMC11740884 DOI: 10.1111/jcmm.70367] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/20/2024] [Revised: 12/10/2024] [Accepted: 01/06/2025] [Indexed: 01/19/2025] Open
Abstract
This study identifies microRNAs (miRNAs) with significant discriminatory power in distinguishing melanoma from nevus, notably hsa-miR-26a and hsa-miR-211, which have exhibited diagnostic potential with accuracy of 81% and 78% respectively. To enhance diagnostic accuracy, we integrated miRNAs into various machine-learning (ML) models. Incorporating miRNAs with AUC scores above 0.70 significantly improved diagnostic accuracy to 94%, with a sensitivity of 91%. These findings underscore the potential of ML models to leverage miRNA data for enhanced melanoma diagnosis. Additionally, using the miRNet tool, we constructed a network of miRNA-miRNA interactions, revealing 170 key genes in melanoma pathophysiology. Protein-protein interaction network analysis via Cytoscape identified hub genes including MYC, BRCA1, JUN, AURKB, CDKN2A, DDX5, MAPK14, DDX3X, DDX6, FOXM1 and GSK3B. The identification of hub genes and their interactions with miRNAs enhances our understanding of the molecular mechanisms driving melanoma. Pathway enrichment analyses highlighted key pathways associated with differentially expressed miRNAs, including the PI3K/AKT, TGF-beta signalling pathway and cell cycle regulation. These pathways are implicated in melanoma development and progression, reinforcing the significance of our findings. The functional enrichment of miRNAs suggests their critical role in modulating essential pathways in melanoma, suggesting their potential as therapeutic targets.
Collapse
Affiliation(s)
- Haniyeh Rafiepoor
- Cancer Biology Research Center, Cancer InstituteTehran University of Medical SciencesTehranIran
| | - Alireza Ghorbankhanloo
- Cancer Biology Research Center, Cancer InstituteTehran University of Medical SciencesTehranIran
| | | | - Elham Angouraj Taghavi
- Cancer Biology Research Center, Cancer InstituteTehran University of Medical SciencesTehranIran
| | - Alireza Ghanadan
- Department of Dermatopathology, Razi HospitalTehran University of Medical SciencesTehranIran
| | - Reza Shirkoohi
- Cancer Biology Research Center, Cancer InstituteTehran University of Medical SciencesTehranIran
- Cancer Research Center, Cancer InstituteTehran University of Medical SciencesTehranIran
| | - Zeinab Aryanian
- Autoimmune Bullous Diseases Research Center, Razi HospitalTehran University of Medical SciencesTehranIran
| | - Saeid Amanpour
- Cancer Biology Research Center, Cancer InstituteTehran University of Medical SciencesTehranIran
| |
Collapse
|
21
|
Tsoumtsa Meda L, Lagarde J, Guillier L, Roussel S, Douarre PE. Using GWAS and Machine Learning to Identify and Predict Genetic Variants Associated with Foodborne Bacteria Phenotypic Traits. Methods Mol Biol 2025; 2852:223-253. [PMID: 39235748 DOI: 10.1007/978-1-0716-4100-2_16] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 09/06/2024]
Abstract
One of the main challenges in food microbiology is to prevent the risk of outbreaks by avoiding the distribution of food contaminated by bacteria. This requires constant monitoring of the circulating strains throughout the food production chain. Bacterial genomes contain signatures of natural evolution and adaptive markers that can be exploited to better understand the behavior of pathogen in the food industry. The monitoring of foodborne strains can therefore be facilitated by the use of these genomic markers capable of rapidly providing essential information on isolated strains, such as the source of contamination, risk of illness, potential for biofilm formation, and tolerance or resistance to biocides. The increasing availability of large genome datasets is enhancing the understanding of the genetic basis of complex traits such as host adaptation, virulence, and persistence. Genome-wide association studies have shown very promising results in the discovery of genomic markers that can be integrated into rapid detection tools. In addition, machine learning has successfully predicted phenotypes and classified important traits. Genome-wide association and machine learning tools have therefore the potential to support decision-making circuits intending at reducing the burden of foodborne diseases. The aim of this chapter review is to provide knowledge on the use of these two methods in food microbiology and to recommend their use in the field.
Collapse
Affiliation(s)
- Landry Tsoumtsa Meda
- ACTALIA, La Roche-sur-Foron, France
- ANSES, Salmonella and Listeria Unit (USEL), University of Paris-Est, Maisons-Alfort Laboratory for Food Safety, Maisons-Alfort, France
| | - Jean Lagarde
- ANSES, Salmonella and Listeria Unit (USEL), University of Paris-Est, Maisons-Alfort Laboratory for Food Safety, Maisons-Alfort, France
- INRAE, Unit of Process Optimisation in Food, Agriculture and the Environment (UR OPAALE), Rennes, France
| | | | - Sophie Roussel
- ANSES, Salmonella and Listeria Unit (USEL), University of Paris-Est, Maisons-Alfort Laboratory for Food Safety, Maisons-Alfort, France
| | - Pierre-Emmanuel Douarre
- ANSES, Salmonella and Listeria Unit (USEL), University of Paris-Est, Maisons-Alfort Laboratory for Food Safety, Maisons-Alfort, France.
| |
Collapse
|
22
|
Teng C, Yang C, Liu Q. Utilising AI technique to identify depression risk among doctoral students. Sci Rep 2024; 14:31978. [PMID: 39738390 DOI: 10.1038/s41598-024-83617-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/10/2024] [Accepted: 12/16/2024] [Indexed: 01/02/2025] Open
Abstract
The phenomenon that the depression risk among doctoral students is higher than that of other groups should not be ignored. Despite this, studies specifically addressing depression risk in doctoral students are relatively scarce, and existing findings are not universally applicable. Using neural network feature extraction technology, this study aims to investigate the factors contributing to the high depression risk of doctoral students and effectively identify doctoral students at depression risk, so as to propose corresponding improvement strategies to prevent and intervene doctoral students with depression risk for universities. Based on the data from the 2019 Nature Global Doctoral Student Survey, we first screened 13 highly relevant features from a total of 37 features potentially related to the risk of depression among doctoral students by Random Forest algorithm. Subsequently, we trained the optimal prediction model to predict the doctoral students with depression risk using a Multilayer Perceptron (MLP), achieving an accuracy of 89.09% on the test set. Additionally, this study constructed a group portrait of doctoral students at risk of depression, and found that overwork, poor work-life balance, and poor supervisor-student relationship, etc., were typical characteristics among these students. Finally, we proposed several improvement strategies for higher education institutions. Our research offers a new perspective on utilising artificial intelligence (AI) methods to tackle educational challenges, particularly in the identification and support of doctoral students at risk of depression, thereby enhancing their mental health.
Collapse
Affiliation(s)
- Changhong Teng
- School of Education, Beijing Institute of Technology, Beijing, 100081, China
| | - Chunmei Yang
- School of Education, Beijing Institute of Technology, Beijing, 100081, China.
| | | |
Collapse
|
23
|
Handra J, James H, Mbilinyi A, Moller-Hansen A, O'Riley C, Andrade J, Deyell M, Hague C, Hawkins N, Ho K, Hu R, Leipsic J, Tam R. The Role of Machine Learning in the Detection of Cardiac Fibrosis in Electrocardiograms: Scoping Review. JMIR Cardio 2024; 8:e60697. [PMID: 39753213 PMCID: PMC11730231 DOI: 10.2196/60697] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/18/2024] [Revised: 09/30/2024] [Accepted: 11/06/2024] [Indexed: 01/14/2025] Open
Abstract
BACKGROUND Cardiovascular disease remains the leading cause of mortality worldwide. Cardiac fibrosis impacts the underlying pathophysiology of many cardiovascular diseases by altering structural integrity and impairing electrical conduction. Identifying cardiac fibrosis is essential for the prognosis and management of cardiovascular disease; however, current diagnostic methods face challenges due to invasiveness, cost, and inaccessibility. Electrocardiograms (ECGs) are widely available and cost-effective for monitoring cardiac electrical activity. While ECG-based methods for inferring fibrosis exist, they are not commonly used due to accuracy limitations and the need for cardiac expertise. However, the ECG shows promise as a target for machine learning (ML) applications in fibrosis detection. OBJECTIVE This study aims to synthesize and critically evaluate the current state of ECG-based ML approaches for cardiac fibrosis detection. METHODS We conducted a scoping review of research in ECG-based ML applications to identify cardiac fibrosis. Comprehensive searches were performed in PubMed, IEEE Xplore, Scopus, Web of Science, and DBLP databases, including publications up to October 2024. Studies were included if they applied ML techniques to detect cardiac fibrosis using ECG or vectorcardiogram data and provided sufficient methodological details and outcome metrics. Two reviewers independently assessed eligibility and extracted data on the ML models used, their performance metrics, study designs, and limitations. RESULTS We identified 11 studies evaluating ML approaches for detecting cardiac fibrosis using ECG data. These studies used various ML techniques, including classical (8/11, 73%), ensemble (3/11, 27%), and deep learning models (4/11, 36%). Support vector machines were the most used classical model (6/11, 55%), with the best-performing models of each study achieving accuracies of 77% to 93%. Among deep learning approaches, convolutional neural networks showed promising results, with one study reporting an area under the receiver operating characteristic curve (AUC) of 0.89 when combined with clinical features. Notably, a large-scale convolutional neural network study (n=14,052) achieved an AUC of 0.84 for detecting cardiac fibrosis, outperforming cardiologists (AUC 0.63-0.66). However, many studies had limited sample sizes and lacked external validation, potentially impacting the generalizability of the findings. Variability in reporting methods may affect the reproducibility and applicability of these ML-based approaches. CONCLUSIONS ML-augmented ECG analysis shows promise for accessible and cost-effective detection of cardiac fibrosis. However, there are common limitations with respect to study design and insufficient external validation, raising concerns about the generalizability and clinical applicability of the findings. Inconsistencies in methodologies and incomplete reporting further impede cross-study comparisons. Future work may benefit from using prospective study designs, larger and more clinically and demographically diverse datasets, advanced ML models, and rigorous external validation. Addressing these challenges could pave the way for the clinical implementation of ML-based ECG detection of cardiac fibrosis to improve patient outcomes and health care resource allocation.
Collapse
Affiliation(s)
- Julia Handra
- Faculty of Medicine, University of British Columbia, Vancouver, BC, Canada
- School of Biomedical Engineering, University of British Columbia, Vancouver, BC, Canada
| | - Hannah James
- Faculty of Medicine, University of British Columbia, Vancouver, BC, Canada
- School of Biomedical Engineering, University of British Columbia, Vancouver, BC, Canada
| | - Ashery Mbilinyi
- School of Biomedical Engineering, University of British Columbia, Vancouver, BC, Canada
| | - Ashley Moller-Hansen
- School of Biomedical Engineering, University of British Columbia, Vancouver, BC, Canada
| | - Callum O'Riley
- School of Biomedical Engineering, University of British Columbia, Vancouver, BC, Canada
| | - Jason Andrade
- Division of Cardiology, Faculty of Medicine, University of British Columbia, Vancouver, BC, Canada
| | - Marc Deyell
- Division of Cardiology, Faculty of Medicine, University of British Columbia, Vancouver, BC, Canada
| | - Cameron Hague
- Department of Radiology, Faculty of Medicine, University of British Columbia, Vancouver, BC, Canada
| | - Nathaniel Hawkins
- Division of Cardiology, Faculty of Medicine, University of British Columbia, Vancouver, BC, Canada
| | - Kendall Ho
- Department of Emergency Medicine, Faculty of Medicine, University of British Columbia, Vancouver, BC, Canada
| | - Ricky Hu
- Faculty of Medicine, University of British Columbia, Vancouver, BC, Canada
| | - Jonathon Leipsic
- Department of Radiology, Faculty of Medicine, University of British Columbia, Vancouver, BC, Canada
| | - Roger Tam
- School of Biomedical Engineering, University of British Columbia, Vancouver, BC, Canada
| |
Collapse
|
24
|
Sánchez-Martínez LJ, Charle-Cuéllar P, Gado AA, Ousmane N, Hernández CL, López-Ejeda N. Using Machine Learning to Fight Child Acute Malnutrition and Predict Weight Gain During Outpatient Treatment with a Simplified Combined Protocol. Nutrients 2024; 16:4213. [PMID: 39683605 DOI: 10.3390/nu16234213] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/10/2024] [Revised: 12/03/2024] [Accepted: 12/04/2024] [Indexed: 12/18/2024] Open
Abstract
BACKGROUND/OBJECTIVES Child acute malnutrition is a global public health problem, affecting 45 million children under 5 years of age. The World Health Organization recommends monitoring weight gain weekly as an indicator of the correct treatment. However, simplified protocols that do not record the weight and base diagnosis and follow-up in arm circumference at discharge are being tested in emergency settings. The present study aims to use machine learning techniques to predict weight gain based on the socio-economic characteristics at admission for the children treated under a simplified protocol in the Diffa region of Niger. METHODS The sample consists of 535 children aged 6-59 months receiving outpatient treatment for acute malnutrition, for whom information on 51 socio-economic variables was collected. First, the Variable Selection Using Random Forest (VSURF) algorithm was used to select the variables associated with weight gain. Subsequently, the dataset was partitioned into training/testing, and an ensemble model was adjusted using five algorithms for prediction, which were combined using a Random Forest meta-algorithm. Afterward, Receiver Operating Characteristic (ROC) curves were used to identify the optimal cut-off point for predicting the group of individuals most vulnerable to developing low weight gain. RESULTS The critical variables that influence weight gain are water, hygiene and sanitation, the caregiver's employment-socio-economic level and access to treatment. The final ensemble prediction model achieved a better fit (R2 = 0.55) with respect to the individual algorithms (R2 = 0.14-0.27). An optimal cut-off point was identified to establish low weight gain, with an Area Under the Curve (AUC) of 0.777 at a value of <6.5 g/kg/day. The ensemble model achieved a success rate of 84% (78/93) at the identification of individuals below <6.5 g/kg/day in the test set. CONCLUSIONS The results highlight the importance of adapting the cut-off points for weight gain to each context, as well as the practical usefulness that these techniques can have in optimizing and adapting to the treatment in humanitarian settings.
Collapse
Affiliation(s)
- Luis Javier Sánchez-Martínez
- Unit of Physical Anthropology, Department of Biodiversity, Ecology and Evolution, Faculty of Biological Sciences, Complutense University of Madrid, 28040 Madrid, Spain
| | | | | | | | - Candela Lucía Hernández
- Unit of Physical Anthropology, Department of Biodiversity, Ecology and Evolution, Faculty of Biological Sciences, Complutense University of Madrid, 28040 Madrid, Spain
| | - Noemí López-Ejeda
- Unit of Physical Anthropology, Department of Biodiversity, Ecology and Evolution, Faculty of Biological Sciences, Complutense University of Madrid, 28040 Madrid, Spain
- EPINUT Research Group, Faculty of Medicine, Complutense University of Madrid, 28040 Madrid, Spain
| |
Collapse
|
25
|
Klontzas ME, Vernardis SI, Batsali A, Papadogiannis F, Panoskaltsis N, Mantalaris A. Machine Learning and Metabolomics Predict Mesenchymal Stem Cell Osteogenic Differentiation in 2D and 3D Cultures. J Funct Biomater 2024; 15:367. [PMID: 39728167 DOI: 10.3390/jfb15120367] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/29/2024] [Revised: 11/27/2024] [Accepted: 12/03/2024] [Indexed: 12/28/2024] Open
Abstract
Stem cells have been widely used to produce artificial bone grafts. Nonetheless, the variability in the degree of stem cell differentiation is an inherent drawback of artificial graft development and requires robust evaluation tools that can certify the quality of stem cell-based products and avoid source-tissue-related and patient-specific variability in outcomes. Omics analyses have been utilised for the evaluation of stem cell attributes in all stages of stem cell biomanufacturing. Herein, metabolomics in combination with machine learning was utilised for the benchmarking of osteogenic differentiation quality in 2D and 3D cultures. Metabolomics analysis was performed with the use of gas chromatography-mass spectrometry (GC-MS). A set of 11 metabolites was used to train an XGboost model which achieved excellent performance in distinguishing between differentiated and undifferentiated umbilical cord blood mesenchymal stem cells (UCB MSCs). The model was benchmarked against samples not present in the training set, being able to efficiently capture osteogenesis in 3D UCB MSC cultures with an area under the curve (AUC) of 82.6%. On the contrary, the model did not capture any differentiation in Wharton's Jelly MSC samples, which are well-known underperformers in osteogenic differentiation (AUC of 56.2%). Mineralisation was significantly correlated with the levels of fumarate, glycerol, and myo-inositol, the four metabolites found most important for model performance (R2 = 0.89, R2 = 0.94, and R2 = 0.96, and p = 0.016, p = 0.0059, and p = 0.0022, respectively). In conclusion, our results indicate that metabolomics in combination with machine learning can be used for the development of reliable potency assays for the evaluation of Advanced Therapy Medicinal Products.
Collapse
Affiliation(s)
- Michail E Klontzas
- Artificial Intelligence and Translational Imaging (ATI) Lab, Department of Radiology, School of Medicine, University of Crete, 71003 Heraklion, Greece
- Computational Biomedicine Laboratory, Institute of Computer Science, Foundation for Research and Technology (ICS-FORTH), 70013 Heraklion, Greece
| | | | - Aristea Batsali
- Haemopoiesis Research Laboratory, School of Medicine, University of Crete, 71003 Heraklion, Greece
| | - Fotios Papadogiannis
- Haemopoiesis Research Laboratory, School of Medicine, University of Crete, 71003 Heraklion, Greece
| | - Nicki Panoskaltsis
- BioMedical Systems Engineering Laboratory, Panoz Institute, School of Pharmacy and Pharmaceutical Sciences, Trinity College, D02 PN40 Dublin, Ireland
| | - Athanasios Mantalaris
- BioMedical Systems Engineering Laboratory, Panoz Institute, School of Pharmacy and Pharmaceutical Sciences, Trinity College, D02 PN40 Dublin, Ireland
- National Institute for Bioprocessing Research and Training (NIBRT), Foster Avenue, Mount Merrion, Blackrock, A94 X099 Dublin, Ireland
| |
Collapse
|
26
|
Ogutu S, Mohammed M, Mwambi H. Cytokine profiles as predictors of HIV incidence using machine learning survival models and statistical interpretable techniques. Sci Rep 2024; 14:29895. [PMID: 39622992 PMCID: PMC11612445 DOI: 10.1038/s41598-024-81510-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/05/2024] [Accepted: 11/27/2024] [Indexed: 12/06/2024] Open
Abstract
HIV remains a critical global health issue, with an estimated 39.9 million people living with the virus worldwide by the end of 2023 (according to WHO). Although the epidemic's impact varies significantly across regions, Africa remains the most affected. In the past decade, considerable efforts have focused on developing preventive measures, such as vaccines and pre-exposure prophylaxis, to combat sexually transmitted HIV. Recently, cytokine profiles have gained attention as potential predictors of HIV incidence due to their involvement in immune regulation and inflammation, presenting new opportunities to enhance preventative strategies. However, the high-dimensional, time-varying nature of cytokine data collected in clinical research, presents challenges for traditional statistical methods like the Cox proportional hazards (PH) model to effectively analyze survival data related to HIV. Machine learning (ML) survival models offer a robust alternative, especially for addressing the limitations of the PH model's assumptions. In this study, we applied survival support vector machine (SSVM) and random survival forest (RSF) models using changes or means in cytokine levels as predictors to assess their association with HIV incidence, evaluate variable importance, measure predictive accuracy using the concordance index (C-index) and integrated Brier score (IBS) and interpret the model's predictions using Shapley additive explanations (SHAP) values. Our results indicated that RSFs models outperformed SSVMs models, with the difference covariate model performing better than the mean covariate model. The highest C-index for SSVM was 0.7180 under the difference covariate model, while for RSF, it reached 0.8801 under the difference covariate model using the log-rank split rule. Key cytokines identified as positive predictors of HIV incidence included TNF-A, BASIC-FGF, IL-5, MCP-3, and EOTAXIN, while 29 cytokines were negative predictors. Baseline factors such as condom use frequency, treatment status, number of partners, and sexual activity also emerged as significant predictors. This study underscored the potential of cytokine profiles for predicting HIV incidence and highlighted the advantages of RSFs models in analyzing high-dimensional, time-varying data over SSVMs. It further through ablation studies emphasized the importance of selecting key features within mean and difference based covariate models to achieve an optimal balance between model complexity and predictive accuracy.
Collapse
Affiliation(s)
- Sarah Ogutu
- School of Mathematics, Statistics and Computer Science, University of KwaZulu-Natal, Pietermaritzburg, 3201, South Africa.
| | - Mohanad Mohammed
- School of Mathematics, Statistics and Computer Science, University of KwaZulu-Natal, Pietermaritzburg, 3201, South Africa
- School of Nursing and Public Health, University of KwaZulu-Natal, Pietermaritzburg, 3201, South Africa
| | - Henry Mwambi
- School of Mathematics, Statistics and Computer Science, University of KwaZulu-Natal, Pietermaritzburg, 3201, South Africa
| |
Collapse
|
27
|
Ngeow AJH, Moosa AS, Tan MG, Zou L, Goh MMR, Lim GH, Tagamolila V, Ereno I, Durnford JR, Cheung SKH, Hong NWJ, Soh SY, Tay YY, Chang ZY, Ong R, Tsang LPM, Yip BKL, Chia KW, Yap K, Lim MH, Ta AWA, Goh HL, Yeo CL, Chan DKL, Tan NC. Development and Validation of a Smartphone Application for Neonatal Jaundice Screening. JAMA Netw Open 2024; 7:e2450260. [PMID: 39661385 PMCID: PMC11635536 DOI: 10.1001/jamanetworkopen.2024.50260] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 08/17/2024] [Accepted: 10/20/2024] [Indexed: 12/12/2024] Open
Abstract
Importance This diagnostic study describes the merger of domain knowledge (Kramer principle of dermal advancement of icterus) with current machine learning (ML) techniques to create a novel tool for screening of neonatal jaundice (NNJ), which affects 60% of term and 80% of preterm infants. Objective This study aimed to develop and validate a smartphone-based ML app to predict bilirubin (SpB) levels in multiethnic neonates using skin color analysis. Design, Setting, and Participants This diagnostic study was conducted between June 2022 and June 2024 at a tertiary hospital and 4 primary-care clinics in Singapore with a consecutive sample of neonates born at 35 or more weeks' gestation and within 21 days of birth. Exposure The smartphone-based ML app captured skin images via the central aperture of a standardized color calibration sticker card from multiple regions of interest arranged in a cephalocaudal fashion, following the Kramer principle of dermal advancement of icterus. The ML model underwent iterative development and k-folds cross-validation, with performance assessed based on root mean squared error, Pearson correlation, and agreement with total serum bilirubin (TSB). The final ML model underwent temporal validation. Main Outcomes and Measures Linear correlation and statistical agreement between paired SpB and TSB; sensitivity and specificity for detection of TSB equal to or greater than 17mg/dL with SpB equal to or greater than 13 mg/dL were assessed. Results The smartphone-based ML app was validated on 546 neonates (median [IQR] gestational age, 38.0 [35.0-41.0] weeks; 286 [52.4%] male; 315 [57.7%] Chinese, 35 [6.4%] Indian, 169 [31.0%] Malay, and 27 [4.9%] other ethnicities). Iterative development and cross-validation was performed on 352 neonates. The final ML model (ensembled gradient boosted trees) incorporated yellowness indicators from the forehead, sternum, and abdomen. Temporal validation on 194 neonates yielded a Pearson r of 0.84 (95% CI, 0.79-0.88; P < .001), 82% of data pairs within clinically acceptable limits of 3 mg/dL, sensitivity of 100%, specificity of 70%, positive predictive value of 10%, negative predictive value of 100%, positive likelihood ratio of 3.3, negative likelihood ratio of 0, and area under the receiver operating characteristic curve of 0.89 (95% CI, 0.82-0.96). Conclusions and Relevance In this diagnostic study of a new smartphone-based ML app, there was good correlation and statistical agreement with TSB with sensitivity of 100%. The screening tool has the potential to be an NNJ screening tool, with treatment decisions based on TSB (reference standard). Further prospective studies are needed to establish the generalizability and cost-effectiveness of the screening tool in the clinical setting.
Collapse
Affiliation(s)
- Alvin Jia Hao Ngeow
- Department of Neonatal and Developmental Medicine, Singapore General Hospital, Singapore
- Yong Loo Lin School of Medicine, National University of Singapore, Singapore
- Paediatrics Academic Clinical Programme, Duke-NUS Medical School, Singapore
- Lee Kong Chian School of Medicine, Nanyang Technological University, Singapore
| | - Aminath Shiwaza Moosa
- SingHealth Polyclinics, Singapore
- Family Medicine Academic Clinical Programme, Duke-NUS Medical School, Singapore
| | - Mary Grace Tan
- Department of Neonatal and Developmental Medicine, Singapore General Hospital, Singapore
| | - Lin Zou
- Synapxe (formerly Integrated Health Information Systems, IHiS), Singapore
| | | | - Gek Hsiang Lim
- Health Services Research Unit, Singapore General Hospital, Singapore
| | - Vina Tagamolila
- Department of Neonatal and Developmental Medicine, Singapore General Hospital, Singapore
| | - Imelda Ereno
- Department of Neonatal and Developmental Medicine, Singapore General Hospital, Singapore
| | - Jared Ryan Durnford
- Department of Neonatal and Developmental Medicine, Singapore General Hospital, Singapore
- Paediatrics Academic Clinical Programme, Duke-NUS Medical School, Singapore
| | - Samson Kei Him Cheung
- Department of Neonatal and Developmental Medicine, Singapore General Hospital, Singapore
- Paediatrics Academic Clinical Programme, Duke-NUS Medical School, Singapore
| | - Nicholas Wei Jie Hong
- Department of Neonatal and Developmental Medicine, Singapore General Hospital, Singapore
- Paediatrics Academic Clinical Programme, Duke-NUS Medical School, Singapore
| | - Ser Yee Soh
- Department of Neonatal and Developmental Medicine, Singapore General Hospital, Singapore
- Paediatrics Academic Clinical Programme, Duke-NUS Medical School, Singapore
| | - Yih Yann Tay
- Nursing Division, Singapore General Hospital, Singapore
| | - Zi Ying Chang
- SingHealth Polyclinics, Singapore
- Family Medicine Academic Clinical Programme, Duke-NUS Medical School, Singapore
| | - Ruiheng Ong
- SingHealth Polyclinics, Singapore
- Family Medicine Academic Clinical Programme, Duke-NUS Medical School, Singapore
| | - Li Ping Marianne Tsang
- SingHealth Polyclinics, Singapore
- Family Medicine Academic Clinical Programme, Duke-NUS Medical School, Singapore
| | - Benny K. L. Yip
- Department of Future Health System, Singapore General Hospital, Singapore
| | - Kuok Wei Chia
- Department of Future Health System, Singapore General Hospital, Singapore
| | | | - Ming Hwee Lim
- Department of Clinical Pathology, Singapore General Hospital, Singapore
| | - Andy Wee An Ta
- Synapxe (formerly Integrated Health Information Systems, IHiS), Singapore
| | - Han Leong Goh
- Synapxe (formerly Integrated Health Information Systems, IHiS), Singapore
| | - Cheo Lian Yeo
- Department of Neonatal and Developmental Medicine, Singapore General Hospital, Singapore
- Yong Loo Lin School of Medicine, National University of Singapore, Singapore
- Paediatrics Academic Clinical Programme, Duke-NUS Medical School, Singapore
- Lee Kong Chian School of Medicine, Nanyang Technological University, Singapore
| | - Daisy Kwai Lin Chan
- Department of Neonatal and Developmental Medicine, Singapore General Hospital, Singapore
- Yong Loo Lin School of Medicine, National University of Singapore, Singapore
- Paediatrics Academic Clinical Programme, Duke-NUS Medical School, Singapore
- Lee Kong Chian School of Medicine, Nanyang Technological University, Singapore
| | - Ngiap Chuan Tan
- SingHealth Polyclinics, Singapore
- Family Medicine Academic Clinical Programme, Duke-NUS Medical School, Singapore
| |
Collapse
|
28
|
Ghazal H, El-Absawy ESA, Ead W, Hasan ME. Machine learning-guided differential gene expression analysis identifies a highly-connected seven-gene cluster in triple-negative breast cancer. Biomedicine (Taipei) 2024; 14:15-35. [PMID: 39777114 PMCID: PMC11703398 DOI: 10.37796/2211-8039.1467] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/13/2024] [Revised: 08/30/2024] [Accepted: 09/02/2024] [Indexed: 01/11/2025] Open
Abstract
Background One of the most challenging cancers is triple-negative breast cancer, which is subdivided into many molecular subtypes. Due to the high degree of heterogeneity, the role of precision medicine remains challenging. With the use of machine learning (ML)-guided gene selection, the differential gene expression analysis can be optimized, and eventually, the process of precision medicine can see great advancement through biomarker discovery. Purpose Enhancing precision medicine in the oncology field by identification of the most representative differentially-expressed genes to be used as biomarkers or as novel drug targets. Methods By utilizing data from the Gene Expression Omnibus (GEO) repository and The Cancer Genome Atlas (TCGA), we identified the differentially expressed genes using the linear model for microarray analysis (LIMMA) and edgeR algorithms, and applied ML-based feature selection using several algorithms. Results A total of 27 genes were selected by merging features identified with both LIMMA and ML-based feature selection methods. The models with the highest area under the curve (AUC) are CatBoost, Extreme Gradient Boosting (XGBoost), Random Forest, and Multi-Layer Perceptron classifiers. ESR1, FOXA1, GATA3, XBP1, GREB1, AR, and AGR2 were identified as hub genes in a highly interconnected cluster. Conclusion ML-based gene selection shows a great impact on the identification of hub genes. The ML models built can improve precision oncology in diagnosis and prognosis. The identified hub genes can serve as biomarkers and warrant further research for potential drug target development.
Collapse
Affiliation(s)
- Hany Ghazal
- Bioinformatics Department, Genetic Engineering and Biotechnology Research Institute, University of Sadat City, Sadat City,
Egypt
| | - El-Sayed A. El-Absawy
- Bioinformatics Department, Genetic Engineering and Biotechnology Research Institute, University of Sadat City, Sadat City,
Egypt
| | - Waleed Ead
- Information Systems Department, Faculty of Computers and Artificial Intelligence, Beni-Suef University, Beni-Suef,
Egypt
| | - Mohamed E. Hasan
- Bioinformatics Department, Genetic Engineering and Biotechnology Research Institute, University of Sadat City, Sadat City,
Egypt
| |
Collapse
|
29
|
Vachon J, Kerckhoffs J, Buteau S, Smargiassi A. Do machine learning methods improve prediction of ambient air pollutants with high spatial contrast? A systematic review. ENVIRONMENTAL RESEARCH 2024; 262:119751. [PMID: 39117059 DOI: 10.1016/j.envres.2024.119751] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/20/2023] [Revised: 07/18/2024] [Accepted: 08/04/2024] [Indexed: 08/10/2024]
Abstract
BACKGROUND & OBJECTIVE The use of machine learning for air pollution modelling is rapidly increasing. We conducted a systematic review of studies comparing statistical and machine learning models predicting the spatiotemporal variation of ambient nitrogen dioxide (NO2), ultrafine particles (UFPs) and black carbon (BC) to determine whether and in which scenarios machine learning generates more accurate predictions. METHODS Web of Science and Scopus were searched up to June 13, 2024. All records were screened by two independent reviewers. Differences in the coefficient of determination (R2) and Root Mean Square Error (RMSE) between best statistical and machine learning methods were compared across categories of methodological elements. RESULTS A total of 38 studies with 46 model comparisons (30 for NO2, 8 for UFPs and 8 for BC) were included. Linear non-regularized methods and Random Forest were most frequently used. Machine learning outperformed statistical models in 34 comparisons. Mean differences (95% confidence intervals) in R2 and RMSE between best machine learning and statistical models were 0.12 (0.08, 0.17) and 20% (11%, 29%) respectively. Tree-based methods performed best in 12 of 17 multi-model comparisons. Nonlinear or regularization regression methods were used in only 12 comparisons and provided similar performance to machine learning methods. CONCLUSION This systematic review suggests that machine learning methods, especially tree-based methods, may be superior to linear non-regularized methods for predicting ambient concentrations of NO2, UFPs and BC. Additional comparison studies using nonlinear, regularized and a wider array of machine learning methods are needed to confirm their relative performance. Future air pollution studies would also benefit from more explicit and standardized reporting of methodologies and results.
Collapse
Affiliation(s)
- Julien Vachon
- Department of Environmental and Occupational Health, School of Public Health, University of Montreal, Montreal, Canada; Center for Public Health Research (CReSP), University of Montreal and CIUSSS Du Centre-Sud-de-l'Île-de-Montréal, Montreal, Canada
| | - Jules Kerckhoffs
- Institute for Risk Assessment Sciences, Utrecht University, Utrecht, the Netherlands
| | - Stéphane Buteau
- Department of Environmental and Occupational Health, School of Public Health, University of Montreal, Montreal, Canada; Center for Public Health Research (CReSP), University of Montreal and CIUSSS Du Centre-Sud-de-l'Île-de-Montréal, Montreal, Canada
| | - Audrey Smargiassi
- Department of Environmental and Occupational Health, School of Public Health, University of Montreal, Montreal, Canada; Center for Public Health Research (CReSP), University of Montreal and CIUSSS Du Centre-Sud-de-l'Île-de-Montréal, Montreal, Canada.
| |
Collapse
|
30
|
Wlosik J, Granjeaud S, Gorvel L, Olive D, Chretien AS. A beginner's guide to supervised analysis for mass cytometry data in cancer biology. Cytometry A 2024; 105:853-869. [PMID: 39486897 DOI: 10.1002/cyto.a.24901] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/10/2024] [Revised: 09/16/2024] [Accepted: 10/01/2024] [Indexed: 11/04/2024]
Abstract
Mass cytometry enables deep profiling of biological samples at single-cell resolution. This technology is more than relevant in cancer research due to high cellular heterogeneity and complexity. Downstream analysis of high-dimensional datasets increasingly relies on machine learning (ML) to extract clinically relevant information, including supervised algorithms for classification and regression purposes. In cancer research, they are used to develop predictive models that will guide clinical decision making. However, the development of supervised algorithms faces major challenges, such as sufficient validation, before being translated into the clinics. In this work, we provide a framework for the analysis of mass cytometry data with a specific focus on supervised algorithms and practical examples of their applications. We also raise awareness on key issues regarding good practices for researchers curious to implement supervised ML on their mass cytometry data. Finally, we discuss the challenges of supervised ML application to cancer research.
Collapse
Affiliation(s)
- Julia Wlosik
- Team 'Immunity and Cancer', Marseille Cancer Research Center, Inserm U1068, CNRS UMR7258, Paoli-Calmettes Institute, Aix-Marseille University UM105, Marseille, France
- Immunomonitoring Department, Paoli-Calmettes Institute, Marseille, France
| | - Samuel Granjeaud
- Systems Biology Platform, Marseille Cancer Research Center, Inserm U1068, CNRS UMR7258, Paoli-Calmettes Institute, Aix-Marseille University UM105, Marseille, France
| | - Laurent Gorvel
- Team 'Immunity and Cancer', Marseille Cancer Research Center, Inserm U1068, CNRS UMR7258, Paoli-Calmettes Institute, Aix-Marseille University UM105, Marseille, France
- Immunomonitoring Department, Paoli-Calmettes Institute, Marseille, France
| | - Daniel Olive
- Team 'Immunity and Cancer', Marseille Cancer Research Center, Inserm U1068, CNRS UMR7258, Paoli-Calmettes Institute, Aix-Marseille University UM105, Marseille, France
- Immunomonitoring Department, Paoli-Calmettes Institute, Marseille, France
| | - Anne-Sophie Chretien
- Team 'Immunity and Cancer', Marseille Cancer Research Center, Inserm U1068, CNRS UMR7258, Paoli-Calmettes Institute, Aix-Marseille University UM105, Marseille, France
- Immunomonitoring Department, Paoli-Calmettes Institute, Marseille, France
| |
Collapse
|
31
|
López-Rueda A, Rodríguez-Sánchez MÁ, Serrano E, Moreno J, Rodríguez A, Llull L, Amaro S, Oleaga L. Enhancing mortality prediction in patients with spontaneous intracerebral hemorrhage: Radiomics and supervised machine learning on non-contrast computed tomography. Eur J Radiol Open 2024; 13:100618. [PMID: 39687913 PMCID: PMC11648778 DOI: 10.1016/j.ejro.2024.100618] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/17/2024] [Revised: 11/17/2024] [Accepted: 11/28/2024] [Indexed: 12/18/2024] Open
Abstract
Purpose This study aims to develop a Radiomics-based Supervised Machine-Learning model to predict mortality in patients with spontaneous intracerebral hemorrhage (sICH). Methods Retrospective analysis of a prospectively collected clinical registry of patients with sICH consecutively admitted at a single academic comprehensive stroke center between January-2016 and April-2018. We conducted an in-depth analysis of 105 radiomic features extracted from 105 patients. Following the identification and handling of missing values, radiomics values were scaled to 0-1 to train different classifiers. The sample was split into 80-20 % training-test and validation cohort in a stratified fashion. Random Forest(RF), K-Nearest Neighbor(KNN), and Support Vector Machine(SVM) classifiers were evaluated, along with several feature selection methods and hyperparameter optimization strategies, to classify the binary outcome of mortality or survival during hospital admission. A tenfold stratified cross-validation method was used to train the models, and average metrics were calculated. Results RF, KNN, and SVM, with the "DropOut+SelectKBest" feature selection strategy and no hyperparameter optimization, demonstrated the best performances with the least number of radiomic features and the most simplified models, achieving a sensitivity range between 0.90 and 0.95 and AUC range from 0.97 to 1 on the validation dataset. Regarding the confusion matrix, the SVM model did not predict any false negative test (negative predicted value 1). Conclusion Radiomics-based Supervised Machine Learning models can predict mortality during admission in patients with sICH. SVM with the "DropOut+SelectKBest" feature selection strategy and no hyperparameter optimization was the best simplified model to detect mortality during admission in patients with sICH.
Collapse
Affiliation(s)
- Antonio López-Rueda
- Clinical Informatics Department, Hospital Clínic de Barcelona, Barcelona, Spain
- Radiology Department, Hospital Clínic de Barcelona, Barcelona, Spain
| | | | - Elena Serrano
- Radiology Department, Hospital Universitario de Bellvitge, Barcelona, Spain
| | - Javier Moreno
- Radiology Department, Hospital Clínic de Barcelona, Barcelona, Spain
| | | | - Laura Llull
- Neurology Department, Hospital Clínic de Barcelona, Barcelona, Spain
| | - Sergi Amaro
- Neurology Department, Hospital Clínic de Barcelona, Barcelona, Spain
| | - Laura Oleaga
- Radiology Department, Hospital Clínic de Barcelona, Barcelona, Spain
| |
Collapse
|
32
|
Leghissa M, Carrera Á, Iglesias CÁ. FRELSA: A dataset for frailty in elderly people originated from ELSA and evaluated through machine learning models. Int J Med Inform 2024; 192:105603. [PMID: 39232373 DOI: 10.1016/j.ijmedinf.2024.105603] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/29/2024] [Revised: 08/01/2024] [Accepted: 08/13/2024] [Indexed: 09/06/2024]
Abstract
BACKGROUND Frailty is an age-related syndrome characterized by loss of strength and exhaustion and associated with multi-morbidity. Early detection and prediction of the appearance of frailty could help older people age better and prevent them from needing invasive and expensive treatments. Machine learning techniques show promising results in creating a medical support tool for such a task. METHODS This study aims to create a dataset for machine learning-based frailty studies, using Fried's Frailty Phenotype definition. Starting from a longitudinal study on aging in the UK population, we defined a frailty label for each subject. We evaluated the definition by training seven different models for detecting frailty with data that were contemporary to the ones used for the definition. We then integrated more data from two years before to obtain prediction models with a 24-month horizon. Features selection was performed using the MultiSURF algorithm, which ranks all features in order of relevance to the detection or prediction task. RESULTS We present a new frailty dataset of 5303 subjects and more than 6500 available features. It is publicly available, provided one has access to the original English Longitudinal Study of Ageing dataset. The dataset is balanced after grouping frailty with pre-frailty, and it is suitable for multiclass or binary classification and prediction problems. The seven tested architectures performed similarly, forming a solid baseline that can be improved with future work. Linear regression achieved the best F-score and AUROC in detection and prediction tasks. CONCLUSIONS Creating new frailty-annotated datasets of this size is necessary to develop and improve the frailty prediction techniques. We have shown that our dataset can be used to study and test machine learning models to detect and predict frailty. Future work should improve models' architecture and performance, consider explainability, and possibly enrich the dataset with older waves.
Collapse
Affiliation(s)
- Matteo Leghissa
- Universidad Politécnica de Madrid, Av. Complutense, 30, 28040, Madrid, Spain.
| | - Álvaro Carrera
- Universidad Politécnica de Madrid, Av. Complutense, 30, 28040, Madrid, Spain.
| | - Carlos Á Iglesias
- Universidad Politécnica de Madrid, Av. Complutense, 30, 28040, Madrid, Spain.
| |
Collapse
|
33
|
Hannon DM, Syed JDA, McNicholas B, Madden M, Laffey JG. The development of a C5.0 machine learning model in a limited data set to predict early mortality in patients with ARDS undergoing an initial session of prone positioning. Intensive Care Med Exp 2024; 12:103. [PMID: 39540987 PMCID: PMC11564488 DOI: 10.1186/s40635-024-00682-z] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/15/2024] [Accepted: 10/06/2024] [Indexed: 11/16/2024] Open
Abstract
BACKGROUND Acute Respiratory Distress Syndrome (ARDS) has a high morbidity and mortality. One therapy that can decrease mortality is ventilation in the prone position (PP). Patients undergoing PP are amongst the sickest, and there is a need for early identification of patients at particularly high risk of death. These patients may benefit from an in-depth review of treatment or consideration of rescue therapies. We report the development of a machine learning model trained to predict early mortality in patients undergoing prone positioning as part of the management of their ARDS. METHODS Prospectively collected clinical data were analysed retrospectively from a single tertiary ICU. The records of patients who underwent an initial session of prone positioning whilst receiving invasive mechanical ventilation were identified (n = 131). The decision to perform prone positioning was based on the criteria in the PROSEVA study. A C5.0 classifier algorithm with adaptive boosting was trained on data gathered before, during, and after initial proning. Data was split between training (85% of data) and testing (15% of data). Hyperparameter tuning was achieved through a grid-search using a maximal entropy configuration. Predictions for 7-day mortality after initial proning session were made on the training and testing data. RESULTS The model demonstrated good performance in predicting 7-day mortality (AUROC: 0.89 training, 0.78 testing). Seven variables were used for prediction. Sensitivity was 0.80 and specificity was 0.67 on the testing data set. Patients predicted to survive had 13.3% mortality, while those predicted to die had 66.67% mortality. Among patients in whom the model predicted patient would survive to day 7 based on their response, mortality at day 7 was 13.3%. Conversely, if the model predicted the patient would not survive to day 7, mortality was 66.67%. CONCLUSIONS This proof-of-concept study shows that with a limited data set, a C5.0 classifier can predict 7-day mortality from a number of variables, including the response to initial proning, and identify a cohort at significantly higher risk of death. This can help identify patients failing conventional therapies who may benefit from a thorough review of their management, including consideration of rescue treatments, such as extracorporeal membrane oxygenation. This study shows the potential of a machine learning model to identify ARDS patients at high risk of early mortality following PP. This information can guide clinicians in tailoring treatment strategies and considering rescue therapies. Further validation in larger cohorts is needed.
Collapse
Affiliation(s)
- David M Hannon
- Department of Anaesthesia, Galway University Hospital, and School of Medicine, University of Galway, Galway, Ireland
- Anaesthesia and Intensive Care Medicine, School of Medicine, University of Galway, Galway, Ireland
| | - Jaffar David Abbas Syed
- Department of Anaesthesia, Galway University Hospital, and School of Medicine, University of Galway, Galway, Ireland
| | - Bairbre McNicholas
- Department of Anaesthesia, Galway University Hospital, and School of Medicine, University of Galway, Galway, Ireland
- Anaesthesia and Intensive Care Medicine, School of Medicine, University of Galway, Galway, Ireland
| | - Michael Madden
- School of Computer Science, University of Galway, Galway, Ireland
| | - John G Laffey
- Department of Anaesthesia, Galway University Hospital, and School of Medicine, University of Galway, Galway, Ireland.
- Anaesthesia and Intensive Care Medicine, School of Medicine, University of Galway, Galway, Ireland.
| |
Collapse
|
34
|
Knezevic A, Arsenovic J, Garipi E, Platisa N, Savic A, Aleksandric T, Popovic D, Subic L, Milenovic N, Simic Panic D, Budinski S, Pasternak J, Manojlovic V, Knezevic MJ, Kapetina Radovic M, Jelicic Z. Machine Learning Model for Predicting Walking Ability in Lower Limb Amputees. J Clin Med 2024; 13:6763. [PMID: 39597907 PMCID: PMC11594448 DOI: 10.3390/jcm13226763] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/29/2024] [Revised: 11/04/2024] [Accepted: 11/08/2024] [Indexed: 11/29/2024] Open
Abstract
Background/Objectives: The number of individuals with lower limb loss (LLL) is rising. Therefore, identifying the walking potential in individuals with LLL and prescribing adequate prosthetic systems are crucial. Various factors can influence participants' walking ability, to different extents. The aim of the present study was to apply machine learning methods to develop a predictive mode. This model can assist rehabilitation and limb loss care teams in making informed decisions regarding prosthesis prescription and predicting walking ability in individuals with LLL. Methods: The present study was designed as a prospective cross-sectional study encompassing 104 consecutively recruited participants with LLL (average age 62.1 ± 10.9 years, 80 (76.9%) men) at the Medical Rehabilitation Clinic. Demographic, physical, psychological, and social status data of patients were collected at the beginning of the rehabilitation program. At the end of the treatment, K-level estimation of functional ability, a Timed Up and Go Test (TUG), and a Two-Minute Walking Test (TMWT) were performed. Support vector machines (SVM) were used to develop the prediction model. Results: Three decision trees were created, one for each output, as follows: K-level, TUG, and TMWT. For all three outputs, there were eight significant predictors (balance, body mass index, age, Beck depression inventory, amputation level, muscle strength of the residual extremity hip extensors, intact extremity (IE) plantar flexors, and IE hip extensors). For the K-level, the ninth predictor was The Multidimensional Scale of Perceived Social Support (MSPSS). Conclusions: Using the SVM model, we can predict the K-level, TUG, and TMWT with high accuracy. These clinical assessments could be incorporated into routine clinical practice to guide clinicians and inform patients of their potential level of ambulation.
Collapse
Affiliation(s)
- Aleksandar Knezevic
- Faculty of Medicine, University of Novi Sad, 21000 Novi Sad, Serbia; (E.G.); (A.S.); (T.A.); (D.P.); (L.S.); (N.M.); (D.S.P.); (S.B.); (J.P.); (V.M.); (M.J.K.)
- Center for Physical Medicine and Rehabilitation, University Clinical Center of Vojvodina, 21000 Novi Sad, Serbia;
| | - Jovana Arsenovic
- Faculty of Technical Sciences, University of Novi Sad, 21000 Novi Sad, Serbia; (J.A.); (M.K.R.); (Z.J.)
| | - Enis Garipi
- Faculty of Medicine, University of Novi Sad, 21000 Novi Sad, Serbia; (E.G.); (A.S.); (T.A.); (D.P.); (L.S.); (N.M.); (D.S.P.); (S.B.); (J.P.); (V.M.); (M.J.K.)
- Center for Physical Medicine and Rehabilitation, University Clinical Center of Vojvodina, 21000 Novi Sad, Serbia;
| | - Nedeljko Platisa
- Center for Physical Medicine and Rehabilitation, University Clinical Center of Vojvodina, 21000 Novi Sad, Serbia;
| | - Aleksandra Savic
- Faculty of Medicine, University of Novi Sad, 21000 Novi Sad, Serbia; (E.G.); (A.S.); (T.A.); (D.P.); (L.S.); (N.M.); (D.S.P.); (S.B.); (J.P.); (V.M.); (M.J.K.)
- Center for Physical Medicine and Rehabilitation, University Clinical Center of Vojvodina, 21000 Novi Sad, Serbia;
| | - Tijana Aleksandric
- Faculty of Medicine, University of Novi Sad, 21000 Novi Sad, Serbia; (E.G.); (A.S.); (T.A.); (D.P.); (L.S.); (N.M.); (D.S.P.); (S.B.); (J.P.); (V.M.); (M.J.K.)
- Center for Physical Medicine and Rehabilitation, University Clinical Center of Vojvodina, 21000 Novi Sad, Serbia;
| | - Dunja Popovic
- Faculty of Medicine, University of Novi Sad, 21000 Novi Sad, Serbia; (E.G.); (A.S.); (T.A.); (D.P.); (L.S.); (N.M.); (D.S.P.); (S.B.); (J.P.); (V.M.); (M.J.K.)
- Center for Physical Medicine and Rehabilitation, University Clinical Center of Vojvodina, 21000 Novi Sad, Serbia;
| | - Larisa Subic
- Faculty of Medicine, University of Novi Sad, 21000 Novi Sad, Serbia; (E.G.); (A.S.); (T.A.); (D.P.); (L.S.); (N.M.); (D.S.P.); (S.B.); (J.P.); (V.M.); (M.J.K.)
- Center for Physical Medicine and Rehabilitation, University Clinical Center of Vojvodina, 21000 Novi Sad, Serbia;
| | - Natasa Milenovic
- Faculty of Medicine, University of Novi Sad, 21000 Novi Sad, Serbia; (E.G.); (A.S.); (T.A.); (D.P.); (L.S.); (N.M.); (D.S.P.); (S.B.); (J.P.); (V.M.); (M.J.K.)
- Special Hospital for Rheumatic Diseases, 21000 Novi Sad, Serbia
| | - Dusica Simic Panic
- Faculty of Medicine, University of Novi Sad, 21000 Novi Sad, Serbia; (E.G.); (A.S.); (T.A.); (D.P.); (L.S.); (N.M.); (D.S.P.); (S.B.); (J.P.); (V.M.); (M.J.K.)
- Center for Physical Medicine and Rehabilitation, University Clinical Center of Vojvodina, 21000 Novi Sad, Serbia;
| | - Slavko Budinski
- Faculty of Medicine, University of Novi Sad, 21000 Novi Sad, Serbia; (E.G.); (A.S.); (T.A.); (D.P.); (L.S.); (N.M.); (D.S.P.); (S.B.); (J.P.); (V.M.); (M.J.K.)
- Clinic for Vascular and Endovascular Surgery, University Clinical Center of Vojvodina, 21000 Novi Sad, Serbia
| | - Janko Pasternak
- Faculty of Medicine, University of Novi Sad, 21000 Novi Sad, Serbia; (E.G.); (A.S.); (T.A.); (D.P.); (L.S.); (N.M.); (D.S.P.); (S.B.); (J.P.); (V.M.); (M.J.K.)
- Clinic for Vascular and Endovascular Surgery, University Clinical Center of Vojvodina, 21000 Novi Sad, Serbia
| | - Vladimir Manojlovic
- Faculty of Medicine, University of Novi Sad, 21000 Novi Sad, Serbia; (E.G.); (A.S.); (T.A.); (D.P.); (L.S.); (N.M.); (D.S.P.); (S.B.); (J.P.); (V.M.); (M.J.K.)
- Clinic for Vascular and Endovascular Surgery, University Clinical Center of Vojvodina, 21000 Novi Sad, Serbia
| | - Milica Jeremic Knezevic
- Faculty of Medicine, University of Novi Sad, 21000 Novi Sad, Serbia; (E.G.); (A.S.); (T.A.); (D.P.); (L.S.); (N.M.); (D.S.P.); (S.B.); (J.P.); (V.M.); (M.J.K.)
| | - Mirna Kapetina Radovic
- Faculty of Technical Sciences, University of Novi Sad, 21000 Novi Sad, Serbia; (J.A.); (M.K.R.); (Z.J.)
| | - Zoran Jelicic
- Faculty of Technical Sciences, University of Novi Sad, 21000 Novi Sad, Serbia; (J.A.); (M.K.R.); (Z.J.)
| |
Collapse
|
35
|
Knecht S, Morandini P, Biehler-Gomez L, Nogueira L, Adalian P, Cattaneo C. Sex estimation from patellar measurements in a contemporary Italian population: a machine learning approach. Int J Legal Med 2024:10.1007/s00414-024-03359-0. [PMID: 39495285 DOI: 10.1007/s00414-024-03359-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/18/2024] [Accepted: 10/23/2024] [Indexed: 11/05/2024]
Abstract
Biological sex estimation in forensic anthropology is a crucial topic, and the patella has shown promise in this regard due to its sexual dimorphism. This study uses 12 machine learning models for sex estimation based on three patellar measurements (maximum height, breadth, and thickness). Data was collected from 180 skeletons of a contemporary Italian population (83 males and 97 females) as well as from an independent sample of 21 forensic cases (13 males and 8 females). Statistical analyses indicated that each of the variables exhibited significant sexual dimorphism. To predict biological sex, the classifiers were built using 70% of a reference sample, then tested on the remaining 30% of the original sample and then tested again on the independent sample. The different classifiers generated accuracies varied between 0.85 and 0.91 on the reference sample and between 0.71 and 0.95 for the validation sample. SVM classifier stood out with the highest accuracy and seemed the best model for our study.This study contributes to the growing application of machine learning in forensic anthropology by being the first to apply such techniques to patellar measurements in an Italian population. It aims to enhance the accuracy and efficiency of biological sex estimation from the patella, building on promising results observed with other skeletal elements.
Collapse
Affiliation(s)
- Siam Knecht
- Aix Marseille Université, CNRS, EFS, ADES, Marseille, 13007, France
| | - Paolo Morandini
- LABANOF (Laboratorio di Antropologia e Odontologia Forense), Department of Biomedical Science for Health, University of Milan, Via Mangiagalli 37, Milan, 20133, Italy
| | - Lucie Biehler-Gomez
- LABANOF (Laboratorio di Antropologia e Odontologia Forense), Department of Biomedical Science for Health, University of Milan, Via Mangiagalli 37, Milan, 20133, Italy.
| | - Luisa Nogueira
- Faculté de Médecine, Institut Universitaire d'Anthropologie Médico-Légale, Université Côte d'Azur, 28 Avenue de Valombrose, Cedex 2 Nice, 06107, France
| | - Pascal Adalian
- Aix Marseille Université, CNRS, EFS, ADES, Marseille, 13007, France
| | - Cristina Cattaneo
- LABANOF (Laboratorio di Antropologia e Odontologia Forense), Department of Biomedical Science for Health, University of Milan, Via Mangiagalli 37, Milan, 20133, Italy
| |
Collapse
|
36
|
Salari E, Wang J, Wynne JF, Chang C, Wu Y, Yang X. Artificial intelligence-based motion tracking in cancer radiotherapy: A review. J Appl Clin Med Phys 2024; 25:e14500. [PMID: 39194360 PMCID: PMC11540048 DOI: 10.1002/acm2.14500] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/15/2023] [Revised: 07/13/2024] [Accepted: 07/27/2024] [Indexed: 08/29/2024] Open
Abstract
Radiotherapy aims to deliver a prescribed dose to the tumor while sparing neighboring organs at risk (OARs). Increasingly complex treatment techniques such as volumetric modulated arc therapy (VMAT), stereotactic radiosurgery (SRS), stereotactic body radiotherapy (SBRT), and proton therapy have been developed to deliver doses more precisely to the target. While such technologies have improved dose delivery, the implementation of intra-fraction motion management to verify tumor position at the time of treatment has become increasingly relevant. Artificial intelligence (AI) has recently demonstrated great potential for real-time tracking of tumors during treatment. However, AI-based motion management faces several challenges, including bias in training data, poor transparency, difficult data collection, complex workflows and quality assurance, and limited sample sizes. This review presents the AI algorithms used for chest, abdomen, and pelvic tumor motion management/tracking for radiotherapy and provides a literature summary on the topic. We will also discuss the limitations of these AI-based studies and propose potential improvements.
Collapse
Affiliation(s)
- Elahheh Salari
- Department of Radiation OncologyEmory UniversityAtlantaGeorgiaUSA
| | - Jing Wang
- Radiation OncologyIcahn School of Medicine at Mount SinaiNew YorkNew YorkUSA
| | | | - Chih‐Wei Chang
- Department of Radiation OncologyEmory UniversityAtlantaGeorgiaUSA
| | - Yizhou Wu
- School of Electrical and Computer EngineeringGeorgia Institute of TechnologyAtlantaGeorgiaUSA
| | - Xiaofeng Yang
- Department of Radiation OncologyEmory UniversityAtlantaGeorgiaUSA
| |
Collapse
|
37
|
Lukovikov DA, Kolesnikova TO, Ikrin AN, Prokhorenko NO, Shevlyakov AD, Korotaev AA, Yang L, Bley V, de Abreu MS, Kalueff AV. A novel open-access artificial-intelligence-driven platform for CNS drug discovery utilizing adult zebrafish. J Neurosci Methods 2024; 411:110256. [PMID: 39182516 DOI: 10.1016/j.jneumeth.2024.110256] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/08/2024] [Revised: 07/30/2024] [Accepted: 08/17/2024] [Indexed: 08/27/2024]
Abstract
BACKGROUND Although zebrafish are increasingly utilized in biomedicine for CNS disease modelling and drug discovery, this generates big data necessitating objective, precise and reproducible analyses. The artificial intelligence (AI) applications have empowered automated image recognition and video-tracking to ensure more efficient behavioral testing. NEW METHOD Capitalizing on several AI tools that most recently became available, here we present a novel open-access AI-driven platform to analyze tracks of adult zebrafish collected from in vivo neuropharmacological experiments. For this, we trained the AI system to distinguish zebrafish behavioral patterns following systemic treatment with several well-studied psychoactive drugs - nicotine, caffeine and ethanol. RESULTS Experiment 1 showed the ability of the AI system to distinguish nicotine and caffeine with 75 % and ethanol with 88 % probability and high (81 %) accuracy following a post-training exposure to these drugs. Experiment 2 further validated our system with additional, previously unexposed compounds (cholinergic arecoline and varenicline, and serotonergic fluoxetine), used as positive and negative controls, respectively. COMPARISON WITH EXISTING METHODS The present study introduces a novel open-access AI-driven approach to analyze locomotor activity of adult zebrafish. CONCLUSIONS Taken together, these findings support the value of custom-made AI tools for unlocking full potential of zebrafish CNS drug research by monitoring, processing and interpreting the results of in vivo experiments.
Collapse
Affiliation(s)
- Danil A Lukovikov
- Graduate Program in Bioinformatics and Genomics, Sirius University of Science and Technology, Sochi 354340, Russia; Neuroscience Department, Sirius University of Science and Technology, Sochi 354340, Russia
| | - Tatiana O Kolesnikova
- Neuroscience Department, Sirius University of Science and Technology, Sochi 354340, Russia
| | - Aleksey N Ikrin
- Graduate Program in Genetics and Genetic Technologies, Sirius University of Science and Technology, Sochi 354340, Russia; Neuroscience Department, Sirius University of Science and Technology, Sochi 354340, Russia
| | - Nikita O Prokhorenko
- Neuroscience Department, Sirius University of Science and Technology, Sochi 354340, Russia
| | - Anton D Shevlyakov
- Graduate Program in Bioinformatics and Genomics, Sirius University of Science and Technology, Sochi 354340, Russia; Neuroscience Department, Sirius University of Science and Technology, Sochi 354340, Russia
| | - Andrei A Korotaev
- Neuroscience Department, Sirius University of Science and Technology, Sochi 354340, Russia
| | - Longen Yang
- Department of Biological Sciences, School of Science, Xi'an Jiaotong-Liverpool University, Suzhou 215123, China; Suzhou Key Laboratory of Neurobiology and Cell Signaling, School of Science, Xi'an Jiaotong-Liverpool University, Suzhou 215123, China
| | - Vea Bley
- Department of Biological Sciences, School of Science, Xi'an Jiaotong-Liverpool University, Suzhou 215123, China; Biology Program, University of Florida, Gainesville, FL 32610, USA
| | - Murilo S de Abreu
- Graduate Program in Health Sciences, Federal University of Health Sciences of Porto Alegre, Porto Alegre, Brazil; Western Caspian University, Baku, Azerbaijan.
| | - Allan V Kalueff
- Neuroscience Department, Sirius University of Science and Technology, Sochi 354340, Russia; Department of Biological Sciences, School of Science, Xi'an Jiaotong-Liverpool University, Suzhou 215123, China; Suzhou Key Laboratory of Neurobiology and Cell Signaling, School of Science, Xi'an Jiaotong-Liverpool University, Suzhou 215123, China; Institute of Translational Biomedicine, St. Petersburg State University, St. Petersburg 199034, Russia; Institute of Experimental Medicine, Almazov National Medical Research Centre, Ministry of Healthcare of Russian Federation, St. Petersburg 194021, Russia.
| |
Collapse
|
38
|
Delgado-Álvarez A, Hernández-Lorenzo L, Nielsen TR, Díez-Cirarda M, Cuevas C, Montero-Escribano P, Delgado-Alonso C, Valles-Salgado M, Gil-Moreno MJ, Matias-Guiu J, Matias-Guiu JA. European cross-cultural neuropsychological test battery (CNTB) for the assessment of cognitive impairment in multiple sclerosis: Cognitive phenotyping and classification supported by machine learning techniques. Mult Scler Relat Disord 2024; 91:105907. [PMID: 39366169 DOI: 10.1016/j.msard.2024.105907] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/13/2023] [Revised: 07/06/2024] [Accepted: 09/24/2024] [Indexed: 10/06/2024]
Abstract
BACKGROUND The European Cross-Cultural Neuropsychological Test Battery (CNTB) has been proposed as a comprehensive battery for cognitive assessment, reducing the potential impact of cultural variables. In this validation study, we aimed to evaluate the diagnostic capacity of CNTB for the assessment of participants with multiple sclerosis (pwMS) compared to the Neuronorma battery (NN) according to the International Classification of Cognitive Disorders in MS criteria, and to develop machine learning (ML) algorithms to improve the diagnostic capacity of CNTB and to select the most relevant tests. METHODS Sixty pwMS and 60 healthy controls (HC) with no differences in sex, age, or years of education were enrolled. All participants completed the CNTB and pwMS were also examined with NN, depression, and fatigue scales. Impaired domains and cognitive phenotypes were defined following ICCoDiMS based on CNTB scores and compared to NN, according to -1SD and -1.5SD cutoff scores. To select the most relevant tests, random forest (RF) was performed for different binary classifications. RESULTS PwMS showed a lower performance compared to HC with medium-large effect sizes, in episodic memory, executive function, attention, and processing speed, in accordance with their characteristic cognitive profile. There were no differences in impaired domains or cognitive phenotypes between CNTB and NN, highlighting the role of episodic memory, executive function, attention, and processing speed tests. The most relevant tests identified by RF were consistent with inter-group comparisons and allowed a better classification than SD cutoff scores. CONCLUSION CNTB is a valid test for cognitive diagnosis in pwMS, including key tests for the most frequently impaired cognitive domains in MS. The use of ML techniques may also be useful to improve diagnosis, especially in some tests with lower sensitivity.
Collapse
Affiliation(s)
- Alfonso Delgado-Álvarez
- Department of Neurology, Hospital Clinico San Carlos, San Carlos Institute for Health Research (IdiSSC), Universidad Complutense, ES, Madrid 28040, Spain; Department of Biological and Health Psychology, Faculty of Psychology, Universidad Autónoma de Madrid, Madrid, Spain
| | - Laura Hernández-Lorenzo
- Department of Computer Architecture and Automation, Faculty of Informatics, Universidad Complutense, Madrid, Spain
| | - T Rune Nielsen
- Danish Dementia Research Centre, Department of Neurology, University of Copenhagen-Rigshospitalet, Copenhagen, Denmark
| | - María Díez-Cirarda
- Department of Neurology, Hospital Clinico San Carlos, San Carlos Institute for Health Research (IdiSSC), Universidad Complutense, ES, Madrid 28040, Spain
| | - Constanza Cuevas
- Department of Neurology, Hospital Clinico San Carlos, San Carlos Institute for Health Research (IdiSSC), Universidad Complutense, ES, Madrid 28040, Spain
| | - Paloma Montero-Escribano
- Department of Neurology, Hospital Clinico San Carlos, San Carlos Institute for Health Research (IdiSSC), Universidad Complutense, ES, Madrid 28040, Spain
| | - Cristina Delgado-Alonso
- Department of Neurology, Hospital Clinico San Carlos, San Carlos Institute for Health Research (IdiSSC), Universidad Complutense, ES, Madrid 28040, Spain
| | - María Valles-Salgado
- Department of Neurology, Hospital Clinico San Carlos, San Carlos Institute for Health Research (IdiSSC), Universidad Complutense, ES, Madrid 28040, Spain
| | - María José Gil-Moreno
- Department of Neurology, Hospital Clinico San Carlos, San Carlos Institute for Health Research (IdiSSC), Universidad Complutense, ES, Madrid 28040, Spain
| | - Jorge Matias-Guiu
- Department of Neurology, Hospital Clinico San Carlos, San Carlos Institute for Health Research (IdiSSC), Universidad Complutense, ES, Madrid 28040, Spain
| | - Jordi A Matias-Guiu
- Department of Neurology, Hospital Clinico San Carlos, San Carlos Institute for Health Research (IdiSSC), Universidad Complutense, ES, Madrid 28040, Spain.
| |
Collapse
|
39
|
Xin X, Wu S, Xu H, Ma Y, Bao N, Gao M, Han X, Gao S, Zhang S, Zhao X, Qi J, Zhang X, Tan J. Non-invasive prediction of human embryonic ploidy using artificial intelligence: a systematic review and meta-analysis. EClinicalMedicine 2024; 77:102897. [PMID: 39513188 PMCID: PMC11541425 DOI: 10.1016/j.eclinm.2024.102897] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 08/10/2024] [Revised: 10/06/2024] [Accepted: 10/07/2024] [Indexed: 11/15/2024] Open
Abstract
Background Embryonic ploidy is critical for the success of embryo transfer. Currently, preimplantation genetic testing for aneuploidy (PGT-A) is the gold standard for detecting ploidy abnormalities. However, PGT-A has several inherent limitations, including invasive biopsy, high economic burden, and ethical constraints. This paper provides the first comprehensive systematic review and meta-analysis of the performance of artificial intelligence (AI) algorithms using embryonic images for non-invasive prediction of embryonic ploidy. Methods Comprehensive searches of studies that developed or utilized AI algorithms to predict embryonic ploidy from embryonic imaging, published up until August 10, 2024, across PubMed, MEDLINE, Embase, IEEE, SCOPUS, Web of Science, and the Cochrane Central Register of Controlled Trials were performed. Studies with prospective or retrospective designs were included without language restrictions. The summary receiver operating characteristic curve, along with pooled sensitivity and specificity, was estimated using a bivariate random-effects model. The risk of bias and study quality were evaluated using the QUADAS-AI tool. Heterogeneity was quantified using the inconsistency index (I 2 ), derived from Cochran's Q test. Predefined subgroup analyses and bivariate meta-regression were conducted to explore potential sources of heterogeneity. This study was registered with PROSPERO (CRD42024500409). Findings Twenty eligible studies were identified, with twelve studies included in the meta-analysis. The pooled sensitivity, specificity, and area under the curve of AI for predicting embryonic euploidy were 0.71 (95% CI: 0.59-0.81), 0.75 (95% CI: 0.69-0.80), and 0.80 (95% CI: 0.76-0.83), respectively, based on a total of 6879 embryos (3110 euploid and 3769 aneuploid). Meta-regression and subgroup analyses identified the type of AI-driven decision support system, external validation, risk of bias, and year of publication as the primary contributors to the observed heterogeneity. There was no evidence of publication bias. Interpretation Our findings indicate that AI algorithms exhibit promising performance in predicting embryonic euploidy based on embryonic imaging. Although the current AI models developed cannot entirely replace invasive methods for determining embryo ploidy, AI demonstrates promise as an auxiliary decision-making tool for embryo selection, particularly for individuals who are unable to undergo PGT-A. To enhance the quality of future research, it is essential to overcome the specific challenges and limitations associated with AI studies in reproductive medicine. Funding This work was supported by the National Key R&D Program of China (2022YFC2702905), the Shengjing Freelance Researcher Plan of Shengjing Hospital and the 345 talent project of Shengjing Hospital.
Collapse
Affiliation(s)
- Xing Xin
- Centre of Reproductive Medicine, Department of Obstetrics and Gynecology, Shengjing Hospital of China Medical University, No. 39 Huaxiang Road, Tiexi District, Shenyang 110022, China
- Key Laboratory of Reproductive Dysfunction Disease and Fertility Remodeling of Liaoning Province, No. 39 Huaxiang Road, Tiexi District, Shenyang 110022, China
| | - Shanshan Wu
- Centre of Reproductive Medicine, Department of Obstetrics and Gynecology, Shengjing Hospital of China Medical University, No. 39 Huaxiang Road, Tiexi District, Shenyang 110022, China
- Key Laboratory of Reproductive Dysfunction Disease and Fertility Remodeling of Liaoning Province, No. 39 Huaxiang Road, Tiexi District, Shenyang 110022, China
| | - Heli Xu
- Department of Clinical Epidemiology, Shengjing Hospital of China Medical University, Shenyang 110022, China
| | - Yujiu Ma
- Centre of Reproductive Medicine, Department of Obstetrics and Gynecology, Shengjing Hospital of China Medical University, No. 39 Huaxiang Road, Tiexi District, Shenyang 110022, China
- Key Laboratory of Reproductive Dysfunction Disease and Fertility Remodeling of Liaoning Province, No. 39 Huaxiang Road, Tiexi District, Shenyang 110022, China
| | - Nan Bao
- The College of Medicine and Biological Information Engineering, Northeastern University, Shenyang 110167, China
| | - Man Gao
- Department of Obstetrics and Gynecology, Shengjing Hospital of China Medical University, No. 36 Sanhao Street, Heping District, Shenyang 110004, China
| | - Xue Han
- Department of Obstetrics and Gynecology, Shengjing Hospital of China Medical University, No. 36 Sanhao Street, Heping District, Shenyang 110004, China
| | - Shan Gao
- Centre of Reproductive Medicine, Department of Obstetrics and Gynecology, Shengjing Hospital of China Medical University, No. 39 Huaxiang Road, Tiexi District, Shenyang 110022, China
- Key Laboratory of Reproductive Dysfunction Disease and Fertility Remodeling of Liaoning Province, No. 39 Huaxiang Road, Tiexi District, Shenyang 110022, China
| | - Siwen Zhang
- Centre of Reproductive Medicine, Department of Obstetrics and Gynecology, Shengjing Hospital of China Medical University, No. 39 Huaxiang Road, Tiexi District, Shenyang 110022, China
- Key Laboratory of Reproductive Dysfunction Disease and Fertility Remodeling of Liaoning Province, No. 39 Huaxiang Road, Tiexi District, Shenyang 110022, China
| | - Xinyang Zhao
- Centre of Reproductive Medicine, Department of Obstetrics and Gynecology, Shengjing Hospital of China Medical University, No. 39 Huaxiang Road, Tiexi District, Shenyang 110022, China
- Key Laboratory of Reproductive Dysfunction Disease and Fertility Remodeling of Liaoning Province, No. 39 Huaxiang Road, Tiexi District, Shenyang 110022, China
| | - Jiarui Qi
- Centre of Reproductive Medicine, Department of Obstetrics and Gynecology, Shengjing Hospital of China Medical University, No. 39 Huaxiang Road, Tiexi District, Shenyang 110022, China
- Key Laboratory of Reproductive Dysfunction Disease and Fertility Remodeling of Liaoning Province, No. 39 Huaxiang Road, Tiexi District, Shenyang 110022, China
| | - Xudong Zhang
- Centre of Reproductive Medicine, Department of Obstetrics and Gynecology, Shengjing Hospital of China Medical University, No. 39 Huaxiang Road, Tiexi District, Shenyang 110022, China
- Key Laboratory of Reproductive Dysfunction Disease and Fertility Remodeling of Liaoning Province, No. 39 Huaxiang Road, Tiexi District, Shenyang 110022, China
| | - Jichun Tan
- Centre of Reproductive Medicine, Department of Obstetrics and Gynecology, Shengjing Hospital of China Medical University, No. 39 Huaxiang Road, Tiexi District, Shenyang 110022, China
- Key Laboratory of Reproductive Dysfunction Disease and Fertility Remodeling of Liaoning Province, No. 39 Huaxiang Road, Tiexi District, Shenyang 110022, China
| |
Collapse
|
40
|
Sourlos N, Vliegenthart R, Santinha J, Klontzas ME, Cuocolo R, Huisman M, van Ooijen P. Recommendations for the creation of benchmark datasets for reproducible artificial intelligence in radiology. Insights Imaging 2024; 15:248. [PMID: 39400639 PMCID: PMC11473745 DOI: 10.1186/s13244-024-01833-2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/22/2024] [Accepted: 09/20/2024] [Indexed: 10/15/2024] Open
Abstract
Various healthcare domains have witnessed successful preliminary implementation of artificial intelligence (AI) solutions, including radiology, though limited generalizability hinders their widespread adoption. Currently, most research groups and industry have limited access to the data needed for external validation studies. The creation and accessibility of benchmark datasets to validate such solutions represents a critical step towards generalizability, for which an array of aspects ranging from preprocessing to regulatory issues and biostatistical principles come into play. In this article, the authors provide recommendations for the creation of benchmark datasets in radiology, explain current limitations in this realm, and explore potential new approaches. CLINICAL RELEVANCE STATEMENT: Benchmark datasets, facilitating validation of AI software performance can contribute to the adoption of AI in clinical practice. KEY POINTS: Benchmark datasets are essential for the validation of AI software performance. Factors like image quality and representativeness of cases should be considered. Benchmark datasets can help adoption by increasing the trustworthiness and robustness of AI.
Collapse
Affiliation(s)
- Nikos Sourlos
- Department of Radiology, University Medical Center of Groningen, Groningen, The Netherlands
- DataScience Center in Health, University Medical Center Groningen, Groningen, The Netherlands
| | - Rozemarijn Vliegenthart
- Department of Radiology, University Medical Center of Groningen, Groningen, The Netherlands
- DataScience Center in Health, University Medical Center Groningen, Groningen, The Netherlands
| | - Joao Santinha
- Digital Surgery LAB, Champalimaud Foundation, Champalimaud Clinical Centre, Lisbon, Portugal
| | - Michail E Klontzas
- Department of Medical Imaging, University Hospital of Heraklion, Heraklion, Greece
- Department of Radiology, School of Medicine, University of Crete, Heraklion, Greece
| | - Renato Cuocolo
- Department of Medicine, Surgery, and Dentistry, University of Salerno, Baronissi, Italy
| | - Merel Huisman
- Department of Radiology and Nuclear Medicine, Radboud University Medical Center, Nijmegen, The Netherlands
| | - Peter van Ooijen
- DataScience Center in Health, University Medical Center Groningen, Groningen, The Netherlands.
- Department of Radiation Oncology, University Medical Center Groningen, Groningen, The Netherlands.
| |
Collapse
|
41
|
Rivera KP, Whang K, Joshi K, Son H, Kim YS, Flores M. Dental Composite Performance Prediction Using Artificial Intelligence. MEDRXIV : THE PREPRINT SERVER FOR HEALTH SCIENCES 2024:2024.10.08.24314998. [PMID: 39417148 PMCID: PMC11482990 DOI: 10.1101/2024.10.08.24314998] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Subscribe] [Scholar Register] [Indexed: 10/19/2024]
Abstract
Objective There is a need to increase the performance and longevity of dental composites and accelerate the translation of novel composites to the market. This study explores artificial intelligence (AI), specifically machine learning (ML), to predict the performance outcomes (POs) of dental composites from their composite attributes (CAs). Methods An extensive dataset from over 200 publications was built and refined to 233 samples with 17 CAs and 7 POs. Nine ML models were evaluated for PO prediction performance using classified data, and Five ML models were evaluated for PO regression analysis. Results The KNN model excelled in predicting flexural modulus (FlexMod), Decision Tree model in flexural strength (FlexStr) and volumetric shrinkage (ShrinkV), and Logistic Regression and SVM models in shrinkage stress (ShrinkStr). Receiver operating characteristic area under the curve (ROC AUC) analysis confirmed these results but found that Random Forest was more effective for FlexStr and ShrinkV, suggesting the possibility of Decision Tree overfitting the data. Regression analysis revealed that the Voting Regressor was superior for FlexMod and ShrinkV predictions, while Decision Tree Regression was optimal for FlexStr and ShrinkStr. Feature importance analysis indicated TEGDMA is a key contributor to FlexMod and ShrinkV, BisGMA and UDMA to FlexStr, and depth of cure, degree of monomer-to-polymer conversion, and filler loading to ShrinkStr. Significance There is a need to conduct a full analysis using multiple ML models because different models predict different POs better, and for a large, comprehensive dataset to train robust AI models to facilitate the prediction and optimization of composite properties and support the development of new dental materials.
Collapse
Affiliation(s)
- Karla Paniagua Rivera
- Department of Electrical and Computer Engineering, the University of Texas at San Antonio, San Antonio, TX, 78249, USA
| | - Kyumin Whang
- Department of Comprehensive Dentistry, the University of Texas Health Science Center at San Antonio, San Antonio, TX 78229, USA
| | - Krishna Joshi
- Department of Comprehensive Dentistry, the University of Texas Health Science Center at San Antonio, San Antonio, TX 78229, USA
| | - Hyeonwi Son
- Department of Oral & Maxillofacial Surgery, School of Dentistry
| | - Yu Shin Kim
- Department of Oral & Maxillofacial Surgery, School of Dentistry
- Programs in Integrated Biomedical Sciences, Translational Sciences, Biomedical Engineering, Radiological Sciences, University of Texas Health Science Center at San Antonio, San Antonio, TX 78229, USA
| | - Mario Flores
- Department of Electrical and Computer Engineering, the University of Texas at San Antonio, San Antonio, TX, 78249, USA
| |
Collapse
|
42
|
Shu CH, Zebda R, Espinosa C, Reiss J, Debuyserie A, Reber K, Aghaeepour N, Pammi M. Early prediction of mortality and morbidities in VLBW preterm neonates using machine learning. Pediatr Res 2024:10.1038/s41390-024-03604-7. [PMID: 39379627 DOI: 10.1038/s41390-024-03604-7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 05/16/2024] [Revised: 09/05/2024] [Accepted: 09/12/2024] [Indexed: 10/10/2024]
Abstract
BACKGROUND Predicting mortality and specific morbidities before they occur may allow for interventions that may improve health trajectories. HYPOTHESIS Integrating key maternal and postnatal infant variables in the first 2 weeks of age into machine learning (ML) algorithms will reliably predict survival and specific morbidities in VLBW preterm infants. METHODS ML algorithms were developed to integrate 47 features for predicting mortality, bronchopulmonary dysplasia (BPD), neonatal sepsis, necrotizing enterocolitis (NEC), intraventricular hemorrhage (IVH), cystic periventricular leukomalacia (PVL), and retinopathy of prematurity (ROP). A retrospective cohort (n = 3341) was used to train and validate the models with a repeated 10-fold cross-validation strategy. These models were then tested on a separate cohort (n = 447) to evaluate the final model performance. RESULTS Among the seven ML algorithms employed, tree-based ensemble models, specifically Random Forest (RF) and XGBoost, had the best performance metrics. The area under the receiver operating characteristic curve (AUROC) of sepsis with or without meningitis (0.73), NEC (0.73), BPD (0.71), and mortality (0.74) exceeded 0.7, while the area under Precision-Recall curve (AUPRC) for all outcomes was greater than the prevalence, demonstrating effective risk stratification in VLBW preterm infants. CONCLUSIONS Our study demonstrates the potential of predictive analytics leveraging ML techniques in advancing precision medicine. IMPACT Reliable prediction of adverse outcomes before they occur has the potential to institute interventions and possibly improve health trajectories in VLBW preterm infants. We used machine learning to develop and test predictive models for mortality and five major morbidities in VLBW preterm infants. Individualized prediction of outcomes and individualized interventions will advance Precision Medicine in Neonatology.
Collapse
Affiliation(s)
- Chi-Hung Shu
- Department of Anesthesiology, Pain, and Perioperative Medicine, Stanford University School of Medicine, Stanford, CA, USA
| | - Rema Zebda
- Department of Pediatrics and Neonatology, Texas Children's Hospital, Baylor College of Medicine, Houston, TX, USA
| | - Camilo Espinosa
- Department of Anesthesiology, Pain, and Perioperative Medicine, Stanford University School of Medicine, Stanford, CA, USA
| | - Jonathan Reiss
- Department of Pediatrics, Stanford University School of Medicine, Stanford, CA, USA
| | - Anne Debuyserie
- Department of Pediatrics and Neonatology, Texas Children's Hospital, Baylor College of Medicine, Houston, TX, USA
| | - Kristina Reber
- Department of Pediatrics and Neonatology, Texas Children's Hospital, Baylor College of Medicine, Houston, TX, USA
| | - Nima Aghaeepour
- Department of Anesthesiology, Pain, and Perioperative Medicine, Stanford University School of Medicine, Stanford, CA, USA
- Department of Pediatrics, Stanford University School of Medicine, Stanford, CA, USA
- Department of Biomedical Data Science, Stanford University School of Medicine, Stanford, CA, USA
| | - Mohan Pammi
- Department of Pediatrics and Neonatology, Texas Children's Hospital, Baylor College of Medicine, Houston, TX, USA.
| |
Collapse
|
43
|
Xu D, Xu Z. Machine learning applications in preventive healthcare: A systematic literature review on predictive analytics of disease comorbidity from multiple perspectives. Artif Intell Med 2024; 156:102950. [PMID: 39163727 DOI: 10.1016/j.artmed.2024.102950] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/25/2023] [Revised: 06/17/2024] [Accepted: 08/13/2024] [Indexed: 08/22/2024]
Abstract
Artificial intelligence is constantly revolutionizing biomedical research and healthcare management. Disease comorbidity is a major threat to the quality of life for susceptible groups, especially middle-aged and elderly patients. The presence of multiple chronic diseases makes precision diagnosis challenging to realize and imposes a heavy burden on the healthcare system and economy. Given an enormous amount of accumulated health data, machine learning techniques show their capability in handling this puzzle. The present study conducts a review to uncover current research efforts in applying these methods to understanding comorbidity mechanisms and making clinical predictions considering these complex patterns. A descriptive metadata analysis of 791 unique publications aims to capture the overall research progression between January 2012 and June 2023. To delve into comorbidity-focused research, 61 of these scientific papers are systematically assessed. Four predictive analytics of tasks are detected: disease comorbidity data extraction, clustering, network, and risk prediction. It is observed that some machine learning-driven applications address inherent data deficiencies in healthcare datasets and provide a model interpretation that identifies significant risk factors of comorbidity development. Based on insights, both technical and practical, gained from relevant literature, this study intends to guide future interests in comorbidity research and draw conclusions about chronic disease prevention and diagnosis with managerial implications.
Collapse
Affiliation(s)
- Duo Xu
- School of Economics and Management, Southeast University, Nanjing 211189, China.
| | - Zeshui Xu
- School of Economics and Management, Southeast University, Nanjing 211189, China; Business School, Sichuan University, Chengdu 610064, China.
| |
Collapse
|
44
|
Ancona RM, Cooper BP, Foraker R, Kaser T, Adeoye O, Mueller KL. Machine learning classification of new firearm injury encounters in the St Louis region: 2010-2020. J Am Med Inform Assoc 2024; 31:2165-2172. [PMID: 38976592 DOI: 10.1093/jamia/ocae173] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/22/2024] [Revised: 06/20/2024] [Accepted: 06/25/2024] [Indexed: 07/10/2024] Open
Abstract
OBJECTIVES To improve firearm injury encounter classification (new vs follow-up) using machine learning (ML) and compare our ML model to other common approaches. MATERIALS AND METHODS This retrospective study used data from the St Louis region-wide hospital-based violence intervention program data repository (2010-2020). We randomly selected 500 patients with a firearm injury diagnosis for inclusion, with 808 total firearm injury encounters split (70/30) for training and testing. We trained a least absolute shrinkage and selection operator (LASSO) regression model with the following predictors: admission type, time between firearm injury visits, number of prior firearm injury emergency department (ED) visits, encounter type (ED or other), and diagnostic codes. Our gold standard for new firearm injury encounter classification was manual chart review. We then used our test data to compare the performance of our ML model to other commonly used approaches (proxy measures of ED visits and time between firearm injury encounters, and diagnostic code encounter type designation [initial vs subsequent or sequela]). Performance metrics included area under the curve (AUC), sensitivity, and specificity with 95% confidence intervals (CIs). RESULTS The ML model had excellent discrimination (0.92, 0.88-0.96) with high sensitivity (0.95, 0.90-0.98) and specificity (0.89, 0.81-0.95). AUC was significantly higher than time-based outcomes, sensitivity was slightly (but not significantly) lower than other approaches, and specificity was higher than all other methods. DISCUSSION ML successfully delineated new firearm injury encounters, outperforming other approaches in ruling out encounters for follow-up. CONCLUSION ML can be used to identify new firearm injury encounters and may be particularly useful in studies assessing re-injuries.
Collapse
Affiliation(s)
- Rachel M Ancona
- Department of Emergency Medicine, Washington University in St Louis, St Louis, MO 63110, United States
| | - Benjamin P Cooper
- Institute for Public Health, Washington University in St Louis, St Louis, MO 63110, United States
| | - Randi Foraker
- Department of Medicine, Washington University in St Louis, St Louis, MO 63110, United States
| | - Taylor Kaser
- Department of Emergency Medicine, Washington University in St Louis, St Louis, MO 63110, United States
| | - Opeolu Adeoye
- Department of Emergency Medicine, Washington University in St Louis, St Louis, MO 63110, United States
| | - Kristen L Mueller
- Department of Emergency Medicine, Washington University in St Louis, St Louis, MO 63110, United States
| |
Collapse
|
45
|
Li X, Wen X, Shang X, Liu J, Zhang L, Cui Y, Luo X, Zhang G, Xie J, Huang T, Chen Z, Lyu Z, Wu X, Lan Y, Meng Q. Identification of diabetic retinopathy classification using machine learning algorithms on clinical data and optical coherence tomography angiography. Eye (Lond) 2024; 38:2813-2821. [PMID: 38871934 PMCID: PMC11427469 DOI: 10.1038/s41433-024-03173-3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/02/2023] [Revised: 04/10/2024] [Accepted: 06/06/2024] [Indexed: 06/15/2024] Open
Abstract
BACKGROUND To apply machine learning (ML) algorithms to perform multiclass diabetic retinopathy (DR) classification using both clinical data and optical coherence tomography angiography (OCTA). METHODS In this cross-sectional observational study, clinical data and OCTA parameters from 203 diabetic patients (203 eye) were used to establish the ML models, and those from 169 diabetic patients (169 eye) were used for independent external validation. The random forest, gradient boosting machine (GBM), deep learning and logistic regression algorithms were used to identify the presence of DR, referable DR (RDR) and vision-threatening DR (VTDR). Four different variable patterns based on clinical data and OCTA variables were examined. The algorithms' performance were evaluated using receiver operating characteristic curves and the area under the curve (AUC) was used to assess predictive accuracy. RESULTS The random forest algorithm on OCTA+clinical data-based variables and OCTA+non-laboratory factor-based variables provided the higher AUC values for DR, RDR and VTDR. The GBM algorithm produced similar results, albeit with slightly lower AUC values. Leading predictors of DR status included vessel density, retinal thickness and GCC thickness, as well as the body mass index, waist-to-hip ratio and glucose-lowering treatment. CONCLUSIONS ML-based multiclass DR classification using OCTA and clinical data can provide reliable assistance for screening, referral, and management DR populations.
Collapse
Affiliation(s)
- Xiaoli Li
- Department of Ophthalmology, Guangdong Eye Institute, Guangdong Provincial People's Hospital (Guangdong Academy of Medical Sciences), Southern Medical University, Guangzhou, China
| | - Xin Wen
- Department of Ophthalmology, Sun Yat-sen Memorial Hospital, Sun Yat-sen University, Guangzhou, China
| | - Xianwen Shang
- Department of Ophthalmology, Guangdong Eye Institute, Guangdong Provincial People's Hospital (Guangdong Academy of Medical Sciences), Southern Medical University, Guangzhou, China
| | - Junbin Liu
- Department of Ophthalmology, Guangdong Eye Institute, Guangdong Provincial People's Hospital (Guangdong Academy of Medical Sciences), Southern Medical University, Guangzhou, China
| | - Liang Zhang
- Department of Ophthalmology, Guangdong Eye Institute, Guangdong Provincial People's Hospital (Guangdong Academy of Medical Sciences), Southern Medical University, Guangzhou, China
| | - Ying Cui
- Department of Ophthalmology, Guangdong Eye Institute, Guangdong Provincial People's Hospital (Guangdong Academy of Medical Sciences), Southern Medical University, Guangzhou, China
| | - Xiaoyang Luo
- Department of Ophthalmology, Guangdong Eye Institute, Guangdong Provincial People's Hospital (Guangdong Academy of Medical Sciences), Southern Medical University, Guangzhou, China
| | - Guanrong Zhang
- Statistics Section, Guangdong Provincial People's Hospital (Guangdong Academy of Medical Sciences), Southern Medical University, Guangzhou, Guangdong, China
| | - Jie Xie
- Department of Ophthalmology, Heyuan People's Hospital, Heyuan, China
| | - Tian Huang
- Department of Ophthalmology, Zhujiang Hospital, Southern Medical University, Guangzhou, China
| | - Zhifan Chen
- Department of Ophthalmology, The Fourth Affiliated Hospital of Guangzhou Medical University, Guangzhou, China
| | - Zheng Lyu
- Department of Ophthalmology, Guangdong Eye Institute, Guangdong Provincial People's Hospital (Guangdong Academy of Medical Sciences), Southern Medical University, Guangzhou, China
| | - Xiyu Wu
- Department of Ophthalmology, Guangdong Eye Institute, Guangdong Provincial People's Hospital (Guangdong Academy of Medical Sciences), Southern Medical University, Guangzhou, China
| | - Yuqing Lan
- Department of Ophthalmology, Sun Yat-sen Memorial Hospital, Sun Yat-sen University, Guangzhou, China.
| | - Qianli Meng
- Department of Ophthalmology, Guangdong Eye Institute, Guangdong Provincial People's Hospital (Guangdong Academy of Medical Sciences), Southern Medical University, Guangzhou, China.
| |
Collapse
|
46
|
Chen D, Cao C, Kloosterman R, Parsa R, Raman S. Trial Factors Associated With Completion of Clinical Trials Evaluating AI: Retrospective Case-Control Study. J Med Internet Res 2024; 26:e58578. [PMID: 39312296 PMCID: PMC11459098 DOI: 10.2196/58578] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/19/2024] [Revised: 05/02/2024] [Accepted: 07/11/2024] [Indexed: 09/25/2024] Open
Abstract
BACKGROUND Evaluation of artificial intelligence (AI) tools in clinical trials remains the gold standard for translation into clinical settings. However, design factors associated with successful trial completion and the common reasons for trial failure are unknown. OBJECTIVE This study aims to compare trial design factors of complete and incomplete clinical trials testing AI tools. We conducted a case-control study of complete (n=485) and incomplete (n=51) clinical trials that evaluated AI as an intervention of ClinicalTrials.gov. METHODS Trial design factors, including area of clinical application, intended use population, and intended role of AI, were extracted. Trials that did not evaluate AI as an intervention and active trials were excluded. The assessed trial design factors related to AI interventions included the domain of clinical application related to organ systems; intended use population for patients or health care providers; and the role of AI for different applications in patient-facing clinical workflows, such as diagnosis, screening, and treatment. In addition, we also assessed general trial design factors including study type, allocation, intervention model, masking, age, sex, funder, continent, length of time, sample size, number of enrollment sites, and study start year. The main outcome was the completion of the clinical trial. Odds ratio (OR) and 95% CI values were calculated for all trial design factors using propensity-matched, multivariable logistic regression. RESULTS We queried ClinicalTrials.gov on December 23, 2023, using AI keywords to identify complete and incomplete trials testing AI technologies as a primary intervention, yielding 485 complete and 51 incomplete trials for inclusion in this study. Our nested propensity-matched, case-control results suggest that trials conducted in Europe were significantly associated with trial completion when compared with North American trials (OR 2.85, 95% CI 1.14-7.10; P=.03), and the trial sample size was positively associated with trial completion (OR 1.00, 95% CI 1.00-1.00; P=.02). CONCLUSIONS Our case-control study is one of the first to identify trial design factors associated with completion of AI trials and catalog study-reported reasons for AI trial failure. We observed that trial design factors positively associated with trial completion include trials conducted in Europe and sample size. Given the promising clinical use of AI tools in health care, our results suggest that future translational research should prioritize addressing the design factors of AI clinical trials associated with trial incompletion and common reasons for study failure.
Collapse
Affiliation(s)
- David Chen
- Temerty Faculty of Medicine, University of Toronto, Toronto, ON, Canada
| | - Christian Cao
- Temerty Faculty of Medicine, University of Toronto, Toronto, ON, Canada
| | | | - Rod Parsa
- Michael G. DeGroote School of Medicine, McMaster University, Hamilton, ON, Canada
| | - Srinivas Raman
- Department of Radiation Oncology, University of Toronto, Toronto, ON, Canada
| |
Collapse
|
47
|
Mehrbakhsh Z, Hassanzadeh R, Behnampour N, Tapak L, Zarrin Z, Khazaei S, Dinu I. Machine learning-based evaluation of prognostic factors for mortality and relapse in patients with acute lymphoblastic leukemia: a comparative simulation study. BMC Med Inform Decis Mak 2024; 24:261. [PMID: 39285373 PMCID: PMC11404043 DOI: 10.1186/s12911-024-02645-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/06/2024] [Accepted: 08/21/2024] [Indexed: 09/22/2024] Open
Abstract
BACKGROUND Predicting mortality and relapse in children with acute lymphoblastic leukemia (ALL) is crucial for effective treatment and follow-up management. ALL is a common and deadly childhood cancer that often relapses after remission. In this study, we aimed to apply and evaluate machine learning-based models for predicting mortality and relapse in pediatric ALL patients. METHODS This retrospective cohort study was conducted on 161 children aged less than 16 years with ALL. Survival status (dead/alive) and patient experience of relapse (yes/no) were considered as the outcome variables. Ten machine learning (ML) algorithms were used to predict mortality and relapse. The performance of the algorithms was evaluated by cross-validation and reported as mean sensitivity, specificity, accuracy and area under the curve (AUC). Finally, prognostic factors were identified based on the best algorithms. RESULTS The mean accuracy of the ML algorithms for prediction of patient mortality ranged from 64 to 74% and for prediction of relapse, it varied from 64 to 84% on test data sets. The mean AUC of the ML algorithms for mortality and relapse was above 64%. The most important prognostic factors for predicting both mortality and relapse were identified as age at diagnosis, hemoglobin and platelets. In addition, significant prognostic factors for predicting mortality included clinical side effects such as splenomegaly, hepatomegaly and lymphadenopathy. CONCLUSIONS Our results showed that artificial neural networks and bagging algorithms outperformed other algorithms in predicting mortality, while boosting and random forest algorithms excelled in predicting relapse in ALL patients across all criteria. These results offer significant clinical insights into the prognostic factors for children with ALL, which can inform treatment decisions and improve patient outcomes.
Collapse
Affiliation(s)
- Zahra Mehrbakhsh
- Department of Biostatistics, School of Public Health, Hamadan University of Medical Sciences, Hamadan, Iran
- Student Research Committee, Hamadan University of Medical Sciences, Hamadan, Iran
| | - Roghayyeh Hassanzadeh
- Department of Biostatistics, School of Public Health, Hamadan University of Medical Sciences, Hamadan, Iran
- Student Research Committee, Hamadan University of Medical Sciences, Hamadan, Iran
| | - Nasser Behnampour
- Department of Biostatistics and Epidemiology, School of Health, Golestan University of Medical Sciences, Gorgan, Iran
| | - Leili Tapak
- Department of Biostatistics, School of Public Health, Hamadan University of Medical Sciences, Hamadan, Iran.
- Modeling of Noncommunicable Diseases Research Center, Hamadan University of Medical Sciences, Hamadan, Iran.
| | - Ziba Zarrin
- Department of Photogrammetry and Remote Sensing, K.N. Toosi University of Technology, Tehran, Iran
| | - Salman Khazaei
- Health Sciences Research Center, Health Sciences & Technology Research Institute, Hamadan University of Medical Science, Hamadan, Iran
| | - Irina Dinu
- School of Public Health, University of Alberta, Edmonton, Canada
| |
Collapse
|
48
|
Fiz F, Rossi N, Langella S, Conci S, Serenari M, Ardito F, Cucchetti A, Gallo T, Zamboni GA, Mosconi C, Boldrini L, Mirarchi M, Cirillo S, Ruzzenente A, Pecorella I, Russolillo N, Borzi M, Vara G, Mele C, Ercolani G, Giuliante F, Cescon M, Guglielmi A, Ferrero A, Sollini M, Chiti A, Torzilli G, Ieva F, Viganò L. Radiomics of Intrahepatic Cholangiocarcinoma and Peritumoral Tissue Predicts Postoperative Survival: Development of a CT-Based Clinical-Radiomic Model. Ann Surg Oncol 2024; 31:5604-5614. [PMID: 38797789 DOI: 10.1245/s10434-024-15457-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/07/2024] [Accepted: 04/28/2024] [Indexed: 05/29/2024]
Abstract
BACKGROUND For many tumors, radiomics provided a relevant prognostic contribution. This study tested whether the computed tomography (CT)-based textural features of intrahepatic cholangiocarcinoma (ICC) and peritumoral tissue improve the prediction of survival after resection compared with the standard clinical indices. METHODS All consecutive patients affected by ICC who underwent hepatectomy at six high-volume centers (2009-2019) were considered for the study. The arterial and portal phases of CT performed fewer than 60 days before surgery were analyzed. A manual segmentation of the tumor was performed (Tumor-VOI). A 5-mm volume expansion then was applied to identify the peritumoral tissue (Margin-VOI). RESULTS The study enrolled 215 patients. After a median follow-up period of 28 months, the overall survival (OS) rate was 57.0%, and the progression-free survival (PFS) rate was 34.9% at 3 years. The clinical predictive model of OS had a C-index of 0.681. The addition of radiomic features led to a progressive improvement of performances (C-index of 0.71, including the portal Tumor-VOI, C-index of 0.752 including the portal Tumor- and Margin-VOI, C-index of 0.764, including all VOIs of the portal and arterial phases). The latter model combined clinical variables (CA19-9 and tumor pattern), tumor indices (density, homogeneity), margin data (kurtosis, compacity, shape), and GLRLM indices. The model had performance equivalent to that of the postoperative clinical model including the pathology data (C-index of 0.765). The same results were observed for PFS. CONCLUSIONS The radiomics of ICC and peritumoral tissue extracted from preoperative CT improves the prediction of survival. Both the portal and arterial phases should be considered. Radiomic and clinical data are complementary and achieve a preoperative estimation of prognosis equivalent to that achieved in the postoperative setting.
Collapse
Affiliation(s)
- Francesco Fiz
- Nuclear Medicine Unit, Department of Diagnostic Imaging, Ente Ospedaliero "Ospedali Galliera", Genoa, Italy
- Department of Nuclear Medicine and Clinical Molecular Imaging, University Hospital, Tübingen, Germany
| | - Noemi Rossi
- MOX Laboratory, Department of Mathematics, Politecnico di Milano, Milan, Italy
| | - Serena Langella
- Department of Digestive and Hepatobiliary Surgery, Mauriziano Umberto I Hospital, Turin, Italy
| | - Simone Conci
- Division of General and Hepatobiliary Surgery, Department of Surgical Sciences, Dentistry, Gynaecology and Pediatrics, University Hospital G.B. Rossi, University of Verona, Verona, Italy
| | - Matteo Serenari
- General Surgery and Transplant Unit, IRCCS, Azienda Ospedaliero-Universitaria di Bologna, Sant'Orsola-Malpighi Hospital, Bologna, Italy
- Department of Medical and Surgical Sciences, Alma Mater Studiorum, University of Bologna, Bologna, Italy
| | - Francesco Ardito
- Hepatobiliary Surgery Unit, A. Gemelli Hospital, Università Cattolica del Sacro Cuore, Rome, Italy
| | - Alessandro Cucchetti
- Department of Medical and Surgical Sciences, Alma Mater Studiorum, University of Bologna, Bologna, Italy
- Department of General Surgery, Morgagni-Pierantoni Hospital, Forlì, Italy
| | - Teresa Gallo
- Department of Radiology, Mauriziano Umberto I Hospital, Turin, Italy
| | - Giulia A Zamboni
- Department of Radiology, University Hospital G.B. Rossi, University of Verona, Verona, Italy
| | - Cristina Mosconi
- Department of Radiology, IRCCS, Azienda Ospedaliero-Universitaria di Bologna, Sant'Orsola-Malpighi Hospital, Bologna, Italy
| | - Luca Boldrini
- Department of Radiology, Radiation Oncology and Hematology, Fondazione Policlinico Universitario "A. Gemelli" IRCCS, Rome, Italy
| | | | - Stefano Cirillo
- Department of Radiology, Mauriziano Umberto I Hospital, Turin, Italy
| | - Andrea Ruzzenente
- Division of General and Hepatobiliary Surgery, Department of Surgical Sciences, Dentistry, Gynaecology and Pediatrics, University Hospital G.B. Rossi, University of Verona, Verona, Italy
| | - Ilaria Pecorella
- Department of Biomedical Sciences, Humanitas University, Pieve Emanuele, Milan, Italy
| | - Nadia Russolillo
- Department of Digestive and Hepatobiliary Surgery, Mauriziano Umberto I Hospital, Turin, Italy
| | - Martina Borzi
- Department of Radiology, University Hospital G.B. Rossi, University of Verona, Verona, Italy
| | - Giulio Vara
- Department of Radiology, IRCCS, Azienda Ospedaliero-Universitaria di Bologna, Sant'Orsola-Malpighi Hospital, Bologna, Italy
| | - Caterina Mele
- Hepatobiliary Surgery Unit, A. Gemelli Hospital, Università Cattolica del Sacro Cuore, Rome, Italy
| | - Giorgio Ercolani
- Department of Medical and Surgical Sciences, Alma Mater Studiorum, University of Bologna, Bologna, Italy
- Department of General Surgery, Morgagni-Pierantoni Hospital, Forlì, Italy
| | - Felice Giuliante
- Hepatobiliary Surgery Unit, A. Gemelli Hospital, Università Cattolica del Sacro Cuore, Rome, Italy
| | - Matteo Cescon
- General Surgery and Transplant Unit, IRCCS, Azienda Ospedaliero-Universitaria di Bologna, Sant'Orsola-Malpighi Hospital, Bologna, Italy
- Department of Medical and Surgical Sciences, Alma Mater Studiorum, University of Bologna, Bologna, Italy
| | - Alfredo Guglielmi
- Division of General and Hepatobiliary Surgery, Department of Surgical Sciences, Dentistry, Gynaecology and Pediatrics, University Hospital G.B. Rossi, University of Verona, Verona, Italy
| | - Alessandro Ferrero
- Department of Digestive and Hepatobiliary Surgery, Mauriziano Umberto I Hospital, Turin, Italy
| | - Martina Sollini
- Department of Nuclear Medicine, IRCCS San Raffaele, Milan, Italy
- Faculty of Medicine, Università Vita-Salute San Raffaele, Milan, Italy
| | - Arturo Chiti
- Department of Nuclear Medicine, IRCCS San Raffaele, Milan, Italy
- Faculty of Medicine, Università Vita-Salute San Raffaele, Milan, Italy
| | - Guido Torzilli
- Department of Biomedical Sciences, Humanitas University, Pieve Emanuele, Milan, Italy
- Division of Hepatobiliary and General Surgery, Department of Surgery, IRCCS Humanitas Research Hospital, Rozzano, Milan, Italy
| | - Francesca Ieva
- MOX Laboratory, Department of Mathematics, Politecnico di Milano, Milan, Italy
- CHDS - Center for Health Data Science, Human Technopole, Milan, Italy
| | - Luca Viganò
- Department of Biomedical Sciences, Humanitas University, Pieve Emanuele, Milan, Italy.
- Hepatobiliary Unit, Department of Minimally Invasive General and Oncologic Surgery, Humanitas Gavazzeni University Hospital, Bergamo, Italy.
| |
Collapse
|
49
|
Stahl D. New horizons in prediction modelling using machine learning in older people's healthcare research. Age Ageing 2024; 53:afae201. [PMID: 39311424 PMCID: PMC11417961 DOI: 10.1093/ageing/afae201] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/18/2023] [Revised: 06/26/2024] [Indexed: 09/26/2024] Open
Abstract
Machine learning (ML) and prediction modelling have become increasingly influential in healthcare, providing critical insights and supporting clinical decisions, particularly in the age of big data. This paper serves as an introductory guide for health researchers and readers interested in prediction modelling and explores how these technologies support clinical decisions, particularly with big data, and covers all aspects of the development, assessment and reporting of a model using ML. The paper starts with the importance of prediction modelling for precision medicine. It outlines different types of prediction and machine learning approaches, including supervised, unsupervised and semi-supervised learning, and provides an overview of popular algorithms for various outcomes and settings. It also introduces key theoretical ML concepts. The importance of data quality, preprocessing and unbiased model performance evaluation is highlighted. Concepts of apparent, internal and external validation will be introduced along with metrics for discrimination and calibration for different types of outcomes. Additionally, the paper addresses model interpretation, fairness and implementation in clinical practice. Finally, the paper provides recommendations for reporting and identifies common pitfalls in prediction modelling and machine learning. The aim of the paper is to help readers understand and critically evaluate research papers that present ML models and to serve as a first guide for developing, assessing and implementing their own.
Collapse
Affiliation(s)
- Daniel Stahl
- Department of Biostatistics and Health Informatics, Institute of Psychiatry, Psychology & Neuroscience, King’s College London, London, UK
| |
Collapse
|
50
|
Park SH, Song SH, Burton F, Arsan C, Jobst B, Feldman M. Machine learning characterization of a rare neurologic disease via electronic health records: a proof-of-principle study on stiff person syndrome. BMC Neurol 2024; 24:272. [PMID: 39097681 PMCID: PMC11297611 DOI: 10.1186/s12883-024-03760-7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/01/2024] [Accepted: 07/12/2024] [Indexed: 08/05/2024] Open
Abstract
BACKGROUND Despite the frequent diagnostic delays of rare neurologic diseases (RND), it remains difficult to study RNDs and their comorbidities due to their rarity and hence the statistical underpowering. Affecting one to two in a million annually, stiff person syndrome (SPS) is an RND characterized by painful muscle spasms and rigidity. Leveraging underutilized electronic health records (EHR), this study showcased a machine-learning-based framework to identify clinical features that optimally characterize the diagnosis of SPS. METHODS A machine-learning-based feature selection approach was employed on 319 items from the past medical histories of 48 individuals (23 with a diagnosis of SPS and 25 controls) with elevated serum autoantibodies against glutamic-acid-decarboxylase-65 (anti-GAD65) in Dartmouth Health's EHR to determine features with the highest discriminatory power. Each iteration of the algorithm implemented a Support Vector Machine (SVM) model, generating importance scores-SHapley Additive exPlanation (SHAP) values-for each feature and removing one with the least salient. Evaluation metrics were calculated through repeated stratified cross-validation. RESULTS Depression, hypothyroidism, GERD, and joint pain were the most characteristic features of SPS. Utilizing these features, the SVM model attained precision of 0.817 (95% CI 0.795-0.840), sensitivity of 0.766 (95% CI 0.743-0.790), F-score of 0.761 (95% CI 0.744-0.778), AUC of 0.808 (95% CI 0.791-0.825), and accuracy of 0.775 (95% CI 0.759-0.790). CONCLUSIONS This framework discerned features that, with further research, may help fully characterize the pathologic mechanism of SPS: depression, hypothyroidism, and GERD may respectively represent comorbidities through common inflammatory, genetic, and dysautonomic links. This methodology could address diagnostic challenges in neurology by uncovering latent associations and generating hypotheses for RNDs.
Collapse
Affiliation(s)
- Soo Hwan Park
- Geisel School of Medicine at Dartmouth, Hanover, NH, USA
- Department of Neurology, Dartmouth Health, Lebanon, NH, USA
| | - Seo Ho Song
- Department of Psychiatry, Beth Israel Deaconess Medical Center, Harvard Medical School, Boston, MA, USA
| | - Frederick Burton
- Department of Psychiatry, University of California Los Angeles Health, Los Angeles, CA, USA
| | - Cybèle Arsan
- Department of Psychiatry, Oakland Medical Center, Kaiser Permanente, Oakland, CA, USA
| | - Barbara Jobst
- Geisel School of Medicine at Dartmouth, Hanover, NH, USA
- Department of Neurology, Dartmouth Health, Lebanon, NH, USA
| | - Mary Feldman
- Geisel School of Medicine at Dartmouth, Hanover, NH, USA.
- Department of Neurology, Dartmouth Health, Lebanon, NH, USA.
| |
Collapse
|