1
|
Tsoumtsa Meda L, Lagarde J, Guillier L, Roussel S, Douarre PE. Using GWAS and Machine Learning to Identify and Predict Genetic Variants Associated with Foodborne Bacteria Phenotypic Traits. Methods Mol Biol 2025; 2852:223-253. [PMID: 39235748 DOI: 10.1007/978-1-0716-4100-2_16] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 09/06/2024]
Abstract
One of the main challenges in food microbiology is to prevent the risk of outbreaks by avoiding the distribution of food contaminated by bacteria. This requires constant monitoring of the circulating strains throughout the food production chain. Bacterial genomes contain signatures of natural evolution and adaptive markers that can be exploited to better understand the behavior of pathogen in the food industry. The monitoring of foodborne strains can therefore be facilitated by the use of these genomic markers capable of rapidly providing essential information on isolated strains, such as the source of contamination, risk of illness, potential for biofilm formation, and tolerance or resistance to biocides. The increasing availability of large genome datasets is enhancing the understanding of the genetic basis of complex traits such as host adaptation, virulence, and persistence. Genome-wide association studies have shown very promising results in the discovery of genomic markers that can be integrated into rapid detection tools. In addition, machine learning has successfully predicted phenotypes and classified important traits. Genome-wide association and machine learning tools have therefore the potential to support decision-making circuits intending at reducing the burden of foodborne diseases. The aim of this chapter review is to provide knowledge on the use of these two methods in food microbiology and to recommend their use in the field.
Collapse
Affiliation(s)
- Landry Tsoumtsa Meda
- ACTALIA, La Roche-sur-Foron, France
- ANSES, Salmonella and Listeria Unit (USEL), University of Paris-Est, Maisons-Alfort Laboratory for Food Safety, Maisons-Alfort, France
| | - Jean Lagarde
- ANSES, Salmonella and Listeria Unit (USEL), University of Paris-Est, Maisons-Alfort Laboratory for Food Safety, Maisons-Alfort, France
- INRAE, Unit of Process Optimisation in Food, Agriculture and the Environment (UR OPAALE), Rennes, France
| | | | - Sophie Roussel
- ANSES, Salmonella and Listeria Unit (USEL), University of Paris-Est, Maisons-Alfort Laboratory for Food Safety, Maisons-Alfort, France
| | - Pierre-Emmanuel Douarre
- ANSES, Salmonella and Listeria Unit (USEL), University of Paris-Est, Maisons-Alfort Laboratory for Food Safety, Maisons-Alfort, France.
| |
Collapse
|
2
|
Teng C, Yang C, Liu Q. Utilising AI technique to identify depression risk among doctoral students. Sci Rep 2024; 14:31978. [PMID: 39738390 DOI: 10.1038/s41598-024-83617-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/10/2024] [Accepted: 12/16/2024] [Indexed: 01/02/2025] Open
Abstract
The phenomenon that the depression risk among doctoral students is higher than that of other groups should not be ignored. Despite this, studies specifically addressing depression risk in doctoral students are relatively scarce, and existing findings are not universally applicable. Using neural network feature extraction technology, this study aims to investigate the factors contributing to the high depression risk of doctoral students and effectively identify doctoral students at depression risk, so as to propose corresponding improvement strategies to prevent and intervene doctoral students with depression risk for universities. Based on the data from the 2019 Nature Global Doctoral Student Survey, we first screened 13 highly relevant features from a total of 37 features potentially related to the risk of depression among doctoral students by Random Forest algorithm. Subsequently, we trained the optimal prediction model to predict the doctoral students with depression risk using a Multilayer Perceptron (MLP), achieving an accuracy of 89.09% on the test set. Additionally, this study constructed a group portrait of doctoral students at risk of depression, and found that overwork, poor work-life balance, and poor supervisor-student relationship, etc., were typical characteristics among these students. Finally, we proposed several improvement strategies for higher education institutions. Our research offers a new perspective on utilising artificial intelligence (AI) methods to tackle educational challenges, particularly in the identification and support of doctoral students at risk of depression, thereby enhancing their mental health.
Collapse
Affiliation(s)
- Changhong Teng
- School of Education, Beijing Institute of Technology, Beijing, 100081, China
| | - Chunmei Yang
- School of Education, Beijing Institute of Technology, Beijing, 100081, China.
| | | |
Collapse
|
3
|
Min SH, Woo K, Song J, Alexander GL, O'Malley T, Moen MD, Topaz M. Understanding Daily Care Experience Preferences Across the Lifespan of Older Adults: Application of Natural Language Processing. West J Nurs Res 2024:1939459241306946. [PMID: 39707813 DOI: 10.1177/01939459241306946] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/23/2024]
Abstract
INTRODUCTION Older adults are a heterogeneous group, and their care experience preferences are likely to be diverse and individualized. Thus, the aim of this study was to identify categories of older adults' care experience preferences and to examine similarities and differences across different age groups. METHODS The initial categories of older adults' care experience preferences were identified through a qualitative review of narrative text (n = 3134) in the ADVault data set. A natural language processing (NLP) algorithm was used to automatically and accurately define older adults' care experience preference categories. Descriptive statistics were used to examine similarities and differences in care experience preference categories across different age groups. RESULTS The overall average performance of NLP algorithms was relatively high (average F-score = 0.88; range: 0.77-0.96). Through a qualitative review of 350 randomly selected texts, a total of 11 categories were identified. The most frequent category was music, followed by photographs, entertainment, family/friends, religion-related, atmosphere, flower/plants, pet, bed/bedding, hobby, and other. After applying the best performing NLP algorithm to each category, older adults' care experience preference categories were music (41.32%), followed by photographs (28.47%), entertainment (13.46%), religion-related (n = 12.69%), pet (12.22%), flower/plants (11.51%), family/friends (8.45%), atmosphere (7.75%), bed/bedding (6.12%), and hobby (5.45%). Young-old and old-old adults had similar care experience preferences with music being the most frequent category while old-old adults had photographs as the most frequent category for their care experience preference. CONCLUSION Clinicians must understand the distinct categories of care experience preferences and incorporate them into personalized care planning.
Collapse
Affiliation(s)
- Se Hee Min
- University of Pennsylvania School of Nursing, Philadelphia, PA, USA
| | - Kyungmi Woo
- Seoul National University College of Nursing, Seoul, South Korea
| | - Jiyoun Song
- University of Pennsylvania School of Nursing, Philadelphia, PA, USA
| | | | | | | | - Maxim Topaz
- Columbia University School of Nursing, New York, NY, USA
| |
Collapse
|
4
|
Sánchez-Martínez LJ, Charle-Cuéllar P, Gado AA, Ousmane N, Hernández CL, López-Ejeda N. Using Machine Learning to Fight Child Acute Malnutrition and Predict Weight Gain During Outpatient Treatment with a Simplified Combined Protocol. Nutrients 2024; 16:4213. [PMID: 39683605 DOI: 10.3390/nu16234213] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/10/2024] [Revised: 12/03/2024] [Accepted: 12/04/2024] [Indexed: 12/18/2024] Open
Abstract
BACKGROUND/OBJECTIVES Child acute malnutrition is a global public health problem, affecting 45 million children under 5 years of age. The World Health Organization recommends monitoring weight gain weekly as an indicator of the correct treatment. However, simplified protocols that do not record the weight and base diagnosis and follow-up in arm circumference at discharge are being tested in emergency settings. The present study aims to use machine learning techniques to predict weight gain based on the socio-economic characteristics at admission for the children treated under a simplified protocol in the Diffa region of Niger. METHODS The sample consists of 535 children aged 6-59 months receiving outpatient treatment for acute malnutrition, for whom information on 51 socio-economic variables was collected. First, the Variable Selection Using Random Forest (VSURF) algorithm was used to select the variables associated with weight gain. Subsequently, the dataset was partitioned into training/testing, and an ensemble model was adjusted using five algorithms for prediction, which were combined using a Random Forest meta-algorithm. Afterward, Receiver Operating Characteristic (ROC) curves were used to identify the optimal cut-off point for predicting the group of individuals most vulnerable to developing low weight gain. RESULTS The critical variables that influence weight gain are water, hygiene and sanitation, the caregiver's employment-socio-economic level and access to treatment. The final ensemble prediction model achieved a better fit (R2 = 0.55) with respect to the individual algorithms (R2 = 0.14-0.27). An optimal cut-off point was identified to establish low weight gain, with an Area Under the Curve (AUC) of 0.777 at a value of <6.5 g/kg/day. The ensemble model achieved a success rate of 84% (78/93) at the identification of individuals below <6.5 g/kg/day in the test set. CONCLUSIONS The results highlight the importance of adapting the cut-off points for weight gain to each context, as well as the practical usefulness that these techniques can have in optimizing and adapting to the treatment in humanitarian settings.
Collapse
Affiliation(s)
- Luis Javier Sánchez-Martínez
- Unit of Physical Anthropology, Department of Biodiversity, Ecology and Evolution, Faculty of Biological Sciences, Complutense University of Madrid, 28040 Madrid, Spain
| | | | | | | | - Candela Lucía Hernández
- Unit of Physical Anthropology, Department of Biodiversity, Ecology and Evolution, Faculty of Biological Sciences, Complutense University of Madrid, 28040 Madrid, Spain
| | - Noemí López-Ejeda
- Unit of Physical Anthropology, Department of Biodiversity, Ecology and Evolution, Faculty of Biological Sciences, Complutense University of Madrid, 28040 Madrid, Spain
- EPINUT Research Group, Faculty of Medicine, Complutense University of Madrid, 28040 Madrid, Spain
| |
Collapse
|
5
|
Klontzas ME, Vernardis SI, Batsali A, Papadogiannis F, Panoskaltsis N, Mantalaris A. Machine Learning and Metabolomics Predict Mesenchymal Stem Cell Osteogenic Differentiation in 2D and 3D Cultures. J Funct Biomater 2024; 15:367. [PMID: 39728167 DOI: 10.3390/jfb15120367] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/29/2024] [Revised: 11/27/2024] [Accepted: 12/03/2024] [Indexed: 12/28/2024] Open
Abstract
Stem cells have been widely used to produce artificial bone grafts. Nonetheless, the variability in the degree of stem cell differentiation is an inherent drawback of artificial graft development and requires robust evaluation tools that can certify the quality of stem cell-based products and avoid source-tissue-related and patient-specific variability in outcomes. Omics analyses have been utilised for the evaluation of stem cell attributes in all stages of stem cell biomanufacturing. Herein, metabolomics in combination with machine learning was utilised for the benchmarking of osteogenic differentiation quality in 2D and 3D cultures. Metabolomics analysis was performed with the use of gas chromatography-mass spectrometry (GC-MS). A set of 11 metabolites was used to train an XGboost model which achieved excellent performance in distinguishing between differentiated and undifferentiated umbilical cord blood mesenchymal stem cells (UCB MSCs). The model was benchmarked against samples not present in the training set, being able to efficiently capture osteogenesis in 3D UCB MSC cultures with an area under the curve (AUC) of 82.6%. On the contrary, the model did not capture any differentiation in Wharton's Jelly MSC samples, which are well-known underperformers in osteogenic differentiation (AUC of 56.2%). Mineralisation was significantly correlated with the levels of fumarate, glycerol, and myo-inositol, the four metabolites found most important for model performance (R2 = 0.89, R2 = 0.94, and R2 = 0.96, and p = 0.016, p = 0.0059, and p = 0.0022, respectively). In conclusion, our results indicate that metabolomics in combination with machine learning can be used for the development of reliable potency assays for the evaluation of Advanced Therapy Medicinal Products.
Collapse
Affiliation(s)
- Michail E Klontzas
- Artificial Intelligence and Translational Imaging (ATI) Lab, Department of Radiology, School of Medicine, University of Crete, 71003 Heraklion, Greece
- Computational Biomedicine Laboratory, Institute of Computer Science, Foundation for Research and Technology (ICS-FORTH), 70013 Heraklion, Greece
| | | | - Aristea Batsali
- Haemopoiesis Research Laboratory, School of Medicine, University of Crete, 71003 Heraklion, Greece
| | - Fotios Papadogiannis
- Haemopoiesis Research Laboratory, School of Medicine, University of Crete, 71003 Heraklion, Greece
| | - Nicki Panoskaltsis
- BioMedical Systems Engineering Laboratory, Panoz Institute, School of Pharmacy and Pharmaceutical Sciences, Trinity College, D02 PN40 Dublin, Ireland
| | - Athanasios Mantalaris
- BioMedical Systems Engineering Laboratory, Panoz Institute, School of Pharmacy and Pharmaceutical Sciences, Trinity College, D02 PN40 Dublin, Ireland
- National Institute for Bioprocessing Research and Training (NIBRT), Foster Avenue, Mount Merrion, Blackrock, A94 X099 Dublin, Ireland
| |
Collapse
|
6
|
Ogutu S, Mohammed M, Mwambi H. Cytokine profiles as predictors of HIV incidence using machine learning survival models and statistical interpretable techniques. Sci Rep 2024; 14:29895. [PMID: 39622992 PMCID: PMC11612445 DOI: 10.1038/s41598-024-81510-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/05/2024] [Accepted: 11/27/2024] [Indexed: 12/06/2024] Open
Abstract
HIV remains a critical global health issue, with an estimated 39.9 million people living with the virus worldwide by the end of 2023 (according to WHO). Although the epidemic's impact varies significantly across regions, Africa remains the most affected. In the past decade, considerable efforts have focused on developing preventive measures, such as vaccines and pre-exposure prophylaxis, to combat sexually transmitted HIV. Recently, cytokine profiles have gained attention as potential predictors of HIV incidence due to their involvement in immune regulation and inflammation, presenting new opportunities to enhance preventative strategies. However, the high-dimensional, time-varying nature of cytokine data collected in clinical research, presents challenges for traditional statistical methods like the Cox proportional hazards (PH) model to effectively analyze survival data related to HIV. Machine learning (ML) survival models offer a robust alternative, especially for addressing the limitations of the PH model's assumptions. In this study, we applied survival support vector machine (SSVM) and random survival forest (RSF) models using changes or means in cytokine levels as predictors to assess their association with HIV incidence, evaluate variable importance, measure predictive accuracy using the concordance index (C-index) and integrated Brier score (IBS) and interpret the model's predictions using Shapley additive explanations (SHAP) values. Our results indicated that RSFs models outperformed SSVMs models, with the difference covariate model performing better than the mean covariate model. The highest C-index for SSVM was 0.7180 under the difference covariate model, while for RSF, it reached 0.8801 under the difference covariate model using the log-rank split rule. Key cytokines identified as positive predictors of HIV incidence included TNF-A, BASIC-FGF, IL-5, MCP-3, and EOTAXIN, while 29 cytokines were negative predictors. Baseline factors such as condom use frequency, treatment status, number of partners, and sexual activity also emerged as significant predictors. This study underscored the potential of cytokine profiles for predicting HIV incidence and highlighted the advantages of RSFs models in analyzing high-dimensional, time-varying data over SSVMs. It further through ablation studies emphasized the importance of selecting key features within mean and difference based covariate models to achieve an optimal balance between model complexity and predictive accuracy.
Collapse
Affiliation(s)
- Sarah Ogutu
- School of Mathematics, Statistics and Computer Science, University of KwaZulu-Natal, Pietermaritzburg, 3201, South Africa.
| | - Mohanad Mohammed
- School of Mathematics, Statistics and Computer Science, University of KwaZulu-Natal, Pietermaritzburg, 3201, South Africa
- School of Nursing and Public Health, University of KwaZulu-Natal, Pietermaritzburg, 3201, South Africa
| | - Henry Mwambi
- School of Mathematics, Statistics and Computer Science, University of KwaZulu-Natal, Pietermaritzburg, 3201, South Africa
| |
Collapse
|
7
|
Ngeow AJH, Moosa AS, Tan MG, Zou L, Goh MMR, Lim GH, Tagamolila V, Ereno I, Durnford JR, Cheung SKH, Hong NWJ, Soh SY, Tay YY, Chang ZY, Ong R, Tsang LPM, Yip BKL, Chia KW, Yap K, Lim MH, Ta AWA, Goh HL, Yeo CL, Chan DKL, Tan NC. Development and Validation of a Smartphone Application for Neonatal Jaundice Screening. JAMA Netw Open 2024; 7:e2450260. [PMID: 39661385 PMCID: PMC11635536 DOI: 10.1001/jamanetworkopen.2024.50260] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 08/17/2024] [Accepted: 10/20/2024] [Indexed: 12/12/2024] Open
Abstract
Importance This diagnostic study describes the merger of domain knowledge (Kramer principle of dermal advancement of icterus) with current machine learning (ML) techniques to create a novel tool for screening of neonatal jaundice (NNJ), which affects 60% of term and 80% of preterm infants. Objective This study aimed to develop and validate a smartphone-based ML app to predict bilirubin (SpB) levels in multiethnic neonates using skin color analysis. Design, Setting, and Participants This diagnostic study was conducted between June 2022 and June 2024 at a tertiary hospital and 4 primary-care clinics in Singapore with a consecutive sample of neonates born at 35 or more weeks' gestation and within 21 days of birth. Exposure The smartphone-based ML app captured skin images via the central aperture of a standardized color calibration sticker card from multiple regions of interest arranged in a cephalocaudal fashion, following the Kramer principle of dermal advancement of icterus. The ML model underwent iterative development and k-folds cross-validation, with performance assessed based on root mean squared error, Pearson correlation, and agreement with total serum bilirubin (TSB). The final ML model underwent temporal validation. Main Outcomes and Measures Linear correlation and statistical agreement between paired SpB and TSB; sensitivity and specificity for detection of TSB equal to or greater than 17mg/dL with SpB equal to or greater than 13 mg/dL were assessed. Results The smartphone-based ML app was validated on 546 neonates (median [IQR] gestational age, 38.0 [35.0-41.0] weeks; 286 [52.4%] male; 315 [57.7%] Chinese, 35 [6.4%] Indian, 169 [31.0%] Malay, and 27 [4.9%] other ethnicities). Iterative development and cross-validation was performed on 352 neonates. The final ML model (ensembled gradient boosted trees) incorporated yellowness indicators from the forehead, sternum, and abdomen. Temporal validation on 194 neonates yielded a Pearson r of 0.84 (95% CI, 0.79-0.88; P < .001), 82% of data pairs within clinically acceptable limits of 3 mg/dL, sensitivity of 100%, specificity of 70%, positive predictive value of 10%, negative predictive value of 100%, positive likelihood ratio of 3.3, negative likelihood ratio of 0, and area under the receiver operating characteristic curve of 0.89 (95% CI, 0.82-0.96). Conclusions and Relevance In this diagnostic study of a new smartphone-based ML app, there was good correlation and statistical agreement with TSB with sensitivity of 100%. The screening tool has the potential to be an NNJ screening tool, with treatment decisions based on TSB (reference standard). Further prospective studies are needed to establish the generalizability and cost-effectiveness of the screening tool in the clinical setting.
Collapse
Affiliation(s)
- Alvin Jia Hao Ngeow
- Department of Neonatal and Developmental Medicine, Singapore General Hospital, Singapore
- Yong Loo Lin School of Medicine, National University of Singapore, Singapore
- Paediatrics Academic Clinical Programme, Duke-NUS Medical School, Singapore
- Lee Kong Chian School of Medicine, Nanyang Technological University, Singapore
| | - Aminath Shiwaza Moosa
- SingHealth Polyclinics, Singapore
- Family Medicine Academic Clinical Programme, Duke-NUS Medical School, Singapore
| | - Mary Grace Tan
- Department of Neonatal and Developmental Medicine, Singapore General Hospital, Singapore
| | - Lin Zou
- Synapxe (formerly Integrated Health Information Systems, IHiS), Singapore
| | | | - Gek Hsiang Lim
- Health Services Research Unit, Singapore General Hospital, Singapore
| | - Vina Tagamolila
- Department of Neonatal and Developmental Medicine, Singapore General Hospital, Singapore
| | - Imelda Ereno
- Department of Neonatal and Developmental Medicine, Singapore General Hospital, Singapore
| | - Jared Ryan Durnford
- Department of Neonatal and Developmental Medicine, Singapore General Hospital, Singapore
- Paediatrics Academic Clinical Programme, Duke-NUS Medical School, Singapore
| | - Samson Kei Him Cheung
- Department of Neonatal and Developmental Medicine, Singapore General Hospital, Singapore
- Paediatrics Academic Clinical Programme, Duke-NUS Medical School, Singapore
| | - Nicholas Wei Jie Hong
- Department of Neonatal and Developmental Medicine, Singapore General Hospital, Singapore
- Paediatrics Academic Clinical Programme, Duke-NUS Medical School, Singapore
| | - Ser Yee Soh
- Department of Neonatal and Developmental Medicine, Singapore General Hospital, Singapore
- Paediatrics Academic Clinical Programme, Duke-NUS Medical School, Singapore
| | - Yih Yann Tay
- Nursing Division, Singapore General Hospital, Singapore
| | - Zi Ying Chang
- SingHealth Polyclinics, Singapore
- Family Medicine Academic Clinical Programme, Duke-NUS Medical School, Singapore
| | - Ruiheng Ong
- SingHealth Polyclinics, Singapore
- Family Medicine Academic Clinical Programme, Duke-NUS Medical School, Singapore
| | - Li Ping Marianne Tsang
- SingHealth Polyclinics, Singapore
- Family Medicine Academic Clinical Programme, Duke-NUS Medical School, Singapore
| | - Benny K. L. Yip
- Department of Future Health System, Singapore General Hospital, Singapore
| | - Kuok Wei Chia
- Department of Future Health System, Singapore General Hospital, Singapore
| | | | - Ming Hwee Lim
- Department of Clinical Pathology, Singapore General Hospital, Singapore
| | - Andy Wee An Ta
- Synapxe (formerly Integrated Health Information Systems, IHiS), Singapore
| | - Han Leong Goh
- Synapxe (formerly Integrated Health Information Systems, IHiS), Singapore
| | - Cheo Lian Yeo
- Department of Neonatal and Developmental Medicine, Singapore General Hospital, Singapore
- Yong Loo Lin School of Medicine, National University of Singapore, Singapore
- Paediatrics Academic Clinical Programme, Duke-NUS Medical School, Singapore
- Lee Kong Chian School of Medicine, Nanyang Technological University, Singapore
| | - Daisy Kwai Lin Chan
- Department of Neonatal and Developmental Medicine, Singapore General Hospital, Singapore
- Yong Loo Lin School of Medicine, National University of Singapore, Singapore
- Paediatrics Academic Clinical Programme, Duke-NUS Medical School, Singapore
- Lee Kong Chian School of Medicine, Nanyang Technological University, Singapore
| | - Ngiap Chuan Tan
- SingHealth Polyclinics, Singapore
- Family Medicine Academic Clinical Programme, Duke-NUS Medical School, Singapore
| |
Collapse
|
8
|
Vachon J, Kerckhoffs J, Buteau S, Smargiassi A. Do machine learning methods improve prediction of ambient air pollutants with high spatial contrast? A systematic review. ENVIRONMENTAL RESEARCH 2024; 262:119751. [PMID: 39117059 DOI: 10.1016/j.envres.2024.119751] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/20/2023] [Revised: 07/18/2024] [Accepted: 08/04/2024] [Indexed: 08/10/2024]
Abstract
BACKGROUND & OBJECTIVE The use of machine learning for air pollution modelling is rapidly increasing. We conducted a systematic review of studies comparing statistical and machine learning models predicting the spatiotemporal variation of ambient nitrogen dioxide (NO2), ultrafine particles (UFPs) and black carbon (BC) to determine whether and in which scenarios machine learning generates more accurate predictions. METHODS Web of Science and Scopus were searched up to June 13, 2024. All records were screened by two independent reviewers. Differences in the coefficient of determination (R2) and Root Mean Square Error (RMSE) between best statistical and machine learning methods were compared across categories of methodological elements. RESULTS A total of 38 studies with 46 model comparisons (30 for NO2, 8 for UFPs and 8 for BC) were included. Linear non-regularized methods and Random Forest were most frequently used. Machine learning outperformed statistical models in 34 comparisons. Mean differences (95% confidence intervals) in R2 and RMSE between best machine learning and statistical models were 0.12 (0.08, 0.17) and 20% (11%, 29%) respectively. Tree-based methods performed best in 12 of 17 multi-model comparisons. Nonlinear or regularization regression methods were used in only 12 comparisons and provided similar performance to machine learning methods. CONCLUSION This systematic review suggests that machine learning methods, especially tree-based methods, may be superior to linear non-regularized methods for predicting ambient concentrations of NO2, UFPs and BC. Additional comparison studies using nonlinear, regularized and a wider array of machine learning methods are needed to confirm their relative performance. Future air pollution studies would also benefit from more explicit and standardized reporting of methodologies and results.
Collapse
Affiliation(s)
- Julien Vachon
- Department of Environmental and Occupational Health, School of Public Health, University of Montreal, Montreal, Canada; Center for Public Health Research (CReSP), University of Montreal and CIUSSS Du Centre-Sud-de-l'Île-de-Montréal, Montreal, Canada
| | - Jules Kerckhoffs
- Institute for Risk Assessment Sciences, Utrecht University, Utrecht, the Netherlands
| | - Stéphane Buteau
- Department of Environmental and Occupational Health, School of Public Health, University of Montreal, Montreal, Canada; Center for Public Health Research (CReSP), University of Montreal and CIUSSS Du Centre-Sud-de-l'Île-de-Montréal, Montreal, Canada
| | - Audrey Smargiassi
- Department of Environmental and Occupational Health, School of Public Health, University of Montreal, Montreal, Canada; Center for Public Health Research (CReSP), University of Montreal and CIUSSS Du Centre-Sud-de-l'Île-de-Montréal, Montreal, Canada.
| |
Collapse
|
9
|
Wlosik J, Granjeaud S, Gorvel L, Olive D, Chretien AS. A beginner's guide to supervised analysis for mass cytometry data in cancer biology. Cytometry A 2024; 105:853-869. [PMID: 39486897 DOI: 10.1002/cyto.a.24901] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/10/2024] [Revised: 09/16/2024] [Accepted: 10/01/2024] [Indexed: 11/04/2024]
Abstract
Mass cytometry enables deep profiling of biological samples at single-cell resolution. This technology is more than relevant in cancer research due to high cellular heterogeneity and complexity. Downstream analysis of high-dimensional datasets increasingly relies on machine learning (ML) to extract clinically relevant information, including supervised algorithms for classification and regression purposes. In cancer research, they are used to develop predictive models that will guide clinical decision making. However, the development of supervised algorithms faces major challenges, such as sufficient validation, before being translated into the clinics. In this work, we provide a framework for the analysis of mass cytometry data with a specific focus on supervised algorithms and practical examples of their applications. We also raise awareness on key issues regarding good practices for researchers curious to implement supervised ML on their mass cytometry data. Finally, we discuss the challenges of supervised ML application to cancer research.
Collapse
Affiliation(s)
- Julia Wlosik
- Team 'Immunity and Cancer', Marseille Cancer Research Center, Inserm U1068, CNRS UMR7258, Paoli-Calmettes Institute, Aix-Marseille University UM105, Marseille, France
- Immunomonitoring Department, Paoli-Calmettes Institute, Marseille, France
| | - Samuel Granjeaud
- Systems Biology Platform, Marseille Cancer Research Center, Inserm U1068, CNRS UMR7258, Paoli-Calmettes Institute, Aix-Marseille University UM105, Marseille, France
| | - Laurent Gorvel
- Team 'Immunity and Cancer', Marseille Cancer Research Center, Inserm U1068, CNRS UMR7258, Paoli-Calmettes Institute, Aix-Marseille University UM105, Marseille, France
- Immunomonitoring Department, Paoli-Calmettes Institute, Marseille, France
| | - Daniel Olive
- Team 'Immunity and Cancer', Marseille Cancer Research Center, Inserm U1068, CNRS UMR7258, Paoli-Calmettes Institute, Aix-Marseille University UM105, Marseille, France
- Immunomonitoring Department, Paoli-Calmettes Institute, Marseille, France
| | - Anne-Sophie Chretien
- Team 'Immunity and Cancer', Marseille Cancer Research Center, Inserm U1068, CNRS UMR7258, Paoli-Calmettes Institute, Aix-Marseille University UM105, Marseille, France
- Immunomonitoring Department, Paoli-Calmettes Institute, Marseille, France
| |
Collapse
|
10
|
López-Rueda A, Rodríguez-Sánchez MÁ, Serrano E, Moreno J, Rodríguez A, Llull L, Amaro S, Oleaga L. Enhancing mortality prediction in patients with spontaneous intracerebral hemorrhage: Radiomics and supervised machine learning on non-contrast computed tomography. Eur J Radiol Open 2024; 13:100618. [PMID: 39687913 PMCID: PMC11648778 DOI: 10.1016/j.ejro.2024.100618] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/17/2024] [Revised: 11/17/2024] [Accepted: 11/28/2024] [Indexed: 12/18/2024] Open
Abstract
Purpose This study aims to develop a Radiomics-based Supervised Machine-Learning model to predict mortality in patients with spontaneous intracerebral hemorrhage (sICH). Methods Retrospective analysis of a prospectively collected clinical registry of patients with sICH consecutively admitted at a single academic comprehensive stroke center between January-2016 and April-2018. We conducted an in-depth analysis of 105 radiomic features extracted from 105 patients. Following the identification and handling of missing values, radiomics values were scaled to 0-1 to train different classifiers. The sample was split into 80-20 % training-test and validation cohort in a stratified fashion. Random Forest(RF), K-Nearest Neighbor(KNN), and Support Vector Machine(SVM) classifiers were evaluated, along with several feature selection methods and hyperparameter optimization strategies, to classify the binary outcome of mortality or survival during hospital admission. A tenfold stratified cross-validation method was used to train the models, and average metrics were calculated. Results RF, KNN, and SVM, with the "DropOut+SelectKBest" feature selection strategy and no hyperparameter optimization, demonstrated the best performances with the least number of radiomic features and the most simplified models, achieving a sensitivity range between 0.90 and 0.95 and AUC range from 0.97 to 1 on the validation dataset. Regarding the confusion matrix, the SVM model did not predict any false negative test (negative predicted value 1). Conclusion Radiomics-based Supervised Machine Learning models can predict mortality during admission in patients with sICH. SVM with the "DropOut+SelectKBest" feature selection strategy and no hyperparameter optimization was the best simplified model to detect mortality during admission in patients with sICH.
Collapse
Affiliation(s)
- Antonio López-Rueda
- Clinical Informatics Department, Hospital Clínic de Barcelona, Barcelona, Spain
- Radiology Department, Hospital Clínic de Barcelona, Barcelona, Spain
| | | | - Elena Serrano
- Radiology Department, Hospital Universitario de Bellvitge, Barcelona, Spain
| | - Javier Moreno
- Radiology Department, Hospital Clínic de Barcelona, Barcelona, Spain
| | | | - Laura Llull
- Neurology Department, Hospital Clínic de Barcelona, Barcelona, Spain
| | - Sergi Amaro
- Neurology Department, Hospital Clínic de Barcelona, Barcelona, Spain
| | - Laura Oleaga
- Radiology Department, Hospital Clínic de Barcelona, Barcelona, Spain
| |
Collapse
|
11
|
Leghissa M, Carrera Á, Iglesias CÁ. FRELSA: A dataset for frailty in elderly people originated from ELSA and evaluated through machine learning models. Int J Med Inform 2024; 192:105603. [PMID: 39232373 DOI: 10.1016/j.ijmedinf.2024.105603] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/29/2024] [Revised: 08/01/2024] [Accepted: 08/13/2024] [Indexed: 09/06/2024]
Abstract
BACKGROUND Frailty is an age-related syndrome characterized by loss of strength and exhaustion and associated with multi-morbidity. Early detection and prediction of the appearance of frailty could help older people age better and prevent them from needing invasive and expensive treatments. Machine learning techniques show promising results in creating a medical support tool for such a task. METHODS This study aims to create a dataset for machine learning-based frailty studies, using Fried's Frailty Phenotype definition. Starting from a longitudinal study on aging in the UK population, we defined a frailty label for each subject. We evaluated the definition by training seven different models for detecting frailty with data that were contemporary to the ones used for the definition. We then integrated more data from two years before to obtain prediction models with a 24-month horizon. Features selection was performed using the MultiSURF algorithm, which ranks all features in order of relevance to the detection or prediction task. RESULTS We present a new frailty dataset of 5303 subjects and more than 6500 available features. It is publicly available, provided one has access to the original English Longitudinal Study of Ageing dataset. The dataset is balanced after grouping frailty with pre-frailty, and it is suitable for multiclass or binary classification and prediction problems. The seven tested architectures performed similarly, forming a solid baseline that can be improved with future work. Linear regression achieved the best F-score and AUROC in detection and prediction tasks. CONCLUSIONS Creating new frailty-annotated datasets of this size is necessary to develop and improve the frailty prediction techniques. We have shown that our dataset can be used to study and test machine learning models to detect and predict frailty. Future work should improve models' architecture and performance, consider explainability, and possibly enrich the dataset with older waves.
Collapse
Affiliation(s)
- Matteo Leghissa
- Universidad Politécnica de Madrid, Av. Complutense, 30, 28040, Madrid, Spain.
| | - Álvaro Carrera
- Universidad Politécnica de Madrid, Av. Complutense, 30, 28040, Madrid, Spain.
| | - Carlos Á Iglesias
- Universidad Politécnica de Madrid, Av. Complutense, 30, 28040, Madrid, Spain.
| |
Collapse
|
12
|
Hannon DM, Syed JDA, McNicholas B, Madden M, Laffey JG. The development of a C5.0 machine learning model in a limited data set to predict early mortality in patients with ARDS undergoing an initial session of prone positioning. Intensive Care Med Exp 2024; 12:103. [PMID: 39540987 PMCID: PMC11564488 DOI: 10.1186/s40635-024-00682-z] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/15/2024] [Accepted: 10/06/2024] [Indexed: 11/16/2024] Open
Abstract
BACKGROUND Acute Respiratory Distress Syndrome (ARDS) has a high morbidity and mortality. One therapy that can decrease mortality is ventilation in the prone position (PP). Patients undergoing PP are amongst the sickest, and there is a need for early identification of patients at particularly high risk of death. These patients may benefit from an in-depth review of treatment or consideration of rescue therapies. We report the development of a machine learning model trained to predict early mortality in patients undergoing prone positioning as part of the management of their ARDS. METHODS Prospectively collected clinical data were analysed retrospectively from a single tertiary ICU. The records of patients who underwent an initial session of prone positioning whilst receiving invasive mechanical ventilation were identified (n = 131). The decision to perform prone positioning was based on the criteria in the PROSEVA study. A C5.0 classifier algorithm with adaptive boosting was trained on data gathered before, during, and after initial proning. Data was split between training (85% of data) and testing (15% of data). Hyperparameter tuning was achieved through a grid-search using a maximal entropy configuration. Predictions for 7-day mortality after initial proning session were made on the training and testing data. RESULTS The model demonstrated good performance in predicting 7-day mortality (AUROC: 0.89 training, 0.78 testing). Seven variables were used for prediction. Sensitivity was 0.80 and specificity was 0.67 on the testing data set. Patients predicted to survive had 13.3% mortality, while those predicted to die had 66.67% mortality. Among patients in whom the model predicted patient would survive to day 7 based on their response, mortality at day 7 was 13.3%. Conversely, if the model predicted the patient would not survive to day 7, mortality was 66.67%. CONCLUSIONS This proof-of-concept study shows that with a limited data set, a C5.0 classifier can predict 7-day mortality from a number of variables, including the response to initial proning, and identify a cohort at significantly higher risk of death. This can help identify patients failing conventional therapies who may benefit from a thorough review of their management, including consideration of rescue treatments, such as extracorporeal membrane oxygenation. This study shows the potential of a machine learning model to identify ARDS patients at high risk of early mortality following PP. This information can guide clinicians in tailoring treatment strategies and considering rescue therapies. Further validation in larger cohorts is needed.
Collapse
Affiliation(s)
- David M Hannon
- Department of Anaesthesia, Galway University Hospital, and School of Medicine, University of Galway, Galway, Ireland
- Anaesthesia and Intensive Care Medicine, School of Medicine, University of Galway, Galway, Ireland
| | - Jaffar David Abbas Syed
- Department of Anaesthesia, Galway University Hospital, and School of Medicine, University of Galway, Galway, Ireland
| | - Bairbre McNicholas
- Department of Anaesthesia, Galway University Hospital, and School of Medicine, University of Galway, Galway, Ireland
- Anaesthesia and Intensive Care Medicine, School of Medicine, University of Galway, Galway, Ireland
| | - Michael Madden
- School of Computer Science, University of Galway, Galway, Ireland
| | - John G Laffey
- Department of Anaesthesia, Galway University Hospital, and School of Medicine, University of Galway, Galway, Ireland.
- Anaesthesia and Intensive Care Medicine, School of Medicine, University of Galway, Galway, Ireland.
| |
Collapse
|
13
|
Knezevic A, Arsenovic J, Garipi E, Platisa N, Savic A, Aleksandric T, Popovic D, Subic L, Milenovic N, Simic Panic D, Budinski S, Pasternak J, Manojlovic V, Knezevic MJ, Kapetina Radovic M, Jelicic Z. Machine Learning Model for Predicting Walking Ability in Lower Limb Amputees. J Clin Med 2024; 13:6763. [PMID: 39597907 PMCID: PMC11594448 DOI: 10.3390/jcm13226763] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/29/2024] [Revised: 11/04/2024] [Accepted: 11/08/2024] [Indexed: 11/29/2024] Open
Abstract
Background/Objectives: The number of individuals with lower limb loss (LLL) is rising. Therefore, identifying the walking potential in individuals with LLL and prescribing adequate prosthetic systems are crucial. Various factors can influence participants' walking ability, to different extents. The aim of the present study was to apply machine learning methods to develop a predictive mode. This model can assist rehabilitation and limb loss care teams in making informed decisions regarding prosthesis prescription and predicting walking ability in individuals with LLL. Methods: The present study was designed as a prospective cross-sectional study encompassing 104 consecutively recruited participants with LLL (average age 62.1 ± 10.9 years, 80 (76.9%) men) at the Medical Rehabilitation Clinic. Demographic, physical, psychological, and social status data of patients were collected at the beginning of the rehabilitation program. At the end of the treatment, K-level estimation of functional ability, a Timed Up and Go Test (TUG), and a Two-Minute Walking Test (TMWT) were performed. Support vector machines (SVM) were used to develop the prediction model. Results: Three decision trees were created, one for each output, as follows: K-level, TUG, and TMWT. For all three outputs, there were eight significant predictors (balance, body mass index, age, Beck depression inventory, amputation level, muscle strength of the residual extremity hip extensors, intact extremity (IE) plantar flexors, and IE hip extensors). For the K-level, the ninth predictor was The Multidimensional Scale of Perceived Social Support (MSPSS). Conclusions: Using the SVM model, we can predict the K-level, TUG, and TMWT with high accuracy. These clinical assessments could be incorporated into routine clinical practice to guide clinicians and inform patients of their potential level of ambulation.
Collapse
Affiliation(s)
- Aleksandar Knezevic
- Faculty of Medicine, University of Novi Sad, 21000 Novi Sad, Serbia; (E.G.); (A.S.); (T.A.); (D.P.); (L.S.); (N.M.); (D.S.P.); (S.B.); (J.P.); (V.M.); (M.J.K.)
- Center for Physical Medicine and Rehabilitation, University Clinical Center of Vojvodina, 21000 Novi Sad, Serbia;
| | - Jovana Arsenovic
- Faculty of Technical Sciences, University of Novi Sad, 21000 Novi Sad, Serbia; (J.A.); (M.K.R.); (Z.J.)
| | - Enis Garipi
- Faculty of Medicine, University of Novi Sad, 21000 Novi Sad, Serbia; (E.G.); (A.S.); (T.A.); (D.P.); (L.S.); (N.M.); (D.S.P.); (S.B.); (J.P.); (V.M.); (M.J.K.)
- Center for Physical Medicine and Rehabilitation, University Clinical Center of Vojvodina, 21000 Novi Sad, Serbia;
| | - Nedeljko Platisa
- Center for Physical Medicine and Rehabilitation, University Clinical Center of Vojvodina, 21000 Novi Sad, Serbia;
| | - Aleksandra Savic
- Faculty of Medicine, University of Novi Sad, 21000 Novi Sad, Serbia; (E.G.); (A.S.); (T.A.); (D.P.); (L.S.); (N.M.); (D.S.P.); (S.B.); (J.P.); (V.M.); (M.J.K.)
- Center for Physical Medicine and Rehabilitation, University Clinical Center of Vojvodina, 21000 Novi Sad, Serbia;
| | - Tijana Aleksandric
- Faculty of Medicine, University of Novi Sad, 21000 Novi Sad, Serbia; (E.G.); (A.S.); (T.A.); (D.P.); (L.S.); (N.M.); (D.S.P.); (S.B.); (J.P.); (V.M.); (M.J.K.)
- Center for Physical Medicine and Rehabilitation, University Clinical Center of Vojvodina, 21000 Novi Sad, Serbia;
| | - Dunja Popovic
- Faculty of Medicine, University of Novi Sad, 21000 Novi Sad, Serbia; (E.G.); (A.S.); (T.A.); (D.P.); (L.S.); (N.M.); (D.S.P.); (S.B.); (J.P.); (V.M.); (M.J.K.)
- Center for Physical Medicine and Rehabilitation, University Clinical Center of Vojvodina, 21000 Novi Sad, Serbia;
| | - Larisa Subic
- Faculty of Medicine, University of Novi Sad, 21000 Novi Sad, Serbia; (E.G.); (A.S.); (T.A.); (D.P.); (L.S.); (N.M.); (D.S.P.); (S.B.); (J.P.); (V.M.); (M.J.K.)
- Center for Physical Medicine and Rehabilitation, University Clinical Center of Vojvodina, 21000 Novi Sad, Serbia;
| | - Natasa Milenovic
- Faculty of Medicine, University of Novi Sad, 21000 Novi Sad, Serbia; (E.G.); (A.S.); (T.A.); (D.P.); (L.S.); (N.M.); (D.S.P.); (S.B.); (J.P.); (V.M.); (M.J.K.)
- Special Hospital for Rheumatic Diseases, 21000 Novi Sad, Serbia
| | - Dusica Simic Panic
- Faculty of Medicine, University of Novi Sad, 21000 Novi Sad, Serbia; (E.G.); (A.S.); (T.A.); (D.P.); (L.S.); (N.M.); (D.S.P.); (S.B.); (J.P.); (V.M.); (M.J.K.)
- Center for Physical Medicine and Rehabilitation, University Clinical Center of Vojvodina, 21000 Novi Sad, Serbia;
| | - Slavko Budinski
- Faculty of Medicine, University of Novi Sad, 21000 Novi Sad, Serbia; (E.G.); (A.S.); (T.A.); (D.P.); (L.S.); (N.M.); (D.S.P.); (S.B.); (J.P.); (V.M.); (M.J.K.)
- Clinic for Vascular and Endovascular Surgery, University Clinical Center of Vojvodina, 21000 Novi Sad, Serbia
| | - Janko Pasternak
- Faculty of Medicine, University of Novi Sad, 21000 Novi Sad, Serbia; (E.G.); (A.S.); (T.A.); (D.P.); (L.S.); (N.M.); (D.S.P.); (S.B.); (J.P.); (V.M.); (M.J.K.)
- Clinic for Vascular and Endovascular Surgery, University Clinical Center of Vojvodina, 21000 Novi Sad, Serbia
| | - Vladimir Manojlovic
- Faculty of Medicine, University of Novi Sad, 21000 Novi Sad, Serbia; (E.G.); (A.S.); (T.A.); (D.P.); (L.S.); (N.M.); (D.S.P.); (S.B.); (J.P.); (V.M.); (M.J.K.)
- Clinic for Vascular and Endovascular Surgery, University Clinical Center of Vojvodina, 21000 Novi Sad, Serbia
| | - Milica Jeremic Knezevic
- Faculty of Medicine, University of Novi Sad, 21000 Novi Sad, Serbia; (E.G.); (A.S.); (T.A.); (D.P.); (L.S.); (N.M.); (D.S.P.); (S.B.); (J.P.); (V.M.); (M.J.K.)
| | - Mirna Kapetina Radovic
- Faculty of Technical Sciences, University of Novi Sad, 21000 Novi Sad, Serbia; (J.A.); (M.K.R.); (Z.J.)
| | - Zoran Jelicic
- Faculty of Technical Sciences, University of Novi Sad, 21000 Novi Sad, Serbia; (J.A.); (M.K.R.); (Z.J.)
| |
Collapse
|
14
|
Knecht S, Morandini P, Biehler-Gomez L, Nogueira L, Adalian P, Cattaneo C. Sex estimation from patellar measurements in a contemporary Italian population: a machine learning approach. Int J Legal Med 2024:10.1007/s00414-024-03359-0. [PMID: 39495285 DOI: 10.1007/s00414-024-03359-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/18/2024] [Accepted: 10/23/2024] [Indexed: 11/05/2024]
Abstract
Biological sex estimation in forensic anthropology is a crucial topic, and the patella has shown promise in this regard due to its sexual dimorphism. This study uses 12 machine learning models for sex estimation based on three patellar measurements (maximum height, breadth, and thickness). Data was collected from 180 skeletons of a contemporary Italian population (83 males and 97 females) as well as from an independent sample of 21 forensic cases (13 males and 8 females). Statistical analyses indicated that each of the variables exhibited significant sexual dimorphism. To predict biological sex, the classifiers were built using 70% of a reference sample, then tested on the remaining 30% of the original sample and then tested again on the independent sample. The different classifiers generated accuracies varied between 0.85 and 0.91 on the reference sample and between 0.71 and 0.95 for the validation sample. SVM classifier stood out with the highest accuracy and seemed the best model for our study.This study contributes to the growing application of machine learning in forensic anthropology by being the first to apply such techniques to patellar measurements in an Italian population. It aims to enhance the accuracy and efficiency of biological sex estimation from the patella, building on promising results observed with other skeletal elements.
Collapse
Affiliation(s)
- Siam Knecht
- Aix Marseille Université, CNRS, EFS, ADES, Marseille, 13007, France
| | - Paolo Morandini
- LABANOF (Laboratorio di Antropologia e Odontologia Forense), Department of Biomedical Science for Health, University of Milan, Via Mangiagalli 37, Milan, 20133, Italy
| | - Lucie Biehler-Gomez
- LABANOF (Laboratorio di Antropologia e Odontologia Forense), Department of Biomedical Science for Health, University of Milan, Via Mangiagalli 37, Milan, 20133, Italy.
| | - Luisa Nogueira
- Faculté de Médecine, Institut Universitaire d'Anthropologie Médico-Légale, Université Côte d'Azur, 28 Avenue de Valombrose, Cedex 2 Nice, 06107, France
| | - Pascal Adalian
- Aix Marseille Université, CNRS, EFS, ADES, Marseille, 13007, France
| | - Cristina Cattaneo
- LABANOF (Laboratorio di Antropologia e Odontologia Forense), Department of Biomedical Science for Health, University of Milan, Via Mangiagalli 37, Milan, 20133, Italy
| |
Collapse
|
15
|
Lukovikov DA, Kolesnikova TO, Ikrin AN, Prokhorenko NO, Shevlyakov AD, Korotaev AA, Yang L, Bley V, de Abreu MS, Kalueff AV. A novel open-access artificial-intelligence-driven platform for CNS drug discovery utilizing adult zebrafish. J Neurosci Methods 2024; 411:110256. [PMID: 39182516 DOI: 10.1016/j.jneumeth.2024.110256] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/08/2024] [Revised: 07/30/2024] [Accepted: 08/17/2024] [Indexed: 08/27/2024]
Abstract
BACKGROUND Although zebrafish are increasingly utilized in biomedicine for CNS disease modelling and drug discovery, this generates big data necessitating objective, precise and reproducible analyses. The artificial intelligence (AI) applications have empowered automated image recognition and video-tracking to ensure more efficient behavioral testing. NEW METHOD Capitalizing on several AI tools that most recently became available, here we present a novel open-access AI-driven platform to analyze tracks of adult zebrafish collected from in vivo neuropharmacological experiments. For this, we trained the AI system to distinguish zebrafish behavioral patterns following systemic treatment with several well-studied psychoactive drugs - nicotine, caffeine and ethanol. RESULTS Experiment 1 showed the ability of the AI system to distinguish nicotine and caffeine with 75 % and ethanol with 88 % probability and high (81 %) accuracy following a post-training exposure to these drugs. Experiment 2 further validated our system with additional, previously unexposed compounds (cholinergic arecoline and varenicline, and serotonergic fluoxetine), used as positive and negative controls, respectively. COMPARISON WITH EXISTING METHODS The present study introduces a novel open-access AI-driven approach to analyze locomotor activity of adult zebrafish. CONCLUSIONS Taken together, these findings support the value of custom-made AI tools for unlocking full potential of zebrafish CNS drug research by monitoring, processing and interpreting the results of in vivo experiments.
Collapse
Affiliation(s)
- Danil A Lukovikov
- Graduate Program in Bioinformatics and Genomics, Sirius University of Science and Technology, Sochi 354340, Russia; Neuroscience Department, Sirius University of Science and Technology, Sochi 354340, Russia
| | - Tatiana O Kolesnikova
- Neuroscience Department, Sirius University of Science and Technology, Sochi 354340, Russia
| | - Aleksey N Ikrin
- Graduate Program in Genetics and Genetic Technologies, Sirius University of Science and Technology, Sochi 354340, Russia; Neuroscience Department, Sirius University of Science and Technology, Sochi 354340, Russia
| | - Nikita O Prokhorenko
- Neuroscience Department, Sirius University of Science and Technology, Sochi 354340, Russia
| | - Anton D Shevlyakov
- Graduate Program in Bioinformatics and Genomics, Sirius University of Science and Technology, Sochi 354340, Russia; Neuroscience Department, Sirius University of Science and Technology, Sochi 354340, Russia
| | - Andrei A Korotaev
- Neuroscience Department, Sirius University of Science and Technology, Sochi 354340, Russia
| | - Longen Yang
- Department of Biological Sciences, School of Science, Xi'an Jiaotong-Liverpool University, Suzhou 215123, China; Suzhou Key Laboratory of Neurobiology and Cell Signaling, School of Science, Xi'an Jiaotong-Liverpool University, Suzhou 215123, China
| | - Vea Bley
- Department of Biological Sciences, School of Science, Xi'an Jiaotong-Liverpool University, Suzhou 215123, China; Biology Program, University of Florida, Gainesville, FL 32610, USA
| | - Murilo S de Abreu
- Graduate Program in Health Sciences, Federal University of Health Sciences of Porto Alegre, Porto Alegre, Brazil; Western Caspian University, Baku, Azerbaijan.
| | - Allan V Kalueff
- Neuroscience Department, Sirius University of Science and Technology, Sochi 354340, Russia; Department of Biological Sciences, School of Science, Xi'an Jiaotong-Liverpool University, Suzhou 215123, China; Suzhou Key Laboratory of Neurobiology and Cell Signaling, School of Science, Xi'an Jiaotong-Liverpool University, Suzhou 215123, China; Institute of Translational Biomedicine, St. Petersburg State University, St. Petersburg 199034, Russia; Institute of Experimental Medicine, Almazov National Medical Research Centre, Ministry of Healthcare of Russian Federation, St. Petersburg 194021, Russia.
| |
Collapse
|
16
|
Salari E, Wang J, Wynne JF, Chang C, Wu Y, Yang X. Artificial intelligence-based motion tracking in cancer radiotherapy: A review. J Appl Clin Med Phys 2024; 25:e14500. [PMID: 39194360 PMCID: PMC11540048 DOI: 10.1002/acm2.14500] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/15/2023] [Revised: 07/13/2024] [Accepted: 07/27/2024] [Indexed: 08/29/2024] Open
Abstract
Radiotherapy aims to deliver a prescribed dose to the tumor while sparing neighboring organs at risk (OARs). Increasingly complex treatment techniques such as volumetric modulated arc therapy (VMAT), stereotactic radiosurgery (SRS), stereotactic body radiotherapy (SBRT), and proton therapy have been developed to deliver doses more precisely to the target. While such technologies have improved dose delivery, the implementation of intra-fraction motion management to verify tumor position at the time of treatment has become increasingly relevant. Artificial intelligence (AI) has recently demonstrated great potential for real-time tracking of tumors during treatment. However, AI-based motion management faces several challenges, including bias in training data, poor transparency, difficult data collection, complex workflows and quality assurance, and limited sample sizes. This review presents the AI algorithms used for chest, abdomen, and pelvic tumor motion management/tracking for radiotherapy and provides a literature summary on the topic. We will also discuss the limitations of these AI-based studies and propose potential improvements.
Collapse
Affiliation(s)
- Elahheh Salari
- Department of Radiation OncologyEmory UniversityAtlantaGeorgiaUSA
| | - Jing Wang
- Radiation OncologyIcahn School of Medicine at Mount SinaiNew YorkNew YorkUSA
| | | | - Chih‐Wei Chang
- Department of Radiation OncologyEmory UniversityAtlantaGeorgiaUSA
| | - Yizhou Wu
- School of Electrical and Computer EngineeringGeorgia Institute of TechnologyAtlantaGeorgiaUSA
| | - Xiaofeng Yang
- Department of Radiation OncologyEmory UniversityAtlantaGeorgiaUSA
| |
Collapse
|
17
|
Delgado-Álvarez A, Hernández-Lorenzo L, Nielsen TR, Díez-Cirarda M, Cuevas C, Montero-Escribano P, Delgado-Alonso C, Valles-Salgado M, Gil-Moreno MJ, Matias-Guiu J, Matias-Guiu JA. European cross-cultural neuropsychological test battery (CNTB) for the assessment of cognitive impairment in multiple sclerosis: Cognitive phenotyping and classification supported by machine learning techniques. Mult Scler Relat Disord 2024; 91:105907. [PMID: 39366169 DOI: 10.1016/j.msard.2024.105907] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/13/2023] [Revised: 07/06/2024] [Accepted: 09/24/2024] [Indexed: 10/06/2024]
Abstract
BACKGROUND The European Cross-Cultural Neuropsychological Test Battery (CNTB) has been proposed as a comprehensive battery for cognitive assessment, reducing the potential impact of cultural variables. In this validation study, we aimed to evaluate the diagnostic capacity of CNTB for the assessment of participants with multiple sclerosis (pwMS) compared to the Neuronorma battery (NN) according to the International Classification of Cognitive Disorders in MS criteria, and to develop machine learning (ML) algorithms to improve the diagnostic capacity of CNTB and to select the most relevant tests. METHODS Sixty pwMS and 60 healthy controls (HC) with no differences in sex, age, or years of education were enrolled. All participants completed the CNTB and pwMS were also examined with NN, depression, and fatigue scales. Impaired domains and cognitive phenotypes were defined following ICCoDiMS based on CNTB scores and compared to NN, according to -1SD and -1.5SD cutoff scores. To select the most relevant tests, random forest (RF) was performed for different binary classifications. RESULTS PwMS showed a lower performance compared to HC with medium-large effect sizes, in episodic memory, executive function, attention, and processing speed, in accordance with their characteristic cognitive profile. There were no differences in impaired domains or cognitive phenotypes between CNTB and NN, highlighting the role of episodic memory, executive function, attention, and processing speed tests. The most relevant tests identified by RF were consistent with inter-group comparisons and allowed a better classification than SD cutoff scores. CONCLUSION CNTB is a valid test for cognitive diagnosis in pwMS, including key tests for the most frequently impaired cognitive domains in MS. The use of ML techniques may also be useful to improve diagnosis, especially in some tests with lower sensitivity.
Collapse
Affiliation(s)
- Alfonso Delgado-Álvarez
- Department of Neurology, Hospital Clinico San Carlos, San Carlos Institute for Health Research (IdiSSC), Universidad Complutense, ES, Madrid 28040, Spain; Department of Biological and Health Psychology, Faculty of Psychology, Universidad Autónoma de Madrid, Madrid, Spain
| | - Laura Hernández-Lorenzo
- Department of Computer Architecture and Automation, Faculty of Informatics, Universidad Complutense, Madrid, Spain
| | - T Rune Nielsen
- Danish Dementia Research Centre, Department of Neurology, University of Copenhagen-Rigshospitalet, Copenhagen, Denmark
| | - María Díez-Cirarda
- Department of Neurology, Hospital Clinico San Carlos, San Carlos Institute for Health Research (IdiSSC), Universidad Complutense, ES, Madrid 28040, Spain
| | - Constanza Cuevas
- Department of Neurology, Hospital Clinico San Carlos, San Carlos Institute for Health Research (IdiSSC), Universidad Complutense, ES, Madrid 28040, Spain
| | - Paloma Montero-Escribano
- Department of Neurology, Hospital Clinico San Carlos, San Carlos Institute for Health Research (IdiSSC), Universidad Complutense, ES, Madrid 28040, Spain
| | - Cristina Delgado-Alonso
- Department of Neurology, Hospital Clinico San Carlos, San Carlos Institute for Health Research (IdiSSC), Universidad Complutense, ES, Madrid 28040, Spain
| | - María Valles-Salgado
- Department of Neurology, Hospital Clinico San Carlos, San Carlos Institute for Health Research (IdiSSC), Universidad Complutense, ES, Madrid 28040, Spain
| | - María José Gil-Moreno
- Department of Neurology, Hospital Clinico San Carlos, San Carlos Institute for Health Research (IdiSSC), Universidad Complutense, ES, Madrid 28040, Spain
| | - Jorge Matias-Guiu
- Department of Neurology, Hospital Clinico San Carlos, San Carlos Institute for Health Research (IdiSSC), Universidad Complutense, ES, Madrid 28040, Spain
| | - Jordi A Matias-Guiu
- Department of Neurology, Hospital Clinico San Carlos, San Carlos Institute for Health Research (IdiSSC), Universidad Complutense, ES, Madrid 28040, Spain.
| |
Collapse
|
18
|
Xin X, Wu S, Xu H, Ma Y, Bao N, Gao M, Han X, Gao S, Zhang S, Zhao X, Qi J, Zhang X, Tan J. Non-invasive prediction of human embryonic ploidy using artificial intelligence: a systematic review and meta-analysis. EClinicalMedicine 2024; 77:102897. [PMID: 39513188 PMCID: PMC11541425 DOI: 10.1016/j.eclinm.2024.102897] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 08/10/2024] [Revised: 10/06/2024] [Accepted: 10/07/2024] [Indexed: 11/15/2024] Open
Abstract
Background Embryonic ploidy is critical for the success of embryo transfer. Currently, preimplantation genetic testing for aneuploidy (PGT-A) is the gold standard for detecting ploidy abnormalities. However, PGT-A has several inherent limitations, including invasive biopsy, high economic burden, and ethical constraints. This paper provides the first comprehensive systematic review and meta-analysis of the performance of artificial intelligence (AI) algorithms using embryonic images for non-invasive prediction of embryonic ploidy. Methods Comprehensive searches of studies that developed or utilized AI algorithms to predict embryonic ploidy from embryonic imaging, published up until August 10, 2024, across PubMed, MEDLINE, Embase, IEEE, SCOPUS, Web of Science, and the Cochrane Central Register of Controlled Trials were performed. Studies with prospective or retrospective designs were included without language restrictions. The summary receiver operating characteristic curve, along with pooled sensitivity and specificity, was estimated using a bivariate random-effects model. The risk of bias and study quality were evaluated using the QUADAS-AI tool. Heterogeneity was quantified using the inconsistency index (I 2 ), derived from Cochran's Q test. Predefined subgroup analyses and bivariate meta-regression were conducted to explore potential sources of heterogeneity. This study was registered with PROSPERO (CRD42024500409). Findings Twenty eligible studies were identified, with twelve studies included in the meta-analysis. The pooled sensitivity, specificity, and area under the curve of AI for predicting embryonic euploidy were 0.71 (95% CI: 0.59-0.81), 0.75 (95% CI: 0.69-0.80), and 0.80 (95% CI: 0.76-0.83), respectively, based on a total of 6879 embryos (3110 euploid and 3769 aneuploid). Meta-regression and subgroup analyses identified the type of AI-driven decision support system, external validation, risk of bias, and year of publication as the primary contributors to the observed heterogeneity. There was no evidence of publication bias. Interpretation Our findings indicate that AI algorithms exhibit promising performance in predicting embryonic euploidy based on embryonic imaging. Although the current AI models developed cannot entirely replace invasive methods for determining embryo ploidy, AI demonstrates promise as an auxiliary decision-making tool for embryo selection, particularly for individuals who are unable to undergo PGT-A. To enhance the quality of future research, it is essential to overcome the specific challenges and limitations associated with AI studies in reproductive medicine. Funding This work was supported by the National Key R&D Program of China (2022YFC2702905), the Shengjing Freelance Researcher Plan of Shengjing Hospital and the 345 talent project of Shengjing Hospital.
Collapse
Affiliation(s)
- Xing Xin
- Centre of Reproductive Medicine, Department of Obstetrics and Gynecology, Shengjing Hospital of China Medical University, No. 39 Huaxiang Road, Tiexi District, Shenyang 110022, China
- Key Laboratory of Reproductive Dysfunction Disease and Fertility Remodeling of Liaoning Province, No. 39 Huaxiang Road, Tiexi District, Shenyang 110022, China
| | - Shanshan Wu
- Centre of Reproductive Medicine, Department of Obstetrics and Gynecology, Shengjing Hospital of China Medical University, No. 39 Huaxiang Road, Tiexi District, Shenyang 110022, China
- Key Laboratory of Reproductive Dysfunction Disease and Fertility Remodeling of Liaoning Province, No. 39 Huaxiang Road, Tiexi District, Shenyang 110022, China
| | - Heli Xu
- Department of Clinical Epidemiology, Shengjing Hospital of China Medical University, Shenyang 110022, China
| | - Yujiu Ma
- Centre of Reproductive Medicine, Department of Obstetrics and Gynecology, Shengjing Hospital of China Medical University, No. 39 Huaxiang Road, Tiexi District, Shenyang 110022, China
- Key Laboratory of Reproductive Dysfunction Disease and Fertility Remodeling of Liaoning Province, No. 39 Huaxiang Road, Tiexi District, Shenyang 110022, China
| | - Nan Bao
- The College of Medicine and Biological Information Engineering, Northeastern University, Shenyang 110167, China
| | - Man Gao
- Department of Obstetrics and Gynecology, Shengjing Hospital of China Medical University, No. 36 Sanhao Street, Heping District, Shenyang 110004, China
| | - Xue Han
- Department of Obstetrics and Gynecology, Shengjing Hospital of China Medical University, No. 36 Sanhao Street, Heping District, Shenyang 110004, China
| | - Shan Gao
- Centre of Reproductive Medicine, Department of Obstetrics and Gynecology, Shengjing Hospital of China Medical University, No. 39 Huaxiang Road, Tiexi District, Shenyang 110022, China
- Key Laboratory of Reproductive Dysfunction Disease and Fertility Remodeling of Liaoning Province, No. 39 Huaxiang Road, Tiexi District, Shenyang 110022, China
| | - Siwen Zhang
- Centre of Reproductive Medicine, Department of Obstetrics and Gynecology, Shengjing Hospital of China Medical University, No. 39 Huaxiang Road, Tiexi District, Shenyang 110022, China
- Key Laboratory of Reproductive Dysfunction Disease and Fertility Remodeling of Liaoning Province, No. 39 Huaxiang Road, Tiexi District, Shenyang 110022, China
| | - Xinyang Zhao
- Centre of Reproductive Medicine, Department of Obstetrics and Gynecology, Shengjing Hospital of China Medical University, No. 39 Huaxiang Road, Tiexi District, Shenyang 110022, China
- Key Laboratory of Reproductive Dysfunction Disease and Fertility Remodeling of Liaoning Province, No. 39 Huaxiang Road, Tiexi District, Shenyang 110022, China
| | - Jiarui Qi
- Centre of Reproductive Medicine, Department of Obstetrics and Gynecology, Shengjing Hospital of China Medical University, No. 39 Huaxiang Road, Tiexi District, Shenyang 110022, China
- Key Laboratory of Reproductive Dysfunction Disease and Fertility Remodeling of Liaoning Province, No. 39 Huaxiang Road, Tiexi District, Shenyang 110022, China
| | - Xudong Zhang
- Centre of Reproductive Medicine, Department of Obstetrics and Gynecology, Shengjing Hospital of China Medical University, No. 39 Huaxiang Road, Tiexi District, Shenyang 110022, China
- Key Laboratory of Reproductive Dysfunction Disease and Fertility Remodeling of Liaoning Province, No. 39 Huaxiang Road, Tiexi District, Shenyang 110022, China
| | - Jichun Tan
- Centre of Reproductive Medicine, Department of Obstetrics and Gynecology, Shengjing Hospital of China Medical University, No. 39 Huaxiang Road, Tiexi District, Shenyang 110022, China
- Key Laboratory of Reproductive Dysfunction Disease and Fertility Remodeling of Liaoning Province, No. 39 Huaxiang Road, Tiexi District, Shenyang 110022, China
| |
Collapse
|
19
|
Sourlos N, Vliegenthart R, Santinha J, Klontzas ME, Cuocolo R, Huisman M, van Ooijen P. Recommendations for the creation of benchmark datasets for reproducible artificial intelligence in radiology. Insights Imaging 2024; 15:248. [PMID: 39400639 PMCID: PMC11473745 DOI: 10.1186/s13244-024-01833-2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/22/2024] [Accepted: 09/20/2024] [Indexed: 10/15/2024] Open
Abstract
Various healthcare domains have witnessed successful preliminary implementation of artificial intelligence (AI) solutions, including radiology, though limited generalizability hinders their widespread adoption. Currently, most research groups and industry have limited access to the data needed for external validation studies. The creation and accessibility of benchmark datasets to validate such solutions represents a critical step towards generalizability, for which an array of aspects ranging from preprocessing to regulatory issues and biostatistical principles come into play. In this article, the authors provide recommendations for the creation of benchmark datasets in radiology, explain current limitations in this realm, and explore potential new approaches. CLINICAL RELEVANCE STATEMENT: Benchmark datasets, facilitating validation of AI software performance can contribute to the adoption of AI in clinical practice. KEY POINTS: Benchmark datasets are essential for the validation of AI software performance. Factors like image quality and representativeness of cases should be considered. Benchmark datasets can help adoption by increasing the trustworthiness and robustness of AI.
Collapse
Affiliation(s)
- Nikos Sourlos
- Department of Radiology, University Medical Center of Groningen, Groningen, The Netherlands
- DataScience Center in Health, University Medical Center Groningen, Groningen, The Netherlands
| | - Rozemarijn Vliegenthart
- Department of Radiology, University Medical Center of Groningen, Groningen, The Netherlands
- DataScience Center in Health, University Medical Center Groningen, Groningen, The Netherlands
| | - Joao Santinha
- Digital Surgery LAB, Champalimaud Foundation, Champalimaud Clinical Centre, Lisbon, Portugal
| | - Michail E Klontzas
- Department of Medical Imaging, University Hospital of Heraklion, Heraklion, Greece
- Department of Radiology, School of Medicine, University of Crete, Heraklion, Greece
| | - Renato Cuocolo
- Department of Medicine, Surgery, and Dentistry, University of Salerno, Baronissi, Italy
| | - Merel Huisman
- Department of Radiology and Nuclear Medicine, Radboud University Medical Center, Nijmegen, The Netherlands
| | - Peter van Ooijen
- DataScience Center in Health, University Medical Center Groningen, Groningen, The Netherlands.
- Department of Radiation Oncology, University Medical Center Groningen, Groningen, The Netherlands.
| |
Collapse
|
20
|
Rivera KP, Whang K, Joshi K, Son H, Kim YS, Flores M. Dental Composite Performance Prediction Using Artificial Intelligence. MEDRXIV : THE PREPRINT SERVER FOR HEALTH SCIENCES 2024:2024.10.08.24314998. [PMID: 39417148 PMCID: PMC11482990 DOI: 10.1101/2024.10.08.24314998] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Subscribe] [Scholar Register] [Indexed: 10/19/2024]
Abstract
Objective There is a need to increase the performance and longevity of dental composites and accelerate the translation of novel composites to the market. This study explores artificial intelligence (AI), specifically machine learning (ML), to predict the performance outcomes (POs) of dental composites from their composite attributes (CAs). Methods An extensive dataset from over 200 publications was built and refined to 233 samples with 17 CAs and 7 POs. Nine ML models were evaluated for PO prediction performance using classified data, and Five ML models were evaluated for PO regression analysis. Results The KNN model excelled in predicting flexural modulus (FlexMod), Decision Tree model in flexural strength (FlexStr) and volumetric shrinkage (ShrinkV), and Logistic Regression and SVM models in shrinkage stress (ShrinkStr). Receiver operating characteristic area under the curve (ROC AUC) analysis confirmed these results but found that Random Forest was more effective for FlexStr and ShrinkV, suggesting the possibility of Decision Tree overfitting the data. Regression analysis revealed that the Voting Regressor was superior for FlexMod and ShrinkV predictions, while Decision Tree Regression was optimal for FlexStr and ShrinkStr. Feature importance analysis indicated TEGDMA is a key contributor to FlexMod and ShrinkV, BisGMA and UDMA to FlexStr, and depth of cure, degree of monomer-to-polymer conversion, and filler loading to ShrinkStr. Significance There is a need to conduct a full analysis using multiple ML models because different models predict different POs better, and for a large, comprehensive dataset to train robust AI models to facilitate the prediction and optimization of composite properties and support the development of new dental materials.
Collapse
Affiliation(s)
- Karla Paniagua Rivera
- Department of Electrical and Computer Engineering, the University of Texas at San Antonio, San Antonio, TX, 78249, USA
| | - Kyumin Whang
- Department of Comprehensive Dentistry, the University of Texas Health Science Center at San Antonio, San Antonio, TX 78229, USA
| | - Krishna Joshi
- Department of Comprehensive Dentistry, the University of Texas Health Science Center at San Antonio, San Antonio, TX 78229, USA
| | - Hyeonwi Son
- Department of Oral & Maxillofacial Surgery, School of Dentistry
| | - Yu Shin Kim
- Department of Oral & Maxillofacial Surgery, School of Dentistry
- Programs in Integrated Biomedical Sciences, Translational Sciences, Biomedical Engineering, Radiological Sciences, University of Texas Health Science Center at San Antonio, San Antonio, TX 78229, USA
| | - Mario Flores
- Department of Electrical and Computer Engineering, the University of Texas at San Antonio, San Antonio, TX, 78249, USA
| |
Collapse
|
21
|
Shu CH, Zebda R, Espinosa C, Reiss J, Debuyserie A, Reber K, Aghaeepour N, Pammi M. Early prediction of mortality and morbidities in VLBW preterm neonates using machine learning. Pediatr Res 2024:10.1038/s41390-024-03604-7. [PMID: 39379627 DOI: 10.1038/s41390-024-03604-7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 05/16/2024] [Revised: 09/05/2024] [Accepted: 09/12/2024] [Indexed: 10/10/2024]
Abstract
BACKGROUND Predicting mortality and specific morbidities before they occur may allow for interventions that may improve health trajectories. HYPOTHESIS Integrating key maternal and postnatal infant variables in the first 2 weeks of age into machine learning (ML) algorithms will reliably predict survival and specific morbidities in VLBW preterm infants. METHODS ML algorithms were developed to integrate 47 features for predicting mortality, bronchopulmonary dysplasia (BPD), neonatal sepsis, necrotizing enterocolitis (NEC), intraventricular hemorrhage (IVH), cystic periventricular leukomalacia (PVL), and retinopathy of prematurity (ROP). A retrospective cohort (n = 3341) was used to train and validate the models with a repeated 10-fold cross-validation strategy. These models were then tested on a separate cohort (n = 447) to evaluate the final model performance. RESULTS Among the seven ML algorithms employed, tree-based ensemble models, specifically Random Forest (RF) and XGBoost, had the best performance metrics. The area under the receiver operating characteristic curve (AUROC) of sepsis with or without meningitis (0.73), NEC (0.73), BPD (0.71), and mortality (0.74) exceeded 0.7, while the area under Precision-Recall curve (AUPRC) for all outcomes was greater than the prevalence, demonstrating effective risk stratification in VLBW preterm infants. CONCLUSIONS Our study demonstrates the potential of predictive analytics leveraging ML techniques in advancing precision medicine. IMPACT Reliable prediction of adverse outcomes before they occur has the potential to institute interventions and possibly improve health trajectories in VLBW preterm infants. We used machine learning to develop and test predictive models for mortality and five major morbidities in VLBW preterm infants. Individualized prediction of outcomes and individualized interventions will advance Precision Medicine in Neonatology.
Collapse
Affiliation(s)
- Chi-Hung Shu
- Department of Anesthesiology, Pain, and Perioperative Medicine, Stanford University School of Medicine, Stanford, CA, USA
| | - Rema Zebda
- Department of Pediatrics and Neonatology, Texas Children's Hospital, Baylor College of Medicine, Houston, TX, USA
| | - Camilo Espinosa
- Department of Anesthesiology, Pain, and Perioperative Medicine, Stanford University School of Medicine, Stanford, CA, USA
| | - Jonathan Reiss
- Department of Pediatrics, Stanford University School of Medicine, Stanford, CA, USA
| | - Anne Debuyserie
- Department of Pediatrics and Neonatology, Texas Children's Hospital, Baylor College of Medicine, Houston, TX, USA
| | - Kristina Reber
- Department of Pediatrics and Neonatology, Texas Children's Hospital, Baylor College of Medicine, Houston, TX, USA
| | - Nima Aghaeepour
- Department of Anesthesiology, Pain, and Perioperative Medicine, Stanford University School of Medicine, Stanford, CA, USA
- Department of Pediatrics, Stanford University School of Medicine, Stanford, CA, USA
- Department of Biomedical Data Science, Stanford University School of Medicine, Stanford, CA, USA
| | - Mohan Pammi
- Department of Pediatrics and Neonatology, Texas Children's Hospital, Baylor College of Medicine, Houston, TX, USA.
| |
Collapse
|
22
|
Xu D, Xu Z. Machine learning applications in preventive healthcare: A systematic literature review on predictive analytics of disease comorbidity from multiple perspectives. Artif Intell Med 2024; 156:102950. [PMID: 39163727 DOI: 10.1016/j.artmed.2024.102950] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/25/2023] [Revised: 06/17/2024] [Accepted: 08/13/2024] [Indexed: 08/22/2024]
Abstract
Artificial intelligence is constantly revolutionizing biomedical research and healthcare management. Disease comorbidity is a major threat to the quality of life for susceptible groups, especially middle-aged and elderly patients. The presence of multiple chronic diseases makes precision diagnosis challenging to realize and imposes a heavy burden on the healthcare system and economy. Given an enormous amount of accumulated health data, machine learning techniques show their capability in handling this puzzle. The present study conducts a review to uncover current research efforts in applying these methods to understanding comorbidity mechanisms and making clinical predictions considering these complex patterns. A descriptive metadata analysis of 791 unique publications aims to capture the overall research progression between January 2012 and June 2023. To delve into comorbidity-focused research, 61 of these scientific papers are systematically assessed. Four predictive analytics of tasks are detected: disease comorbidity data extraction, clustering, network, and risk prediction. It is observed that some machine learning-driven applications address inherent data deficiencies in healthcare datasets and provide a model interpretation that identifies significant risk factors of comorbidity development. Based on insights, both technical and practical, gained from relevant literature, this study intends to guide future interests in comorbidity research and draw conclusions about chronic disease prevention and diagnosis with managerial implications.
Collapse
Affiliation(s)
- Duo Xu
- School of Economics and Management, Southeast University, Nanjing 211189, China.
| | - Zeshui Xu
- School of Economics and Management, Southeast University, Nanjing 211189, China; Business School, Sichuan University, Chengdu 610064, China.
| |
Collapse
|
23
|
Ancona RM, Cooper BP, Foraker R, Kaser T, Adeoye O, Mueller KL. Machine learning classification of new firearm injury encounters in the St Louis region: 2010-2020. J Am Med Inform Assoc 2024; 31:2165-2172. [PMID: 38976592 DOI: 10.1093/jamia/ocae173] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/22/2024] [Revised: 06/20/2024] [Accepted: 06/25/2024] [Indexed: 07/10/2024] Open
Abstract
OBJECTIVES To improve firearm injury encounter classification (new vs follow-up) using machine learning (ML) and compare our ML model to other common approaches. MATERIALS AND METHODS This retrospective study used data from the St Louis region-wide hospital-based violence intervention program data repository (2010-2020). We randomly selected 500 patients with a firearm injury diagnosis for inclusion, with 808 total firearm injury encounters split (70/30) for training and testing. We trained a least absolute shrinkage and selection operator (LASSO) regression model with the following predictors: admission type, time between firearm injury visits, number of prior firearm injury emergency department (ED) visits, encounter type (ED or other), and diagnostic codes. Our gold standard for new firearm injury encounter classification was manual chart review. We then used our test data to compare the performance of our ML model to other commonly used approaches (proxy measures of ED visits and time between firearm injury encounters, and diagnostic code encounter type designation [initial vs subsequent or sequela]). Performance metrics included area under the curve (AUC), sensitivity, and specificity with 95% confidence intervals (CIs). RESULTS The ML model had excellent discrimination (0.92, 0.88-0.96) with high sensitivity (0.95, 0.90-0.98) and specificity (0.89, 0.81-0.95). AUC was significantly higher than time-based outcomes, sensitivity was slightly (but not significantly) lower than other approaches, and specificity was higher than all other methods. DISCUSSION ML successfully delineated new firearm injury encounters, outperforming other approaches in ruling out encounters for follow-up. CONCLUSION ML can be used to identify new firearm injury encounters and may be particularly useful in studies assessing re-injuries.
Collapse
Affiliation(s)
- Rachel M Ancona
- Department of Emergency Medicine, Washington University in St Louis, St Louis, MO 63110, United States
| | - Benjamin P Cooper
- Institute for Public Health, Washington University in St Louis, St Louis, MO 63110, United States
| | - Randi Foraker
- Department of Medicine, Washington University in St Louis, St Louis, MO 63110, United States
| | - Taylor Kaser
- Department of Emergency Medicine, Washington University in St Louis, St Louis, MO 63110, United States
| | - Opeolu Adeoye
- Department of Emergency Medicine, Washington University in St Louis, St Louis, MO 63110, United States
| | - Kristen L Mueller
- Department of Emergency Medicine, Washington University in St Louis, St Louis, MO 63110, United States
| |
Collapse
|
24
|
Li X, Wen X, Shang X, Liu J, Zhang L, Cui Y, Luo X, Zhang G, Xie J, Huang T, Chen Z, Lyu Z, Wu X, Lan Y, Meng Q. Identification of diabetic retinopathy classification using machine learning algorithms on clinical data and optical coherence tomography angiography. Eye (Lond) 2024; 38:2813-2821. [PMID: 38871934 PMCID: PMC11427469 DOI: 10.1038/s41433-024-03173-3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/02/2023] [Revised: 04/10/2024] [Accepted: 06/06/2024] [Indexed: 06/15/2024] Open
Abstract
BACKGROUND To apply machine learning (ML) algorithms to perform multiclass diabetic retinopathy (DR) classification using both clinical data and optical coherence tomography angiography (OCTA). METHODS In this cross-sectional observational study, clinical data and OCTA parameters from 203 diabetic patients (203 eye) were used to establish the ML models, and those from 169 diabetic patients (169 eye) were used for independent external validation. The random forest, gradient boosting machine (GBM), deep learning and logistic regression algorithms were used to identify the presence of DR, referable DR (RDR) and vision-threatening DR (VTDR). Four different variable patterns based on clinical data and OCTA variables were examined. The algorithms' performance were evaluated using receiver operating characteristic curves and the area under the curve (AUC) was used to assess predictive accuracy. RESULTS The random forest algorithm on OCTA+clinical data-based variables and OCTA+non-laboratory factor-based variables provided the higher AUC values for DR, RDR and VTDR. The GBM algorithm produced similar results, albeit with slightly lower AUC values. Leading predictors of DR status included vessel density, retinal thickness and GCC thickness, as well as the body mass index, waist-to-hip ratio and glucose-lowering treatment. CONCLUSIONS ML-based multiclass DR classification using OCTA and clinical data can provide reliable assistance for screening, referral, and management DR populations.
Collapse
Affiliation(s)
- Xiaoli Li
- Department of Ophthalmology, Guangdong Eye Institute, Guangdong Provincial People's Hospital (Guangdong Academy of Medical Sciences), Southern Medical University, Guangzhou, China
| | - Xin Wen
- Department of Ophthalmology, Sun Yat-sen Memorial Hospital, Sun Yat-sen University, Guangzhou, China
| | - Xianwen Shang
- Department of Ophthalmology, Guangdong Eye Institute, Guangdong Provincial People's Hospital (Guangdong Academy of Medical Sciences), Southern Medical University, Guangzhou, China
| | - Junbin Liu
- Department of Ophthalmology, Guangdong Eye Institute, Guangdong Provincial People's Hospital (Guangdong Academy of Medical Sciences), Southern Medical University, Guangzhou, China
| | - Liang Zhang
- Department of Ophthalmology, Guangdong Eye Institute, Guangdong Provincial People's Hospital (Guangdong Academy of Medical Sciences), Southern Medical University, Guangzhou, China
| | - Ying Cui
- Department of Ophthalmology, Guangdong Eye Institute, Guangdong Provincial People's Hospital (Guangdong Academy of Medical Sciences), Southern Medical University, Guangzhou, China
| | - Xiaoyang Luo
- Department of Ophthalmology, Guangdong Eye Institute, Guangdong Provincial People's Hospital (Guangdong Academy of Medical Sciences), Southern Medical University, Guangzhou, China
| | - Guanrong Zhang
- Statistics Section, Guangdong Provincial People's Hospital (Guangdong Academy of Medical Sciences), Southern Medical University, Guangzhou, Guangdong, China
| | - Jie Xie
- Department of Ophthalmology, Heyuan People's Hospital, Heyuan, China
| | - Tian Huang
- Department of Ophthalmology, Zhujiang Hospital, Southern Medical University, Guangzhou, China
| | - Zhifan Chen
- Department of Ophthalmology, The Fourth Affiliated Hospital of Guangzhou Medical University, Guangzhou, China
| | - Zheng Lyu
- Department of Ophthalmology, Guangdong Eye Institute, Guangdong Provincial People's Hospital (Guangdong Academy of Medical Sciences), Southern Medical University, Guangzhou, China
| | - Xiyu Wu
- Department of Ophthalmology, Guangdong Eye Institute, Guangdong Provincial People's Hospital (Guangdong Academy of Medical Sciences), Southern Medical University, Guangzhou, China
| | - Yuqing Lan
- Department of Ophthalmology, Sun Yat-sen Memorial Hospital, Sun Yat-sen University, Guangzhou, China.
| | - Qianli Meng
- Department of Ophthalmology, Guangdong Eye Institute, Guangdong Provincial People's Hospital (Guangdong Academy of Medical Sciences), Southern Medical University, Guangzhou, China.
| |
Collapse
|
25
|
Chen D, Cao C, Kloosterman R, Parsa R, Raman S. Trial Factors Associated With Completion of Clinical Trials Evaluating AI: Retrospective Case-Control Study. J Med Internet Res 2024; 26:e58578. [PMID: 39312296 PMCID: PMC11459098 DOI: 10.2196/58578] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/19/2024] [Revised: 05/02/2024] [Accepted: 07/11/2024] [Indexed: 09/25/2024] Open
Abstract
BACKGROUND Evaluation of artificial intelligence (AI) tools in clinical trials remains the gold standard for translation into clinical settings. However, design factors associated with successful trial completion and the common reasons for trial failure are unknown. OBJECTIVE This study aims to compare trial design factors of complete and incomplete clinical trials testing AI tools. We conducted a case-control study of complete (n=485) and incomplete (n=51) clinical trials that evaluated AI as an intervention of ClinicalTrials.gov. METHODS Trial design factors, including area of clinical application, intended use population, and intended role of AI, were extracted. Trials that did not evaluate AI as an intervention and active trials were excluded. The assessed trial design factors related to AI interventions included the domain of clinical application related to organ systems; intended use population for patients or health care providers; and the role of AI for different applications in patient-facing clinical workflows, such as diagnosis, screening, and treatment. In addition, we also assessed general trial design factors including study type, allocation, intervention model, masking, age, sex, funder, continent, length of time, sample size, number of enrollment sites, and study start year. The main outcome was the completion of the clinical trial. Odds ratio (OR) and 95% CI values were calculated for all trial design factors using propensity-matched, multivariable logistic regression. RESULTS We queried ClinicalTrials.gov on December 23, 2023, using AI keywords to identify complete and incomplete trials testing AI technologies as a primary intervention, yielding 485 complete and 51 incomplete trials for inclusion in this study. Our nested propensity-matched, case-control results suggest that trials conducted in Europe were significantly associated with trial completion when compared with North American trials (OR 2.85, 95% CI 1.14-7.10; P=.03), and the trial sample size was positively associated with trial completion (OR 1.00, 95% CI 1.00-1.00; P=.02). CONCLUSIONS Our case-control study is one of the first to identify trial design factors associated with completion of AI trials and catalog study-reported reasons for AI trial failure. We observed that trial design factors positively associated with trial completion include trials conducted in Europe and sample size. Given the promising clinical use of AI tools in health care, our results suggest that future translational research should prioritize addressing the design factors of AI clinical trials associated with trial incompletion and common reasons for study failure.
Collapse
Affiliation(s)
- David Chen
- Temerty Faculty of Medicine, University of Toronto, Toronto, ON, Canada
| | - Christian Cao
- Temerty Faculty of Medicine, University of Toronto, Toronto, ON, Canada
| | | | - Rod Parsa
- Michael G. DeGroote School of Medicine, McMaster University, Hamilton, ON, Canada
| | - Srinivas Raman
- Department of Radiation Oncology, University of Toronto, Toronto, ON, Canada
| |
Collapse
|
26
|
Mehrbakhsh Z, Hassanzadeh R, Behnampour N, Tapak L, Zarrin Z, Khazaei S, Dinu I. Machine learning-based evaluation of prognostic factors for mortality and relapse in patients with acute lymphoblastic leukemia: a comparative simulation study. BMC Med Inform Decis Mak 2024; 24:261. [PMID: 39285373 PMCID: PMC11404043 DOI: 10.1186/s12911-024-02645-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/06/2024] [Accepted: 08/21/2024] [Indexed: 09/22/2024] Open
Abstract
BACKGROUND Predicting mortality and relapse in children with acute lymphoblastic leukemia (ALL) is crucial for effective treatment and follow-up management. ALL is a common and deadly childhood cancer that often relapses after remission. In this study, we aimed to apply and evaluate machine learning-based models for predicting mortality and relapse in pediatric ALL patients. METHODS This retrospective cohort study was conducted on 161 children aged less than 16 years with ALL. Survival status (dead/alive) and patient experience of relapse (yes/no) were considered as the outcome variables. Ten machine learning (ML) algorithms were used to predict mortality and relapse. The performance of the algorithms was evaluated by cross-validation and reported as mean sensitivity, specificity, accuracy and area under the curve (AUC). Finally, prognostic factors were identified based on the best algorithms. RESULTS The mean accuracy of the ML algorithms for prediction of patient mortality ranged from 64 to 74% and for prediction of relapse, it varied from 64 to 84% on test data sets. The mean AUC of the ML algorithms for mortality and relapse was above 64%. The most important prognostic factors for predicting both mortality and relapse were identified as age at diagnosis, hemoglobin and platelets. In addition, significant prognostic factors for predicting mortality included clinical side effects such as splenomegaly, hepatomegaly and lymphadenopathy. CONCLUSIONS Our results showed that artificial neural networks and bagging algorithms outperformed other algorithms in predicting mortality, while boosting and random forest algorithms excelled in predicting relapse in ALL patients across all criteria. These results offer significant clinical insights into the prognostic factors for children with ALL, which can inform treatment decisions and improve patient outcomes.
Collapse
Affiliation(s)
- Zahra Mehrbakhsh
- Department of Biostatistics, School of Public Health, Hamadan University of Medical Sciences, Hamadan, Iran
- Student Research Committee, Hamadan University of Medical Sciences, Hamadan, Iran
| | - Roghayyeh Hassanzadeh
- Department of Biostatistics, School of Public Health, Hamadan University of Medical Sciences, Hamadan, Iran
- Student Research Committee, Hamadan University of Medical Sciences, Hamadan, Iran
| | - Nasser Behnampour
- Department of Biostatistics and Epidemiology, School of Health, Golestan University of Medical Sciences, Gorgan, Iran
| | - Leili Tapak
- Department of Biostatistics, School of Public Health, Hamadan University of Medical Sciences, Hamadan, Iran.
- Modeling of Noncommunicable Diseases Research Center, Hamadan University of Medical Sciences, Hamadan, Iran.
| | - Ziba Zarrin
- Department of Photogrammetry and Remote Sensing, K.N. Toosi University of Technology, Tehran, Iran
| | - Salman Khazaei
- Health Sciences Research Center, Health Sciences & Technology Research Institute, Hamadan University of Medical Science, Hamadan, Iran
| | - Irina Dinu
- School of Public Health, University of Alberta, Edmonton, Canada
| |
Collapse
|
27
|
Fiz F, Rossi N, Langella S, Conci S, Serenari M, Ardito F, Cucchetti A, Gallo T, Zamboni GA, Mosconi C, Boldrini L, Mirarchi M, Cirillo S, Ruzzenente A, Pecorella I, Russolillo N, Borzi M, Vara G, Mele C, Ercolani G, Giuliante F, Cescon M, Guglielmi A, Ferrero A, Sollini M, Chiti A, Torzilli G, Ieva F, Viganò L. Radiomics of Intrahepatic Cholangiocarcinoma and Peritumoral Tissue Predicts Postoperative Survival: Development of a CT-Based Clinical-Radiomic Model. Ann Surg Oncol 2024; 31:5604-5614. [PMID: 38797789 DOI: 10.1245/s10434-024-15457-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/07/2024] [Accepted: 04/28/2024] [Indexed: 05/29/2024]
Abstract
BACKGROUND For many tumors, radiomics provided a relevant prognostic contribution. This study tested whether the computed tomography (CT)-based textural features of intrahepatic cholangiocarcinoma (ICC) and peritumoral tissue improve the prediction of survival after resection compared with the standard clinical indices. METHODS All consecutive patients affected by ICC who underwent hepatectomy at six high-volume centers (2009-2019) were considered for the study. The arterial and portal phases of CT performed fewer than 60 days before surgery were analyzed. A manual segmentation of the tumor was performed (Tumor-VOI). A 5-mm volume expansion then was applied to identify the peritumoral tissue (Margin-VOI). RESULTS The study enrolled 215 patients. After a median follow-up period of 28 months, the overall survival (OS) rate was 57.0%, and the progression-free survival (PFS) rate was 34.9% at 3 years. The clinical predictive model of OS had a C-index of 0.681. The addition of radiomic features led to a progressive improvement of performances (C-index of 0.71, including the portal Tumor-VOI, C-index of 0.752 including the portal Tumor- and Margin-VOI, C-index of 0.764, including all VOIs of the portal and arterial phases). The latter model combined clinical variables (CA19-9 and tumor pattern), tumor indices (density, homogeneity), margin data (kurtosis, compacity, shape), and GLRLM indices. The model had performance equivalent to that of the postoperative clinical model including the pathology data (C-index of 0.765). The same results were observed for PFS. CONCLUSIONS The radiomics of ICC and peritumoral tissue extracted from preoperative CT improves the prediction of survival. Both the portal and arterial phases should be considered. Radiomic and clinical data are complementary and achieve a preoperative estimation of prognosis equivalent to that achieved in the postoperative setting.
Collapse
Affiliation(s)
- Francesco Fiz
- Nuclear Medicine Unit, Department of Diagnostic Imaging, Ente Ospedaliero "Ospedali Galliera", Genoa, Italy
- Department of Nuclear Medicine and Clinical Molecular Imaging, University Hospital, Tübingen, Germany
| | - Noemi Rossi
- MOX Laboratory, Department of Mathematics, Politecnico di Milano, Milan, Italy
| | - Serena Langella
- Department of Digestive and Hepatobiliary Surgery, Mauriziano Umberto I Hospital, Turin, Italy
| | - Simone Conci
- Division of General and Hepatobiliary Surgery, Department of Surgical Sciences, Dentistry, Gynaecology and Pediatrics, University Hospital G.B. Rossi, University of Verona, Verona, Italy
| | - Matteo Serenari
- General Surgery and Transplant Unit, IRCCS, Azienda Ospedaliero-Universitaria di Bologna, Sant'Orsola-Malpighi Hospital, Bologna, Italy
- Department of Medical and Surgical Sciences, Alma Mater Studiorum, University of Bologna, Bologna, Italy
| | - Francesco Ardito
- Hepatobiliary Surgery Unit, A. Gemelli Hospital, Università Cattolica del Sacro Cuore, Rome, Italy
| | - Alessandro Cucchetti
- Department of Medical and Surgical Sciences, Alma Mater Studiorum, University of Bologna, Bologna, Italy
- Department of General Surgery, Morgagni-Pierantoni Hospital, Forlì, Italy
| | - Teresa Gallo
- Department of Radiology, Mauriziano Umberto I Hospital, Turin, Italy
| | - Giulia A Zamboni
- Department of Radiology, University Hospital G.B. Rossi, University of Verona, Verona, Italy
| | - Cristina Mosconi
- Department of Radiology, IRCCS, Azienda Ospedaliero-Universitaria di Bologna, Sant'Orsola-Malpighi Hospital, Bologna, Italy
| | - Luca Boldrini
- Department of Radiology, Radiation Oncology and Hematology, Fondazione Policlinico Universitario "A. Gemelli" IRCCS, Rome, Italy
| | | | - Stefano Cirillo
- Department of Radiology, Mauriziano Umberto I Hospital, Turin, Italy
| | - Andrea Ruzzenente
- Division of General and Hepatobiliary Surgery, Department of Surgical Sciences, Dentistry, Gynaecology and Pediatrics, University Hospital G.B. Rossi, University of Verona, Verona, Italy
| | - Ilaria Pecorella
- Department of Biomedical Sciences, Humanitas University, Pieve Emanuele, Milan, Italy
| | - Nadia Russolillo
- Department of Digestive and Hepatobiliary Surgery, Mauriziano Umberto I Hospital, Turin, Italy
| | - Martina Borzi
- Department of Radiology, University Hospital G.B. Rossi, University of Verona, Verona, Italy
| | - Giulio Vara
- Department of Radiology, IRCCS, Azienda Ospedaliero-Universitaria di Bologna, Sant'Orsola-Malpighi Hospital, Bologna, Italy
| | - Caterina Mele
- Hepatobiliary Surgery Unit, A. Gemelli Hospital, Università Cattolica del Sacro Cuore, Rome, Italy
| | - Giorgio Ercolani
- Department of Medical and Surgical Sciences, Alma Mater Studiorum, University of Bologna, Bologna, Italy
- Department of General Surgery, Morgagni-Pierantoni Hospital, Forlì, Italy
| | - Felice Giuliante
- Hepatobiliary Surgery Unit, A. Gemelli Hospital, Università Cattolica del Sacro Cuore, Rome, Italy
| | - Matteo Cescon
- General Surgery and Transplant Unit, IRCCS, Azienda Ospedaliero-Universitaria di Bologna, Sant'Orsola-Malpighi Hospital, Bologna, Italy
- Department of Medical and Surgical Sciences, Alma Mater Studiorum, University of Bologna, Bologna, Italy
| | - Alfredo Guglielmi
- Division of General and Hepatobiliary Surgery, Department of Surgical Sciences, Dentistry, Gynaecology and Pediatrics, University Hospital G.B. Rossi, University of Verona, Verona, Italy
| | - Alessandro Ferrero
- Department of Digestive and Hepatobiliary Surgery, Mauriziano Umberto I Hospital, Turin, Italy
| | - Martina Sollini
- Department of Nuclear Medicine, IRCCS San Raffaele, Milan, Italy
- Faculty of Medicine, Università Vita-Salute San Raffaele, Milan, Italy
| | - Arturo Chiti
- Department of Nuclear Medicine, IRCCS San Raffaele, Milan, Italy
- Faculty of Medicine, Università Vita-Salute San Raffaele, Milan, Italy
| | - Guido Torzilli
- Department of Biomedical Sciences, Humanitas University, Pieve Emanuele, Milan, Italy
- Division of Hepatobiliary and General Surgery, Department of Surgery, IRCCS Humanitas Research Hospital, Rozzano, Milan, Italy
| | - Francesca Ieva
- MOX Laboratory, Department of Mathematics, Politecnico di Milano, Milan, Italy
- CHDS - Center for Health Data Science, Human Technopole, Milan, Italy
| | - Luca Viganò
- Department of Biomedical Sciences, Humanitas University, Pieve Emanuele, Milan, Italy.
- Hepatobiliary Unit, Department of Minimally Invasive General and Oncologic Surgery, Humanitas Gavazzeni University Hospital, Bergamo, Italy.
| |
Collapse
|
28
|
Stahl D. New horizons in prediction modelling using machine learning in older people's healthcare research. Age Ageing 2024; 53:afae201. [PMID: 39311424 PMCID: PMC11417961 DOI: 10.1093/ageing/afae201] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/18/2023] [Revised: 06/26/2024] [Indexed: 09/26/2024] Open
Abstract
Machine learning (ML) and prediction modelling have become increasingly influential in healthcare, providing critical insights and supporting clinical decisions, particularly in the age of big data. This paper serves as an introductory guide for health researchers and readers interested in prediction modelling and explores how these technologies support clinical decisions, particularly with big data, and covers all aspects of the development, assessment and reporting of a model using ML. The paper starts with the importance of prediction modelling for precision medicine. It outlines different types of prediction and machine learning approaches, including supervised, unsupervised and semi-supervised learning, and provides an overview of popular algorithms for various outcomes and settings. It also introduces key theoretical ML concepts. The importance of data quality, preprocessing and unbiased model performance evaluation is highlighted. Concepts of apparent, internal and external validation will be introduced along with metrics for discrimination and calibration for different types of outcomes. Additionally, the paper addresses model interpretation, fairness and implementation in clinical practice. Finally, the paper provides recommendations for reporting and identifies common pitfalls in prediction modelling and machine learning. The aim of the paper is to help readers understand and critically evaluate research papers that present ML models and to serve as a first guide for developing, assessing and implementing their own.
Collapse
Affiliation(s)
- Daniel Stahl
- Department of Biostatistics and Health Informatics, Institute of Psychiatry, Psychology & Neuroscience, King’s College London, London, UK
| |
Collapse
|
29
|
Park SH, Song SH, Burton F, Arsan C, Jobst B, Feldman M. Machine learning characterization of a rare neurologic disease via electronic health records: a proof-of-principle study on stiff person syndrome. BMC Neurol 2024; 24:272. [PMID: 39097681 PMCID: PMC11297611 DOI: 10.1186/s12883-024-03760-7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/01/2024] [Accepted: 07/12/2024] [Indexed: 08/05/2024] Open
Abstract
BACKGROUND Despite the frequent diagnostic delays of rare neurologic diseases (RND), it remains difficult to study RNDs and their comorbidities due to their rarity and hence the statistical underpowering. Affecting one to two in a million annually, stiff person syndrome (SPS) is an RND characterized by painful muscle spasms and rigidity. Leveraging underutilized electronic health records (EHR), this study showcased a machine-learning-based framework to identify clinical features that optimally characterize the diagnosis of SPS. METHODS A machine-learning-based feature selection approach was employed on 319 items from the past medical histories of 48 individuals (23 with a diagnosis of SPS and 25 controls) with elevated serum autoantibodies against glutamic-acid-decarboxylase-65 (anti-GAD65) in Dartmouth Health's EHR to determine features with the highest discriminatory power. Each iteration of the algorithm implemented a Support Vector Machine (SVM) model, generating importance scores-SHapley Additive exPlanation (SHAP) values-for each feature and removing one with the least salient. Evaluation metrics were calculated through repeated stratified cross-validation. RESULTS Depression, hypothyroidism, GERD, and joint pain were the most characteristic features of SPS. Utilizing these features, the SVM model attained precision of 0.817 (95% CI 0.795-0.840), sensitivity of 0.766 (95% CI 0.743-0.790), F-score of 0.761 (95% CI 0.744-0.778), AUC of 0.808 (95% CI 0.791-0.825), and accuracy of 0.775 (95% CI 0.759-0.790). CONCLUSIONS This framework discerned features that, with further research, may help fully characterize the pathologic mechanism of SPS: depression, hypothyroidism, and GERD may respectively represent comorbidities through common inflammatory, genetic, and dysautonomic links. This methodology could address diagnostic challenges in neurology by uncovering latent associations and generating hypotheses for RNDs.
Collapse
Affiliation(s)
- Soo Hwan Park
- Geisel School of Medicine at Dartmouth, Hanover, NH, USA
- Department of Neurology, Dartmouth Health, Lebanon, NH, USA
| | - Seo Ho Song
- Department of Psychiatry, Beth Israel Deaconess Medical Center, Harvard Medical School, Boston, MA, USA
| | - Frederick Burton
- Department of Psychiatry, University of California Los Angeles Health, Los Angeles, CA, USA
| | - Cybèle Arsan
- Department of Psychiatry, Oakland Medical Center, Kaiser Permanente, Oakland, CA, USA
| | - Barbara Jobst
- Geisel School of Medicine at Dartmouth, Hanover, NH, USA
- Department of Neurology, Dartmouth Health, Lebanon, NH, USA
| | - Mary Feldman
- Geisel School of Medicine at Dartmouth, Hanover, NH, USA.
- Department of Neurology, Dartmouth Health, Lebanon, NH, USA.
| |
Collapse
|
30
|
Perski O, Kale D, Leppin C, Okpako T, Simons D, Goldstein SP, Hekler E, Brown J. Supervised machine learning to predict smoking lapses from Ecological Momentary Assessments and sensor data: Implications for just-in-time adaptive intervention development. PLOS DIGITAL HEALTH 2024; 3:e0000594. [PMID: 39178183 PMCID: PMC11343380 DOI: 10.1371/journal.pdig.0000594] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/08/2024] [Accepted: 07/28/2024] [Indexed: 08/25/2024]
Abstract
Specific moments of lapse among smokers attempting to quit often lead to full relapse, which highlights a need for interventions that target lapses before they might occur, such as just-in-time adaptive interventions (JITAIs). To inform the decision points and tailoring variables of a lapse prevention JITAI, we trained and tested supervised machine learning algorithms that use Ecological Momentary Assessments (EMAs) and wearable sensor data of potential lapse triggers and lapse incidence. We aimed to identify a best-performing and feasible algorithm to take forwards in a JITAI. For 10 days, adult smokers attempting to quit were asked to complete 16 hourly EMAs/day assessing cravings, mood, activity, social context, physical context, and lapse incidence, and to wear a Fitbit Charge 4 during waking hours to passively collect data on steps and heart rate. A series of group-level supervised machine learning algorithms (e.g., Random Forest, XGBoost) were trained and tested, without and with the sensor data. Their ability to predict lapses for out-of-sample (i) observations and (ii) individuals were evaluated. Next, a series of individual-level and hybrid (i.e., group- and individual-level) algorithms were trained and tested. Participants (N = 38) responded to 6,124 EMAs (with 6.9% of responses reporting a lapse). Without sensor data, the best-performing group-level algorithm had an area under the receiver operating characteristic curve (AUC) of 0.899 (95% CI = 0.871-0.928). Its ability to classify lapses for out-of-sample individuals ranged from poor to excellent (AUCper person = 0.524-0.994; median AUC = 0.639). 15/38 participants had adequate data for individual-level algorithms to be constructed, with a median AUC of 0.855 (range: 0.451-1.000). Hybrid algorithms could be constructed for 25/38 participants, with a median AUC of 0.692 (range: 0.523 to 0.998). With sensor data, the best-performing group-level algorithm had an AUC of 0.952 (95% CI = 0.933-0.970). Its ability to classify lapses for out-of-sample individuals ranged from poor to excellent (AUCper person = 0.494-0.979; median AUC = 0.745). 11/30 participants had adequate data for individual-level algorithms to be constructed, with a median AUC of 0.983 (range: 0.549-1.000). Hybrid algorithms could be constructed for 20/30 participants, with a median AUC of 0.772 (range: 0.444 to 0.968). In conclusion, high-performing group-level lapse prediction algorithms without and with sensor data had variable performance when applied to out-of-sample individuals. Individual-level and hybrid algorithms could be constructed for a limited number of individuals but had improved performance, particularly when incorporating sensor data for participants with sufficient wear time. Feasibility constraints and the need to balance multiple success criteria in the JITAI development and implementation process are discussed.
Collapse
Affiliation(s)
- Olga Perski
- Faculty of Social Sciences, Tampere University, Finland
- Herbert Wertheim School of Public Health and Human Longevity Science, University of California, San Diego, California, United States of America
- Department of Behavioural Science and Health, University College London, United Kingdom
| | - Dimitra Kale
- Department of Behavioural Science and Health, University College London, United Kingdom
| | - Corinna Leppin
- Department of Behavioural Science and Health, University College London, United Kingdom
| | - Tosan Okpako
- Department of Behavioural Science and Health, University College London, United Kingdom
| | - David Simons
- Centre for Emerging, Endemic and Exotic Diseases, Royal Veterinary College, United Kingdom
| | - Stephanie P. Goldstein
- Department of Psychiatry and Human Behavior, Warren Alpert Medical School of Brown University & The Miriam Hospital/Weight Control and Diabetes Research Center, United States of America
| | - Eric Hekler
- Herbert Wertheim School of Public Health and Human Longevity Science, University of California, San Diego, California, United States of America
| | - Jamie Brown
- Department of Behavioural Science and Health, University College London, United Kingdom
| |
Collapse
|
31
|
Frank GKW, Stoddard JJ, Brown T, Gowin J, Kaye WH. Weight gained during treatment predicts 6-month body mass index in a large sample of patients with anorexia nervosa using ensemble machine learning. Int J Eat Disord 2024; 57:1653-1667. [PMID: 38610100 DOI: 10.1002/eat.24208] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 11/13/2023] [Revised: 03/22/2024] [Accepted: 03/24/2024] [Indexed: 04/14/2024]
Abstract
OBJECTIVE This study used machine learning methods to analyze data on treatment outcomes from individuals with anorexia nervosa admitted to a specialized eating disorders treatment program. METHODS Of 368 individuals with anorexia nervosa (209 adolescents and 159 adults), 160 individuals had data available for a 6-month follow-up analysis. Participants were treated in a 6-day-per-week partial-hospital program. Participants were assessed for eating disorder-specific and non-specific psychopathology. The analyses used established machine learning procedures combined in an ensemble model from support vector machine learning, random forest prediction, and the elastic net regularized regression with an exploration (training; 75%) and confirmation (test; 25%) split of the data. RESULTS The models predicting body mass index (BMI) at 6-month follow-up explained a 28.6% variance in the training set (n = 120). The model had good performance in predicting 6-month BMI in the test dataset (n = 40), with predicted BMI significantly correlating with actual BMI (r = .51, p = 0.01). The change in BMI from admission to discharge was the most important predictor, strongly correlating with reported BMI at 6-month follow-up (r = .55). Behavioral variables were much less predictive of BMI outcome. Results were similar for z-transformed BMI in the adolescent-only group. Length of stay was most predictive of weight gain in treatment (r = .56) but did not predict longer-term BMI. CONCLUSIONS This study, using an agnostic ensemble machine learning approach in the largest to-date sample of individuals with anorexia nervosa, suggests that achieving weight gain goals in treatment predicts longer-term weight-related outcomes. Other potential predictors, personality, mood, or eating disorder-specific symptoms were relatively much less predictive. PUBLIC SIGNIFICANCE The results from this study indicate that the amount of weight gained during treatment predicts BMI 6 months after discharge from a high level of care. This suggests that patients require sufficient time in a higher level of care treatment to meet their specific weight goals and be able to maintain normal weight.
Collapse
Affiliation(s)
- Guido K W Frank
- Department of Psychiatry, University of California San Diego, San Diego, California, USA
| | - Joel J Stoddard
- Department of Psychiatry, University of Colorado Anschutz Medical Campus, Aurora, Colorado, USA
| | - Tiffany Brown
- Department of Psychological Sciences, Auburn University, Auburn, Alabama, USA
| | - Josh Gowin
- Department of Radiology, University of Colorado Anschutz Medical Campus, Aurora, Colorado, USA
| | - Walter H Kaye
- Department of Psychiatry, University of California San Diego, San Diego, California, USA
| |
Collapse
|
32
|
Li Q, Li P, Chen J, Ren R, Ren N, Xia Y. Machine Learning for Predicting Stillbirth: A Systematic Review. Reprod Sci 2024:10.1007/s43032-024-01655-z. [PMID: 39078567 DOI: 10.1007/s43032-024-01655-z] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/04/2024] [Accepted: 07/11/2024] [Indexed: 07/31/2024]
Abstract
Stillbirth is a major global issue, with over 5 million cases each year. The multifactorial nature of stillbirth makes it difficult to predict. Artificial intelligence (AI) and machine learning (ML) have the potential to enhance clinical decision-making and enable precise assessments. This study reviewed the literature on predictive ML models for stillbirth highlighting input characteristics, performance metrics, and validation. The PubMed, Cochrane, and Web of Science databases were searched for studies using AI to develop predictive models for stillbirth. Findings were analyzed qualitatively using narrative synthesis and graphics. Risk of bias and the applicability of the studies were assessed using PROBAST. Model design and performance were discussed. Eight studies involving 14,840,654 women with gestational ages ranging from 20 weeks to full term were included in the qualitative analysis. Most studies used neural networks, random forests, and logistic regression algorithms. The number of predictive features varied from 14 to 53. Only 50% of studies validated the models. Cross-validation was commonly employed, and only 25% of studies performed external validation. All studies reported area under the curve as a performance metric (range 0.54-0.9), and five studies reported sensitivity (range, 60- 90%) and specificity (range, 64 - 93.3%). A stacked ensemble model that analyzed 53 features performed better than other models (AUC = 0.9; sensitivity and specificity > 85%). Available ML models can attain a considerable degree of accuracy for prediction of stillbirth; however, these models require further development before they can be applied in a clinical setting.
Collapse
Affiliation(s)
- Qingyuan Li
- Department of Clinical Medicine, International Medical College of Chongqing Medical University, Yixueyuan Road No.1, Yuzhong District, Chongqing, 400016, China
| | - Pan Li
- Department of Clinical Medicine, Southwest Medical University, Zhongshan Road, No.319 Section 3, Luzhou, 646000, China
| | - Junyu Chen
- School of Public Health, Chongqing Medical University, Yixueyuan Road No.1, Yuzhong District, Chongqing, 400016, China
| | - Ruyu Ren
- School of Public Health, Chongqing Medical University, Yixueyuan Road No.1, Yuzhong District, Chongqing, 400016, China
| | - Ni Ren
- School of Public Health, Chongqing Medical University, Yixueyuan Road No.1, Yuzhong District, Chongqing, 400016, China
| | - Yinyin Xia
- School of Public Health, Chongqing Medical University, Yixueyuan Road No.1, Yuzhong District, Chongqing, 400016, China.
| |
Collapse
|
33
|
Wawer A, Chojnicka I, Sarzyńska-Wawer J, Krawczyk M. A cross-dataset study on automatic detection of autism spectrum disorder from text data. Acta Psychiatr Scand 2024. [PMID: 39032040 DOI: 10.1111/acps.13737] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 03/11/2024] [Revised: 06/20/2024] [Accepted: 07/06/2024] [Indexed: 07/22/2024]
Abstract
OBJECTIVE The goals of this article are as follows. First, to investigate the possibility of detecting autism spectrum disorder (ASD) from text data using the latest generation of machine learning tools. Second, to compare model performance on two datasets of transcribed statements, collected using two different diagnostic tools. Third, to investigate the feasibility of knowledge transfer between models trained on both datasets and check if data augmentation can help alleviate the problem of a small number of observations. METHOD We explore two techniques to detect ASD. The first one is based on fine-tuning HerBERT, a BERT-based, monolingual deep transformer neural network. The second one uses the newest, multipurpose text embeddings from OpenAI and a classifier. We apply the methods to two separate datasets of transcribed statements, collected using two different diagnostic tools: thought, language, and communication (TLC) and autism diagnosis observation schedule-2 (ADOS-2). We conducted several cross-dataset experiments in both a zero-shot setting and a setting where models are pretrained on one dataset and then training continues on another to test the possibility of knowledge transfer. RESULTS Unlike previous studies, the models we tested obtained average results on ADOS-2 data but reached very good performance of the models in TLC. We did not observe any benefits from knowledge transfer between datasets. We observed relatively poor performance of models trained on augmented data and hypothesize that data augmentation by back translation obfuscates autism-specific signals. CONCLUSION The quality of machine learning models that detect ASD from text data is improving, but model results are dependent on the type of input data or diagnostic tool.
Collapse
Affiliation(s)
- Aleksander Wawer
- Polish Academy of Sciences, Institute of Computer Science, Warsaw, Poland
| | - Izabela Chojnicka
- Department of Health and Rehabilitation Psychology, Faculty of Psychology, University of Warsaw, Warsaw, Poland
| | | | | |
Collapse
|
34
|
Bai C, Sun Y, Zhang X, Zuo Z. Assessment of AURKA expression and prognosis prediction in lung adenocarcinoma using machine learning-based pathomics signature. Heliyon 2024; 10:e33107. [PMID: 39022022 PMCID: PMC11253280 DOI: 10.1016/j.heliyon.2024.e33107] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/21/2024] [Revised: 06/07/2024] [Accepted: 06/14/2024] [Indexed: 07/20/2024] Open
Abstract
Objective This study aimed to develop quantitative feature-based models from histopathological images to assess aurora kinase A (AURKA) expression and predict the prognosis of patients with lung adenocarcinoma (LUAD). Methods A dataset of patients with LUAD was derived from the cancer genome atlas (TCGA) with information on clinical characteristics, RNA sequencing and histopathological images. The TCGA-LUAD cohort was randomly divided into training (n = 229) and testing (n = 98) sets. We extracted quantitative image features from histopathological slides of patients with LUAD using computational approaches, constructed a predictive model for AURKA expression in the training set, and estimated their predictive performance in the test set. A Cox proportional hazards model was used to assess whether the pathomic scores (PS) generated by the model independently predicted LUAD survival. Results High AURKA expression was an independent risk factor for overall survival (OS) in patients with LUAD (hazard ratio = 1.816, 95 % confidence intervals = 1.257-2.623, P = 0.001). The model based on histopathological image features had significant predictive value for AURKA expression: the area under the curve of the receiver operating characteristic curve in the training set and validation set was 0.809 and 0.739, respectively. Decision curve analysis showed that the model had clinical utility. Patients with high PS and low PS had different survival rates (P = 0.019). Multivariate analysis suggested that PS was an independent prognostic factor for LUAD (hazard ratio = 1.615, 95 % confidence intervals = 1.071-2.438, P = 0.022). Conclusion Pathomics models based on machine learning can accurately predict AURKA expression and the PS generated by the model can predict LUAD prognosis.
Collapse
Affiliation(s)
- Cuiqing Bai
- Department of Respiratory Disease, Affiliated Hospital of Jiangnan University, Wuxi, China
| | - Yan Sun
- Department of Respiratory Disease, Affiliated Hospital of Jiangnan University, Wuxi, China
| | - Xiuqin Zhang
- Department of Respiratory Disease, Affiliated Hospital of Jiangnan University, Wuxi, China
| | - Zhitong Zuo
- Department of Respiratory Disease, Affiliated Hospital of Jiangnan University, Wuxi, China
| |
Collapse
|
35
|
Zhang Q, Fotaki A, Ghadimi S, Wang Y, Doneva M, Wetzl J, Delfino JG, O'Regan DP, Prieto C, Epstein FH. Improving the efficiency and accuracy of cardiovascular magnetic resonance with artificial intelligence-review of evidence and proposition of a roadmap to clinical translation. J Cardiovasc Magn Reson 2024; 26:101051. [PMID: 38909656 PMCID: PMC11331970 DOI: 10.1016/j.jocmr.2024.101051] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/17/2024] [Revised: 06/09/2024] [Accepted: 06/18/2024] [Indexed: 06/25/2024] Open
Abstract
BACKGROUND Cardiovascular magnetic resonance (CMR) is an important imaging modality for the assessment of heart disease; however, limitations of CMR include long exam times and high complexity compared to other cardiac imaging modalities. Recently advancements in artificial intelligence (AI) technology have shown great potential to address many CMR limitations. While the developments are remarkable, translation of AI-based methods into real-world CMR clinical practice remains at a nascent stage and much work lies ahead to realize the full potential of AI for CMR. METHODS Herein we review recent cutting-edge and representative examples demonstrating how AI can advance CMR in areas such as exam planning, accelerated image reconstruction, post-processing, quality control, classification and diagnosis. RESULTS These advances can be applied to speed up and simplify essentially every application including cine, strain, late gadolinium enhancement, parametric mapping, 3D whole heart, flow, perfusion and others. AI is a unique technology based on training models using data. Beyond reviewing the literature, this paper discusses important AI-specific issues in the context of CMR, including (1) properties and characteristics of datasets for training and validation, (2) previously published guidelines for reporting CMR AI research, (3) considerations around clinical deployment, (4) responsibilities of clinicians and the need for multi-disciplinary teams in the development and deployment of AI in CMR, (5) industry considerations, and (6) regulatory perspectives. CONCLUSIONS Understanding and consideration of all these factors will contribute to the effective and ethical deployment of AI to improve clinical CMR.
Collapse
Affiliation(s)
- Qiang Zhang
- Oxford Centre for Clinical Magnetic Resonance Research, Division of Cardiovascular Medicine, Radcliffe Department of Medicine, University of Oxford, Oxford, UK; Big Data Institute, University of Oxford, Oxford, UK.
| | - Anastasia Fotaki
- School of Biomedical Engineering and Imaging Sciences, King's College London, London, UK; Royal Brompton Hospital, Guy's and St Thomas' NHS Foundation Trust, London, UK.
| | - Sona Ghadimi
- Department of Biomedical Engineering, University of Virginia, Charlottesville, VA, USA.
| | - Yu Wang
- Department of Biomedical Engineering, University of Virginia, Charlottesville, VA, USA.
| | | | - Jens Wetzl
- Siemens Healthineers AG, Erlangen, Germany.
| | - Jana G Delfino
- US Food and Drug Administration, Center for Devices and Radiological Health (CDRH), Office of Science and Engineering Laboratories (OSEL), Silver Spring, MD, USA.
| | - Declan P O'Regan
- MRC Laboratory of Medical Sciences, Imperial College London, London, UK.
| | - Claudia Prieto
- School of Biomedical Engineering and Imaging Sciences, King's College London, London, UK; School of Engineering, Pontificia Universidad Católica de Chile, Santiago, Chile.
| | - Frederick H Epstein
- Department of Biomedical Engineering, University of Virginia, Charlottesville, VA, USA.
| |
Collapse
|
36
|
Yehuala TZ, Agimas MC, Derseh NM, Wubante SM, Fente BM, Yismaw GA, Tesfie TK. Machine learning algorithms to predict healthcare-seeking behaviors of mothers for acute respiratory infections and their determinants among children under five in sub-Saharan Africa. Front Public Health 2024; 12:1362392. [PMID: 38962762 PMCID: PMC11220189 DOI: 10.3389/fpubh.2024.1362392] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/23/2024] [Accepted: 06/03/2024] [Indexed: 07/05/2024] Open
Abstract
Background Acute respiratory infections (ARIs) are the leading cause of death in children under the age of 5 globally. Maternal healthcare-seeking behavior may help minimize mortality associated with ARIs since they make decisions about the kind and frequency of healthcare services for their children. Therefore, this study aimed to predict the absence of maternal healthcare-seeking behavior and identify its associated factors among children under the age 5 in sub-Saharan Africa (SSA) using machine learning models. Methods The sub-Saharan African countries' demographic health survey was the source of the dataset. We used a weighted sample of 16,832 under-five children in this study. The data were processed using Python (version 3.9), and machine learning models such as extreme gradient boosting (XGB), random forest, decision tree, logistic regression, and Naïve Bayes were applied. In this study, we used evaluation metrics, including the AUC ROC curve, accuracy, precision, recall, and F-measure, to assess the performance of the predictive models. Result In this study, a weighted sample of 16,832 under-five children was used in the final analysis. Among the proposed machine learning models, the random forest (RF) was the best-predicted model with an accuracy of 88.89%, a precision of 89.5%, an F-measure of 83%, an AUC ROC curve of 95.8%, and a recall of 77.6% in predicting the absence of mothers' healthcare-seeking behavior for ARIs. The accuracy for Naïve Bayes was the lowest (66.41%) when compared to other proposed models. No media exposure, living in rural areas, not breastfeeding, poor wealth status, home delivery, no ANC visit, no maternal education, mothers' age group of 35-49 years, and distance to health facilities were significant predictors for the absence of mothers' healthcare-seeking behaviors for ARIs. On the other hand, undernourished children with stunting, underweight, and wasting status, diarrhea, birth size, married women, being a male or female sex child, and having a maternal occupation were significantly associated with good maternal healthcare-seeking behaviors for ARIs among under-five children. Conclusion The RF model provides greater predictive power for estimating mothers' healthcare-seeking behaviors based on ARI risk factors. Machine learning could help achieve early prediction and intervention in children with high-risk ARIs. This leads to a recommendation for policy direction to reduce child mortality due to ARIs in sub-Saharan countries.
Collapse
Affiliation(s)
- Tirualem Zeleke Yehuala
- Department Health Informatics, Institute of Public Health, College of Medicine and Health Sciences, University of Gondar, Gondar, Ethiopia
| | - Muluken Chanie Agimas
- Department of Epidemiology and Biostatistics, Institute of Public Health, College of Medicine and Health Sciences, University of Gondar, Gondar, Ethiopia
| | - Nebiyu Mekonnen Derseh
- Department of Epidemiology and Biostatistics, Institute of Public Health, College of Medicine and Health Sciences, University of Gondar, Gondar, Ethiopia
| | - Sisay Maru Wubante
- Department Health Informatics, Institute of Public Health, College of Medicine and Health Sciences, University of Gondar, Gondar, Ethiopia
| | - Bezawit Melak Fente
- Department of General Midwifery, School of Midwifery, College of Medicine and Health Sciences, University of Gondar, Gondar, Ethiopia
| | - Getaneh Awoke Yismaw
- Department of Epidemiology and Biostatistics, Institute of Public Health, College of Medicine and Health Sciences, University of Gondar, Gondar, Ethiopia
| | - Tigabu Kidie Tesfie
- Department of Epidemiology and Biostatistics, Institute of Public Health, College of Medicine and Health Sciences, University of Gondar, Gondar, Ethiopia
| |
Collapse
|
37
|
Wang Q, Yang L, Xiu S, Shen Y, Jin H, Lin L. A Prediction Model for Identifying Seasonal Influenza Vaccination Uptake Among Children in Wuxi, China: Prospective Observational Study. JMIR Public Health Surveill 2024; 10:e56064. [PMID: 38885032 PMCID: PMC11217706 DOI: 10.2196/56064] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/04/2024] [Revised: 05/07/2024] [Accepted: 05/15/2024] [Indexed: 06/18/2024] Open
Abstract
BACKGROUND Predicting vaccination behaviors accurately could provide insights for health care professionals to develop targeted interventions. OBJECTIVE The aim of this study was to develop predictive models for influenza vaccination behavior among children in China. METHODS We obtained data from a prospective observational study in Wuxi, eastern China. The predicted outcome was individual-level vaccine uptake and covariates included sociodemographics of the child and parent, parental vaccine hesitancy, perceptions of convenience to the clinic, satisfaction with clinic services, and willingness to vaccinate. Bayesian networks, logistic regression, least absolute shrinkage and selection operator (LASSO) regression, support vector machine (SVM), naive Bayes (NB), random forest (RF), and decision tree classifiers were used to construct prediction models. Various performance metrics, including area under the receiver operating characteristic curve (AUC), were used to evaluate the predictive performance of the different models. Receiver operating characteristic curves and calibration plots were used to assess model performance. RESULTS A total of 2383 participants were included in the study; 83.2% of these children (n=1982) were <5 years old and 6.6% (n=158) had previously received an influenza vaccine. More than half (1356/2383, 56.9%) the parents indicated a willingness to vaccinate their child against influenza. Among the 2383 children, 26.3% (n=627) received influenza vaccination during the 2020-2021 season. Within the training set, the RF model showed the best performance across all metrics. In the validation set, the logistic regression model and NB model had the highest AUC values; the SVM model had the highest precision; the NB model had the highest recall; and the logistic regression model had the highest accuracy, F1 score, and Cohen κ value. The LASSO and logistic regression models were well-calibrated. CONCLUSIONS The developed prediction model can be used to quantify the uptake of seasonal influenza vaccination for children in China. The stepwise logistic regression model may be better suited for prediction purposes.
Collapse
Affiliation(s)
- Qiang Wang
- Department of Epidemiology, School of Public Health, Fudan University, Shanghai, China
- Department of Epidemiology and Health Statistics, School of Public Health, Southeast University, Nanjing, China
| | - Liuqing Yang
- Department of Epidemiology and Health Statistics, School of Public Health, Southeast University, Nanjing, China
| | - Shixin Xiu
- Department of Immunization, Affiliated Wuxi Center for Disease Control and Prevention of Nanjing Medical University, Wuxi Center for Disease Control and Prevention, Wuxi, China
| | - Yuan Shen
- Department of Immunization, Affiliated Wuxi Center for Disease Control and Prevention of Nanjing Medical University, Wuxi Center for Disease Control and Prevention, Wuxi, China
| | - Hui Jin
- Department of Epidemiology and Health Statistics, School of Public Health, Southeast University, Nanjing, China
| | - Leesa Lin
- Department of Infectious Disease Epidemiology, London School of Hygiene & Tropical Medicine, London, United Kingdom
- Laboratory of Data Discovery for Health (D24H), Hong Kong Science Park, Hong Kong Special Administrative Region, China
- WHO Collaborating Centre for Infectious Disease Epidemiology and Control, School of Public Health, Li Ka Shing Faculty of Medicine, The University of Hong Kong, Hong Kong Special Administrative Region, China
| |
Collapse
|
38
|
Beak W, Park J, Ji S. Data-driven prediction model for periodontal disease based on correlational feature analysis and clinical validation. Heliyon 2024; 10:e32496. [PMID: 38912435 PMCID: PMC11193031 DOI: 10.1016/j.heliyon.2024.e32496] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/09/2023] [Revised: 06/03/2024] [Accepted: 06/05/2024] [Indexed: 06/25/2024] Open
Abstract
Objectives This study aimed to investigate the performance and reliability of data-driven models employing correlational feature analysis and clinical validation for predicting periodontal disease. Methods The 7th Korea National Health and Nutrition Examination Survey (n = 10,654) was used for correlation analysis to identify significant risk factors for periodontitis. Periodontal prediction models were developed with the selected factors and database, followed by internal validation with 5-fold cross-validation and 1000 bootstrap resampling. External validation was conducted with clinical data (n = 120) collected through self-reported questionnaires, clinical periodontal parameters, and radiographic image analysis. Predictive performance was assessed for logistics regression, support vector machine, random forest, XGBoost, and neural network algorithms using the area under the receiver operating characteristic curves (AUC) and other performance metrics. Results Correlation analysis identified 16 features from over 1000 potential risk factors for periodontitis. The best data-driven model (XGBoost) showed AUC values of 0.823 and 0.796 for internal and external validations, respectively. Modeling with clinical data revealed those same measures to be 0.836 and 0.649, respectively. In addition, the data-driven model could predict other clinical periodontal parameters including severe bone loss (AUC = 0.813), gingival bleeding (AUC = 0.694), and tooth loss (AUC = 0.734). A patient case study about prognostic predictions revealed that the probability of periodontitis can be reduced by 6.0 % (stop smoking) and 0.6 % (stop drinking) on average. Conclusions Data-driven models for predicting periodontitis and other periodontal parameters were developed from 16 risk factors, demonstrating enhanced prediction performance and reproducibility in internal-external validations.
Collapse
Affiliation(s)
- Woosun Beak
- Department of Dental Public Health, Ajou University Graduate School of Clinical Dentistry, Suwon, Republic of Korea
- Department of Dentistry, Gyeonggi Provincial Medical Center Suwon Hospital, Suwon, Republic of Korea
| | - Jihun Park
- Department of Materials Science and Engineering, University of Maryland, College Park, MD, USA
| | - Suk Ji
- Department of Dental Public Health, Ajou University Graduate School of Clinical Dentistry, Suwon, Republic of Korea
- Department of Periodontology, Institute of Oral Health Science, Ajou University School of Medicine, Suwon, Republic of Korea
| |
Collapse
|
39
|
Philip MM, Watts J, McKiddie F, Welch A, Nath M. Development and Validation of Prognostic Models Using Radiomic Features from Pre-Treatment Positron Emission Tomography (PET) Images in Head and Neck Squamous Cell Carcinoma (HNSCC) Patients. Cancers (Basel) 2024; 16:2195. [PMID: 38927901 PMCID: PMC11202084 DOI: 10.3390/cancers16122195] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/17/2024] [Accepted: 06/04/2024] [Indexed: 06/28/2024] Open
Abstract
High-dimensional radiomics features derived from pre-treatment positron emission tomography (PET) images offer prognostic insights for patients with head and neck squamous cell carcinoma (HNSCC). Using 124 PET radiomics features and clinical variables (age, sex, stage of cancer, site of cancer) from a cohort of 232 patients, we evaluated four survival models-penalized Cox model, random forest, gradient boosted model and support vector machine-to predict all-cause mortality (ACM), locoregional recurrence/residual disease (LR) and distant metastasis (DM) probability during 36, 24 and 24 months of follow-up, respectively. We developed models with five-fold cross-validation, selected the best-performing model for each outcome based on the concordance index (C-statistic) and the integrated Brier score (IBS) and validated them in an independent cohort of 102 patients. The penalized Cox model demonstrated better performance for ACM (C-statistic = 0.70, IBS = 0.12) and DM (C-statistic = 0.70, IBS = 0.08) while the random forest model displayed better performance for LR (C-statistic = 0.76, IBS = 0.07). We conclude that the ML-based prognostic model can aid clinicians in quantifying prognosis and determining effective treatment strategies, thereby improving favorable outcomes in HNSCC patients.
Collapse
Affiliation(s)
- Mahima Merin Philip
- Institute of Applied Health Sciences, University of Aberdeen, Aberdeen AB25 2ZD, UK;
| | - Jessica Watts
- National Health Service Grampian, Aberdeen AB15 6RE, UK; (J.W.); (F.M.)
| | - Fergus McKiddie
- National Health Service Grampian, Aberdeen AB15 6RE, UK; (J.W.); (F.M.)
| | - Andy Welch
- Institute of Education in Healthcare and Medical Sciences, University of Aberdeen, Aberdeen AB25 2ZD, UK;
| | - Mintu Nath
- Institute of Applied Health Sciences, University of Aberdeen, Aberdeen AB25 2ZD, UK;
| |
Collapse
|
40
|
Perez-Lopez R, Ghaffari Laleh N, Mahmood F, Kather JN. A guide to artificial intelligence for cancer researchers. Nat Rev Cancer 2024; 24:427-441. [PMID: 38755439 DOI: 10.1038/s41568-024-00694-7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Accepted: 04/09/2024] [Indexed: 05/18/2024]
Abstract
Artificial intelligence (AI) has been commoditized. It has evolved from a specialty resource to a readily accessible tool for cancer researchers. AI-based tools can boost research productivity in daily workflows, but can also extract hidden information from existing data, thereby enabling new scientific discoveries. Building a basic literacy in these tools is useful for every cancer researcher. Researchers with a traditional biological science focus can use AI-based tools through off-the-shelf software, whereas those who are more computationally inclined can develop their own AI-based software pipelines. In this article, we provide a practical guide for non-computational cancer researchers to understand how AI-based tools can benefit them. We convey general principles of AI for applications in image analysis, natural language processing and drug discovery. In addition, we give examples of how non-computational researchers can get started on the journey to productively use AI in their own work.
Collapse
Affiliation(s)
- Raquel Perez-Lopez
- Radiomics Group, Vall d'Hebron Institute of Oncology, Vall d'Hebron Barcelona Hospital Campus, Barcelona, Spain
| | - Narmin Ghaffari Laleh
- Else Kroener Fresenius Center for Digital Health, Technical University Dresden, Dresden, Germany
| | - Faisal Mahmood
- Department of Pathology, Brigham and Women's Hospital, Harvard Medical School, Boston, MA, USA
- Department of Pathology, Massachusetts General Hospital, Harvard Medical School, Boston, MA, USA
- Cancer Program, Broad Institute of Harvard and MIT, Cambridge, MA, USA
- Cancer Data Science Program, Dana-Farber Cancer Institute, Boston, MA, USA
- Department of Biomedical Informatics, Harvard Medical School, Boston, MA, USA
- Harvard Data Science Initiative, Harvard University, Cambridge, MA, USA
| | - Jakob Nikolas Kather
- Else Kroener Fresenius Center for Digital Health, Technical University Dresden, Dresden, Germany.
- Department of Medicine I, University Hospital Dresden, Dresden, Germany.
- Medical Oncology, National Center for Tumour Diseases (NCT), University Hospital Heidelberg, Heidelberg, Germany.
| |
Collapse
|
41
|
Winder AJ, Stanley EA, Fiehler J, Forkert ND. Challenges and Potential of Artificial Intelligence in Neuroradiology. Clin Neuroradiol 2024; 34:293-305. [PMID: 38285239 DOI: 10.1007/s00062-024-01382-7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/13/2023] [Accepted: 01/03/2024] [Indexed: 01/30/2024]
Abstract
PURPOSE Artificial intelligence (AI) has emerged as a transformative force in medical research and is garnering increased attention in the public consciousness. This represents a critical time period in which medical researchers, healthcare providers, insurers, regulatory agencies, and patients are all developing and shaping their beliefs and policies regarding the use of AI in the healthcare sector. The successful deployment of AI will require support from all these groups. This commentary proposes that widespread support for medical AI must be driven by clear and transparent scientific reporting, beginning at the earliest stages of scientific research. METHODS A review of relevant guidelines and literature describing how scientific reporting plays a central role at key stages in the life cycle of an AI software product was conducted. To contextualize this principle within a specific medical domain, we discuss the current state of predictive tissue outcome modeling in acute ischemic stroke and the unique challenges presented therein. RESULTS AND CONCLUSION Translating AI methods from the research to the clinical domain is complicated by challenges related to model design and validation studies, medical product regulations, and healthcare providers' reservations regarding AI's efficacy and affordability. However, each of these limitations is also an opportunity for high-impact research that will help to accelerate the clinical adoption of state-of-the-art medical AI. In all cases, establishing and adhering to appropriate reporting standards is an important responsibility that is shared by all of the parties involved in the life cycle of a prospective AI software product.
Collapse
Affiliation(s)
- Anthony J Winder
- Department of Radiology, University of Calgary, Calgary, Canada.
- Hotchkiss Brain Institute, University of Calgary, Calgary, Canada.
| | - Emma Am Stanley
- Department of Radiology, University of Calgary, Calgary, Canada
- Hotchkiss Brain Institute, University of Calgary, Calgary, Canada
- Biomedical Engineering Graduate Program, University of Calgary, Calgary, Canada
- Alberta Children's Hospital Research Institute, University of Calgary, Calgary, Canada
| | - Jens Fiehler
- Department of Diagnostic and Interventional Neuroradiology, University Medical Center Hamburg-Eppendorf, Hamburg, Germany
| | - Nils D Forkert
- Department of Radiology, University of Calgary, Calgary, Canada
- Hotchkiss Brain Institute, University of Calgary, Calgary, Canada
- Alberta Children's Hospital Research Institute, University of Calgary, Calgary, Canada
- Department of Clinical Neuroscience, University of Calgary, Calgary, Canada
- Department of Electrical and Software Engineering, University of Calgary, Calgary, Canada
| |
Collapse
|
42
|
Mugambi P, Carreiro S. Best of Both Worlds: Bridging One Model for All and Group-Specific Model Approaches using Ensemble-based Subpopulation Modeling. AMIA JOINT SUMMITS ON TRANSLATIONAL SCIENCE PROCEEDINGS. AMIA JOINT SUMMITS ON TRANSLATIONAL SCIENCE 2024; 2024:354-363. [PMID: 38827055 PMCID: PMC11141864] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Subscribe] [Scholar Register] [Indexed: 06/04/2024]
Abstract
Subpopulation models have become of increasing interest in prediction of clinical outcomes because they promise to perform better for underrepresented patient subgroups. However, the personalization benefits gained from these models tradeoff their statistical power, and can be impractical when the subpopulation's sample size is small. We hypothesize that a hierarchical model in which population information is integrated into subpopulation models would preserve the personalization benefits and offset the loss of power. In this work, we integrate ideas from ensemble modeling, personalization, and hierarchical modeling and build ensemble-based subpopulation models in which specialization relies on whole group samples. This approach significantly improves the precision of the positive class, especially for the underrepresented subgroups, with minimal cost to the recall. It consistently outperforms one model for all and one model for each subgroup approaches, especially in the presence of a high class-imbalance, for subgroups with at least 380 training samples.
Collapse
|
43
|
Hagen M, Dass R, Westhues C, Blom J, Schultheiss SJ, Patz S. Interpretable machine learning decodes soil microbiome's response to drought stress. ENVIRONMENTAL MICROBIOME 2024; 19:35. [PMID: 38812054 PMCID: PMC11138018 DOI: 10.1186/s40793-024-00578-1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/30/2023] [Accepted: 05/10/2024] [Indexed: 05/31/2024]
Abstract
BACKGROUND Extreme weather events induced by climate change, particularly droughts, have detrimental consequences for crop yields and food security. Concurrently, these conditions provoke substantial changes in the soil bacterial microbiota and affect plant health. Early recognition of soil affected by drought enables farmers to implement appropriate agricultural management practices. In this context, interpretable machine learning holds immense potential for drought stress classification of soil based on marker taxa. RESULTS This study demonstrates that the 16S rRNA-based metagenomic approach of Differential Abundance Analysis methods and machine learning-based Shapley Additive Explanation values provide similar information. They exhibit their potential as complementary approaches for identifying marker taxa and investigating their enrichment or depletion under drought stress in grass lineages. Additionally, the Random Forest Classifier trained on a diverse range of relative abundance data from the soil bacterial micobiome of various plant species achieves a high accuracy of 92.3 % at the genus rank for drought stress prediction. It demonstrates its generalization capacity for the lineages tested. CONCLUSIONS In the detection of drought stress in soil bacterial microbiota, this study emphasizes the potential of an optimized and generalized location-based ML classifier. By identifying marker taxa, this approach holds promising implications for microbe-assisted plant breeding programs and contributes to the development of sustainable agriculture practices. These findings are crucial for preserving global food security in the face of climate change.
Collapse
Affiliation(s)
- Michelle Hagen
- Computomics GmbH, Eisenbahnstraße 1, 72072, Tübingen, Baden-Württemberg, Germany
| | - Rupashree Dass
- Computomics GmbH, Eisenbahnstraße 1, 72072, Tübingen, Baden-Württemberg, Germany
| | - Cathy Westhues
- Computomics GmbH, Eisenbahnstraße 1, 72072, Tübingen, Baden-Württemberg, Germany
| | - Jochen Blom
- Bioinformatics and Systems Biology, Justus Liebig University Gießen, Heinrich-Buff-Ring 58, 35390, Gießen, Hesse, Germany
| | | | - Sascha Patz
- Computomics GmbH, Eisenbahnstraße 1, 72072, Tübingen, Baden-Württemberg, Germany.
| |
Collapse
|
44
|
Yan Z, Dube V, Heselton J, Johnson K, Yan C, Jones V, Blaskewicz Boron J, Shade M. Understanding older people's voice interactions with smart voice assistants: a new modified rule-based natural language processing model with human input. Front Digit Health 2024; 6:1329910. [PMID: 38812806 PMCID: PMC11135128 DOI: 10.3389/fdgth.2024.1329910] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/31/2023] [Accepted: 05/06/2024] [Indexed: 05/31/2024] Open
Abstract
The COVID-19 pandemic has expedited the integration of Smart Voice Assistants (SVA) among older people. The qualitative data derived from user commands on SVA is pivotal for elucidating the engagement patterns of older individuals with such systems. However, the sheer volume of user-generated voice interaction data presents a formidable challenge for manual coding. Compounding this issue, age-related cognitive decline and alterations in speech patterns further complicate the interpretation of older users' SVA voice interactions. Conventional dictionary-based textual analysis tools, which count word frequencies, are inadequate in capturing the evolving and communicative essence of these interactions that unfold over a series of dialogues and modify with time. To address these challenges, our study introduces a novel, modified rule-based Natural Language Processing (MR-NLP) model augmented with human input. This reproducible approach capitalizes on human-derived insights to establish a lexicon of critical keywords and to formulate rules for the iterative refinement of the NLP model. English speakers, aged 50 or older and residing alone, were enlisted to engage with Amazon Alexa™ via predefined daily routines for a minimum of 30 min daily spanning three months (N = 35, mean age = 77). We amassed time-stamped, textual data comprising participants' user commands and responses from Alexa™. Initially, a subset constituting 20% of the data (1,020 instances) underwent manual coding by human coder, predicated on keywords and commands. Separately, a rule-based Natural Language Processing (NLP) methodology was employed to code the identical subset. Discrepancies arising between human coder and the NLP model programmer were deliberated upon and reconciled to refine the rule-based NLP coding framework for the entire dataset. The modified rule-based NLP approach demonstrated notable enhancements in efficiency and scalability and reduced susceptibility to inadvertent errors in comparison to manual coding. Furthermore, human input was instrumental in augmenting the NLP model, yielding insights germane to the aging adult demographic, such as recurring speech patterns or ambiguities. By disseminating this innovative software solution to the scientific community, we endeavor to advance research and innovation in NLP model formulation, subsequently contributing to the understanding of older people's interactions with SVA and other AI-powered systems.
Collapse
Affiliation(s)
- Zhengxu Yan
- College of Computing, Data Science and Society, University of California-Berkeley, Berkeley, CA, United States
| | - Victoria Dube
- Department of Gerontology, University of Nebraska-Omaha, Omaha, NE, United States
| | - Judith Heselton
- Department of Gerontology, University of Nebraska-Omaha, Omaha, NE, United States
| | - Kate Johnson
- College of Journalism and Mass Communications, University of Nebraska-Lincoln, Lincoln, NE, United States
| | - Changmin Yan
- College of Journalism and Mass Communications, University of Nebraska-Lincoln, Lincoln, NE, United States
| | - Valerie Jones
- College of Journalism and Mass Communications, University of Nebraska-Lincoln, Lincoln, NE, United States
| | | | - Marcia Shade
- College of Nursing, University of Nebraska Medical Center, Omaha, NE, United States
| |
Collapse
|
45
|
Xing Z, Chen H, Alman AC. Discriminating insulin resistance in middle-aged nondiabetic women using machine learning approaches. AIMS Public Health 2024; 11:667-687. [PMID: 39027391 PMCID: PMC11252584 DOI: 10.3934/publichealth.2024034] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/20/2024] [Revised: 03/28/2024] [Accepted: 04/08/2024] [Indexed: 07/20/2024] Open
Abstract
Objective We employed machine learning algorithms to discriminate insulin resistance (IR) in middle-aged nondiabetic women. Methods The data was from the National Health and Nutrition Examination Survey (2007-2018). The study subjects were 2084 nondiabetic women aged 45-64. The analysis included 48 predictors. We randomly divided the data into training (n = 1667) and testing (n = 417) datasets. Four machine learning techniques were employed to discriminate IR: extreme gradient boosting (XGBoosting), random forest (RF), gradient boosting machine (GBM), and decision tree (DT). The area under the curve (AUC) of receiver operating characteristic (ROC), accuracy, sensitivity, specificity, positive predictive value, negative predictive value, and F1 score were compared as performance metrics to select the optimal technique. Results The XGBoosting algorithm achieved a relatively high AUC of 0.93 in the training dataset and 0.86 in the testing dataset to discriminate IR using 48 predictors and was followed by the RF, GBM, and DT models. After selecting the top five predictors to build models, the XGBoost algorithm with the AUC of 0.90 (training dataset) and 0.86 (testing dataset) remained the optimal prediction model. The SHapley Additive exPlanations (SHAP) values revealed the associations between the five predictors and IR, namely BMI (strongly positive impact on IR), fasting glucose (strongly positive), HDL-C (medium negative), triglycerides (medium positive), and glycohemoglobin (medium positive). The threshold values for identifying IR were 29 kg/m2, 100 mg/dL, 54.5 mg/dL, 89 mg/dL, and 5.6% for BMI, glucose, HDL-C, triglycerides, and glycohemoglobin, respectively. Conclusion The XGBoosting algorithm demonstrated superior performance metrics for discriminating IR in middle-aged nondiabetic women, with BMI, glucose, HDL-C, glycohemoglobin, and triglycerides as the top five predictors.
Collapse
Affiliation(s)
- Zailing Xing
- College of Public Health, University of South Florida, 13201 Bruce B. Downs Blvd, MDC 56, Tampa, FL 33612, USA
| | | | | |
Collapse
|
46
|
Kartal MT, Depren Ö, Kılıç Depren S. A comprehensive analysis of key factors' impact on environmental performance: Evidence from Globe by novel super learner algorithm. JOURNAL OF ENVIRONMENTAL MANAGEMENT 2024; 359:121040. [PMID: 38718609 DOI: 10.1016/j.jenvman.2024.121040] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/19/2024] [Revised: 04/13/2024] [Accepted: 04/27/2024] [Indexed: 05/22/2024]
Abstract
This study aims to analyze comprehensively the impact of different economic and demographic factors, which affect economic development, on environmental performance. In this context, the study considers the Environmental Performance Index as the response variable, uses GDP per capita, tariff rate, tax burden, government expenditure, inflation, unemployment, population, income tax rate, public debt, FDI inflow, and corporate tax rate as the explanatory variables, examines 181 countries, performs a novel Super Learner (SL) algorithm, which includes a total of six machine learning (ML) algorithms, and uses data for the years 2018, 2020, and 2022. The results demonstrate that (i) the SL algorithm has a superior capacity with regard to other ML algorithms; (ii) gross domestic product per capita is the most crucial factor in the environmental performance followed by tariff rates, tax burden, government expenditure, and inflation, in order; (iii) among all, the corporate tax rate has the lowest importance on the environmental performance followed by also foreign direct investment, public debt, income tax rate, population, and unemployment; (iv) there are some critical thresholds, which imply that the impact of the factors on the environmental performance change according to these barriers. Overall, the study reveals the nonlinear impact of the variables on environmental performance as well as their relative importance and critical threshold. Thus, the study provides policymakers valuable insights in re-formulating their environmental policies to increase environmental performance. Accordingly, various policy options are discussed.
Collapse
Affiliation(s)
- Mustafa Tevfik Kartal
- Department of Economics and Management, Khazar University, Baku, Azerbaijan; Department of Finance and Banking, European University of Lefke, Lefke, Northern Cyprus, TR-10 Mersin, Türkiye; Adnan Kassar School of Business, Lebanese American University, Beirut, Lebanon; Clinic of Economics, Azerbaijan State University of Economics (UNEC), Baku, Azerbaijan; GUST Center for Sustainable Development, Gulf University for Science and Technology, Kuwait
| | - Özer Depren
- Clinic of Economics, Azerbaijan State University of Economics (UNEC), Baku, Azerbaijan; Customer Experience Research Lab, Yapı Kredi Bank, İstanbul, Türkiye
| | | |
Collapse
|
47
|
Minhas R, Peker NY, Hakkoz MA, Arbatli S, Celik Y, Erdem CE, Semiz B, Peker Y. Association of Visual-Based Signals with Electroencephalography Patterns in Enhancing the Drowsiness Detection in Drivers with Obstructive Sleep Apnea. SENSORS (BASEL, SWITZERLAND) 2024; 24:2625. [PMID: 38676243 PMCID: PMC11055081 DOI: 10.3390/s24082625] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/11/2024] [Revised: 04/08/2024] [Accepted: 04/15/2024] [Indexed: 04/28/2024]
Abstract
Individuals with obstructive sleep apnea (OSA) face increased accident risks due to excessive daytime sleepiness. PERCLOS, a recognized drowsiness detection method, encounters challenges from image quality, eyewear interference, and lighting variations, impacting its performance, and requiring validation through physiological signals. We propose visual-based scoring using adaptive thresholding for eye aspect ratio with OpenCV for face detection and Dlib for eye detection from video recordings. This technique identified 453 drowsiness (PERCLOS ≥ 0.3 || CLOSDUR ≥ 2 s) and 474 wakefulness episodes (PERCLOS < 0.3 and CLOSDUR < 2 s) among fifty OSA drivers in a 50 min driving simulation while wearing six-channel EEG electrodes. Applying discrete wavelet transform, we derived ten EEG features, correlated them with visual-based episodes using various criteria, and assessed the sensitivity of brain regions and individual EEG channels. Among these features, theta-alpha-ratio exhibited robust mapping (94.7%) with visual-based scoring, followed by delta-alpha-ratio (87.2%) and delta-theta-ratio (86.7%). Frontal area (86.4%) and channel F4 (75.4%) aligned most episodes with theta-alpha-ratio, while frontal, and occipital regions, particularly channels F4 and O2, displayed superior alignment across multiple features. Adding frontal or occipital channels could correlate all episodes with EEG patterns, reducing hardware needs. Our work could potentially enhance real-time drowsiness detection reliability and assess fitness to drive in OSA drivers.
Collapse
Affiliation(s)
- Riaz Minhas
- College of Engineering, Koc University, Istanbul 34450, Turkey; (R.M.); (B.S.)
| | - Nur Yasin Peker
- Department of Mechatronics Engineering, Sakarya University of Applied Sciences, Sakarya 54050, Turkey;
| | - Mustafa Abdullah Hakkoz
- Graduate School of Computer Engineering, Istanbul Technical University, Istanbul 34469, Turkey;
| | - Semih Arbatli
- Graduate School of Health Sciences, Koc University, Istanbul 34010, Turkey;
| | - Yeliz Celik
- Research Center for Translational Medicine (KUTTAM), Koc University, Istanbul 34010, Turkey;
| | - Cigdem Eroglu Erdem
- Department of Electrical and Electronics Engineering, Ozyegin University, Istanbul 34794, Turkey;
| | - Beren Semiz
- College of Engineering, Koc University, Istanbul 34450, Turkey; (R.M.); (B.S.)
| | - Yuksel Peker
- Research Center for Translational Medicine (KUTTAM), Koc University, Istanbul 34010, Turkey;
- Department of Pulmonary Medicine, School of Medicine, Koc University, Istanbul 34010, Turkey
- Sahlgrenska Academy, University of Gothenburg, 40530 Gothenburg, Sweden
- School of Medicine, Lund University, 22185 Lund, Sweden
- School of Medicine, University of Pittsburgh, Pittsburgh, PA 15213, USA
| |
Collapse
|
48
|
Han X, Bai Z, Mogushi K, Hase T, Takeuchi K, Iida Y, Sumita YI, Wakabayashi N. Machine Learning Prediction of Tongue Pressure in Elderly Patients with Head and Neck Tumor: A Cross-Sectional Study. J Clin Med 2024; 13:2363. [PMID: 38673635 PMCID: PMC11051183 DOI: 10.3390/jcm13082363] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/28/2024] [Revised: 04/05/2024] [Accepted: 04/07/2024] [Indexed: 04/28/2024] Open
Abstract
Background: This investigation sought to cross validate the predictors of tongue pressure recovery in elderly patients' post-treatment for head and neck tumors, leveraging advanced machine learning techniques. Methods: By employing logistic regression, support vector regression, random forest, and extreme gradient boosting, the study analyzed an array of variables including patient demographics, surgery types, dental health status, and age, drawn from comprehensive medical records and direct tongue pressure assessments. Results: Among the models, logistic regression emerged as the most effective, demonstrating an accuracy of 0.630 [95% confidence interval (CI): 0.370-0.778], F1 score of 0.688 [95% confidence interval (CI): 0.435-0.853], precision of 0.611 [95% confidence interval (CI): 0.313-0.801], recall of 0.786 [95% confidence interval (CI): 0.413-0.938] and an area under the receiver operating characteristic curve of 0.626 [95% confidence interval (CI): 0.409-0.806]. This model distinctly highlighted the significance of glossectomy (p = 0.039), the presence of functional teeth (p = 0.043), and the patient's age (p = 0.044) as pivotal factors influencing tongue pressure, setting the threshold for statistical significance at p < 0.05. Conclusions: The analysis underscored the critical role of glossectomy, the presence of functional natural teeth, and age as determinants of tongue pressure in logistics regression, with the presence of natural teeth and the tumor site located in the tongue consistently emerging as the key predictors across all computational models employed in this study.
Collapse
Affiliation(s)
- Xuewei Han
- Department of Advanced Prosthodontics, Graduate School, Medical and Dental Sciences, Tokyo Medical and Dental University, Tokyo 1138510, Japan; (X.H.); (Z.B.); (N.W.)
| | - Ziyi Bai
- Department of Advanced Prosthodontics, Graduate School, Medical and Dental Sciences, Tokyo Medical and Dental University, Tokyo 1138510, Japan; (X.H.); (Z.B.); (N.W.)
| | - Kaoru Mogushi
- Institute of Education, Tokyo Medical and Dental University, Tokyo 1138510, Japan; (K.M.); (T.H.)
| | - Takeshi Hase
- Institute of Education, Tokyo Medical and Dental University, Tokyo 1138510, Japan; (K.M.); (T.H.)
- Faculty of Pharmacy, Keio University, Tokyo 1088345, Japan
- Center for Mathematical Modelling and Data Science, Osaka University, Osaka 5608531, Japan
- The Systems Biology Institute, Tokyo 1410022, Japan
| | - Katsuyuki Takeuchi
- Institute of Education, Tokyo Medical and Dental University, Tokyo 1138510, Japan; (K.M.); (T.H.)
| | - Yoritsugu Iida
- Institute of Education, Tokyo Medical and Dental University, Tokyo 1138510, Japan; (K.M.); (T.H.)
| | - Yuka I. Sumita
- Department of Advanced Prosthodontics, Graduate School, Medical and Dental Sciences, Tokyo Medical and Dental University, Tokyo 1138510, Japan; (X.H.); (Z.B.); (N.W.)
- Department of Partial and Complete Denture, The Nippon Dental University School of Life Dentistry, Tokyo 1028159, Japan
| | - Noriyuki Wakabayashi
- Department of Advanced Prosthodontics, Graduate School, Medical and Dental Sciences, Tokyo Medical and Dental University, Tokyo 1138510, Japan; (X.H.); (Z.B.); (N.W.)
| |
Collapse
|
49
|
Zygomalas A, Kalles D, Katsiakis N, Anastasopoulos A, Skroubis G. Artificial Intelligence Assisted Recognition of Anatomical Landmarks and Laparoscopic Instruments in Transabdominal Preperitoneal Inguinal Hernia Repair. Surg Innov 2024; 31:178-184. [PMID: 38195405 DOI: 10.1177/15533506241226502] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/11/2024]
Abstract
Laparoscopic TAPP (Trans-Abdominal PrePeritoneal) is a minimally invasive surgical procedure used to repair inguinal hernias. Arguably, one important aspect to TAPP hernia repair is the identification of anatomical landmarks and the correct use of various laparoscopic instruments. There are very few studies regarding the use of artificial intelligence in laparoscopic inguinal hernia repair and more specifically in TAPP. The aim of this study is to evaluate the feasibility and usefulness of AI in the recognition of anatomical landmarks and tools in laparoscopic TAPP videos. Imaging data have been exported from 20 Laparoscopic TAPP videos that have been performed by the authors and another 5 high quality TAPP videos from the internet (free access) performed by other surgeons. In total 1095 selected images have been exported for annotation. To accomplish the AI result of computer vision, the YOLOv8 model of deep learning was used. In total 2716 segmented areas of interest have been exported. The AI model was able to detect the various classes with a maximum F1 score of .82 when the confidence threshold was set to .406. The mAP50 was .873 for all classes. The Precision was above 50% when the confidence was over 10%. The Recall rate was above 50% when confidence was less than 70%. These results suggest that the model is effective at balancing precision and recall, capturing both true positives and minimizing false negatives. Artificial Intelligence recognition of anatomical landmarks and laparoscopic instruments in TAPP is feasible with acceptable success rates.
Collapse
Affiliation(s)
- Apollon Zygomalas
- AI Assisted Laparoscopy Research Program, Hellenic Open University, Patras, Greece
- Department of Minimally Invasive Surgery, Olympion General Clinic of Patras, Patras, Greece
| | - Dimitrios Kalles
- AI Assisted Laparoscopy Research Program, Hellenic Open University, Patras, Greece
| | - Nikolaos Katsiakis
- Department of Minimally Invasive Surgery, Olympion General Clinic of Patras, Patras, Greece
| | - Andreas Anastasopoulos
- Department of Minimally Invasive Surgery, Olympion General Clinic of Patras, Patras, Greece
| | - Georgios Skroubis
- AI Assisted Laparoscopy Research Program, Hellenic Open University, Patras, Greece
- Department of General Surgery, University Hospital of Patras, Patras, Greece
| |
Collapse
|
50
|
Xiong D, Marcus M, Maida CA, Lyu Y, Hays RD, Wang Y, Shen J, Spolsky VW, Lee SY, Crall JJ, Liu H. Development of short forms for screening children's dental caries and urgent treatment needs using item response theory and machine learning methods. PLoS One 2024; 19:e0299947. [PMID: 38517846 PMCID: PMC10959356 DOI: 10.1371/journal.pone.0299947] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/26/2021] [Accepted: 02/20/2024] [Indexed: 03/24/2024] Open
Abstract
OBJECTIVES Surveys can assist in screening oral diseases in populations to enhance the early detection of disease and intervention strategies for children in need. This paper aims to develop short forms of child-report and proxy-report survey screening instruments for active dental caries and urgent treatment needs in school-age children. METHODS This cross-sectional study recruited 497 distinct dyads of children aged 8-17 and their parents between 2015 to 2019 from 14 dental clinics and private practices in Los Angeles County. We evaluated responses to 88 child-reported and 64 proxy-reported oral health questions to select and calibrate short forms using Item Response Theory. Seven classical Machine Learning algorithms were employed to predict children's active caries and urgent treatment needs using the short forms together with family demographic variables. The candidate algorithms include CatBoost, Logistic Regression, K-Nearest Neighbors (KNN), Naïve Bayes, Neural Network, Random Forest, and Support Vector Machine. Predictive performance was assessed using repeated 5-fold nested cross-validations. RESULTS We developed and calibrated four ten-item short forms. Naïve Bayes outperformed other algorithms with the highest median of cross-validated area under the ROC curve. The means of best testing sensitivities and specificities using both child-reported and proxy-reported responses were 0.84 and 0.30 for active caries, and 0.81 and 0.31 for urgent treatment needs respectively. Models incorporating both response types showed a slightly higher predictive accuracy than those relying on either child-reported or proxy-reported responses. CONCLUSIONS The combination of Item Response Theory and Machine Learning algorithms yielded potentially useful screening instruments for both active caries and urgent treatment needs of children. The survey screening approach is relatively cost-effective and convenient when dealing with oral health assessment in large populations. Future studies are needed to further leverage the customize and refine the instruments based on the estimated item characteristics for specific subgroups of the populations to enhance predictive accuracy.
Collapse
Affiliation(s)
- Di Xiong
- Section of Public and Population Health, Division of Oral and Systemic Health Sciences, School of Dentistry, University of California, Los Angeles, Los Angeles, California, United States of America
- Department of Biostatistics, Fielding School of Public Health, University of California, Los Angeles, Los Angeles, California, United States of America
| | - Marvin Marcus
- Section of Public and Population Health, Division of Oral and Systemic Health Sciences, School of Dentistry, University of California, Los Angeles, Los Angeles, California, United States of America
| | - Carl A. Maida
- Section of Public and Population Health, Division of Oral and Systemic Health Sciences, School of Dentistry, University of California, Los Angeles, Los Angeles, California, United States of America
| | - Yuetong Lyu
- Section of Public and Population Health, Division of Oral and Systemic Health Sciences, School of Dentistry, University of California, Los Angeles, Los Angeles, California, United States of America
- Department of Biostatistics, Fielding School of Public Health, University of California, Los Angeles, Los Angeles, California, United States of America
| | - Ron D. Hays
- Division of General Internal Medicine and Health Services Research, Department of Medicine, David Geffen School of Medicine, University of California, Los Angeles, Los Angeles, California, United States of America
- Department of Health Policy and Management, Fielding School of Public Health, University of California, Los Angeles, Los Angeles, California, United States of America
- RAND Corporation, Santa Monica, California, United States of America
| | - Yan Wang
- Section of Public and Population Health, Division of Oral and Systemic Health Sciences, School of Dentistry, University of California, Los Angeles, Los Angeles, California, United States of America
| | - Jie Shen
- Section of Public and Population Health, Division of Oral and Systemic Health Sciences, School of Dentistry, University of California, Los Angeles, Los Angeles, California, United States of America
| | - Vladimir W. Spolsky
- Section of Public and Population Health, Division of Oral and Systemic Health Sciences, School of Dentistry, University of California, Los Angeles, Los Angeles, California, United States of America
| | - Steve Y. Lee
- Sectopm of Interdisciplinary Dentistry, Division of Diagnostic and Surgical Sciences, School of Dentistry, University of California, Los Angeles, Los Angeles, California, United States of America
| | - James J. Crall
- Section of Public and Population Health, Division of Oral and Systemic Health Sciences, School of Dentistry, University of California, Los Angeles, Los Angeles, California, United States of America
| | - Honghu Liu
- Section of Public and Population Health, Division of Oral and Systemic Health Sciences, School of Dentistry, University of California, Los Angeles, Los Angeles, California, United States of America
- Department of Biostatistics, Fielding School of Public Health, University of California, Los Angeles, Los Angeles, California, United States of America
- Division of General Internal Medicine and Health Services Research, Department of Medicine, David Geffen School of Medicine, University of California, Los Angeles, Los Angeles, California, United States of America
| |
Collapse
|