Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For: Kuhle S, Maguire B, Zhang H, Hamilton D, Allen AC, Joseph KS, Allen VM. Comparison of logistic regression with machine learning methods for the prediction of fetal growth abnormalities: a retrospective cohort study. BMC Pregnancy Childbirth 2018;18:333. [PMID: 30111303 PMCID: PMC6094446 DOI: 10.1186/s12884-018-1971-2] [Citation(s) in RCA: 60] [Impact Index Per Article: 10.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/26/2018] [Accepted: 08/07/2018] [Indexed: 11/10/2022] Open

For:	Kuhle S, Maguire B, Zhang H, Hamilton D, Allen AC, Joseph KS, Allen VM. Comparison of logistic regression with machine learning methods for the prediction of fetal growth abnormalities: a retrospective cohort study. BMC Pregnancy Childbirth 2018;18:333. [PMID: 30111303 PMCID: PMC6094446 DOI: 10.1186/s12884-018-1971-2] [Citation(s) in RCA: 60] [Impact Index Per Article: 10.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/26/2018] [Accepted: 08/07/2018] [Indexed: 11/10/2022] Open

Number

Cited by Other Article(s)

Ewington L, Black N, Leeson C, Al Wattar BH, Quenby S. Multivariable prediction models for fetal macrosomia and large for gestational age: A systematic review. BJOG 2024;131:1591-1602. [PMID: 38465451 DOI: 10.1111/1471-0528.17802] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/10/2023] [Revised: 02/08/2024] [Accepted: 02/22/2024] [Indexed: 03/12/2024]

Abstract

BACKGROUND

The identification of large for gestational age (LGA) and macrosomic fetuses is essential for counselling and managing these pregnancies.

OBJECTIVES

To systematically review the literature for multivariable prediction models for LGA and macrosomia, assessing the performance, quality and applicability of the included model in clinical practice.

SEARCH STRATEGY

MEDLINE, EMBASE and Cochrane Library were searched until June 2022.

SELECTION CRITERIA

We included observational and experimental studies reporting the development and/or validation of any multivariable prediction model for fetal macrosomia and/or LGA. We excluded studies that used a single variable or did not evaluate model performance.

DATA COLLECTION AND ANALYSIS

Data were extracted using the Checklist for critical appraisal and data extraction for systematic reviews of prediction modelling studies checklist. The model performance measures discrimination, calibration and validation were extracted. The quality and completion of reporting within each study was assessed by its adherence to the TRIPOD (Transparent Reporting of a multivariable prediction model for Individual Prognosis Or Diagnosis) checklist. The risk of bias and applicability were measured using PROBAST (Prediction model Risk Of Bias Assessment Tool).

MAIN RESULTS

A total of 8442 citations were identified, with 58 included in the analysis: 32/58 (55.2%) developed, 21/58 (36.2%) developed and internally validated and 2/58 (3.4%) developed and externally validated a model. Only three studies externally validated pre-existing models. Macrosomia and LGA were differentially defined by many studies. In total, 111 multivariable prediction models were developed using 112 different variables. Model discrimination was wide ranging area under the receiver operating characteristics curve (AUROC 0.56-0.96) and few studies reported calibration (11/58, 19.0%). Only 5/58 (8.6%) studies had a low risk of bias.

CONCLUSIONS

There are currently no multivariable prediction models for macrosomia/LGA that are ready for clinical implementation.

Collapse

Pirompud P, Sivapirunthep P, Punyapornwithaya V, Chaosap C. Machine learning predictive modeling for condemnation risk assessment in antibiotic-free raised broilers. Poult Sci 2024;103:104270. [PMID: 39260246 DOI: 10.1016/j.psj.2024.104270] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/14/2024] [Revised: 08/20/2024] [Accepted: 08/22/2024] [Indexed: 09/13/2024] Open

Allotey J, Archer L, Snell KIE, Coomar D, Massé J, Sletner L, Wolf H, Daskalakis G, Saito S, Ganzevoort W, Ohkuchi A, Mistry H, Farrar D, Mone F, Zhang J, Seed PT, Teede H, Da Silva Costa F, Souka AP, Smuk M, Ferrazzani S, Salvi S, Prefumo F, Gabbay-Benziv R, Nagata C, Takeda S, Sequeira E, Lapaire O, Cecatti JG, Morris RK, Baschat AA, Salvesen K, Smits L, Anggraini D, Rumbold A, van Gelder M, Coomarasamy A, Kingdom J, Heinonen S, Khalil A, Goffinet F, Haqnawaz S, Zamora J, Riley RD, Thangaratinam S, Kwong A, Savitri AI, Bhattacharya S, Uiterwaal CSPM, Staff AC, Andersen LB, Olive EL, Redman C, Macleod M, Thilaganathan B, Ramírez JA, Audibert F, Magnus PM, Jenum AK, McAuliffe FM, West J, Askie LM, Zimmerman PA, Riddell C, van de Post J, Illanes SE, Holzman C, van Kuijk SMJ, Carbillon L, Villa PM, Eskild A, Chappell L, Velauthar L, van Oostwaard M, Verlohren S, Poston L, Ferrazzi E, Vinter CA, Brown M, Vollebregt KC, Langenveld J, Widmer M, Haavaldsen C, Carroli G, Olsen J, Zavaleta N, Eisensee I, Vergani P, Lumbiganon P, Makrides M, Facchinetti F, Temmerman M, Gibson R, Frusca T, Norman JE, Figueiró-Filho EA, Laivuori H, Lykke JA, Conde-Agudelo A, Galindo A, Mbah A, Betran AP, Herraiz I, Trogstad L, Smith GGS, Steegers EAP, Salim R, Huang T, Adank A, Meschino WS, Browne JL, Allen RE, Klipstein-Grobusch K, Crowther CA, Jørgensen JS, Forest JC, Mol BW, Giguère Y, Kenny LC, Odibo AO, Myers J, Yeo S, McCowan L, Pajkrt E, Haddad BG, Dekker G, Kleinrouweler EC, LeCarpentier É, Roberts CT, Groen H, Skråstad RB, Eero K, Pilalis A, Souza RT, Hawkins LA, Figueras F, Crovetto F. Development and validation of a prognostic model to predict birth weight: individual participant data meta-analysis. BMJ MEDICINE 2024;3:e000784. [PMID: 39184566 PMCID: PMC11344865 DOI: 10.1136/bmjmed-2023-000784] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 10/12/2023] [Accepted: 06/04/2024] [Indexed: 08/27/2024]

Abstract

Objective

To predict birth weight at various potential gestational ages of delivery based on data routinely available at the first antenatal visit.

Design

Individual participant data meta-analysis.

Data sources

Individual participant data of four cohorts (237 228 pregnancies) from the International Prediction of Pregnancy Complications (IPPIC) network dataset.

Eligibility criteria for selecting studies

Studies in the IPPIC network were identified by searching major databases for studies reporting risk factors for adverse pregnancy outcomes, such as pre-eclampsia, fetal growth restriction, and stillbirth, from database inception to August 2019. Data of four IPPIC cohorts (237 228 pregnancies) from the US (National Institute of Child Health and Human Development, 2018; 233 483 pregnancies), UK (Allen et al, 2017; 1045 pregnancies), Norway (STORK Groruddalen research programme, 2010; 823 pregnancies), and Australia (Rumbold et al, 2006; 1877 pregnancies) were included in the development of the model.

Results

The IPPIC birth weight model was developed with random intercept regression models with backward elimination for variable selection. Internal-external cross validation was performed to assess the study specific and pooled performance of the model, reported as calibration slope, calibration-in-the-large, and observed versus expected average birth weight ratio. Meta-analysis showed that the apparent performance of the model had good calibration (calibration slope 0.99, 95% confidence interval (CI) 0.88 to 1.10; calibration-in-the-large 44.5 g, -18.4 to 107.3) with an observed versus expected average birth weight ratio of 1.02 (95% CI 0.97 to 1.07). The proportion of variation in birth weight explained by the model (R2) was 46.9% (range 32.7-56.1% in each cohort). On internal-external cross validation, the model showed good calibration and predictive performance when validated in three cohorts with a calibration slope of 0.90 (Allen cohort), 1.04 (STORK Groruddalen cohort), and 1.07 (Rumbold cohort), calibration-in-the-large of -22.3 g (Allen cohort), -33.42 (Rumbold cohort), and 86.4 g (STORK Groruddalen cohort), and observed versus expected ratio of 0.99 (Rumbold cohort), 1.00 (Allen cohort), and 1.03 (STORK Groruddalen cohort); respective pooled estimates were 1.00 (95% CI 0.78 to 1.23; calibration slope), 9.7 g (-154.3 to 173.8; calibration-in-the-large), and 1.00 (0.94 to 1.07; observed v expected ratio). The model predictions were more accurate (smaller mean square error) in the lower end of predicted birth weight, which is important in informing clinical decision making.

Conclusions

The IPPIC birth weight model allowed birth weight predictions for a range of possible gestational ages. The model explained about 50% of individual variation in birth weights, was well calibrated (especially in babies at high risk of fetal growth restriction and its complications), and showed promising performance in four different populations included in the individual participant data meta-analysis. Further research to examine the generalisability of performance in other countries, settings, and subgroups is required.

Trial registration

PROSPERO CRD42019135045.

Collapse

Liu L, Jung SH. Repeated Sieving for Prediction Model Building with High-Dimensional Data. J Pers Med 2024;14:769. [PMID: 39064023 PMCID: PMC11277592 DOI: 10.3390/jpm14070769] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/19/2024] [Revised: 07/11/2024] [Accepted: 07/16/2024] [Indexed: 07/28/2024] Open

Abstract

Background: The prediction of patients' outcomes is a key component in personalized medicine. Oftentimes, a prediction model is developed using a large number of candidate predictors, called high-dimensional data, including genomic data, lab tests, electronic health records, etc. Variable selection, also called dimension reduction, is a critical step in developing a prediction model using high-dimensional data. Methods: In this paper, we compare the variable selection and prediction performance of popular machine learning (ML) methods with our proposed method. LASSO is a popular ML method that selects variables by imposing an L1-norm penalty to the likelihood. By this approach, LASSO selects features based on the size of regression estimates, rather than their statistical significance. As a result, LASSO can miss significant features while it is known to over-select features. Elastic net (EN), another popular ML method, tends to select even more features than LASSO since it uses a combination of L1- and L2-norm penalties that is less strict than an L1-norm penalty. Insignificant features included in a fitted prediction model act like white noises, so that the fitted model will lose prediction accuracy. Furthermore, for the future use of a fitted prediction model, we have to collect the data of all the features included in the model, which will cost a lot and possibly lower the accuracy of the data if the number of features is too many. Therefore, we propose an ML method, called repeated sieving, extending the standard regression methods with stepwise variable selection. By selecting features based on their statistical significance, it resolves the over-selection issue with high-dimensional data. Results: Through extensive numerical studies and real data examples, our results show that the repeated sieving method selects far fewer features than LASSO and EN, but has higher prediction accuracy than the existing ML methods. Conclusions: We conclude that our repeated sieving method performs well in both variable selection and prediction, and it saves the cost of future investigation on the selected factors.

Collapse

Jazayeri SB, Maroufi SF, Akbarinejad S, Ghodsi Z, Rahimi-Movaghar V. Development of a regional-based predictive model of incidence of traumatic spinal cord injury using machine learning algorithms. World Neurosurg X 2024;23:100280. [PMID: 38497064 PMCID: PMC10943041 DOI: 10.1016/j.wnsx.2024.100280] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/05/2022] [Revised: 01/26/2024] [Accepted: 02/20/2024] [Indexed: 03/19/2024] Open

Van SN, Cui J, Wang Y, Jiang H, Sha F, Li Y. Identifying First-Trimester Risk Factors for SGA-LGA Using Weighted Inheritance Voting Ensemble Learning. Bioengineering (Basel) 2024;11:657. [PMID: 39061738 PMCID: PMC11274223 DOI: 10.3390/bioengineering11070657] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/27/2024] [Revised: 06/13/2024] [Accepted: 06/21/2024] [Indexed: 07/28/2024] Open

Abstract

The classification of fetuses as Small for Gestational Age (SGA) and Large for Gestational Age (LGA) is a critical aspect of neonatal health assessment. SGA and LGA, terms used to describe fetal weights that fall below or above the expected weights for Appropriate for Gestational Age (AGA) fetuses, indicate intrauterine growth restriction and excessive fetal growth, respectively. Early prediction and assessment of latent risk factors associated with these classifications can facilitate timely medical interventions, thereby optimizing the health outcomes for both the infant and the mother. This study aims to leverage first-trimester data to achieve these objectives. This study analyzed data from 7943 pregnant women, including 424 SGA, 928 LGA, and 6591 AGA cases, collected from 2015 to 2021 at the Third Affiliated Hospital of Sun Yat-sen University in Guangzhou, China. We propose a novel algorithm, named the Weighted Inheritance Voting Ensemble Learning Algorithm (WIVELA), to predict the classification of fetuses into SGA, LGA, and AGA categories based on biochemical parameters, maternal factors, and morbidity during pregnancy. Additionally, we proposed algorithms for relevance determination based on the classifier to ascertain the importance of features associated with SGA and LGA. The proposed classification solution demonstrated a notable average accuracy rate of 92.12% on 10-fold cross-validation over 100 loops, outperforming five state-of-the-art machine learning algorithms. Furthermore, we identified significant latent maternal risk factors directly associated with SGA and LGA conditions, such as weight change during the first trimester, prepregnancy weight, height, age, and obstetric factors like fetal growth restriction and birthing LGA baby. This study also underscored the importance of biomarker features at the end of the first trimester, including HDL, TG, OGTT-1h, OGTT-0h, OGTT-2h, TC, FPG, and LDL, which reflect the status of SGA or LGA fetuses. This study presents innovative solutions for classifying and identifying relevant attributes, offering valuable tools for medical teams in the clinical monitoring of fetuses predisposed to SGA and LGA conditions during the initial stage of pregnancy. These proposed solutions facilitate early intervention in nutritional care and prenatal healthcare, thereby contributing to enhanced strategies for managing the health and well-being of both the fetus and the expectant mother.

Collapse

Bai X, Zhou Z, Zheng Z, Li Y, Liu K, Zheng Y, Yang H, Zhu H, Chen S, Pan H. Development and evaluation of machine learning models for predicting large-for-gestational-age newborns in women exposed to radiation prior to pregnancy. BMC Med Inform Decis Mak 2024;24:174. [PMID: 38902714 PMCID: PMC11188254 DOI: 10.1186/s12911-024-02556-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/17/2023] [Accepted: 05/28/2024] [Indexed: 06/22/2024] Open

Affiliation(s)

Xi Bai Key Laboratory of Endocrine Glucose & Lipids Metabolism and Brain Aging, Department of Endocrinology, Ministry of Education, Shandong Provincial Hospital Affiliated to Shandong First Medical University, Jinan, Shandong, China Key Laboratory of Endocrinology of National Health Commission, Department of Endocrinology, State Key Laboratory of Complex Severe and Rare Diseases, Peking Union Medical College Hospital, Chinese Academy of Medical Science and Peking Union Medical College, Beijing, 100730, China
Zhibo Zhou Key Laboratory of Endocrinology of National Health Commission, Department of Endocrinology, State Key Laboratory of Complex Severe and Rare Diseases, Peking Union Medical College Hospital, Chinese Academy of Medical Science and Peking Union Medical College, Beijing, 100730, China
Zeyan Zheng Key Laboratory of Endocrinology of National Health Commission, Department of Endocrinology, State Key Laboratory of Complex Severe and Rare Diseases, Peking Union Medical College Hospital, Chinese Academy of Medical Science and Peking Union Medical College, Beijing, 100730, China
Yansheng Li DHC Mediway Technology CO., Ltd, Beijing, China
Kejia Liu DHC Mediway Technology CO., Ltd, Beijing, China
Yuanjun Zheng DHC Mediway Technology CO., Ltd, Beijing, China
Hongbo Yang Key Laboratory of Endocrinology of National Health Commission, Department of Endocrinology, State Key Laboratory of Complex Severe and Rare Diseases, Peking Union Medical College Hospital, Chinese Academy of Medical Science and Peking Union Medical College, Beijing, 100730, China
Huijuan Zhu Key Laboratory of Endocrinology of National Health Commission, Department of Endocrinology, State Key Laboratory of Complex Severe and Rare Diseases, Peking Union Medical College Hospital, Chinese Academy of Medical Science and Peking Union Medical College, Beijing, 100730, China
Shi Chen Key Laboratory of Endocrinology of National Health Commission, Department of Endocrinology, State Key Laboratory of Complex Severe and Rare Diseases, Peking Union Medical College Hospital, Chinese Academy of Medical Science and Peking Union Medical College, Beijing, 100730, China.
Hui Pan Key Laboratory of Endocrinology of National Health Commission, Department of Endocrinology, State Key Laboratory of Complex Severe and Rare Diseases, Peking Union Medical College Hospital, Chinese Academy of Medical Science and Peking Union Medical College, Beijing, 100730, China.

Collapse

Wu H, Chen Z, Gu J, Jiang Y, Gao S, Chen W, Miao C. Predicting Chronic Pain and Treatment Outcomes Using Machine Learning Models Based on High-dimensional Clinical Data From a Large Retrospective Cohort. Clin Ther 2024;46:490-498. [PMID: 38824080 DOI: 10.1016/j.clinthera.2024.04.012] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/29/2023] [Revised: 04/13/2024] [Accepted: 04/27/2024] [Indexed: 06/03/2024]

Karabacak M, Bhimani AD, Schupper AJ, Carr MT, Steinberger J, Margetis K. Machine learning models on a web application to predict short-term postoperative outcomes following anterior cervical discectomy and fusion. BMC Musculoskelet Disord 2024;25:401. [PMID: 38773464 PMCID: PMC11110429 DOI: 10.1186/s12891-024-07528-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 01/17/2024] [Accepted: 05/15/2024] [Indexed: 05/23/2024] Open

Abstract

BACKGROUND

The frequency of anterior cervical discectomy and fusion (ACDF) has increased up to 400% since 2011, underscoring the need to preoperatively anticipate adverse postoperative outcomes given the procedure's expanding use. Our study aims to accomplish two goals: firstly, to develop a suite of explainable machine learning (ML) models capable of predicting adverse postoperative outcomes following ACDF surgery, and secondly, to embed these models in a user-friendly web application, demonstrating their potential utility.

METHODS

We utilized data from the National Surgical Quality Improvement Program database to identify patients who underwent ACDF surgery. The outcomes of interest were four short-term postoperative adverse outcomes: prolonged length of stay (LOS), non-home discharges, 30-day readmissions, and major complications. We utilized five ML algorithms - TabPFN, TabNET, XGBoost, LightGBM, and Random Forest - coupled with the Optuna optimization library for hyperparameter tuning. To bolster the interpretability of our models, we employed SHapley Additive exPlanations (SHAP) for evaluating predictor variables' relative importance and used partial dependence plots to illustrate the impact of individual variables on the predictions generated by our top-performing models. We visualized model performance using receiver operating characteristic (ROC) curves and precision-recall curves (PRC). Quantitative metrics calculated were the area under the ROC curve (AUROC), balanced accuracy, weighted area under the PRC (AUPRC), weighted precision, and weighted recall. Models with the highest AUROC values were selected for inclusion in a web application.

RESULTS

The analysis included 57,760 patients for prolonged LOS [11.1% with prolonged LOS], 57,780 for non-home discharges [3.3% non-home discharges], 57,790 for 30-day readmissions [2.9% readmitted], and 57,800 for major complications [1.4% with major complications]. The top-performing models, which were the ones built with the Random Forest algorithm, yielded mean AUROCs of 0.776, 0.846, 0.775, and 0.747 for predicting prolonged LOS, non-home discharges, readmissions, and complications, respectively.

CONCLUSIONS

Our study employs advanced ML methodologies to enhance the prediction of adverse postoperative outcomes following ACDF. We designed an accessible web application to integrate these models into clinical practice. Our findings affirm that ML tools serve as vital supplements in risk stratification, facilitating the prediction of diverse outcomes and enhancing patient counseling for ACDF.

Collapse

Wahbah M, Zitouni MS, Al Sakaji R, Funamoto K, Widatalla N, Krishnan A, Kimura Y, Khandoker AH. A deep learning framework for noninvasive fetal ECG signal extraction. Front Physiol 2024;15:1329313. [PMID: 38711954 PMCID: PMC11073781 DOI: 10.3389/fphys.2024.1329313] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/28/2023] [Accepted: 03/22/2024] [Indexed: 05/08/2024] Open

Pirompud P, Sivapirunthep P, Punyapornwithaya V, Chaosap C. Application of machine learning algorithms to predict dead on arrival of broiler chickens raised without antibiotic program. Poult Sci 2024;103:103504. [PMID: 38335671 PMCID: PMC10864801 DOI: 10.1016/j.psj.2024.103504] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/03/2023] [Revised: 01/20/2024] [Accepted: 01/23/2024] [Indexed: 02/12/2024] Open

Abstract

Understanding the factors of dead-on-arrival (DOA) incidents during pre-slaughter handling is crucial for informed decision-making, improving broiler welfare, and optimizing farm profitability. In this study, 3 different machine learning (ML) algorithms - least absolute shrinkage and selection operator (LASSO), classification tree (CT), and random forest (RF) - were used together with 4 sampling techniques to optimize imbalanced data. The dataset comes from 22,115 broiler truckloads from a large producer in Thailand (2021-2022) and includes 14 independent variables covering the rearing, catching, and transportation stages. The study focuses on DOA% in the range of 0.10 to 1.20%, with a threshold for high DOA% above 0.3%, and records DOA% per truckload during pre-slaughter ante-mortem inspection. With a high DOA rate of 25.2%, the imbalanced dataset prompts the implementation of 4 methods to tune the imbalance parameters: random over sampling (ROS), random under sampling (RUS), both sampling (BOTH), and synthetic sampling or random over sampling example (ROSE). The aim is to improve the performance of the prediction model in classifying and predicting high DOA%. The comparative analysis of the different error metrics shows that RF outperforms the other models in a balanced dataset. In particular, RUS shows a significant improvement in prediction performance across all models compared to the original unbalanced dataset. The identification of the 4 most important variables for predicting high DOA percentages - mortality and culling rate, rearing stocking density, season, and mean body weight - emphasizes their importance for broiler production. This study provides valuable insights into the prediction of DOA status using an ML approach and contributes to the development of more effective strategies to mitigate high DOA percentages in commercial broiler production.

Collapse

Halomoan Harahap T, Mansouri S, Salim Abdullah O, Uinarni H, Askar S, Jabbar TL, Hussien Alawadi A, Yaseen Hassan A. An artificial intelligence approach to predict infants' health status at birth. Int J Med Inform 2024;183:105338. [PMID: 38211423 DOI: 10.1016/j.ijmedinf.2024.105338] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/18/2023] [Revised: 12/28/2023] [Accepted: 12/31/2023] [Indexed: 01/13/2024]

Yu QY, Lin Y, Zhou YR, Yang XJ, Hemelaar J. Predicting risk of preterm birth in singleton pregnancies using machine learning algorithms. Front Big Data 2024;7:1291196. [PMID: 38495848 PMCID: PMC10941650 DOI: 10.3389/fdata.2024.1291196] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/08/2023] [Accepted: 02/12/2024] [Indexed: 03/19/2024] Open

Han W, Chen S, Kong L, Li Q, Zhang J, Shan G, He H. Lifestyle and clinical factors as predictive indicators of cardiometabolic multimorbidity in Chinese adults: Baseline findings of the Beijing Health Management Cohort (BHMC) study. Comput Biol Med 2024;168:107792. [PMID: 38070203 DOI: 10.1016/j.compbiomed.2023.107792] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/06/2023] [Revised: 11/23/2023] [Accepted: 11/28/2023] [Indexed: 01/10/2024]

Abstract

BACKGROUND

Cardiometabolic multimorbidity (CMM) is increasing globally as a result of lifestyle changes and the aging population. Even though previous studies have examined risk factors associated with CMM, there is a shortage of prediction models that can accurately identify high-risk individuals for early prevention.

METHODS

In the baseline survey of the Beijing Health Management Cohort, a total of 77,752 adults aged 18 years or older were recruited from 2020 to 2021. Data on lifestyle factors, clinical profiles, and diagnoses of diabetes, coronary heart disease, and stroke were collected. Logistic regression models were used to identify risk factors for CMM. Nomograms were developed to estimate an individual's probability of CMM based on the identified risk factors. The performance of the model was evaluated using the area under the receiver operating characteristic curve (AUC).

RESULTS

In men, the top three risk factors for CMM were hypertension (OR: 3.52, 95 % CI: 2.97-4.18), eating very fast (3.43, 2.27-5.16), and dyslipidemia (2.59, 2.20-3.06). In women, hypertension showed the strongest association with CMM (3.62, 2.90-4.52), followed by night sleep duration less than 5 h per day (2.41, 1.67-3.50) and dyslipidemia (1.91, 1.58-2.32). The ORs for holding passive and depressed psychological traits were 1.49 (95%CI: 1.08-2.06) in men and 1.58 (1.03-2.43) in women. Prediction models incorporating these factors demonstrated good discrimination in the test set, with AUC 0.84 (0.83-0.86) for men and 0.90 (0.89-0.91) for women. The sex-specific nomograms were established based on selected predictors.

CONCLUSIONS

Modifiable lifestyle factors, metabolic health and psychological trait are associated with the risk of CMM. The developed prediction models and nomograms could facilitate early identification of individuals at high-risk of CMM.

Collapse

Peng C, Yang F, Yu J, Peng L, Zhang C, Chen C, Lin Z, Li Y, He J, Jin Z. Machine Learning Prediction Algorithm for In-Hospital Mortality following Body Contouring. Plast Reconstr Surg 2023;152:1103e-1113e. [PMID: 36940163 DOI: 10.1097/prs.0000000000010436] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 03/21/2023]

Khan W, Zaki N, Ahmad A, Masud MM, Govender R, Rojas-Perilla N, Ali L, Ghenimi N, Ahmed LA. Node embedding-based graph autoencoder outlier detection for adverse pregnancy outcomes. Sci Rep 2023;13:19817. [PMID: 37963898 PMCID: PMC10645849 DOI: 10.1038/s41598-023-46726-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/07/2023] [Accepted: 11/04/2023] [Indexed: 11/16/2023] Open

Wang K, Tang Y, Zhang F, Guo X, Gao L. Combined application of inflammation-related biomarkers to predict postoperative complications of rectal cancer patients: a retrospective study by machine learning analysis. Langenbecks Arch Surg 2023;408:400. [PMID: 37831218 DOI: 10.1007/s00423-023-03127-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/10/2023] [Accepted: 09/29/2023] [Indexed: 10/14/2023]

Miranda J, Paules C, Noell G, Youssef L, Paternina-Caicedo A, Crovetto F, Cañellas N, Garcia-Martín ML, Amigó N, Eixarch E, Faner R, Figueras F, Simões RV, Crispi F, Gratacós E. Similarity network fusion to identify phenotypes of small-for-gestational-age fetuses. iScience 2023;26:107620. [PMID: 37694157 PMCID: PMC10485038 DOI: 10.1016/j.isci.2023.107620] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/05/2023] [Revised: 04/19/2023] [Accepted: 08/09/2023] [Indexed: 09/12/2023] Open

Affiliation(s)

Jezid Miranda BCNatal – Barcelona Center for Maternal-Fetal and Neonatal Medicine (Hospital Clínic and Hospital Sant Joan de Deu), IDIBAPS, University of Barcelona, and Centre for Biomedical Research on Rare Diseases (CIBER-ER), Barcelona, Spain Department of Obstetrics and Gynecology, Faculty of Medicine, Universidad de Cartagena, Cartagena de Indias, Colombia
Cristina Paules BCNatal – Barcelona Center for Maternal-Fetal and Neonatal Medicine (Hospital Clínic and Hospital Sant Joan de Deu), IDIBAPS, University of Barcelona, and Centre for Biomedical Research on Rare Diseases (CIBER-ER), Barcelona, Spain Aragon Institute of Health Research (IIS Aragon), Obstetrics Department, Hospital Clínico Universitario Lozano Blesa, Zaragoza, Spain
Guillaume Noell University of Barcelona, Biomedicine Department, IDIBAPS, Centre for Biomedical Research on Respiratory Diseases (CIBERES), Barcelona, Spain
Lina Youssef BCNatal – Barcelona Center for Maternal-Fetal and Neonatal Medicine (Hospital Clínic and Hospital Sant Joan de Deu), IDIBAPS, University of Barcelona, and Centre for Biomedical Research on Rare Diseases (CIBER-ER), Barcelona, Spain
Angel Paternina-Caicedo Faculty of Medicine, Universidad del Sinu, Cartagena de Indias, Colombia
Francesca Crovetto BCNatal – Barcelona Center for Maternal-Fetal and Neonatal Medicine (Hospital Clínic and Hospital Sant Joan de Deu), IDIBAPS, University of Barcelona, and Centre for Biomedical Research on Rare Diseases (CIBER-ER), Barcelona, Spain
Nicolau Cañellas Metabolomics Platform, IISPV, DEEiA, Universidad Rovira i Virgili, Biomedical Research Centre in Diabetes and Associated Metabolic Disorders (CIBERDEM), Tarragona, Spain
María L. Garcia-Martín BIONAND, Andalusian Centre for Nanomedicine and Biotechnology, Junta de Andalucía, Universidad de Málaga, Málaga, Spain
Nuria Amigó Biosfer Teslab, Reus, Spain
Elisenda Eixarch BCNatal – Barcelona Center for Maternal-Fetal and Neonatal Medicine (Hospital Clínic and Hospital Sant Joan de Deu), IDIBAPS, University of Barcelona, and Centre for Biomedical Research on Rare Diseases (CIBER-ER), Barcelona, Spain
Rosa Faner University of Barcelona, Biomedicine Department, IDIBAPS, Centre for Biomedical Research on Respiratory Diseases (CIBERES), Barcelona, Spain
Francesc Figueras BCNatal – Barcelona Center for Maternal-Fetal and Neonatal Medicine (Hospital Clínic and Hospital Sant Joan de Deu), IDIBAPS, University of Barcelona, and Centre for Biomedical Research on Rare Diseases (CIBER-ER), Barcelona, Spain
Rui V. Simões BCNatal – Barcelona Center for Maternal-Fetal and Neonatal Medicine (Hospital Clínic and Hospital Sant Joan de Deu), IDIBAPS, University of Barcelona, and Centre for Biomedical Research on Rare Diseases (CIBER-ER), Barcelona, Spain Institute for Research & Innovation in Health (i3S), University of Porto, Porto, Portugal
Fàtima Crispi BCNatal – Barcelona Center for Maternal-Fetal and Neonatal Medicine (Hospital Clínic and Hospital Sant Joan de Deu), IDIBAPS, University of Barcelona, and Centre for Biomedical Research on Rare Diseases (CIBER-ER), Barcelona, Spain
Eduard Gratacós BCNatal – Barcelona Center for Maternal-Fetal and Neonatal Medicine (Hospital Clínic and Hospital Sant Joan de Deu), IDIBAPS, University of Barcelona, and Centre for Biomedical Research on Rare Diseases (CIBER-ER), Barcelona, Spain

Collapse

Ozkara BB, Karabacak M, Kotha A, Cristiano BC, Wintermark M, Yedavalli VS. Development of machine learning models for predicting outcome in patients with distal medium vessel occlusions: a retrospective study. Quant Imaging Med Surg 2023;13:5815-5830. [PMID: 37711830 PMCID: PMC10498209 DOI: 10.21037/qims-23-154] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/08/2023] [Accepted: 06/30/2023] [Indexed: 09/16/2023]

Abstract

Background

While numerous prognostic factors have been reported for large vessel occlusion (LVO)-acute ischemic stroke (AIS) patients, the same cannot be said for distal medium vessel occlusions (DMVOs). We used machine learning (ML) algorithms to develop a model predicting the short-term outcome of AIS patients with DMVOs using demographic, clinical, and laboratory variables and baseline computed tomography (CT) perfusion (CTP) postprocessing quantitative parameters.

Methods

In this retrospective cohort study, consecutive patients with AIS admitted to two comprehensive stroke centers between January 1, 2017, and September 1, 2022, were screened. Demographic, clinical, and radiological data were extracted from electronic medical records. The clinical outcome was divided into two categories, with a cut-off defined by the median National Institutes of Health Stroke Scale (NIHSS) shift score. Data preprocessing involved addressing missing values through imputation, scaling with a robust scaler, normalization using min-max normalization, and encoding of categorical variables. The data were split into training and test sets (70:30), and recursive feature elimination (RFE) was employed for feature selection. For ML analyses, XGBoost, LightGBM, CatBoost, multi-layer perceptron, random forest, and logistic regression algorithms were utilized. Performance evaluation involved the receiver operating characteristic (ROC) curve, precision-recall curve (PRC), the area under these curves, accuracy, precision, recall, and Matthews correlation coefficient (MCC). The relative weights of predictor variables were examined using Shapley additive explanations (SHAP).

Results

Sixty-nine patients were included and divided into two groups: 35 patients with favorable outcomes and 34 patients with unfavorable outcomes. Utilizing ten selected features, the XGBoost algorithm achieved the best performance in predicting unfavorable outcomes, with an area under the ROC curve (AUROC) of 0.894 and an area under the PRC curve (AUPRC) of 0.756. The SHAP analysis ranked the following features in order of importance for the XGBoost model: mismatch volume, time-to-maximum of the tissue residue function (Tmax) >6 s, diffusion-weighted imaging (DWI) volume, neutrophil-to-platelet ratio (NPR), mean corpuscular volume (MCV), Tmax >10 s, hemoglobin, potassium, hypoperfusion index (HI), and Tmax >8 s.

Conclusions

Our ML models, trained on baseline quantitative laboratory and CT parameters, accurately predicted the short-term outcome in patients with DMVOs. These findings may aid clinicians in predicting prognosis and may be helpful for future research.

Collapse

Patterson JK, Thorsten VR, Eggleston B, Nolen T, Lokangaka A, Tshefu A, Goudar SS, Derman RJ, Chomba E, Carlo WA, Mazariegos M, Krebs NF, Saleem S, Goldenberg RL, Patel A, Hibberd PL, Esamai F, Liechty EA, Haque R, Petri B, Koso-Thomas M, McClure EM, Bose CL, Bauserman M. Building a predictive model of low birth weight in low- and middle-income countries: a prospective cohort study. BMC Pregnancy Childbirth 2023;23:600. [PMID: 37608358 PMCID: PMC10464177 DOI: 10.1186/s12884-023-05866-1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/13/2023] [Accepted: 07/21/2023] [Indexed: 08/24/2023] Open

Abstract

BACKGROUND

Low birth weight (LBW, < 2500 g) infants are at significant risk for death and disability. Improving outcomes for LBW infants requires access to advanced neonatal care, which is a limited resource in low- and middle-income countries (LMICs). Predictive modeling might be useful in LMICs to identify mothers at high-risk of delivering a LBW infant to facilitate referral to centers capable of treating these infants.

METHODS

We developed predictive models for LBW using the NICHD Global Network for Women's and Children's Health Research Maternal and Newborn Health Registry. This registry enrolled pregnant women from research sites in the Democratic Republic of the Congo, Zambia, Kenya, Guatemala, India (2 sites: Belagavi, Nagpur), Pakistan, and Bangladesh between January 2017 - December 2020. We tested five predictive models: decision tree, random forest, logistic regression, K-nearest neighbor and support vector machine.

RESULTS

We report a rate of LBW of 13.8% among the eight Global Network sites from 2017-2020, with a range of 3.8% (Kenya) and approximately 20% (in each Asian site). Of the five models tested, the logistic regression model performed best with an area under the curve of 0.72, an accuracy of 61% and a recall of 72%. All of the top performing models identified clinical site, maternal weight, hypertensive disorders, severe antepartum hemorrhage and antenatal care as key variables in predicting LBW.

CONCLUSIONS

Predictive modeling can identify women at high risk for delivering a LBW infant with good sensitivity using clinical variables available prior to delivery in LMICs. Such modeling is the first step in the development of a clinical decision support tool to assist providers in decision-making regarding referral of these women prior to delivery. Consistent referral of women at high-risk for delivering a LBW infant could have extensive public health consequences in LMICs by directing limited resources for advanced neonatal care to the infants at highest risk.

Collapse

Affiliation(s)

Jackie K Patterson Department of Pediatrics, University of North Carolina at Chapel Hill School of Medicine, 101 Manning Dr, Chapel Hill, NC, 27514, USA.
Vanessa R Thorsten RTI International, Research Triangle Park, Durham, NC, USA
Barry Eggleston RTI International, Research Triangle Park, Durham, NC, USA
Tracy Nolen RTI International, Research Triangle Park, Durham, NC, USA
Adrien Lokangaka Kinshasa School of Public Health, University of Kinshasa, Kinshasa, Democratic Republic of the Congo
Antoinette Tshefu Kinshasa School of Public Health, University of Kinshasa, Kinshasa, Democratic Republic of the Congo
Shivaprasad S Goudar Jawaharlal Nehru Medical College, KLE University, Belagavi, India
Richard J Derman Department of Obstetrics and Gynecology, Thomas Jefferson University, Philadelphia, PA, USA
Elwyn Chomba University Teaching Hospital, Lusaka, Zambia
Waldemar A Carlo University of Alabama at Birmingham, Birmingham, AL, USA
Manolo Mazariegos Institute of Nutrition of Central America and Panama, Guatemala City, Guatemala
Nancy F Krebs School of Medicine, University of Colorado, Aurora, CO, USA
Sarah Saleem Department of Community Health Sciences, Aga Khan University, Karachi, Pakistan
Robert L Goldenberg Department of Obstetrics and Gynecology, Columbia University, New York, NY, USA
Archana Patel Lata Medical Research Foundation, Nagpur & Datta Meghe Institute of Medical Sciences, Sawangi, India
Patricia L Hibberd School of Public Health, Boston University, Boston, MA, USA
Fabian Esamai Department of Child Health and Paediatrics, School of Medicine, Moi University, Eldoret, Kenya
Edward A Liechty School of Medicine, Indiana University, Indianapolis, IN, USA
Rashidul Haque International Centre for Diarrhoeal Disease Research, Dhaka, Bangladesh
Bill Petri Division of Infectious Diseases, University of Virginia, Charlottesville, VA, USA
Marion Koso-Thomas Eunice Kennedy Shriver National Institute of Child Health and Human Development, National Institutes of Health, Bethesda, MD, USA
Elizabeth M McClure RTI International, Research Triangle Park, Durham, NC, USA
Carl L Bose Department of Pediatrics, University of North Carolina at Chapel Hill School of Medicine, 101 Manning Dr, Chapel Hill, NC, 27514, USA
Melissa Bauserman Department of Pediatrics, University of North Carolina at Chapel Hill School of Medicine, 101 Manning Dr, Chapel Hill, NC, 27514, USA

Collapse

Yan FJ, Chen XH, Quan XQ, Wang LL, Wei XY, Zhu JL. Development and validation of an interpretable machine learning model-Predicting mild cognitive impairment in a high-risk stroke population. Front Aging Neurosci 2023;15:1180351. [PMID: 37396650 PMCID: PMC10308219 DOI: 10.3389/fnagi.2023.1180351] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/06/2023] [Accepted: 05/30/2023] [Indexed: 07/04/2023] Open

Yalçın N, Kaşıkcı M, Çelik HT, Demirkan K, Yiğit Ş, Yurdakök M. Development and validation of machine learning-based clinical decision support tool for identifying malnutrition in NICU patients. Sci Rep 2023;13:5227. [PMID: 36997630 PMCID: PMC10063679 DOI: 10.1038/s41598-023-32570-z] [Citation(s) in RCA: 5] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/27/2022] [Accepted: 03/29/2023] [Indexed: 04/01/2023] Open

Sarkar S, Mukherjee A, Chakraborty M, Quamar MT, Duttagupta S, Bhattacharya A. Prediction of elevated groundwater fluoride across India using multi-model approach: insights on the influence of geologic and environmental factors. ENVIRONMENTAL SCIENCE AND POLLUTION RESEARCH INTERNATIONAL 2023;30:31998-32013. [PMID: 36459318 DOI: 10.1007/s11356-022-24328-3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/08/2022] [Accepted: 11/16/2022] [Indexed: 06/17/2023]

Karabacak M, Margetis K. A Machine Learning-Based Online Prediction Tool for Predicting Short-Term Postoperative Outcomes Following Spinal Tumor Resections. Cancers (Basel) 2023;15:cancers15030812. [PMID: 36765771 PMCID: PMC9913622 DOI: 10.3390/cancers15030812] [Citation(s) in RCA: 11] [Impact Index Per Article: 11.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/04/2023] [Revised: 01/24/2023] [Accepted: 01/26/2023] [Indexed: 01/31/2023] Open

Arowosegbe A, Oyelade T. Application of Natural Language Processing (NLP) in Detecting and Preventing Suicide Ideation: A Systematic Review. INTERNATIONAL JOURNAL OF ENVIRONMENTAL RESEARCH AND PUBLIC HEALTH 2023;20:1514. [PMID: 36674270 PMCID: PMC9859480 DOI: 10.3390/ijerph20021514] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 12/08/2022] [Revised: 01/04/2023] [Accepted: 01/11/2023] [Indexed: 06/17/2023]

Du Y, Rafferty AR, McAuliffe FM, Mehegan J, Mooney C. Towards an explainable clinical decision support system for large-for-gestational-age births. PLoS One 2023;18:e0281821. [PMID: 36809384 PMCID: PMC9942967 DOI: 10.1371/journal.pone.0281821] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/30/2021] [Accepted: 01/31/2023] [Indexed: 02/23/2023] Open

Abstract

A myriad of maternal and neonatal complications can result from delivery of a large-for-gestational-age (LGA) infant. LGA birth rates have increased in many countries since the late 20th century, partially due to a rise in maternal body mass index, which is associated with LGA risk. The objective of the current study was to develop LGA prediction models for women with overweight and obesity for the purpose of clinical decision support in a clinical setting. Maternal characteristics, serum biomarkers and fetal anatomy scan measurements for 465 pregnant women with overweight and obesity before and at approximately 21 weeks gestation were obtained from the PEARS (Pregnancy Exercise and Nutrition with smart phone application support) study data. Random forest, support vector machine, adaptive boosting and extreme gradient boosting algorithms were applied with synthetic minority over-sampling technique to develop probabilistic prediction models. Two models were developed for use in different settings: a clinical setting for white women (AUC-ROC of 0.75); and a clinical setting for women of all ethnicity and regions (AUC-ROC of 0.57). Maternal age, mid upper arm circumference, white cell count at the first antenatal visit, fetal biometry and gestational age at fetal anatomy scan were found to be important predictors of LGA. Pobal HP deprivation index and fetal biometry centiles, which are population-specific, are also important. Moreover, we explained our models with Local Interpretable Model-agnostic Explanations (LIME) to improve explainability, which was proven effective by case studies. Our explainable models can effectively predict the probability of an LGA birth for women with overweight and obesity, and are anticipated to be useful to support clinical decision-making and for the development of early pregnancy intervention strategies to reduce pregnancy complications related to LGA.

Collapse

Analytical Comparison of Risk Prediction Models for the Onset of Macrosomia Based on Three Statistical Methods. DISEASE MARKERS 2022;2022:9073043. [PMID: 36124028 PMCID: PMC9482546 DOI: 10.1155/2022/9073043] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 07/26/2022] [Revised: 08/29/2022] [Accepted: 09/01/2022] [Indexed: 11/18/2022]

Abstract Background and Purpose. Fetal overgrowth can pose a serious threat to the safety of a mother and child. Early identification of high-risk pregnant women and timely pregnancy intervention and guidance are of great value in preventing the development of giant babies and improving adverse maternal and infant outcomes. The current clinical methods for predicting macrosomia mainly rely on obstetric examination and imaging, but their accuracy is controversial. And there is no accepted method for accurately predicting macrosomia. We investigated the risk factors influencing the occurrence of macrosomia and established a prediction model for the occurrence of macrosomia to provide a reference basis for interventions to prevent macrosomia. Method. A retrospective selection of 93 women who were hospitalized in our hospital from March 2019 to May 2022 with a singleton pregnancy and delivered at term with macrosomia were the study group. And 356 women who delivered a normal size baby during the same period were the control group. The variables that were associated with the onset of macrosomia were screened from maternal medical records. Logistic regression models, random forest, and CART decision tree models were developed using the screened variables as input variables and whether they were macrosomia as outcome variables, respectively. The performance of the three models was evaluated by accuracy, precision, recall, F1 score, and receiver operating characteristic curve (ROC). Result. The risk prediction models for the onset of macrosomia, logistic regression model, random forest model, and decision tree, were successfully developed, with accuracies of 0.904, 1.000, and 0.901 in the training set and 0.926, 0.582, and 0.852 in the validation set, respectively. The AUC in the training set were 0.898, 1.000, and 0.789, and in the validation set were 0.906, 0.913, and 0.731, respectively. In general, the logistic regression model has the highest diagnostic efficiency, followed by the random forest model. Conclusion. Logistic regression models have high application value in the assessment of predicting the risk of macrosomia, and it is suggested that the advantages of logistic regression models and random forest models should be combined in future studies and applications to make them work better in the prediction of the risk of macrosomia. Collapse

Bai X, Zhou Z, Su M, Li Y, Yang L, Liu K, Yang H, Zhu H, Chen S, Pan H. Predictive models for small-for-gestational-age births in women exposed to pesticides before pregnancy based on multiple machine learning algorithms. Front Public Health 2022;10:940182. [PMID: 36003638 PMCID: PMC9394741 DOI: 10.3389/fpubh.2022.940182] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/10/2022] [Accepted: 07/19/2022] [Indexed: 11/25/2022] Open

Affiliation(s)

Xi Bai Key Laboratory of Endocrinology of National Health Commission, Department of Endocrinology, State Key Laboratory of Complex Severe and Rare Diseases, Peking Union Medical College Hospital, Chinese Academy of Medical Science and Peking Union Medical College, Beijing, China
Zhibo Zhou Key Laboratory of Endocrinology of National Health Commission, Department of Endocrinology, State Key Laboratory of Complex Severe and Rare Diseases, Peking Union Medical College Hospital, Chinese Academy of Medical Science and Peking Union Medical College, Beijing, China
Mingliang Su DHC Mediway Technology Co., Ltd, Beijing, China
Yansheng Li DHC Mediway Technology Co., Ltd, Beijing, China
Liuqing Yang DHC Mediway Technology Co., Ltd, Beijing, China
Kejia Liu DHC Mediway Technology Co., Ltd, Beijing, China
Hongbo Yang Key Laboratory of Endocrinology of National Health Commission, Department of Endocrinology, State Key Laboratory of Complex Severe and Rare Diseases, Peking Union Medical College Hospital, Chinese Academy of Medical Science and Peking Union Medical College, Beijing, China
Huijuan Zhu Key Laboratory of Endocrinology of National Health Commission, Department of Endocrinology, State Key Laboratory of Complex Severe and Rare Diseases, Peking Union Medical College Hospital, Chinese Academy of Medical Science and Peking Union Medical College, Beijing, China
Shi Chen Key Laboratory of Endocrinology of National Health Commission, Department of Endocrinology, State Key Laboratory of Complex Severe and Rare Diseases, Peking Union Medical College Hospital, Chinese Academy of Medical Science and Peking Union Medical College, Beijing, China *Correspondence: Hui Pan
Hui Pan Key Laboratory of Endocrinology of National Health Commission, Department of Endocrinology, State Key Laboratory of Complex Severe and Rare Diseases, Peking Union Medical College Hospital, Chinese Academy of Medical Science and Peking Union Medical College, Beijing, China Shi Chen

Collapse

Infant birth weight estimation and low birth weight classification in United Arab Emirates using machine learning algorithms. Sci Rep 2022;12:12110. [PMID: 35840605 PMCID: PMC9287292 DOI: 10.1038/s41598-022-14393-6] [Citation(s) in RCA: 12] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/15/2021] [Accepted: 06/06/2022] [Indexed: 11/08/2022] Open

Abstract

Accurate prediction of a newborn’s birth weight (BW) is a crucial determinant to evaluate the newborn’s health and safety. Infants with low BW (LBW) are at a higher risk of serious short- and long-term health outcomes. Over the past decade, machine learning (ML) techniques have shown a successful breakthrough in the field of medical diagnostics. Various automated systems have been proposed that use maternal features for LBW prediction. However, each proposed system uses different maternal features for LBW classification and estimation. Therefore, this paper provides a detailed setup for BW estimation and LBW classification. Multiple subsets of features were combined to perform predictions with and without feature selection techniques. Furthermore, the synthetic minority oversampling technique was employed to oversample the minority class. The performance of 30 ML algorithms was evaluated for both infant BW estimation and LBW classification. Experiments were performed on a self-created dataset with 88 features. The dataset was obtained from 821 women from three hospitals in the United Arab Emirates. Different performance metrics, such as mean absolute error and mean absolute percent error, were used for BW estimation. Accuracy, precision, recall, F-scores, and confusion matrices were used for LBW classification. Extensive experiments performed using five-folds cross validation show that the best weight estimation was obtained using Random Forest algorithm with mean absolute error of 294.53 g while the best classification performance was obtained using Logistic Regression with SMOTE oversampling techniques that achieved accuracy, precision, recall and F1 score of 90.24%, 87.6%, 90.2% and 0.89, respectively. The results also suggest that features such as diabetes, hypertension, and gestational age, play a vital role in LBW classification.

Collapse

Peng L, Peng C, Yang F, Wang J, Zuo W, Cheng C, Mao Z, Jin Z, Li W. Machine learning approach for the prediction of 30-day mortality in patients with sepsis-associated encephalopathy. BMC Med Res Methodol 2022;22:183. [PMID: 35787248 PMCID: PMC9252033 DOI: 10.1186/s12874-022-01664-z] [Citation(s) in RCA: 13] [Impact Index Per Article: 6.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/08/2022] [Accepted: 06/23/2022] [Indexed: 11/10/2022] Open

Abstract

Objective

Our study aimed to identify predictors as well as develop machine learning (ML) models to predict the risk of 30-day mortality in patients with sepsis-associated encephalopathy (SAE).

Materials and methods

ML models were developed and validated based on a public database named Medical Information Mart for Intensive Care (MIMIC)-IV. Models were compared by the area under the curve (AUC), accuracy, sensitivity, specificity, positive and negative predictive values, and Hosmer–Lemeshow good of fit test.

Results

Of 6994 patients in MIMIC-IV included in the final cohort, a total of 1232 (17.62%) patients died following SAE. Recursive feature elimination (RFE) selected 15 variables, including acute physiology score III (APSIII), Glasgow coma score (GCS), sepsis related organ failure assessment (SOFA), Charlson comorbidity index (CCI), red blood cell volume distribution width (RDW), blood urea nitrogen (BUN), age, respiratory rate, PaO₂, temperature, lactate, creatinine (CRE), malignant cancer, metastatic solid tumor, and platelet (PLT). The validation cohort demonstrated all ML approaches had higher discriminative ability compared with the bagged trees (BT) model, although the difference was not statistically significant. Furthermore, in terms of the calibration performance, the artificial neural network (NNET), logistic regression (LR), and adapting boosting (Ada) models had a good calibration—namely, a high accuracy of prediction, with P-values of 0.831, 0.119, and 0.129, respectively.

Conclusions

The ML models, as demonstrated by our study, can be used to evaluate the prognosis of SAE patients in the intensive care unit (ICU). Online calculator could facilitate the sharing of predictive models.

Supplementary Information

The online version contains supplementary material available at 10.1186/s12874-022-01664-z.

Collapse

Exploring the predictive capability of machine learning models in identifying foot and mouth disease outbreak occurrences in cattle farms in an endemic setting of Thailand. Prev Vet Med 2022;207:105706. [DOI: 10.1016/j.prevetmed.2022.105706] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/10/2022] [Revised: 06/09/2022] [Accepted: 07/01/2022] [Indexed: 11/20/2022]

Iwama N, Obara T, Ishikuro M, Murakami K, Ueno F, Noda A, Onuma T, Matsuzaki F, Hoshiai T, Saito M, Metoki H, Sugawara J, Yaegashi N, Kuriyama S. Risk scores for predicting small for gestational age infants in Japan: The TMM birthree cohort study. Sci Rep 2022;12:8921. [PMID: 35618764 PMCID: PMC9135745 DOI: 10.1038/s41598-022-12892-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/27/2021] [Accepted: 05/16/2022] [Indexed: 11/30/2022] Open

Affiliation(s)

Noriyuki Iwama Department of Obstetrics and Gynecology, Tohoku University Hospital, 1-1, Seiryomachi, Sendai, Miyagi, 980-8574, Japan. .,Division of Molecular Epidemiology, Department of Preventive Medicine and Epidemiology, Tohoku Medical Megabank Organization, Tohoku University, Sendai, Japan.
Taku Obara Division of Molecular Epidemiology, Department of Preventive Medicine and Epidemiology, Tohoku Medical Megabank Organization, Tohoku University, Sendai, Japan.,Division of Molecular Epidemiology, Graduate School of Medicine, Tohoku University, Sendai, Japan.,Department of Pharmaceutical Sciences, Tohoku University Hospital, Sendai, Miyagi, Japan
Mami Ishikuro Division of Molecular Epidemiology, Department of Preventive Medicine and Epidemiology, Tohoku Medical Megabank Organization, Tohoku University, Sendai, Japan.,Division of Molecular Epidemiology, Graduate School of Medicine, Tohoku University, Sendai, Japan
Keiko Murakami Division of Molecular Epidemiology, Department of Preventive Medicine and Epidemiology, Tohoku Medical Megabank Organization, Tohoku University, Sendai, Japan.,Division of Molecular Epidemiology, Graduate School of Medicine, Tohoku University, Sendai, Japan
Fumihiko Ueno Division of Molecular Epidemiology, Department of Preventive Medicine and Epidemiology, Tohoku Medical Megabank Organization, Tohoku University, Sendai, Japan.,Division of Molecular Epidemiology, Graduate School of Medicine, Tohoku University, Sendai, Japan
Aoi Noda Division of Molecular Epidemiology, Department of Preventive Medicine and Epidemiology, Tohoku Medical Megabank Organization, Tohoku University, Sendai, Japan.,Division of Molecular Epidemiology, Graduate School of Medicine, Tohoku University, Sendai, Japan.,Department of Pharmaceutical Sciences, Tohoku University Hospital, Sendai, Miyagi, Japan
Tomomi Onuma Division of Molecular Epidemiology, Department of Preventive Medicine and Epidemiology, Tohoku Medical Megabank Organization, Tohoku University, Sendai, Japan.,Division of Molecular Epidemiology, Graduate School of Medicine, Tohoku University, Sendai, Japan
Fumiko Matsuzaki Division of Molecular Epidemiology, Department of Preventive Medicine and Epidemiology, Tohoku Medical Megabank Organization, Tohoku University, Sendai, Japan
Tetsuro Hoshiai Department of Obstetrics and Gynecology, Tohoku University Hospital, 1-1, Seiryomachi, Sendai, Miyagi, 980-8574, Japan
Masatoshi Saito Department of Obstetrics and Gynecology, Tohoku University Hospital, 1-1, Seiryomachi, Sendai, Miyagi, 980-8574, Japan.,Department of Maternal and Fetal Therapeutics, Tohoku University Graduate School of Medicine, Sendai, Miyagi, Japan
Hirohito Metoki Tohoku Medical Megabank Organization, Tohoku University, Sendai, Miyagi, Japan.,Division of Public Health, Hygiene and Epidemiology, Tohoku Medical Pharmaceutical University, Sendai, Miyagi, Japan
Junichi Sugawara Department of Obstetrics and Gynecology, Tohoku University Hospital, 1-1, Seiryomachi, Sendai, Miyagi, 980-8574, Japan.,Tohoku Medical Megabank Organization, Tohoku University, Sendai, Miyagi, Japan.,Environment and Genome Research Center, Tohoku University Graduate School of Medicine, Sendai, Miyagi, Japan
Nobuo Yaegashi Department of Obstetrics and Gynecology, Tohoku University Hospital, 1-1, Seiryomachi, Sendai, Miyagi, 980-8574, Japan.,Tohoku Medical Megabank Organization, Tohoku University, Sendai, Miyagi, Japan.,Environment and Genome Research Center, Tohoku University Graduate School of Medicine, Sendai, Miyagi, Japan.,Department of Obstetrics and Gynecology, Tohoku University Graduate School of Medicine, Sendai, Miyagi, Japan
Shinichi Kuriyama Division of Molecular Epidemiology, Department of Preventive Medicine and Epidemiology, Tohoku Medical Megabank Organization, Tohoku University, Sendai, Japan.,Division of Molecular Epidemiology, Graduate School of Medicine, Tohoku University, Sendai, Japan.,International Research Institute of Disaster Science, Tohoku University, Sendai, Miyagi, Japan

Collapse

Development and Evaluation of a Machine Learning Prediction Model for Small-for-Gestational-Age Births in Women Exposed to Radiation before Pregnancy. J Pers Med 2022;12:jpm12040550. [PMID: 35455666 PMCID: PMC9031835 DOI: 10.3390/jpm12040550] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/26/2022] [Revised: 03/15/2022] [Accepted: 03/16/2022] [Indexed: 12/22/2022] Open

Perichart-Perera O, Avila-Sosa V, Solis-Paredes JM, Montoya-Estrada A, Reyes-Muñoz E, Rodríguez-Cano AM, González-Leyva CP, Sánchez-Martínez M, Estrada-Gutierrez G, Irles C. Vitamin D Deficiency, Excessive Gestational Weight Gain, and Oxidative Stress Predict Small for Gestational Age Newborns Using an Artificial Neural Network Model. Antioxidants (Basel) 2022;11:574. [PMID: 35326224 PMCID: PMC8944993 DOI: 10.3390/antiox11030574] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/06/2022] [Revised: 03/10/2022] [Accepted: 03/14/2022] [Indexed: 02/04/2023] Open

Dhombres F, Bonnard J, Bailly K, Maurice P, Papageorghiou A, Jouannic JM. Contributions of artificial intelligence reported in Obstetrics and Gynecology journals: a systematic review. J Med Internet Res 2022;24:e35465. [PMID: 35297766 PMCID: PMC9069308 DOI: 10.2196/35465] [Citation(s) in RCA: 22] [Impact Index Per Article: 11.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/05/2021] [Revised: 02/11/2022] [Accepted: 03/15/2022] [Indexed: 11/13/2022] Open

Abstract

Background

The applications of artificial intelligence (AI) processes have grown significantly in all medical disciplines during the last decades. Two main types of AI have been applied in medicine: symbolic AI (eg, knowledge base and ontologies) and nonsymbolic AI (eg, machine learning and artificial neural networks). Consequently, AI has also been applied across most obstetrics and gynecology (OB/GYN) domains, including general obstetrics, gynecology surgery, fetal ultrasound, and assisted reproductive medicine, among others.

Objective

The aim of this study was to provide a systematic review to establish the actual contributions of AI reported in OB/GYN discipline journals.

Methods

The PubMed database was searched for citations indexed with “artificial intelligence” and at least one of the following medical subject heading (MeSH) terms between January 1, 2000, and April 30, 2020: “obstetrics”; “gynecology”; “reproductive techniques, assisted”; or “pregnancy.” All publications in OB/GYN core disciplines journals were considered. The selection of journals was based on disciplines defined in Web of Science. The publications were excluded if no AI process was used in the study. Review, editorial, and commentary articles were also excluded. The study analysis comprised (1) classification of publications into OB/GYN domains, (2) description of AI methods, (3) description of AI algorithms, (4) description of data sets, (5) description of AI contributions, and (6) description of the validation of the AI process.

Results

The PubMed search retrieved 579 citations and 66 publications met the selection criteria. All OB/GYN subdomains were covered: obstetrics (41%, 27/66), gynecology (3%, 2/66), assisted reproductive medicine (33%, 22/66), early pregnancy (2%, 1/66), and fetal medicine (21%, 14/66). Both machine learning methods (39/66) and knowledge base methods (25/66) were represented. Machine learning used imaging, numerical, and clinical data sets. Knowledge base methods used mostly omics data sets. The actual contributions of AI were method/algorithm development (53%, 35/66), hypothesis generation (42%, 28/66), or software development (3%, 2/66). Validation was performed on one data set (86%, 57/66) and no external validation was reported. We observed a general rising trend in publications related to AI in OB/GYN over the last two decades. Most of these publications (82%, 54/66) remain out of the scope of the usual OB/GYN journals.

Conclusions

In OB/GYN discipline journals, mostly preliminary work (eg, proof-of-concept algorithm or method) in AI applied to this discipline is reported and clinical validation remains an unmet prerequisite. Improvement driven by new AI research guidelines is expected. However, these guidelines are covering only a part of AI approaches (nonsymbolic) reported in this review; hence, updates need to be considered.

Collapse

Ayat M, Kim B, Kang CW. A new data mining-based framework to predict the success of private participation in infrastructure projects. INTERNATIONAL JOURNAL OF CONSTRUCTION MANAGEMENT 2022. [DOI: 10.1080/15623599.2022.2045862] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 10/18/2022]

Hu MH, Yen HK, Chen IH, Wu CH, Chen CW, Yang JJ, Wang ZY, Yen MH, Yang SH, Lin WH. Decreased psoas muscle area is a prognosticator for 90-day and 1-year survival in patients undergoing surgical treatment for spinal metastasis. Clin Nutr 2022;41:620-629. [PMID: 35124469 DOI: 10.1016/j.clnu.2022.01.011] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/15/2021] [Revised: 01/06/2022] [Accepted: 01/12/2022] [Indexed: 11/03/2022]

Abstract

BACKGROUND AND AIMS

Survival estimation for patients with spinal metastasis is crucial to treatment decisions. Psoas muscle area (PMA), a surrogate for total muscle mass, has been proposed as a useful survival prognosticator. However, few studies have validated the predictive value of decreased PMA in an Asian cohort or its predictive value after controlling for existing preoperative scoring systems (PSSs). In this study, we aim to answer: (1) Is PMA associated with survival in Han Chinese patients with spinal metastasis? (2) Is PMA a good prognosticator according to concordance index (c-index) and decision curve analysis (DCA) after controlling for six existing and commonly used PSSs?

METHODS

This study included 180 adult (≥18 years old) Taiwanese patients with a mean age of 58.3 years (range: 22-85) undergoing surgical treatment for spinal metastasis. A patient's PMA was classified into decreased, medium, and large if it fell into the lower (0-33%), middle (33-67%), and upper (67-100%) 1/3 in the study cohort, respectively. We used logistic and cox proportional-hazard regressions to assess whether PMA was associated with 90-day, 1-year, and overall survival. The model performance before and after addition of PMA to six commonly used PSSs, including Tomita score, original Tokuhashi score, revised Tokuhashi score, modified Bauer score, New England Spinal Metastasis Score, and Skeletal Oncology Research Group machine learning algorithms (SORG-MLAs), was compared by c-index and DCA to determine if PMA was a useful survival prognosticator.

RESULTS

Patients with a larger PMA is associated with better 90-day, but not 1-year, survival. The model performance of 90-day survival prediction improved after PMA was incorporated into all PSSs except SORG-MLAs. PMA barely improved the discriminatory ability (c-index, 0.74; 95% confidence interval [CI], 0.67-0.82 vs. c-index, 0.74; 95% CI, 0.66-0.81) and provided little gain of clinical net benefit on DCA for SORG-MLAs' 90-day survival prediction.

CONCLUSIONS

PMA is a prognosticator for 90-day survival and improves the discriminatory ability of earlier-proposed PSSs in our Asian cohort. However, incorporating PMA into more modern PSSs such as SORG-MLAs did not significantly improve its prediction performance.

Collapse

Côté M, Osseni MA, Brassard D, Carbonneau É, Robitaille J, Vohl MC, Lemieux S, Laviolette F, Lamarche B. Are Machine Learning Algorithms More Accurate in Predicting Vegetable and Fruit Consumption Than Traditional Statistical Models? An Exploratory Analysis. Front Nutr 2022;9:740898. [PMID: 35252288 PMCID: PMC8891134 DOI: 10.3389/fnut.2022.740898] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/13/2021] [Accepted: 01/25/2022] [Indexed: 12/23/2022] Open

Abstract

Machine learning (ML) algorithms may help better understand the complex interactions among factors that influence dietary choices and behaviors. The aim of this study was to explore whether ML algorithms are more accurate than traditional statistical models in predicting vegetable and fruit (VF) consumption. A large array of features (2,452 features from 525 variables) encompassing individual and environmental information related to dietary habits and food choices in a sample of 1,147 French-speaking adult men and women was used for the purpose of this study. Adequate VF consumption, which was defined as 5 servings/d or more, was measured by averaging data from three web-based 24 h recalls and used as the outcome to predict. Nine classification ML algorithms were compared to two traditional statistical predictive models, logistic regression and penalized regression (Lasso). The performance of the predictive ML algorithms was tested after the implementation of adjustments, including normalizing the data, as well as in a series of sensitivity analyses such as using VF consumption obtained from a web-based food frequency questionnaire (wFFQ) and applying a feature selection algorithm in an attempt to reduce overfitting. Logistic regression and Lasso predicted adequate VF consumption with an accuracy of 0.64 (95% confidence interval [CI]: 0.58–0.70) and 0.64 (95%CI: 0.60–0.68) respectively. Among the ML algorithms tested, the most accurate algorithms to predict adequate VF consumption were the support vector machine (SVM) with either a radial basis kernel or a sigmoid kernel, both with an accuracy of 0.65 (95%CI: 0.59–0.71). The least accurate ML algorithm was the SVM with a linear kernel with an accuracy of 0.55 (95%CI: 0.49–0.61). Using dietary intake data from the wFFQ and applying a feature selection algorithm had little to no impact on the performance of the algorithms. In summary, ML algorithms and traditional statistical models predicted adequate VF consumption with similar accuracies among adults. These results suggest that additional research is needed to explore further the true potential of ML in predicting dietary behaviours that are determined by complex interactions among several individual, social and environmental factors.

Collapse

Affiliation(s)

Mélina Côté Centre Nutrition, santé et société (NUTRISS), Institut sur la nutrition et les aliments fonctionnels de l'Université Laval (INAF), Université Laval, Québec, QC, Canada École de nutrition, Université Laval, Québec, QC, Canada
Mazid Abiodoun Osseni Centre de recherche en données massives (CRDM), Université Laval, Québec, QC, Canada Groupe de recherche en apprentissage automatique de l'Université Laval (GRAAL), Université Laval, Québec, QC, Canada
Didier Brassard Centre Nutrition, santé et société (NUTRISS), Institut sur la nutrition et les aliments fonctionnels de l'Université Laval (INAF), Université Laval, Québec, QC, Canada École de nutrition, Université Laval, Québec, QC, Canada
Élise Carbonneau Centre Nutrition, santé et société (NUTRISS), Institut sur la nutrition et les aliments fonctionnels de l'Université Laval (INAF), Université Laval, Québec, QC, Canada École de nutrition, Université Laval, Québec, QC, Canada
Julie Robitaille Centre Nutrition, santé et société (NUTRISS), Institut sur la nutrition et les aliments fonctionnels de l'Université Laval (INAF), Université Laval, Québec, QC, Canada École de nutrition, Université Laval, Québec, QC, Canada
Marie-Claude Vohl Centre Nutrition, santé et société (NUTRISS), Institut sur la nutrition et les aliments fonctionnels de l'Université Laval (INAF), Université Laval, Québec, QC, Canada École de nutrition, Université Laval, Québec, QC, Canada
Simone Lemieux Centre Nutrition, santé et société (NUTRISS), Institut sur la nutrition et les aliments fonctionnels de l'Université Laval (INAF), Université Laval, Québec, QC, Canada École de nutrition, Université Laval, Québec, QC, Canada
François Laviolette Centre Nutrition, santé et société (NUTRISS), Institut sur la nutrition et les aliments fonctionnels de l'Université Laval (INAF), Université Laval, Québec, QC, Canada Centre de recherche en données massives (CRDM), Université Laval, Québec, QC, Canada Groupe de recherche en apprentissage automatique de l'Université Laval (GRAAL), Université Laval, Québec, QC, Canada
Benoît Lamarche Centre Nutrition, santé et société (NUTRISS), Institut sur la nutrition et les aliments fonctionnels de l'Université Laval (INAF), Université Laval, Québec, QC, Canada École de nutrition, Université Laval, Québec, QC, Canada *Correspondence: Benoît Lamarche

Collapse

Ochola D, Boekelo B, van de Ven GWJ, Taulya G, Kubiriba J, van Asten PJA, Giller KE. Mapping spatial distribution and geographic shifts of East African highland banana (Musa spp.) in Uganda. PLoS One 2022;17:e0263439. [PMID: 35176065 PMCID: PMC8853547 DOI: 10.1371/journal.pone.0263439] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/29/2021] [Accepted: 01/20/2022] [Indexed: 11/24/2022] Open

Abstract

East African highland banana (Musa acuminata genome group AAA-EA; hereafter referred to as banana) is critical for Uganda's food supply, hence our aim to map current distribution and to understand changes in banana production areas over the past five decades. We collected banana presence/absence data through an online survey based on high-resolution satellite images and coupled this data with independent covariates as inputs for ensemble machine learning prediction of current banana distribution. We assessed geographic shifts of production areas using spatially explicit differences between the 1958 and 2016 banana distribution maps. The biophysical factors associated with banana spatial distribution and geographic shift were determined using a logistic regression model and classification and regression tree, respectively. Ensemble models were superior (AUC = 0.895; 0.907) compared to their constituent algorithms trained with 12 and 17 covariates, respectively: random forests (AUC = 0.883; 0.901), gradient boosting machines (AUC = 0.878; 0.903), and neural networks (AUC = 0.870; 0.890). The logistic regression model (AUC = 0.879) performance was similar to that for the ensemble model and its constituent algorithms. In 2016, banana cultivation was concentrated in the western (44%) and central (36%) regions, while only a small proportion was in the eastern (18%) and northern (2%) regions. About 60% of increased cultivation since 1958 was in the western region; 50% of decreased cultivation in the eastern region; and 44% of continued cultivation in the central region. Soil organic carbon, soil pH, annual precipitation, slope gradient, bulk density and blue reflectance were associated with increased banana cultivation while precipitation seasonality and mean annual temperature were associated with decreased banana cultivation over the past 50 years. The maps of spatial distribution and geographic shift of banana can support targeting of context-specific intensification options and policy advocacy to avert agriculture driven environmental degradation.

Collapse

Bertini A, Salas R, Chabert S, Sobrevia L, Pardo F. Using Machine Learning to Predict Complications in Pregnancy: A Systematic Review. Front Bioeng Biotechnol 2022;9:780389. [PMID: 35127665 PMCID: PMC8807522 DOI: 10.3389/fbioe.2021.780389] [Citation(s) in RCA: 15] [Impact Index Per Article: 7.5] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/21/2021] [Accepted: 12/10/2021] [Indexed: 12/11/2022] Open

Abstract Introduction: Artificial intelligence is widely used in medical field, and machine learning has been increasingly used in health care, prediction, and diagnosis and as a method of determining priority. Machine learning methods have been features of several tools in the fields of obstetrics and childcare. This present review aims to summarize the machine learning techniques to predict perinatal complications.Objective: To identify the applicability and performance of machine learning methods used to identify pregnancy complications.Methods: A total of 98 articles were obtained with the keywords “machine learning,” “deep learning,” “artificial intelligence,” and accordingly as they related to perinatal complications (“complications in pregnancy,” “pregnancy complications”) from three scientific databases: PubMed, Scopus, and Web of Science. These were managed on the Mendeley platform and classified using the PRISMA method.Results: A total of 31 articles were selected after elimination according to inclusion and exclusion criteria. The features used to predict perinatal complications were primarily electronic medical records (48%), medical images (29%), and biological markers (19%), while 4% were based on other types of features, such as sensors and fetal heart rate. The main perinatal complications considered in the application of machine learning thus far are pre-eclampsia and prematurity. In the 31 studies, a total of sixteen complications were predicted. The main precision metric used is the AUC. The machine learning methods with the best results were the prediction of prematurity from medical images using the support vector machine technique, with an accuracy of 95.7%, and the prediction of neonatal mortality with the XGBoost technique, with 99.7% accuracy.Conclusion: It is important to continue promoting this area of research and promote solutions with multicenter clinical applicability through machine learning to reduce perinatal complications. This systematic review contributes significantly to the specialized literature on artificial intelligence and women’s health. Collapse

Affiliation(s)

Ayleen Bertini Metabolic Diseases Research Laboratory (MDRL), Interdisciplinary Center for Research in Territorial Health of the Aconcagua Valley (CIISTe Aconcagua), Center for Biomedical Research (CIB), Universidad de Valparaíso, Valparaiso, Chile PhD Program Doctorado en Ciencias e Ingeniería para La Salud, Faculty of Medicine, Universidad de Valparaíso, Valparaiso, Chile
Rodrigo Salas School of Biomedical Engineering, Faculty of Engineering, Universidad de Valparaíso, Valparaiso, Chile Centro de Investigación y Desarrollo en INGeniería en Salud – CINGS, Universidad de Valparaíso, Valparaiso, Chile Instituto Milenio Intelligent Healthcare Engineering, Valparaíso, Chile
Steren Chabert School of Biomedical Engineering, Faculty of Engineering, Universidad de Valparaíso, Valparaiso, Chile Centro de Investigación y Desarrollo en INGeniería en Salud – CINGS, Universidad de Valparaíso, Valparaiso, Chile Instituto Milenio Intelligent Healthcare Engineering, Valparaíso, Chile
Luis Sobrevia Cellular and Molecular Physiology Laboratory (CMPL), Division of Obstetrics and Gynaecology, School of Medicine, Faculty of Medicine, Pontificia Universidad Católica de Chile, Santiago, Chile Department of Physiology, Faculty of Pharmacy, Universidad de Sevilla, Seville, Spain University of Queensland Centre for Clinical Research (UQCCR), Faculty of Medicine and Biomedical Sciences, University of Queensland, Herston, QLD, Australia Department of Pathology and Medical Biology, University of Groningen, University Medical Center Groningen, Groningen, Netherlands Medical School (Faculty of Medicine), São Paulo State University (UNESP), São Paulo, Brazil Tecnologico de Monterrey, Eutra, The Institute for Obesity Research, School of Medicine and Health Sciences, Monterrey, Mexico
Fabián Pardo Metabolic Diseases Research Laboratory (MDRL), Interdisciplinary Center for Research in Territorial Health of the Aconcagua Valley (CIISTe Aconcagua), Center for Biomedical Research (CIB), Universidad de Valparaíso, Valparaiso, Chile Cellular and Molecular Physiology Laboratory (CMPL), Division of Obstetrics and Gynaecology, School of Medicine, Faculty of Medicine, Pontificia Universidad Católica de Chile, Santiago, Chile School of Medicine, Campus San Felipe, Faculty of Medicine, Universidad de Valparaíso, San Felipe, Chile *Correspondence: Fabián Pardo,

Collapse

Liu Z, Han N, Su T, Ji Y, Bao H, Zhou S, Luo S, Wang H, Liu J, Wang HJ. Interpretable machine learning to identify important predictors of birth weight: A prospective cohort study. Front Pediatr 2022;10:899954. [PMID: 36440327 PMCID: PMC9691849 DOI: 10.3389/fped.2022.899954] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 03/29/2022] [Accepted: 10/24/2022] [Indexed: 11/29/2022] Open

Abstract

BACKGROUND

Predicting birth weight and identifying its risk factors are clinically important. This study aims to use interpretable machine learning to predict birth weight and identity important predictors.

METHODS

This prospective cohort study was conducted in Tongzhou Maternal and Child Health Care Hospital of Beijing, China, recruiting pregnant women between June 2018 and February 2019. We used 24 features to predict infant birth weight, including gestational age, mother's age, parity, history of macrosomia delivery, pre-pregnancy body mass index (BMI), height, father's BMI, lifestyle (diet, physical activity, smoking), and biomarker (fasting glucose and lipids) features. Study outcome was birth weight of infant. We used 8 supervised learning models including 4 individual [linear regression, ridge regression, lasso regression, support vector machines regression (SVR)], and 4 ensemble estimators (random forest, AdaBoost, gradient boosted trees, and voting ensemble for regression) to predict birth weight. Model accuracy was measured by root mean squared error (RMSE) of 10-fold cross validation on the training set and RMSE of prediction on the test set. We used permutation importance algorithm to understand the prediction from the models and what affected them.

RESULT

This study included 4,754 mother-child dyads. RMSEs were lower in voting ensemble for regression, linear regression, and SVR than random forest, AdaBoost, and gradient boosted tree. The 5 most important predictors for infant birth weight were gestational age, fetal sex, preterm birth, mother's height, and pre-pregnancy BMI. After adding ultrasound-measured indicators of fetal growth into predictors, mother's height and pre-pregnancy BMI remained the most important predictors in predicting the outcome.

CONCLUSION

Mother's height and pre-pregnancy BMI were identified as important predictors for infant birth weight. Interpretable machine learning is a promising tool in the prediction of birth weight.

Collapse

Vaulet T, Al-Memar M, Fourie H, Bobdiwala S, Saso S, Pipi M, Stalder C, Bennett P, Timmerman D, Bourne T, De Moor B. Gradient boosted trees with individual explanations: An alternative to logistic regression for viability prediction in the first trimester of pregnancy. COMPUTER METHODS AND PROGRAMS IN BIOMEDICINE 2022;213:106520. [PMID: 34808532 PMCID: PMC8674730 DOI: 10.1016/j.cmpb.2021.106520] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 05/19/2021] [Accepted: 11/02/2021] [Indexed: 06/13/2023]

Abstract

BACKGROUND

Clinical models to predict first trimester viability are traditionally based on multivariable logistic regression (LR) which is not directly interpretable for non-statistical experts like physicians. Furthermore, LR requires complete datasets and pre-established variables specifications. In this study, we leveraged the internal non-linearity, feature selection and missing values handling mechanisms of machine learning algorithms, along with a post-hoc interpretability strategy, as potential advantages over LR for clinical modeling.

METHODS

The dataset included 1154 patients with 2377 individual scans and was obtained from a prospective observational cohort study conducted at a hospital in London, UK, from March 2014 to May 2019. The data were split into a training (70%) and a test set (30%). Parsimonious and complete multivariable models were developed from two algorithms to predict first trimester viability at 11-14 weeks gestational age (GA): LR and light gradient boosted machine (LGBM). Missing values were handled by multiple imputation where appropriate. The SHapley Additive exPlanations (SHAP) framework was applied to derive individual explanations of the models.

RESULTS

The parsimonious LGBM model had similar discriminative and calibration performance as the parsimonious LR (AUC 0.885 vs 0.860; calibration slope: 1.19 vs 1.18). The complete models did not outperform the parsimonious models. LGBM was robust to the presence of missing values and did not require multiple imputation unlike LR. Decision path plots and feature importance analysis revealed different algorithm behaviors despite similar predictive performance. The main driving variable from the LR model was the pre-specified interaction between fetal heart presence and mean sac diameter. The crown-rump length variable and a proxy variable reflecting the difference in GA between expected and observed GA were the two most important variables of LGBM. Finally, while variable interactions must be specified upfront with LR, several interactions were ranked by the SHAP framework among the most important features learned automatically by the LGBM algorithm.

CONCLUSIONS

Gradient boosted algorithms performed similarly to carefully crafted LR models in terms of discrimination and calibration for first trimester viability prediction. By handling multi-collinearity, missing values, feature selection and variable interactions internally, the gradient boosted trees algorithm, combined with SHAP, offers a serious alternative to traditional LR models.

Collapse

Affiliation(s)

Thibaut Vaulet ESAT-STADIUS, Stadius Centre for Dynamical Systems, Signal Processing and Data Analytics (STADIUS), Leuven (Arenberg) Kasteelpark Arenberg 10 - box 2446, Leuven 3001, Belgium.
Maya Al-Memar Tommy's National Early Miscarriage Research Centre, Queen Charlotte's and Chelsea Hospital, Imperial College, Du Cane Road, London W12 0HS, United Kingdom
Hanine Fourie Tommy's National Early Miscarriage Research Centre, Queen Charlotte's and Chelsea Hospital, Imperial College, Du Cane Road, London W12 0HS, United Kingdom
Shabnam Bobdiwala Tommy's National Early Miscarriage Research Centre, Queen Charlotte's and Chelsea Hospital, Imperial College, Du Cane Road, London W12 0HS, United Kingdom
Srdjan Saso Tommy's National Early Miscarriage Research Centre, Queen Charlotte's and Chelsea Hospital, Imperial College, Du Cane Road, London W12 0HS, United Kingdom
Maria Pipi Tommy's National Early Miscarriage Research Centre, Queen Charlotte's and Chelsea Hospital, Imperial College, Du Cane Road, London W12 0HS, United Kingdom
Catriona Stalder Tommy's National Early Miscarriage Research Centre, Queen Charlotte's and Chelsea Hospital, Imperial College, Du Cane Road, London W12 0HS, United Kingdom
Phillip Bennett Tommy's National Early Miscarriage Research Centre, Queen Charlotte's and Chelsea Hospital, Imperial College, Du Cane Road, London W12 0HS, United Kingdom
Dirk Timmerman Department of Development and Regeneration, KU Leuven, Leuven, Belgium; Department of Obstetrics and Gynecology, University Hospitals Leuven, Leuven, Belgium
Tom Bourne Tommy's National Early Miscarriage Research Centre, Queen Charlotte's and Chelsea Hospital, Imperial College, Du Cane Road, London W12 0HS, United Kingdom; Department of Development and Regeneration, KU Leuven, Leuven, Belgium; Department of Obstetrics and Gynecology, University Hospitals Leuven, Leuven, Belgium
Bart De Moor ESAT-STADIUS, Stadius Centre for Dynamical Systems, Signal Processing and Data Analytics (STADIUS), Leuven (Arenberg) Kasteelpark Arenberg 10 - box 2446, Leuven 3001, Belgium

Collapse

Wang Y, Liu H, Wang J, Hu X, Wang A, Nie Z, Xu H, Li J, Xin H, Zhang J, Zhang H, Wang Y, Lyu Z. Development and validation of a new predictive model for macrosomia at late-term pregnancy: A prospective study. Front Endocrinol (Lausanne) 2022;13:1019234. [PMID: 36465629 PMCID: PMC9713232 DOI: 10.3389/fendo.2022.1019234] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 08/14/2022] [Accepted: 11/01/2022] [Indexed: 11/18/2022] Open

Abstract

OBJECTIVE

Fetal macrosomia is defined as a birth weight more than 4,000 g and is associated with maternal and fetal complications. This early metabolic disease may influence the entire life of the infant. Currently, macrosomia is predicted by using the estimated fetal weight (EFW). However, the EFW is inaccurate when the gestational week is gradually increasing. To assess precisely the risk of macrosomia, we developed a new predictive model to estimate the risk of macrosomia.

METHODS

We continuously collected data on 655 subjects who attended regular antenatal visits and delivered at the Second Hospital of Hebei Medical University (Shijiazhuang, China) from November 2020 to September 2021. A total of 17 maternal features and 2 fetal ultrasonographic features were included at late-term pregnancy. The 655 subjects were divided into a model training set and an internal validation set. Then, 450 pregnant women were recruited from Handan Central Hospital (Handan, China) from November 2021 to March 2022 as the external validation set. The least absolute shrinkage and selection operator method was used to select the most appropriate predictive features and optimize them via 10-fold cross-validation. The multivariate logistical regressions were used to build the predictive model. Receiver operating characteristic (ROC) curves, C-indices, and calibration plots were obtained to assess model discrimination and accuracy. The model's clinical utility was evaluated via decision curve analysis (DCA).

RESULTS

Four predictors were finally included to develop this new model: prepregnancy obesity (prepregnancy body mass index ≥ 30 kg/m²), hypertriglyceridemia, gestational diabetes mellitus, and fetal abdominal circumference. This model afforded moderate predictive power [area under the ROC curve 0.788 (95% confidence interval [CI] 0.736, 0.840) for the training set, 0.819 (95% CI 0.744,0.894) for the internal validation set, and 0.773 (95% CI 0.713,0.833) for the external validation set]. On DCA, the model evidenced a good fit with, and positive net benefits for, both the internal and external validation sets.

CONCLUSIONS

We developed a predictive model for macrosomia and performed external validation in other regions to further prove the discrimination and accuracy of this predictive model. This novel model will aid clinicians in easily identifying those at high risk of macrosomia and assist obstetricians to plan accordingly.

Collapse

Tarimo CS, Bhuyan SS, Li Q, Mahande MJJ, Wu J, Fu X. Validating machine learning models for the prediction of labour induction intervention using routine data: a registry-based retrospective cohort study at a tertiary hospital in northern Tanzania. BMJ Open 2021;11:e051925. [PMID: 34857568 PMCID: PMC8647548 DOI: 10.1136/bmjopen-2021-051925] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 11/04/2022] Open

C1431T Variant of PPARγ Is Associated with Preeclampsia in Pregnant Women. Life (Basel) 2021;11:life11101052. [PMID: 34685423 PMCID: PMC8540421 DOI: 10.3390/life11101052] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/13/2021] [Revised: 10/01/2021] [Accepted: 10/02/2021] [Indexed: 12/16/2022] Open

Abstract

Peroxisome proliferator-activated receptor γ (PPARγ) is essential for placental development, whose SNPs have shown increased susceptibility to pregnancy-related diseases, such as preeclampsia. Our aim was to investigate the association between preeclampsia and three PPARγ SNPs (Pro12Ala, C1431T, and C681G), which together with nine clinical factors were used to build a pragmatic model for preeclampsia prediction. Data were collected from 1648 women from the EDEN cohort, of which 35 women had preeclamptic pregnancies, and the remaining 1613 women had normal pregnancies. Univariate analysis comparing preeclamptic patients to the control resulted in the SNP C1431T being the only factor significantly associated with preeclampsia (p < 0.05), with a confidence interval of 95% and odds ratio ranging from 4.90 to 8.75. On the other hand, three methods of multivariate feature selection highlighted seven features that could be potential predictors of preeclampsia: maternal C1431T and C681G variants, obesity, body mass index, number of pregnancies, primiparity, cigarette use, and education. These seven features were further used as input into eight different machine-learning algorithms to create predictive models, whose performances were evaluated based on metrics of accuracy and the area under the receiver operating characteristic curve (AUC). The boost tree-based model performed the best, with respective accuracy and AUC values of 0.971 ± 0.002 and 0.991 ± 0.001 in the training set and 0.951 and 0.701 in the testing set. A flowchart based on the boost tree model was constructed to depict the procedure for preeclampsia prediction. This final decision tree showed that the C1431T variant of PPARγ is significantly associated with susceptibility to preeclampsia. We believe that this final decision tree could be applied in the clinical prediction of preeclampsia in the very early stages of pregnancy.

Collapse

Khan O, Badhiwala JH, Witiw CD, Wilson JR, Fehlings MG. Machine learning algorithms for prediction of health-related quality-of-life after surgery for mild degenerative cervical myelopathy. Spine J 2021;21:1659-1669. [PMID: 32045708 DOI: 10.1016/j.spinee.2020.02.003] [Citation(s) in RCA: 30] [Impact Index Per Article: 10.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 09/26/2019] [Revised: 02/03/2020] [Accepted: 02/03/2020] [Indexed: 02/03/2023]

Abstract

BACKGROUND

Degenerative cervical myelopathy (DCM) is the most common cause of spinal cord dysfunction worldwide. Current guidelines recommend management based on the severity of myelopathy, measured by the modified Japanese Orthopedic Association (mJOA) score. Patients with moderate to severe myelopathy, defined by an mJOA below 15, are recommended to undergo surgery. However, the management for mild myelopathy (mJOA between 15 and 17) is controversial since the response to surgery is more heterogeneous.

PURPOSE

To develop machine learning algorithms predicting phenotypes of mild myelopathy patients that would benefit most from surgery.

STUDY DESIGN

Retrospective subgroup analysis of prospectively collected data.

PATIENT SAMPLES

Data were obtained from 193 mild DCM patients who underwent surgical decompression and were enrolled in the multicenter AOSpine CSM clinical trials.

OUTCOME MEASURES

The mJOA score, an assessment of functional status, was used to isolate patients with mild DCM. The primary outcome measures were change from baseline for the Short Form-36 (SF-36) mental component summary (MCS) and physical component summary (PCS) at 1-year postsurgery. These changes were dichotomized according to whether they exceeded the minimal clinically important difference.

METHODS

The data were split into training (75%) and testing (25%) sets. Model predictors included baseline demographic variables and clinical presentation. Seven machine learning algorithms and a logistic regression model were trained and optimized using the training set, and their performances were evaluated using the testing set. For each outcome (improvement in MCS or PCS), the machine learning algorithm with the greatest area under the curve (AUC) on the training set was selected for further analysis.

RESULTS

The generalized boosted model (GBM) and earth models performed well in the prediction of significant improvement in MCS and PCS respectively, with AUCs of 0.72 to 0.78 on the training set. This performance was replicated on the testing set, in which the GBM and earth models showed AUCs of 0.77 and 0.78, respectively, as well as fair to good calibration across the predicted range of probabilities. Female patients with a low initial MCS were less likely to experience significant improvement in MCS than males. The presence of certain signs and symptoms (eg, lower limb spasticity, clumsy hands) were also predictive of worse outcome.

CONCLUSIONS

Machine learning models showed good predictive power and provided information about the phenotypes of mild DCM patients most likely to benefit from surgical intervention. Overall, machine learning may be a useful tool for management of mild DCM, though external validation and prospective analysis should be performed to better solidify its role.

Collapse

Côté M, Lamarche B. Artificial intelligence in nutrition research: perspectives on current and future applications. Appl Physiol Nutr Metab 2021;47:1-8. [PMID: 34525321 DOI: 10.1139/apnm-2021-0448] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022]

Froud R, Hansen SH, Ruud HK, Foss J, Ferguson L, Fredriksen PM. Relative Performance of Machine Learning and Linear Regression in Predicting Quality of Life and Academic Performance of School Children in Norway: Data Analysis of a Quasi-Experimental Study. J Med Internet Res 2021;23:e22021. [PMID: 34009128 PMCID: PMC8325075 DOI: 10.2196/22021] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/03/2020] [Revised: 10/26/2020] [Accepted: 05/17/2021] [Indexed: 12/14/2022] Open

Abstract

BACKGROUND

Machine learning techniques are increasingly being applied in health research. It is not clear how useful these approaches are for modeling continuous outcomes. Child quality of life is associated with parental socioeconomic status and physical activity and may be associated with aerobic fitness and strength. It is unclear whether diet or academic performance is associated with quality of life.

OBJECTIVE

The purpose of this study was to compare the predictive performance of machine learning techniques with that of linear regression in examining the extent to which continuous outcomes (physical activity, aerobic fitness, muscular strength, diet, and parental education) are predictive of academic performance and quality of life and whether academic performance and quality of life are associated.

METHODS

We modeled data from children attending 9 schools in a quasi-experimental study. We split data randomly into training and validation sets. Curvilinear, nonlinear, and heteroscedastic variables were simulated to examine the performance of machine learning techniques compared to that of linear models, with and without imputation.

RESULTS

We included data for 1711 children. Regression models explained 24% of academic performance variance in the real complete-case validation set, and up to 15% in quality of life. While machine learning techniques explained high proportions of variance in training sets, in validation, machine learning techniques explained approximately 0% of academic performance and 3% to 8% of quality of life. With imputation, machine learning techniques improved to 15% for academic performance. Machine learning outperformed regression for simulated nonlinear and heteroscedastic variables. The best predictors of academic performance in adjusted models were the child's mother having a master-level education (P<.001; β=1.98, 95% CI 0.25 to 3.71), increased television and computer use (P=.03; β=1.19, 95% CI 0.25 to 3.71), and dichotomized self-reported exercise (P=.001; β=2.47, 95% CI 1.08 to 3.87). For quality of life, self-reported exercise (P<.001; β=1.09, 95% CI 0.53 to 1.66) and increased television and computer use (P=.002; β=-0.95, 95% CI -1.55 to -0.36) were the best predictors. Adjusted academic performance was associated with quality of life (P=.02; β=0.12, 95% CI 0.02 to 0.22).

CONCLUSIONS

Linear regression was less prone to overfitting and outperformed commonly used machine learning techniques. Imputation improved the performance of machine learning, but not sufficiently to outperform regression. Machine learning techniques outperformed linear regression for modeling nonlinear and heteroscedastic relationships and may be of use in such cases. Regression with splines performed almost as well in nonlinear modeling. Lifestyle variables, including physical exercise, television and computer use, and parental education are predictive of academic performance or quality of life. Academic performance is associated with quality of life after adjusting for lifestyle variables and may offer another promising intervention target to improve quality of life in children.

Collapse

Wu CX, Suresh E, Phng FWL, Tai KP, Pakdeethai J, D'Souza JLA, Tan WS, Phan P, Lew KSM, Tan GYH, Chua GSW, Hwang CH. Effect of a Real-Time Risk Score on 30-day Readmission Reduction in Singapore. Appl Clin Inform 2021;12:372-382. [PMID: 34010978 DOI: 10.1055/s-0041-1726422] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/14/2022] Open

Khan O, Badhiwala JH, Akbar MA, Fehlings MG. Prediction of Worse Functional Status After Surgery for Degenerative Cervical Myelopathy: A Machine Learning Approach. Neurosurgery 2021;88:584-591. [PMID: 33289519 DOI: 10.1093/neuros/nyaa477] [Citation(s) in RCA: 15] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/20/2020] [Accepted: 08/12/2020] [Indexed: 11/14/2022] Open