Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For: Zhang L, Wang Y, Niu M, Wang C, Wang Z. Machine learning for characterizing risk of type 2 diabetes mellitus in a rural Chinese population: the Henan Rural Cohort Study. Sci Rep 2020;10:4406. [PMID: 32157171 PMCID: PMC7064542 DOI: 10.1038/s41598-020-61123-x] [Citation(s) in RCA: 40] [Impact Index Per Article: 10.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/11/2019] [Accepted: 02/19/2020] [Indexed: 01/19/2023] Open

For:	Zhang L, Wang Y, Niu M, Wang C, Wang Z. Machine learning for characterizing risk of type 2 diabetes mellitus in a rural Chinese population: the Henan Rural Cohort Study. Sci Rep 2020;10:4406. [PMID: 32157171 PMCID: PMC7064542 DOI: 10.1038/s41598-020-61123-x] [Citation(s) in RCA: 40] [Impact Index Per Article: 10.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/11/2019] [Accepted: 02/19/2020] [Indexed: 01/19/2023] Open

Number

Cited by Other Article(s)

Jabara M, Kose O, Perlman G, Corcos S, Pelletier MA, Possik E, Tsoukas M, Sharma A. Artificial Intelligence-Based Digital Biomarkers for Type 2 Diabetes: A Review. Can J Cardiol 2024;40:1922-1933. [PMID: 39111729 DOI: 10.1016/j.cjca.2024.07.028] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/11/2024] [Revised: 07/27/2024] [Accepted: 07/29/2024] [Indexed: 09/10/2024] Open

Ayub H, Khan MA, Shehryar Ali Naqvi S, Faseeh M, Kim J, Mehmood A, Kim YJ. Unraveling the Potential of Attentive Bi-LSTM for Accurate Obesity Prognosis: Advancing Public Health towards Sustainable Cities. Bioengineering (Basel) 2024;11:533. [PMID: 38927769 PMCID: PMC11200407 DOI: 10.3390/bioengineering11060533] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/09/2024] [Revised: 05/13/2024] [Accepted: 05/19/2024] [Indexed: 06/28/2024] Open

Zhang H, Zeng T, Zhang J, Zheng J, Min J, Peng M, Liu G, Zhong X, Wang Y, Qiu K, Tian S, Liu X, Huang H, Surmach M, Wang P, Hu X, Chen L. Development and validation of machine learning-augmented algorithm for insulin sensitivity assessment in the community and primary care settings: a population-based study in China. Front Endocrinol (Lausanne) 2024;15:1292346. [PMID: 38332892 PMCID: PMC10850228 DOI: 10.3389/fendo.2024.1292346] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 09/11/2023] [Accepted: 01/11/2024] [Indexed: 02/10/2024] Open

Abstract

Objective

Insulin plays a central role in the regulation of energy and glucose homeostasis, and insulin resistance (IR) is widely considered as the "common soil" of a cluster of cardiometabolic disorders. Assessment of insulin sensitivity is very important in preventing and treating IR-related disease. This study aims to develop and validate machine learning (ML)-augmented algorithms for insulin sensitivity assessment in the community and primary care settings.

Methods

We analyzed the data of 9358 participants over 40 years old who participated in the population-based cohort of the Hubei center of the REACTION study (Risk Evaluation of Cancers in Chinese Diabetic Individuals). Three non-ensemble algorithms and four ensemble algorithms were used to develop the models with 70 non-laboratory variables for the community and 87 (70 non-laboratory and 17 laboratory) variables for the primary care settings to screen the classifier of the state-of-the-art. The models with the best performance were further streamlined using top-ranked 5, 8, 10, 13, 15, and 20 features. Performances of these ML models were evaluated using the area under the receiver operating characteristic curve (AUROC), the area under the precision-recall curve (AUPR), and the Brier score. The Shapley additive explanation (SHAP) analysis was employed to evaluate the importance of features and interpret the models.

Results

The LightGBM models developed for the community (AUROC 0.794, AUPR 0.575, Brier score 0.145) and primary care settings (AUROC 0.867, AUPR 0.705, Brier score 0.119) achieved higher performance than the models constructed by the other six algorithms. The streamlined LightGBM models for the community (AUROC 0.791, AUPR 0.563, Brier score 0.146) and primary care settings (AUROC 0.863, AUPR 0.692, Brier score 0.124) using the 20 top-ranked variables also showed excellent performance. SHAP analysis indicated that the top-ranked features included fasting plasma glucose (FPG), waist circumference (WC), body mass index (BMI), triglycerides (TG), gender, waist-to-height ratio (WHtR), the number of daughters born, resting pulse rate (RPR), etc.

Conclusion

The ML models using the LightGBM algorithm are efficient to predict insulin sensitivity in the community and primary care settings accurately and might potentially become an efficient and practical tool for insulin sensitivity assessment in these settings.

Collapse

Affiliation(s)

Hao Zhang Department of Endocrinology, Union Hospital, Tongji Medical College, Huazhong University of Science and Technology, Wuhan, China Hubei Provincial Clinical Research Center for Diabetes and Metabolic Disorders, Wuhan, China
Tianshu Zeng Department of Endocrinology, Union Hospital, Tongji Medical College, Huazhong University of Science and Technology, Wuhan, China Hubei Provincial Clinical Research Center for Diabetes and Metabolic Disorders, Wuhan, China
Jiaoyue Zhang Department of Endocrinology, Union Hospital, Tongji Medical College, Huazhong University of Science and Technology, Wuhan, China Hubei Provincial Clinical Research Center for Diabetes and Metabolic Disorders, Wuhan, China
Juan Zheng Department of Endocrinology, Union Hospital, Tongji Medical College, Huazhong University of Science and Technology, Wuhan, China Hubei Provincial Clinical Research Center for Diabetes and Metabolic Disorders, Wuhan, China
Jie Min Department of Endocrinology, Union Hospital, Tongji Medical College, Huazhong University of Science and Technology, Wuhan, China Hubei Provincial Clinical Research Center for Diabetes and Metabolic Disorders, Wuhan, China
Miaomiao Peng Department of Endocrinology, Union Hospital, Tongji Medical College, Huazhong University of Science and Technology, Wuhan, China Hubei Provincial Clinical Research Center for Diabetes and Metabolic Disorders, Wuhan, China
Geng Liu Department of Endocrinology, Union Hospital, Tongji Medical College, Huazhong University of Science and Technology, Wuhan, China Hubei Provincial Clinical Research Center for Diabetes and Metabolic Disorders, Wuhan, China
Xueyu Zhong Department of Endocrinology, Union Hospital, Tongji Medical College, Huazhong University of Science and Technology, Wuhan, China Hubei Provincial Clinical Research Center for Diabetes and Metabolic Disorders, Wuhan, China
Ying Wang Department of Endocrinology, Union Hospital, Tongji Medical College, Huazhong University of Science and Technology, Wuhan, China Hubei Provincial Clinical Research Center for Diabetes and Metabolic Disorders, Wuhan, China
Kangli Qiu Department of Endocrinology, Union Hospital, Tongji Medical College, Huazhong University of Science and Technology, Wuhan, China Hubei Provincial Clinical Research Center for Diabetes and Metabolic Disorders, Wuhan, China
Shenghua Tian Department of Endocrinology, Union Hospital, Tongji Medical College, Huazhong University of Science and Technology, Wuhan, China Hubei Provincial Clinical Research Center for Diabetes and Metabolic Disorders, Wuhan, China
Xiaohuan Liu Department of Endocrinology, Union Hospital, Tongji Medical College, Huazhong University of Science and Technology, Wuhan, China Hubei Provincial Clinical Research Center for Diabetes and Metabolic Disorders, Wuhan, China
Hantao Huang Department of Emergency Medicine, Yichang Yiling Hospital, Yichang, China
Marina Surmach Department of Public Health and Health Services, Grodno State Medical University, Grodno, Belarus
Ping Wang Precision Health Program, Department of Radiology, College of Human Medicine, Michigan State University, East Lansing, MI, United States
Xiang Hu Department of Endocrinology, Union Hospital, Tongji Medical College, Huazhong University of Science and Technology, Wuhan, China Hubei Provincial Clinical Research Center for Diabetes and Metabolic Disorders, Wuhan, China
Lulu Chen Department of Endocrinology, Union Hospital, Tongji Medical College, Huazhong University of Science and Technology, Wuhan, China Hubei Provincial Clinical Research Center for Diabetes and Metabolic Disorders, Wuhan, China

Collapse

Shojaee-Mend H, Velayati F, Tayefi B, Babaee E. Prediction of Diabetes Using Data Mining and Machine Learning Algorithms: A Cross-Sectional Study. Healthc Inform Res 2024;30:73-82. [PMID: 38359851 PMCID: PMC10879823 DOI: 10.4258/hir.2024.30.1.73] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/17/2023] [Revised: 01/24/2024] [Accepted: 01/24/2024] [Indexed: 02/17/2024] Open

He Y, Matsunaga M, Li Y, Kishi T, Tanihara S, Iwata N, Tabuchi T, Ota A. Classifying Schizophrenia Cases by Artificial Neural Network Using Japanese Web-Based Survey Data: Case-Control Study. JMIR Form Res 2023;7:e50193. [PMID: 37966882 PMCID: PMC10687680 DOI: 10.2196/50193] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/26/2023] [Revised: 09/18/2023] [Accepted: 10/08/2023] [Indexed: 11/16/2023] Open

Abstract

BACKGROUND

In Japan, challenges were reported in accurately estimating the prevalence of schizophrenia among the general population. Retrieving previous studies, we investigated that patients with schizophrenia were more likely to experience poor subjective well-being and various physical, psychiatric, and social comorbidities. These factors might have great potential for precisely classifying schizophrenia cases in order to estimate the prevalence. Machine learning has shown a positive impact on many fields, including epidemiology, due to its high-precision modeling capability. It has been applied in research on mental disorders. However, few studies have applied machine learning technology to the precise classification of schizophrenia cases by variables of demographic and health-related backgrounds, especially using large-scale web-based surveys.

OBJECTIVE

The aim of the study is to construct an artificial neural network (ANN) model that can accurately classify schizophrenia cases from large-scale Japanese web-based survey data and to verify the generalizability of the model.

METHODS

Data were obtained from a large Japanese internet research pooled panel (Rakuten Insight, Inc) in 2021. A total of 223 individuals, aged 20-75 years, having schizophrenia, and 1776 healthy controls were included. Answers to the questions in a web-based survey were formatted as 1 response variable (self-report diagnosed with schizophrenia) and multiple feature variables (demographic, health-related backgrounds, physical comorbidities, psychiatric comorbidities, and social comorbidities). An ANN was applied to construct a model for classifying schizophrenia cases. Logistic regression (LR) was used as a reference. The performances of the models and algorithms were then compared.

RESULTS

The model trained by the ANN performed better than LR in terms of area under the receiver operating characteristic curve (0.86 vs 0.78), accuracy (0.93 vs 0.91), and specificity (0.96 vs 0.94), while the model trained by LR showed better sensitivity (0.63 vs 0.56). Comparing the performances of the ANN and LR, the ANN was better in terms of area under the receiver operating characteristic curve (bootstrapping: 0.847 vs 0.773 and cross-validation: 0.81 vs 0.72), while LR performed better in terms of accuracy (0.894 vs 0.856). Sleep medication use, age, household income, and employment type were the top 4 variables in terms of importance.

CONCLUSIONS

This study constructed an ANN model to classify schizophrenia cases using web-based survey data. Our model showed a high internal validity. The findings are expected to provide evidence for estimating the prevalence of schizophrenia in the Japanese population and informing future epidemiological studies.

Collapse

Chellappan D, Rajaguru H. Enhancement of Classifier Performance Using Swarm Intelligence in Detection of Diabetes from Pancreatic Microarray Gene Data. Biomimetics (Basel) 2023;8:503. [PMID: 37887634 PMCID: PMC10604158 DOI: 10.3390/biomimetics8060503] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/29/2023] [Revised: 10/08/2023] [Accepted: 10/20/2023] [Indexed: 10/28/2023] Open

Li S, Chen Y, Zhang L, Li R, Kang N, Hou J, Wang J, Bao Y, Jiang F, Zhu R, Wang C, Zhang L. An environment-wide association study for the identification of non-invasive factors for type 2 diabetes mellitus: Analysis based on the Henan Rural Cohort study. Diabetes Res Clin Pract 2023;204:110917. [PMID: 37748711 DOI: 10.1016/j.diabres.2023.110917] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 06/12/2023] [Revised: 09/16/2023] [Accepted: 09/21/2023] [Indexed: 09/27/2023]

Affiliation(s)

Shuoyi Li Department of Epidemiology and Biostatistics, College of Public Health, Zhengzhou University, Zhengzhou, Henan 450001, PR China
Ying Chen Department of Epidemiology and Biostatistics, College of Public Health, Zhengzhou University, Zhengzhou, Henan 450001, PR China
Liying Zhang Department of Epidemiology and Biostatistics, College of Public Health, Zhengzhou University, Zhengzhou, Henan 450001, PR China
Ruiying Li Department of Epidemiology and Biostatistics, College of Public Health, Zhengzhou University, Zhengzhou, Henan 450001, PR China
Ning Kang Department of Epidemiology and Biostatistics, College of Public Health, Zhengzhou University, Zhengzhou, Henan 450001, PR China
Jian Hou Department of Epidemiology and Biostatistics, College of Public Health, Zhengzhou University, Zhengzhou, Henan 450001, PR China
Jing Wang China-Australia Joint Research Center for Infectious Diseases, School of Public Health, Xi'an Jiaotong University Health Science Center, Xi'an, Shaanxi 710061, PR China
Yining Bao China-Australia Joint Research Center for Infectious Diseases, School of Public Health, Xi'an Jiaotong University Health Science Center, Xi'an, Shaanxi 710061, PR China
Feng Jiang Department of Epidemiology and Biostatistics, College of Public Health, Zhengzhou University, Zhengzhou, Henan 450001, PR China
Ruifang Zhu Department of Epidemiology and Biostatistics, College of Public Health, Zhengzhou University, Zhengzhou, Henan 450001, PR China
Chongjian Wang Department of Epidemiology and Biostatistics, College of Public Health, Zhengzhou University, Zhengzhou, Henan 450001, PR China.
Lei Zhang China-Australia Joint Research Center for Infectious Diseases, School of Public Health, Xi'an Jiaotong University Health Science Center, Xi'an, Shaanxi 710061, PR China; Artificial Intelligence and Modelling in Epidemiology Program, Melbourne Sexual Health Centre, Alfred Health, Melbourne, Australia; Central Clinical School, Faculty of Medicine, Monash University, Melbourne, Australia.

Collapse

Lee Y, Seo J. Suggestion of statistical validation on feature importance of machine learning. ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY. IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY. ANNUAL INTERNATIONAL CONFERENCE 2023;2023:1-4. [PMID: 38083557 DOI: 10.1109/embc40787.2023.10340208] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/18/2023]

Dong C, Nemet G, Gao X, Barbose G, Sigrin B, O'Shaughnessy E. Machine learning reduces soft costs for residential solar photovoltaics. Sci Rep 2023;13:7213. [PMID: 37137971 PMCID: PMC10156750 DOI: 10.1038/s41598-023-33014-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/05/2022] [Accepted: 04/05/2023] [Indexed: 05/05/2023] Open

Cheng YL, Wu YR, Lin KD, Lin CHR, Lin IM. Using Machine Learning for the Risk Factors Classification of Glycemic Control in Type 2 Diabetes Mellitus. Healthcare (Basel) 2023;11:healthcare11081141. [PMID: 37107975 PMCID: PMC10138388 DOI: 10.3390/healthcare11081141] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/06/2023] [Revised: 04/05/2023] [Accepted: 04/13/2023] [Indexed: 04/29/2023] Open

Liu X, Huang X, Zhao J, Su Y, Shen L, Duan Y, Gong J, Zhang Z, Piao S, Zhu Q, Rong X, Guo J. Application of machine learning in Chinese medicine differentiation of dampness-heat pattern in patients with type 2 diabetes mellitus. Heliyon 2023;9:e13289. [PMID: 36873141 PMCID: PMC9975099 DOI: 10.1016/j.heliyon.2023.e13289] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/17/2022] [Revised: 01/26/2023] [Accepted: 01/27/2023] [Indexed: 02/15/2023] Open

Abstract

Background

China has become the country with the largest number of people with type 2 diabetes mellitus (T2DM), and Chinese medicine (CM) has unique advantages in preventing and treating T2DM, while accurate pattern differentiation is the guarantee for proper treatment.

Objective

The establishment of the CM pattern differentiation model of T2DM is helpful to the pattern diagnosis of the disease. At present, there are few studies on dampness-heat pattern differentiation models of T2DM. Therefore, we establish a machine learning model, hoping to provide an efficient tool for the pattern diagnosis of CM for T2DM in the future.

Methods

A total of 1021 effective samples of T2DM patients from ten CM hospitals or clinics were collected by a questionnaire including patients' demographic and dampness-heat-related symptoms and signs. All information and the diagnosis of the dampness-heat pattern of patients were completed by experienced CM physicians at each visit. We applied six machine learning algorithms (Artificial Neural Network [ANN], K-Nearest Neighbor [KNN], Naïve Bayes [NB], Support Vector Machine [SVM], Extreme Gradient Boosting [XGBoost] and Random Forest [RF]) and compared their performance. And then we also utilized Shapley additive explanation (SHAP) method to explain the best performance model.

Results

The XGBoost model had the highest AUC (0.951, 95% CI 0.925-0.978) among the six models, with the best sensitivity, accuracy, F1 score, negative predictive value, and excellent specificity, precision, and positive predictive value. The SHAP method based on XGBoost showed that slimy yellow tongue fur was the most important sign in dampness-heat pattern diagnosis. The slippery pulse or rapid-slippery pulse, sticky stool with ungratifying defecation also performed an important role in this diagnostic model. Furthermore, the red tongue acted as an important tongue sign for the dampness-heat pattern.

Conclusion

This study constructed a dampness-heat pattern differentiation model of T2DM based on machine learning. The XGBoost model is a tool with the potential to help CM practitioners make quick diagnosis decisions and contribute to the standardization and international application of CM patterns.

Collapse

Affiliation(s)

Xinyu Liu Guangdong Metabolic Diseases Research Center of Integrated Chinese and Western Medicine, Guangdong Pharmaceutical University, Guangzhou, 510006, China.,Key Laboratory of Glucolipid Metabolic Disorder, Ministry of Education of China, Guangdong Pharmaceutical University, Guangzhou, 510006, China.,Guangdong TCM Key Laboratory for Metabolic Diseases, Guangdong Pharmaceutical University, Guangzhou, 510006, China.,Institute of Chinese Medicine, Guangdong Pharmaceutical University, Guangzhou, 510006, China
Xiaoqiang Huang Science and Technology Innovation Center, Guangzhou University of Chinese Medicine, Guangzhou, 510006, China
Jindong Zhao The First Affiliated Hospital of Anhui University of Chinese, Hefei, 230031, China
Yanjin Su Shaanxi University of Chinese Medicine, Xi'an, 712046, China
Lu Shen Shaanxi Provincial Hospital of Traditional Chinese Medicine, Xi'an, 710003, China
Yuhong Duan Affiliated Hospital of Shannxi University of Chinese Medicine, Xi'an, 712000, China
Jing Gong Department of Integrated Traditional Chinese and Western Medicine, Tongji Hospital, Tongji Medical College, Huazhong University of Science and Technology, Wuhan, 430030, China
Zhihai Zhang The First Affiliated Hospital of Xiamen University, Xiamen, 361003, China
Shenghua Piao Guangdong Metabolic Diseases Research Center of Integrated Chinese and Western Medicine, Guangdong Pharmaceutical University, Guangzhou, 510006, China.,Key Laboratory of Glucolipid Metabolic Disorder, Ministry of Education of China, Guangdong Pharmaceutical University, Guangzhou, 510006, China.,Guangdong TCM Key Laboratory for Metabolic Diseases, Guangdong Pharmaceutical University, Guangzhou, 510006, China.,Institute of Chinese Medicine, Guangdong Pharmaceutical University, Guangzhou, 510006, China
Qing Zhu Guangdong Metabolic Diseases Research Center of Integrated Chinese and Western Medicine, Guangdong Pharmaceutical University, Guangzhou, 510006, China.,Key Laboratory of Glucolipid Metabolic Disorder, Ministry of Education of China, Guangdong Pharmaceutical University, Guangzhou, 510006, China.,Guangdong TCM Key Laboratory for Metabolic Diseases, Guangdong Pharmaceutical University, Guangzhou, 510006, China.,Institute of Chinese Medicine, Guangdong Pharmaceutical University, Guangzhou, 510006, China
Xianglu Rong Guangdong Metabolic Diseases Research Center of Integrated Chinese and Western Medicine, Guangdong Pharmaceutical University, Guangzhou, 510006, China.,Key Laboratory of Glucolipid Metabolic Disorder, Ministry of Education of China, Guangdong Pharmaceutical University, Guangzhou, 510006, China.,Guangdong TCM Key Laboratory for Metabolic Diseases, Guangdong Pharmaceutical University, Guangzhou, 510006, China.,Institute of Chinese Medicine, Guangdong Pharmaceutical University, Guangzhou, 510006, China
Jiao Guo Guangdong Metabolic Diseases Research Center of Integrated Chinese and Western Medicine, Guangdong Pharmaceutical University, Guangzhou, 510006, China.,Key Laboratory of Glucolipid Metabolic Disorder, Ministry of Education of China, Guangdong Pharmaceutical University, Guangzhou, 510006, China.,Guangdong TCM Key Laboratory for Metabolic Diseases, Guangdong Pharmaceutical University, Guangzhou, 510006, China.,Institute of Chinese Medicine, Guangdong Pharmaceutical University, Guangzhou, 510006, China

Collapse

Smail HO, Mohamad DA. Identification of DNA methylation of CAPN10 gene changes in the patients with type 2 diabetes mellitus as a predictive biomarker instead of HbA1c, random blood sugar, lipid profile, kidney function test, and some risk factors. Endocr Regul 2023;57:221-234. [PMID: 37823570 DOI: 10.2478/enr-2023-0025] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 10/13/2023] Open

Afsaneh E, Sharifdini A, Ghazzaghi H, Ghobadi MZ. Recent applications of machine learning and deep learning models in the prediction, diagnosis, and management of diabetes: a comprehensive review. Diabetol Metab Syndr 2022;14:196. [PMID: 36572938 PMCID: PMC9793536 DOI: 10.1186/s13098-022-00969-9] [Citation(s) in RCA: 15] [Impact Index Per Article: 7.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 10/06/2022] [Accepted: 12/16/2022] [Indexed: 12/28/2022] Open

Srinivasu PN, Shafi J, Krishna TB, Sujatha CN, Praveen SP, Ijaz MF. Using Recurrent Neural Networks for Predicting Type-2 Diabetes from Genomic and Tabular Data. Diagnostics (Basel) 2022;12:3067. [PMID: 36553074 PMCID: PMC9776641 DOI: 10.3390/diagnostics12123067] [Citation(s) in RCA: 5] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/04/2022] [Revised: 12/01/2022] [Accepted: 12/04/2022] [Indexed: 12/12/2022] Open

Abstract

The development of genomic technology for smart diagnosis and therapies for various diseases has lately been the most demanding area for computer-aided diagnostic and treatment research. Exponential breakthroughs in artificial intelligence and machine intelligence technologies could pave the way for identifying challenges afflicting the healthcare industry. Genomics is paving the way for predicting future illnesses, including cancer, Alzheimer's disease, and diabetes. Machine learning advancements have expedited the pace of biomedical informatics research and inspired new branches of computational biology. Furthermore, knowing gene relationships has resulted in developing more accurate models that can effectively detect patterns in vast volumes of data, making classification models important in various domains. Recurrent Neural Network models have a memory that allows them to quickly remember knowledge from previous cycles and process genetic data. The present work focuses on type 2 diabetes prediction using gene sequences derived from genomic DNA fragments through automated feature selection and feature extraction procedures for matching gene patterns with training data. The suggested model was tested using tabular data to predict type 2 diabetes based on several parameters. The performance of neural networks incorporating Recurrent Neural Network (RNN) components, Long Short-Term Memory (LSTM), and Gated Recurrent Units (GRU) was tested in this research. The model's efficiency is assessed using the evaluation metrics such as Sensitivity, Specificity, Accuracy, F1-Score, and Mathews Correlation Coefficient (MCC). The suggested technique predicted future illnesses with fair Accuracy. Furthermore, our research showed that the suggested model could be used in real-world scenarios and that input risk variables from an end-user Android application could be kept and evaluated on a secure remote server.

Collapse

Kanda E, Suzuki A, Makino M, Tsubota H, Kanemata S, Shirakawa K, Yajima T. Machine learning models for prediction of HF and CKD development in early-stage type 2 diabetes patients. Sci Rep 2022;12:20012. [PMID: 36411366 PMCID: PMC9678863 DOI: 10.1038/s41598-022-24562-2] [Citation(s) in RCA: 6] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/14/2022] [Accepted: 11/17/2022] [Indexed: 11/23/2022] Open

Mao Y, Zhu Z, Pan S, Lin W, Liang J, Huang H, Li L, Wen J, Chen G. Value of machine learning algorithms for predicting diabetes risk: A subset analysis from a real-world retrospective cohort study. J Diabetes Investig 2022;14:309-320. [PMID: 36345236 PMCID: PMC9889616 DOI: 10.1111/jdi.13937] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 02/01/2022] [Revised: 10/04/2022] [Accepted: 10/16/2022] [Indexed: 11/11/2022] Open

Genis-Mendoza AD, González-Castro TB, Tovilla-Vidal G, Juárez-Rojop IE, Castillo-Avila RG, López-Narváez ML, Tovilla-Zárate CA, Sánchez-de la Cruz JP, Fresán A, Nicolini H. Increased Levels of HbA1c in Individuals with Type 2 Diabetes and Depression: A Meta-Analysis of 34 Studies with 68,398 Participants. Biomedicines 2022;10:biomedicines10081919. [PMID: 36009468 PMCID: PMC9405837 DOI: 10.3390/biomedicines10081919] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/27/2022] [Revised: 07/19/2022] [Accepted: 07/23/2022] [Indexed: 01/10/2023] Open

Affiliation(s)

Alma Delia Genis-Mendoza Laboratorio de Genómica de Enfermedades Psiquiátricas y Neurodegenerativas, Instituto Nacional de Medicina Genómica, Ciudad de México 14610, Mexico
Thelma Beatriz González-Castro División Académica Multidisciplinaria de Jalpa de Méndez, Universidad Juárez Autónoma de Tabasco, Jalpa de Méndez 86040, Tabasco, Mexico
Gisselle Tovilla-Vidal División Académica de Ciencias de la Salud, Universidad Juárez Autónoma de Tabasco, Villahermosa 86100, Tabasco, Mexico
Isela Esther Juárez-Rojop División Académica de Ciencias de la Salud, Universidad Juárez Autónoma de Tabasco, Villahermosa 86100, Tabasco, Mexico
Rosa Giannina Castillo-Avila División Académica de Ciencias de la Salud, Universidad Juárez Autónoma de Tabasco, Villahermosa 86100, Tabasco, Mexico
María Lilia López-Narváez Hospital Chiapas Nos Une “Dr. Gilberto Gómez Maza”, Secretaría de Salud de Chiapas, Tuxtla Gutiérrez 29045, Chiapas, Mexico
Carlos Alfonso Tovilla-Zárate División Académica Multidisciplinaria de Comalcalco, Universidad Juárez Autónoma de Tabasco, Comalcalco 86040, Tabasco, Mexico Correspondence: (C.A.T.-Z.); (H.N.); Tel.: +52-993-358-1500 (ext. 6901) (C.A.T.-Z.); +52-5350-1900 (ext. 1197) (H.N.)
Juan Pablo Sánchez-de la Cruz División Académica Multidisciplinaria de Comalcalco, Universidad Juárez Autónoma de Tabasco, Comalcalco 86040, Tabasco, Mexico
Ana Fresán Subdirección de Investigaciones Clínicas, Instituto Nacional de Psiquiatría Ramón de la Fuente Muñíz, Ciudad de México 14370, Mexico
Humberto Nicolini Laboratorio de Genómica de Enfermedades Psiquiátricas y Neurodegenerativas, Instituto Nacional de Medicina Genómica, Ciudad de México 14610, Mexico Correspondence: (C.A.T.-Z.); (H.N.); Tel.: +52-993-358-1500 (ext. 6901) (C.A.T.-Z.); +52-5350-1900 (ext. 1197) (H.N.)

Collapse

Motaib I, Aitlahbib F, Fadil A, Z Rhmari Tlemcani F, Elamari S, Laidi S, Chadli A. Predicting poor glycemic control during Ramadan among non-fasting patients with diabetes using artificial intelligence based machine learning models. Diabetes Res Clin Pract 2022;190:109982. [PMID: 35803316 DOI: 10.1016/j.diabres.2022.109982] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 03/10/2022] [Revised: 06/17/2022] [Accepted: 07/04/2022] [Indexed: 11/30/2022]

Liu Q, Zhou Q, He Y, Zou J, Guo Y, Yan Y. Predicting the 2-Year Risk of Progression from Prediabetes to Diabetes Using Machine Learning among Chinese Elderly Adults. J Pers Med 2022;12:jpm12071055. [PMID: 35887552 PMCID: PMC9324396 DOI: 10.3390/jpm12071055] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/10/2022] [Revised: 06/06/2022] [Accepted: 06/23/2022] [Indexed: 11/18/2022] Open

Liu Q, Zhang M, He Y, Zhang L, Zou J, Yan Y, Guo Y. Predicting the Risk of Incident Type 2 Diabetes Mellitus in Chinese Elderly Using Machine Learning Techniques. J Pers Med 2022;12:jpm12060905. [PMID: 35743691 PMCID: PMC9224915 DOI: 10.3390/jpm12060905] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/09/2022] [Revised: 05/21/2022] [Accepted: 05/27/2022] [Indexed: 02/04/2023] Open

Abstract

Early identification of individuals at high risk of diabetes is crucial for implementing early intervention strategies. However, algorithms specific to elderly Chinese adults are lacking. The aim of this study is to build effective prediction models based on machine learning (ML) for the risk of type 2 diabetes mellitus (T2DM) in Chinese elderly. A retrospective cohort study was conducted using the health screening data of adults older than 65 years in Wuhan, China from 2018 to 2020. With a strict data filtration, 127,031 records from the eligible participants were utilized. Overall, 8298 participants were diagnosed with incident T2DM during the 2-year follow-up (2019–2020). The dataset was randomly split into training set (n = 101,625) and test set (n = 25,406). We developed prediction models based on four ML algorithms: logistic regression (LR), decision tree (DT), random forest (RF), and extreme gradient boosting (XGBoost). Using LASSO regression, 21 prediction features were selected. The Random under-sampling (RUS) was applied to address the class imbalance, and the Shapley Additive Explanations (SHAP) was used to calculate and visualize feature importance. Model performance was evaluated by the area under the receiver operating characteristic curve (AUC), sensitivity, specificity, and accuracy. The XGBoost model achieved the best performance (AUC = 0.7805, sensitivity = 0.6452, specificity = 0.7577, accuracy = 0.7503). Fasting plasma glucose (FPG), education, exercise, gender, and waist circumference (WC) were the top five important predictors. This study showed that XGBoost model can be applied to screen individuals at high risk of T2DM in the early phrase, which has the strong potential for intelligent prevention and control of diabetes. The key features could also be useful for developing targeted diabetes prevention interventions.

Collapse

Research Progress in the Early Warning of Chicken Diseases by Monitoring Clinical Symptoms. APPLIED SCIENCES-BASEL 2022. [DOI: 10.3390/app12115601] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 02/01/2023]

Zhang L, Niu M, Zhang H, Wang Y, Zhang H, Mao Z, Zhang X, He M, Wu T, Wang Z, Wang C. Nonlaboratory-based risk assessment model for coronary heart disease screening: Model development and validation. Int J Med Inform 2022;162:104746. [PMID: 35325662 DOI: 10.1016/j.ijmedinf.2022.104746] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/30/2021] [Revised: 03/14/2022] [Accepted: 03/15/2022] [Indexed: 12/11/2022]

Abstract

BACKGROUND

Identifying groups at high risk of coronary heart disease (CHD) is important to reduce mortality due to CHD. Although machine learning methods have been introduced, many require laboratory or imaging parameters, which are not always readily available; thus, their wide applications are limited.

OBJECTIVE

The aim of this study was to develop and validate a simple, efficient, and joint machine learning model for identifying individuals at high risk of CHD using easily obtainable nonlaboratory parameters.

METHODS

This prospective study used data from the Henan Rural Cohort Study, which was conducted in rural areas of Henan Province, China, between July 2015 and September 2017. A joint machine learning model was developed by selecting and combining four base machine learning algorithms, including logistic regression (LR), artificial neural network (ANN), random forest (RF), and gradient boosting machine (GBM). We used readily accessible variables, including demographics, medical and family history, lifestyle and dietary factors, and anthropometric data, to inform the model. The model was also externally validated by a cohort of individuals from the Dongfeng-Tongji cohort study. Model discrimination was assessed by using the area under the receiver operating characteristic curve (AUC), and calibration was measured by using the Brier score (BS).

RESULTS

A total of 38 716 participants (mean [SD] age, 55.64[12.19] years; 23449[60.6%] female) from the Henan Rural Cohort Study and 17 958 subjects (mean [SD] age, 62.74 [7.59] years; 10,076 [56.1%] female) from the Dongfeng-Tongji cohort study were included in the analysis. Age, waist circumference, pulse pressure, heart rate, family history of CHD, education level, family history of type 2 diabetes mellitus (T2DM), and family history of dyslipidaemia were strongly associated with the development of CHD. In regard to internal validation, the model we built demonstrated good discrimination (AUC, 0.844 (95% CI 0.828-0.860)) and had acceptable calibration (BS, 0. 066). In regard to external validation, the model performed well with clearly useful discrimination (AUC, 0.792 (95% CI 0.774-0.810)) and robust calibration (BS, 0.069).

CONCLUSIONS

In this study, the novel and simple, machine learning-based model comprising readily accessible variables accurately identified individuals at high risk of CHD. This model has the potential to be widely applied for large-scale screening of CHD populations, especially in medical resource-constrained settings.

TRIAL REGISTRATION

The Henan Rural Cohort Study has been registered at the Chinese Clinical Trial Register. (Trial registration: ChiCTR-OOC-15006699. Registered 6 July 2015 - Retrospectively registered) http://www.chictr.org.cn/showproj.aspx?proj=11375.

Collapse

Affiliation(s)

Liying Zhang School of Computer and Artificial Intelligence, Zhengzhou University, Zhengzhou, Henan, PR China; Department of Epidemiology and Biostatistics, College of Public Health, Zhengzhou University, Zhengzhou, Henan, PR China
Miaomiao Niu Department of Epidemiology and Biostatistics, College of Public Health, Zhengzhou University, Zhengzhou, Henan, PR China
Haiyang Zhang School of Computer and Artificial Intelligence, Zhengzhou University, Zhengzhou, Henan, PR China
Yikang Wang Department of Epidemiology and Biostatistics, College of Public Health, Zhengzhou University, Zhengzhou, Henan, PR China
Haiqing Zhang Department of Occupational and Environmental Health, Key Laboratory of Environment and Health, Ministry of Education and State Key Laboratory of Environmental Health (Incubating) School of Public Health, Tongji Medical College, Huazhong University of Science and Technology, Wuhan, Hubei, PR China
Zhenxing Mao Department of Epidemiology and Biostatistics, College of Public Health, Zhengzhou University, Zhengzhou, Henan, PR China
Xiaomin Zhang Department of Occupational and Environmental Health, Key Laboratory of Environment and Health, Ministry of Education and State Key Laboratory of Environmental Health (Incubating) School of Public Health, Tongji Medical College, Huazhong University of Science and Technology, Wuhan, Hubei, PR China
Meian He Department of Occupational and Environmental Health, Key Laboratory of Environment and Health, Ministry of Education and State Key Laboratory of Environmental Health (Incubating) School of Public Health, Tongji Medical College, Huazhong University of Science and Technology, Wuhan, Hubei, PR China
Tangchun Wu Department of Occupational and Environmental Health, Key Laboratory of Environment and Health, Ministry of Education and State Key Laboratory of Environmental Health (Incubating) School of Public Health, Tongji Medical College, Huazhong University of Science and Technology, Wuhan, Hubei, PR China
Zhenfei Wang School of Computer and Artificial Intelligence, Zhengzhou University, Zhengzhou, Henan, PR China.
Chongjian Wang Department of Epidemiology and Biostatistics, College of Public Health, Zhengzhou University, Zhengzhou, Henan, PR China.

Collapse

Karaglani M, Panagopoulou M, Cheimonidi C, Tsamardinos I, Maltezos E, Papanas N, Papazoglou D, Mastorakos G, Chatzaki E. Liquid Biopsy in Type 2 Diabetes Mellitus Management: Building Specific Biosignatures via Machine Learning. J Clin Med 2022;11:1045. [PMID: 35207316 PMCID: PMC8876363 DOI: 10.3390/jcm11041045] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/18/2022] [Revised: 02/09/2022] [Accepted: 02/15/2022] [Indexed: 02/05/2023] Open

Machine learning-based diagnosis and risk factor analysis of cardiocerebrovascular disease based on KNHANES. Sci Rep 2022;12:2250. [PMID: 35145205 PMCID: PMC8831514 DOI: 10.1038/s41598-022-06333-1] [Citation(s) in RCA: 10] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/23/2021] [Accepted: 01/25/2022] [Indexed: 12/31/2022] Open

Haneef R, Tijhuis M, Thiébaut R, Májek O, Pristaš I, Tolenan H, Gallay A. Methodological guidelines to estimate population-based health indicators using linked data and/or machine learning techniques. Arch Public Health 2022;80:9. [PMID: 34983651 PMCID: PMC8725299 DOI: 10.1186/s13690-021-00770-6] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/27/2021] [Accepted: 12/17/2021] [Indexed: 12/23/2022] Open

Abstract

BACKGROUND

The capacity to use data linkage and artificial intelligence to estimate and predict health indicators varies across European countries. However, the estimation of health indicators from linked administrative data is challenging due to several reasons such as variability in data sources and data collection methods resulting in reduced interoperability at various levels and timeliness, availability of a large number of variables, lack of skills and capacity to link and analyze big data. The main objective of this study is to develop the methodological guidelines calculating population-based health indicators to guide European countries using linked data and/or machine learning (ML) techniques with new methods.

METHOD

We have performed the following step-wise approach systematically to develop the methodological guidelines: i. Scientific literature review, ii. Identification of inspiring examples from European countries, and iii. Developing the checklist of guidelines contents.

RESULTS

We have developed the methodological guidelines, which provide a systematic approach for studies using linked data and/or ML-techniques to produce population-based health indicators. These guidelines include a detailed checklist of the following items: rationale and objective of the study (i.e., research question), study design, linked data sources, study population/sample size, study outcomes, data preparation, data analysis (i.e., statistical techniques, sensitivity analysis and potential issues during data analysis) and study limitations.

CONCLUSIONS

This is the first study to develop the methodological guidelines for studies focused on population health using linked data and/or machine learning techniques. These guidelines would support researchers to adopt and develop a systematic approach for high-quality research methods. There is a need for high-quality research methodologies using more linked data and ML-techniques to develop a structured cross-disciplinary approach for improving the population health information and thereby the population health.

Collapse

AIM in Endocrinology. Artif Intell Med 2022. [DOI: 10.1007/978-3-030-64573-1_328] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/07/2022]

Liu X, Zhang W, Zhang Q, Chen L, Zeng T, Zhang J, Min J, Tian S, Zhang H, Huang H, Wang P, Hu X, Chen L. Development and validation of a machine learning-augmented algorithm for diabetes screening in community and primary care settings: A population-based study. Front Endocrinol (Lausanne) 2022;13:1043919. [PMID: 36518245 PMCID: PMC9742532 DOI: 10.3389/fendo.2022.1043919] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 09/14/2022] [Accepted: 11/11/2022] [Indexed: 11/29/2022] Open

Affiliation(s)

XiaoHuan Liu Department of Endocrinology, Union Hospital, Tongji Medical College, Huazhong University of Science and Technology, Wuhan, China Hubei provincial Clinical Research Center for Diabetes and Metabolic Disorders, Wuhan, China
Weiyue Zhang Department of Endocrinology, Union Hospital, Tongji Medical College, Huazhong University of Science and Technology, Wuhan, China Hubei provincial Clinical Research Center for Diabetes and Metabolic Disorders, Wuhan, China
Qiao Zhang Department of Cardiovascular Surgery, Union Hospital, Tongji Medical College, Huazhong University of Science and Technology, Wuhan, China
Long Chen Department of Computer Science and Technology, Tsinghua University, Beijing, China
TianShu Zeng Department of Endocrinology, Union Hospital, Tongji Medical College, Huazhong University of Science and Technology, Wuhan, China Hubei provincial Clinical Research Center for Diabetes and Metabolic Disorders, Wuhan, China
JiaoYue Zhang Department of Endocrinology, Union Hospital, Tongji Medical College, Huazhong University of Science and Technology, Wuhan, China Hubei provincial Clinical Research Center for Diabetes and Metabolic Disorders, Wuhan, China
Jie Min Department of Endocrinology, Union Hospital, Tongji Medical College, Huazhong University of Science and Technology, Wuhan, China Hubei provincial Clinical Research Center for Diabetes and Metabolic Disorders, Wuhan, China
ShengHua Tian Department of Endocrinology, Union Hospital, Tongji Medical College, Huazhong University of Science and Technology, Wuhan, China Hubei provincial Clinical Research Center for Diabetes and Metabolic Disorders, Wuhan, China
Hao Zhang Department of Endocrinology, Union Hospital, Tongji Medical College, Huazhong University of Science and Technology, Wuhan, China Hubei provincial Clinical Research Center for Diabetes and Metabolic Disorders, Wuhan, China
Hantao Huang Yiling Hospital, Yichang, China
Ping Wang Precision Health Program, Department of Radiology, College of Human Medicine, Michigan State University, East Lansing, MI, United States
Xiang Hu Department of Endocrinology, Union Hospital, Tongji Medical College, Huazhong University of Science and Technology, Wuhan, China Hubei provincial Clinical Research Center for Diabetes and Metabolic Disorders, Wuhan, China *Correspondence: LuLu Chen, ; Xiang Hu,
LuLu Chen Department of Endocrinology, Union Hospital, Tongji Medical College, Huazhong University of Science and Technology, Wuhan, China Hubei provincial Clinical Research Center for Diabetes and Metabolic Disorders, Wuhan, China *Correspondence: LuLu Chen, ; Xiang Hu,

Collapse

Fregoso-Aparicio L, Noguez J, Montesinos L, García-García JA. Machine learning and deep learning predictive models for type 2 diabetes: a systematic review. Diabetol Metab Syndr 2021;13:148. [PMID: 34930452 PMCID: PMC8686642 DOI: 10.1186/s13098-021-00767-9] [Citation(s) in RCA: 31] [Impact Index Per Article: 10.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 07/06/2021] [Accepted: 12/07/2021] [Indexed: 12/12/2022] Open

Nomura A, Noguchi M, Kometani M, Furukawa K, Yoneda T. Artificial Intelligence in Current Diabetes Management and Prediction. Curr Diab Rep 2021;21:61. [PMID: 34902070 PMCID: PMC8668843 DOI: 10.1007/s11892-021-01423-2] [Citation(s) in RCA: 25] [Impact Index Per Article: 8.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Accepted: 07/13/2021] [Indexed: 10/28/2022]

Makino K, Lee S, Bae S, Chiba I, Harada K, Katayama O, Tomida K, Morikawa M, Shimada H. Simplified Decision-Tree Algorithm to Predict Falls for Community-Dwelling Older Adults. J Clin Med 2021;10:jcm10215184. [PMID: 34768703 PMCID: PMC8585075 DOI: 10.3390/jcm10215184] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/26/2021] [Revised: 10/26/2021] [Accepted: 11/03/2021] [Indexed: 11/16/2022] Open

Affiliation(s)

Keitaro Makino Department of Preventive Gerontology, Center for Gerontology and Social Science, National Center for Geriatrics and Gerontology, 7-430 Morioka-cho, Obu City 474-8511, Japan; (S.L.); (S.B.); (I.C.); (K.H.); (O.K.); (K.T.); (M.M.) Research Fellowship for Young Scientists, Japan Society for the Promotion of Science, Chiyoda-ku, Tokyo 102-0083, Japan Correspondence: ; Tel.: +81-562-44-5651
Sangyoon Lee Department of Preventive Gerontology, Center for Gerontology and Social Science, National Center for Geriatrics and Gerontology, 7-430 Morioka-cho, Obu City 474-8511, Japan; (S.L.); (S.B.); (I.C.); (K.H.); (O.K.); (K.T.); (M.M.)
Seongryu Bae Department of Preventive Gerontology, Center for Gerontology and Social Science, National Center for Geriatrics and Gerontology, 7-430 Morioka-cho, Obu City 474-8511, Japan; (S.L.); (S.B.); (I.C.); (K.H.); (O.K.); (K.T.); (M.M.)
Ippei Chiba Department of Preventive Gerontology, Center for Gerontology and Social Science, National Center for Geriatrics and Gerontology, 7-430 Morioka-cho, Obu City 474-8511, Japan; (S.L.); (S.B.); (I.C.); (K.H.); (O.K.); (K.T.); (M.M.)
Kenji Harada Department of Preventive Gerontology, Center for Gerontology and Social Science, National Center for Geriatrics and Gerontology, 7-430 Morioka-cho, Obu City 474-8511, Japan; (S.L.); (S.B.); (I.C.); (K.H.); (O.K.); (K.T.); (M.M.)
Osamu Katayama Department of Preventive Gerontology, Center for Gerontology and Social Science, National Center for Geriatrics and Gerontology, 7-430 Morioka-cho, Obu City 474-8511, Japan; (S.L.); (S.B.); (I.C.); (K.H.); (O.K.); (K.T.); (M.M.)
Kouki Tomida Department of Preventive Gerontology, Center for Gerontology and Social Science, National Center for Geriatrics and Gerontology, 7-430 Morioka-cho, Obu City 474-8511, Japan; (S.L.); (S.B.); (I.C.); (K.H.); (O.K.); (K.T.); (M.M.)
Masanori Morikawa Department of Preventive Gerontology, Center for Gerontology and Social Science, National Center for Geriatrics and Gerontology, 7-430 Morioka-cho, Obu City 474-8511, Japan; (S.L.); (S.B.); (I.C.); (K.H.); (O.K.); (K.T.); (M.M.)
Hiroyuki Shimada Center for Gerontology and Social Science, National Center for Geriatrics and Gerontology, 7-430 Morioka-cho, Obu City 474-8511, Japan;

Collapse

Makino K, Lee S, Bae S, Chiba I, Harada K, Katayama O, Shinkai Y, Shimada H. Development and validation of new screening tool for predicting dementia risk in community-dwelling older Japanese adults. J Transl Med 2021;19:448. [PMID: 34702306 PMCID: PMC8549197 DOI: 10.1186/s12967-021-03121-9] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/27/2021] [Accepted: 10/16/2021] [Indexed: 12/02/2022] Open

Abstract

BACKGROUND

Established clinical assessments for detecting dementia risk often require time, cost, and face-to-face meetings. We aimed to develop a Simplified Telephone Assessment for Dementia risk (STAD) (a new screening tool utilizing telephonic interviews to predict dementia risk) and examine the predictive validity of the STAD for the incidence of dementia.

METHODS

We developed STAD based on a combination of literature review, statistical analysis, and expert opinion. We selected 12 binary questions on subjective cognitive complaints, depressive symptoms, and lifestyle activities. In the validation study, we used STAD for 4298 community-dwelling older adults and observed the incidence of dementia during the 24-month follow-up period. The total score of STAD ranging from 0 to 12 was calculated, and the cut-off point for dementia incidence was determined using the Youden index. The survival rate of dementia incidence according to the cut-off points was determined. Furthermore, we used a decision-tree model (classification and regression tree, CART) to enhance the predictive ability of STAD for dementia risk screening.

RESULTS

The cut-off point of STAD was set at 4/5. Participants scoring ≥ 5 points showed a significantly higher risk of dementia than those scoring ≤ 4 points, even after adjusting for covariates (hazard ratio [95% confidence interval], 2.67 [1.40-5.08]). A decision tree model using the CART algorithm was constructed using 12 nodes with three STAD items. It showed better performance for dementia prediction in terms of accuracy and specificity as compared to the logistic regression model, although its sensitivity was worse than the logistic regression model.

CONCLUSIONS

We developed a 12-item questionnaire, STAD, as a screening tool to predict dementia risk utilizing telephonic interviews and confirmed its predictive validity. Our findings might provide useful information for early screening of dementia risk and enable bridging between community and clinical settings. Additionally, STAD could be employed without face-to-face meetings in a short time; therefore, it may be a suitable screening tool for community-dwelling older adults who have negative attitudes toward clinical examination or are non-adherent to follow-up assessments in clinical trials.

Collapse

Nguyen P, Ohnmacht AJ, Galhoz A, Büttner M, Theis F, Menden MP. Künstliche Intelligenz und maschinelles Lernen in der Diabetesforschung. DIABETOLOGE 2021. [DOI: 10.1007/s11428-021-00817-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]

Niu M, Wang Y, Zhang L, Tu R, Liu X, Hou J, Huo W, Mao Z, Wang C, Bie R. Identifying the predictive effectiveness of a genetic risk score for incident hypertension using machine learning methods among populations in rural China. Hypertens Res 2021;44:1483-1491. [PMID: 34480134 DOI: 10.1038/s41440-021-00738-7] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/17/2021] [Revised: 07/31/2021] [Accepted: 08/04/2021] [Indexed: 12/17/2022]

Abstract

Current studies have shown the controversial effect of genetic risk scores (GRSs) in hypertension prediction. Machine learning methods are used extensively in the medical field but rarely in the mining of genetic information. This study aims to determine whether genetic information can improve the prediction of incident hypertension using machine learning approaches in a prospective study. The study recruited 4592 subjects without hypertension at baseline from a cohort study conducted in rural China. A polygenic risk score (PGGRS) was calculated using 13 SNPs. According to a ratio of 7:3, subjects were randomly allocated to the train and test datasets. Models with and without the PGGRS were established using the train dataset with Cox regression, artificial neural network (ANN), random forest (RF), and gradient boosting machine (GBM) methods. The discrimination and reclassification of models were estimated using the test dataset. The PGGRS showed a significant association with the risk of incident hypertension (HR (95% CI), 1.046 (1.004, 1.090), P = 0.031) irrespective of baseline blood pressure. Models that did not include the PGGRS achieved AUCs (95% CI) of 0.785 (0.763, 0.807), 0.790 (0.768, 0.811), 0.838 (0.817, 0.857), and 0.854 (0.835, 0.873) for the Cox, ANN, RF, and GBM methods, respectively. The addition of the PGGRS led to the improvement of the AUC by 0.001, 0.008, 0.023, and 0.017; IDI by 1.39%, 2.86%, 4.73%, and 4.68%; and NRI by 25.05%, 13.01%, 44.87%, and 22.94%, respectively. Incident hypertension risk was better predicted by the traditional+PGGRS model, especially when machine learning approaches were used, suggesting that genetic information may have the potential to identify new hypertension cases using machine learning methods in resource-limited areas. CLINICAL TRIAL REGISTRATION: The Henan Rural Cohort Study has been registered at the Chinese Clinical Trial Register (Registration number: ChiCTR-OOC-15006699). http://www.chictr.org.cn/showproj.aspx?proj=11375 .

Collapse

Lee S, Zhou J, Leung KSK, Wu WKK, Wong WT, Liu T, Wong ICK, Jeevaratnam K, Zhang Q, Tse G. Development of a predictive risk model for all-cause mortality in patients with diabetes in Hong Kong. BMJ Open Diabetes Res Care 2021;9:9/1/e001950. [PMID: 34117050 PMCID: PMC8201981 DOI: 10.1136/bmjdrc-2020-001950] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 10/15/2020] [Accepted: 05/09/2021] [Indexed: 01/14/2023] Open

Abstract

INTRODUCTION

Patients with diabetes mellitus are risk of premature death. In this study, we developed a machine learning-driven predictive risk model for all-cause mortality among patients with type 2 diabetes mellitus using multiparametric approach with data from different domains.

RESEARCH DESIGN AND METHODS

This study used territory-wide data of patients with type 2 diabetes attending public hospitals or their associated ambulatory/outpatient facilities in Hong Kong between January 1, 2009 and December 31, 2009. The primary outcome is all-cause mortality. The association of risk variables and all-cause mortality was assessed using Cox proportional hazards models. Machine and deep learning approaches were used to improve overall survival prediction and were evaluated with fivefold cross validation method.

RESULTS

A total of 273 678 patients (mean age: 65.4±12.7 years, male: 48.2%, median follow-up: 142 (IQR=106-142) months) were included, with 91 155 deaths occurring on follow-up (33.3%; annualized mortality rate: 3.4%/year; 2.7 million patient-years). Multivariate Cox regression found the following significant predictors of all-cause mortality: age, male gender, baseline comorbidities, anemia, mean values of neutrophil-to-lymphocyte ratio, high-density lipoprotein-cholesterol, total cholesterol, triglyceride, HbA1c and fasting blood glucose (FBG), measures of variability of both HbA1c and FBG. The above parameters were incorporated into a score-based predictive risk model that had a c-statistic of 0.73 (95% CI 0.66 to 0.77), which was improved to 0.86 (0.81 to 0.90) and 0.87 (0.84 to 0.91) using random survival forests and deep survival learning models, respectively.

CONCLUSIONS

A multiparametric model incorporating variables from different domains predicted all-cause mortality accurately in type 2 diabetes mellitus. The predictive and modeling capabilities of machine/deep learning survival analysis achieved more accurate predictions.

Collapse

Liao Q, Zhang Q, Feng X, Huang H, Xu H, Tian B, Liu J, Yu Q, Guo N, Liu Q, Huang B, Ma D, Ai J, Xu S, Li K. Development of deep learning algorithms for predicting blastocyst formation and quality by time-lapse monitoring. Commun Biol 2021;4:415. [PMID: 33772211 PMCID: PMC7998018 DOI: 10.1038/s42003-021-01937-1] [Citation(s) in RCA: 20] [Impact Index Per Article: 6.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/03/2020] [Accepted: 02/24/2021] [Indexed: 12/24/2022] Open

Affiliation(s)

Qiuyue Liao Department of Gynecology and Obstetrics, Tongji Hospital, Tongji Medical College, Huazhong University of Science and Technology, Wuhan, Hubei, China
Qi Zhang Shanghai Institute for Advanced Communication and Data Science, Shanghai University, Shanghai, China
Xue Feng Department of Gynecology and Obstetrics, Tongji Hospital, Tongji Medical College, Huazhong University of Science and Technology, Wuhan, Hubei, China
Haibo Huang Shanghai Institute for Advanced Communication and Data Science, Shanghai University, Shanghai, China
Haohao Xu Shanghai Institute for Advanced Communication and Data Science, Shanghai University, Shanghai, China
Baoyuan Tian Shanghai Institute for Advanced Communication and Data Science, Shanghai University, Shanghai, China
Jihao Liu Shanghai Institute for Advanced Communication and Data Science, Shanghai University, Shanghai, China
Qihui Yu Shanghai Institute for Advanced Communication and Data Science, Shanghai University, Shanghai, China
Na Guo Department of Gynecology and Obstetrics, Tongji Hospital, Tongji Medical College, Huazhong University of Science and Technology, Wuhan, Hubei, China
Qun Liu Department of Gynecology and Obstetrics, Tongji Hospital, Tongji Medical College, Huazhong University of Science and Technology, Wuhan, Hubei, China
Bo Huang Department of Gynecology and Obstetrics, Tongji Hospital, Tongji Medical College, Huazhong University of Science and Technology, Wuhan, Hubei, China
Ding Ma Department of Gynecology and Obstetrics, Tongji Hospital, Tongji Medical College, Huazhong University of Science and Technology, Wuhan, Hubei, China
Jihui Ai Department of Gynecology and Obstetrics, Tongji Hospital, Tongji Medical College, Huazhong University of Science and Technology, Wuhan, Hubei, China.
Shugong Xu Shanghai Institute for Advanced Communication and Data Science, Shanghai University, Shanghai, China.
Kezhen Li Department of Gynecology and Obstetrics, Tongji Hospital, Tongji Medical College, Huazhong University of Science and Technology, Wuhan, Hubei, China.

Collapse

Wang Y, Zhang L, Niu M, Li R, Tu R, Liu X, Hou J, Mao Z, Wang Z, Wang C. Genetic Risk Score Increased Discriminant Efficiency of Predictive Models for Type 2 Diabetes Mellitus Using Machine Learning: Cohort Study. Front Public Health 2021;9:606711. [PMID: 33681127 PMCID: PMC7925839 DOI: 10.3389/fpubh.2021.606711] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/15/2020] [Accepted: 01/25/2021] [Indexed: 11/13/2022] Open

Abstract

Background: Previous studies have constructed prediction models for type 2 diabetes mellitus (T2DM), but machine learning was rarely used and few focused on genetic prediction. This study aimed to establish an effective T2DM prediction tool and to further explore the potential of genetic risk scores (GRS) via various classifiers among rural adults.

Methods: In this prospective study, the GRS for a total of 5,712 participants from the Henan Rural Cohort Study was calculated. Cox proportional hazards (CPH) regression was used to analyze the associations between GRS and T2DM. CPH, artificial neural network (ANN), random forest (RF), and gradient boosting machine (GBM) were used to establish prediction models, respectively. The area under the receiver operating characteristic curve (AUC) and net reclassification index (NRI) were used to assess the discrimination ability of the models. The decision curve was plotted to determine the clinical-utility for prediction models.

Results: Compared with the individuals in the lowest quintile of the GRS, the HR (95% CI) was 2.06 (1.40 to 3.03) for those with the highest quintile of GRS (P_trend < 0.05). Based on conventional predictors, the AUCs of the prediction model were 0.815, 0.816, 0.843, and 0.851 via CPH, ANN, RF, and GBM, respectively. Changes with the integration of GRS for CPH, ANN, RF, and GBM were 0.001, 0.002, 0.018, and 0.033, respectively. The reclassifications were significantly improved for all classifiers when adding GRS (NRI: 41.2% for CPH; 41.0% for ANN; 46.4% for ANN; 45.1% for GBM). Decision curve analysis indicated the clinical benefits of model combined GRS.

Conclusion: The prediction model combined with GRS may provide incremental predictions of performance beyond conventional factors for T2DM, which demonstrated the potential clinical use of genetic markers to screen vulnerable populations.

Clinical Trial Registration: The Henan Rural Cohort Study is registered in the Chinese Clinical Trial Register (Registration number: ChiCTR-OOC-15006699). http://www.chictr.org.cn/showproj.aspx?proj=11375.

Collapse

Niu M, Zhang L, Wang Y, Tu R, Liu X, Hou J, Huo W, Mao Z, Wang Z, Wang C. Genetic factors increase the identification efficiency of predictive models for dyslipidaemia: a prospective cohort study. Lipids Health Dis 2021;20:11. [PMID: 33579296 PMCID: PMC7881493 DOI: 10.1186/s12944-021-01439-3] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/15/2020] [Accepted: 01/27/2021] [Indexed: 11/10/2022] Open

Abstract

Background

Few studies have developed risk models for dyslipidaemia, especially for rural populations. Furthermore, the performance of genetic factors in predicting dyslipidaemia has not been explored. The purpose of this study is to develop and evaluate prediction models with and without genetic factors for dyslipidaemia in rural populations.

Methods

A total of 3596 individuals from the Henan Rural Cohort Study were included in this study. According to the ratio of 7:3, all individuals were divided into a training set and a testing set. The conventional models and conventional+GRS (genetic risk score) models were developed with Cox regression, artificial neural network (ANN), random forest (RF), and gradient boosting machine (GBM) classifiers in the training set. The area under the receiver operating characteristic curve (AUC), net reclassification index (NRI), and integrated discrimination index (IDI) were used to assess the discrimination ability of the models, and the calibration curve was used to show calibration ability in the testing set.

Results

Compared to the lowest quartile of GRS, the hazard ratio (HR) (95% confidence interval (CI)) of individuals in the highest quartile of GRS was 1.23(1.07, 1.41) in the total population. Age, family history of diabetes, physical activity, body mass index (BMI), triglycerides (TGs), high-density lipoprotein cholesterol (HDL-C), and low-density lipoprotein cholesterol (LDL-C) were used to develop the conventional models, and the AUCs of the Cox, ANN, RF, and GBM classifiers were 0.702(0.673, 0.729), 0.736(0.708, 0.762), 0.787 (0.762, 0.811), and 0.816(0.792, 0.839), respectively. After adding GRS, the AUCs increased by 0.005, 0.018, 0.023, and 0.015 with the Cox, ANN, RF, and GBM classifiers, respectively. The corresponding NRI and IDI were 25.6, 7.8, 14.1, and 18.1% and 2.3, 1.0, 2.5, and 1.8%, respectively.

Conclusion

Genetic factors could improve the predictive ability of the dyslipidaemia risk model, suggesting that genetic information could be provided as a potential predictor to screen for clinical dyslipidaemia.

Trial registration

The Henan Rural Cohort Study has been registered at the Chinese Clinical Trial Register. (Trial registration: ChiCTR-OOC-15006699. Registered 6 July 2015 - Retrospectively registered).

Supplementary Information

The online version contains supplementary material available at 10.1186/s12944-021-01439-3.

Collapse

Wu Y, Hu H, Cai J, Chen R, Zuo X, Cheng H, Yan D. Machine Learning for Predicting the 3-Year Risk of Incident Diabetes in Chinese Adults. Front Public Health 2021;9:626331. [PMID: 34268283 PMCID: PMC8275929 DOI: 10.3389/fpubh.2021.626331] [Citation(s) in RCA: 16] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/16/2020] [Accepted: 05/21/2021] [Indexed: 02/05/2023] Open

Abstract

Purpose: We aimed to establish and validate a risk assessment system that combines demographic and clinical variables to predict the 3-year risk of incident diabetes in Chinese adults. Methods: A 3-year cohort study was performed on 15,928 Chinese adults without diabetes at baseline. All participants were randomly divided into a training set (n = 7,940) and a validation set (n = 7,988). XGBoost method is an effective machine learning technique used to select the most important variables from candidate variables. And we further established a stepwise model based on the predictors chosen by the XGBoost model. The area under the receiver operating characteristic curve (AUC), decision curve and calibration analysis were used to assess discrimination, clinical use and calibration of the model, respectively. The external validation was performed on a cohort of 11,113 Japanese participants. Result: In the training and validation sets, 148 and 145 incident diabetes cases occurred. XGBoost methods selected the 10 most important variables from 15 candidate variables. Fasting plasma glucose (FPG), body mass index (BMI) and age were the top 3 important variables. And we further established a stepwise model and a prediction nomogram. The AUCs of the stepwise model were 0.933 and 0.910 in the training and validation sets, respectively. The Hosmer-Lemeshow test showed a perfect fit between the predicted diabetes risk and the observed diabetes risk (p = 0.068 for the training set, p = 0.165 for the validation set). Decision curve analysis presented the clinical use of the stepwise model and there was a wide range of alternative threshold probability spectrum. And there were almost no the interactions between these predictors (most P-values for interaction >0.05). Furthermore, the AUC for the external validation set was 0.830, and the Hosmer-Lemeshow test for the external validation set showed no statistically significant difference between the predicted diabetes risk and observed diabetes risk (P = 0.824). Conclusion: We established and validated a risk assessment system for characterizing the 3-year risk of incident diabetes.

Collapse

Hong N, Park Y, You SC, Rhee Y. AIM in Endocrinology. Artif Intell Med 2021. [DOI: 10.1007/978-3-030-58080-3_328-1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/07/2022]

Artificial Neural Networks Model for Predicting Type 2 Diabetes Mellitus Based on VDR Gene FokI Polymorphism, Lipid Profile and Demographic Data. BIOLOGY 2020;9:biology9080222. [PMID: 32823649 PMCID: PMC7465516 DOI: 10.3390/biology9080222] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 07/12/2020] [Revised: 08/04/2020] [Accepted: 08/10/2020] [Indexed: 01/06/2023]

Reed RA, Morgan AS, Zeitlin J, Jarreau PH, Torchin H, Pierrat V, Ancel PY, Khoshnood B. Machine-Learning vs. Expert-Opinion Driven Logistic Regression Modelling for Predicting 30-Day Unplanned Rehospitalisation in Preterm Babies: A Prospective, Population-Based Study (EPIPAGE 2). Front Pediatr 2020;8:585868. [PMID: 33614539 PMCID: PMC7886676 DOI: 10.3389/fped.2020.585868] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 07/21/2020] [Accepted: 12/29/2020] [Indexed: 11/28/2022] Open

Abstract

Introduction: Preterm babies are a vulnerable population that experience significant short and long-term morbidity. Rehospitalisations constitute an important, potentially modifiable adverse event in this population. Improving the ability of clinicians to identify those patients at the greatest risk of rehospitalisation has the potential to improve outcomes and reduce costs. Machine-learning algorithms can provide potentially advantageous methods of prediction compared to conventional approaches like logistic regression. Objective: To compare two machine-learning methods (least absolute shrinkage and selection operator (LASSO) and random forest) to expert-opinion driven logistic regression modelling for predicting unplanned rehospitalisation within 30 days in a large French cohort of preterm babies. Design, Setting and Participants: This study used data derived exclusively from the population-based prospective cohort study of French preterm babies, EPIPAGE 2. Only those babies discharged home alive and whose parents completed the 1-year survey were eligible for inclusion in our study. All predictive models used a binary outcome, denoting a baby's status for an unplanned rehospitalisation within 30 days of discharge. Predictors included those quantifying clinical, treatment, maternal and socio-demographic factors. The predictive abilities of models constructed using LASSO and random forest algorithms were compared with a traditional logistic regression model. The logistic regression model comprised 10 predictors, selected by expert clinicians, while the LASSO and random forest included 75 predictors. Performance measures were derived using 10-fold cross-validation. Performance was quantified using area under the receiver operator characteristic curve, sensitivity, specificity, Tjur's coefficient of determination and calibration measures. Results: The rate of 30-day unplanned rehospitalisation in the eligible population used to construct the models was 9.1% (95% CI 8.2-10.1) (350/3,841). The random forest model demonstrated both an improved AUROC (0.65; 95% CI 0.59-0.7; p = 0.03) and specificity vs. logistic regression (AUROC 0.57; 95% CI 0.51-0.62, p = 0.04). The LASSO performed similarly (AUROC 0.59; 95% CI 0.53-0.65; p = 0.68) to logistic regression. Conclusions: Compared to an expert-specified logistic regression model, random forest offered improved prediction of 30-day unplanned rehospitalisation in preterm babies. However, all models offered relatively low levels of predictive ability, regardless of modelling method.

Collapse