1
|
Olusanya MO, Ogunsakin RE, Ghai M, Adeleke MA. Accuracy of Machine Learning Classification Models for the Prediction of Type 2 Diabetes Mellitus: A Systematic Survey and Meta-Analysis Approach. INTERNATIONAL JOURNAL OF ENVIRONMENTAL RESEARCH AND PUBLIC HEALTH 2022; 19:ijerph192114280. [PMID: 36361161 PMCID: PMC9655196 DOI: 10.3390/ijerph192114280] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/27/2022] [Revised: 10/22/2022] [Accepted: 10/25/2022] [Indexed: 05/13/2023]
Abstract
Soft-computing and statistical learning models have gained substantial momentum in predicting type 2 diabetes mellitus (T2DM) disease. This paper reviews recent soft-computing and statistical learning models in T2DM using a meta-analysis approach. We searched for papers using soft-computing and statistical learning models focused on T2DM published between 2010 and 2021 on three different search engines. Of 1215 studies identified, 34 with 136952 patients met our inclusion criteria. The pooled algorithm's performance was able to predict T2DM with an overall accuracy of 0.86 (95% confidence interval [CI] of [0.82, 0.89]). The classification of diabetes prediction was significantly greater in models with a screening and diagnosis (pooled proportion [95% CI] = 0.91 [0.74, 0.97]) when compared to models with nephropathy (pooled proportion = 0.48 [0.76, 0.89] to 0.88 [0.83, 0.91]). For the prediction of T2DM, the decision trees (DT) models had a pooled accuracy of 0.88 [95% CI: 0.82, 0.92], and the neural network (NN) models had a pooled accuracy of 0.85 [95% CI: 0.79, 0.89]. Meta-regression did not provide any statistically significant findings for the heterogeneous accuracy in studies with different diabetes predictions, sample sizes, and impact factors. Additionally, ML models showed high accuracy for the prediction of T2DM. The predictive accuracy of ML algorithms in T2DM is promising, mainly through DT and NN models. However, there is heterogeneity among ML models. We compared the results and models and concluded that this evidence might help clinicians interpret data and implement optimum models for their dataset for T2DM prediction.
Collapse
Affiliation(s)
- Micheal O. Olusanya
- Department of Computer Science and Information Technology, Sol Plaatje University, Kimberley 8300, South Africa
- Correspondence:
| | - Ropo Ebenezer Ogunsakin
- Biostatistics Unit, Discipline of Public Health Medicine, School of Nursing & Public Health, College of Health Sciences, University of KwaZulu-Natal, Durban 4000, South Africa
| | - Meenu Ghai
- Discipline of Genetics, School of Life Sciences, University of KwaZulu-Natal, Durban 4000, South Africa
| | - Matthew Adekunle Adeleke
- Discipline of Genetics, School of Life Sciences, University of KwaZulu-Natal, Durban 4000, South Africa
| |
Collapse
|
2
|
Kosaka T, Ono T, Kida M, Fushida S, Nokubi T, Kokubo Y, Watanabe M, Higashiyama A, Miyamoto Y, Ikebe K. A prediction model of masticatory performance change in 50- to 70-year-old Japanese: The Suita study. J Dent 2020; 104:103535. [PMID: 33207241 DOI: 10.1016/j.jdent.2020.103535] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/09/2020] [Revised: 10/26/2020] [Accepted: 11/12/2020] [Indexed: 01/22/2023] Open
Abstract
OBJECTIVES Declines in masticatory performance might be a risk factor for worsening nutritional intake and result in general frailty. The present study constructed and investigated a method to predict the extent to which objective masticatory performance declines with age in cases with poor oral health status. METHODS Participants comprised 1201 participants in the Suita study with dental checkup at both baseline and follow-up (500 men and 701 women; age at baseline, 65.6 ± 7.8 years; mean follow-up, 5.1 ± 1.1 years). First, multiple linear regression analysis was performed with masticatory performance at follow-up as the dependent variable and sex as well as baseline age, number of functional teeth, maximum bite force, occlusal support, periodontal status, salivary flow rate, and masticatory performance as independent variables. Scores were assigned to each factor based on the standardized partial regression coefficient obtained from multiple linear regression analysis. Participants were divided into quintile groups (Q1-Q5) based on total scores for factors, and rates of masticatory performance change for each group were calculated and compared. RESULTS Mean rates of masticatory performance change in groups Q1-Q5 from the model to predict declining masticatory performance were: Q1, -9.7%; Q2, -12.7%; Q3, -18.0%; Q4, -19.9%; and Q5, -29.8%.Thus there was a trend for masticatory performance to decrease with decreasing score. CONCLUSIONS The model developed in this study quantitatively predicted declines in masticatory performance after approximately 5 years. CLINICAL SIGNIFICANCE We developed a model for predicting the extent to which masticatory performance will change over the next 5 years. This model may offer a useful tool when taking measures to prevent declines in masticatory performance with aging.
Collapse
Affiliation(s)
- Takayuki Kosaka
- Department of Prosthodontics, Gerodontology and Oral Rehabilitation, Osaka University Graduate School of Dentistry, 1-8 Yamadaoka, Suita, Osaka, 565-0871, Japan.
| | - Takahiro Ono
- Department of Prosthodontics, Gerodontology and Oral Rehabilitation, Osaka University Graduate School of Dentistry, 1-8 Yamadaoka, Suita, Osaka, 565-0871, Japan; Division of Comprehensive Prosthodontics, Faculty of Dentistry and Graduate School of Medical and Dental Sciences, Niigata University, 2-5274, Gakkocho-dori, Niigata, 951-8514, Japan
| | - Momoyo Kida
- Department of Prosthodontics, Gerodontology and Oral Rehabilitation, Osaka University Graduate School of Dentistry, 1-8 Yamadaoka, Suita, Osaka, 565-0871, Japan
| | - Shuri Fushida
- Department of Prosthodontics, Gerodontology and Oral Rehabilitation, Osaka University Graduate School of Dentistry, 1-8 Yamadaoka, Suita, Osaka, 565-0871, Japan
| | - Takashi Nokubi
- Osaka University, 1-1 Yamadaoka, Suita, Osaka, 565-0871, Japan
| | - Yoshihiro Kokubo
- Department of Preventive Cardiology, National Cerebral and Cardiovascular Center, 6-1, Kishibe-Shimmachi, Suita, Osaka, 564-8565, Japan
| | - Makoto Watanabe
- Department of Preventive Cardiology, National Cerebral and Cardiovascular Center, 6-1, Kishibe-Shimmachi, Suita, Osaka, 564-8565, Japan
| | - Aya Higashiyama
- Department of Preventive Cardiology, National Cerebral and Cardiovascular Center, 6-1, Kishibe-Shimmachi, Suita, Osaka, 564-8565, Japan
| | - Yoshihiro Miyamoto
- Open Innovation Center, National Cerebral and Cardiovascular Center, 6-1, Kishibe-Shimmachi, Suita, Osaka, 564-8565, Japan
| | - Kazunori Ikebe
- Department of Prosthodontics, Gerodontology and Oral Rehabilitation, Osaka University Graduate School of Dentistry, 1-8 Yamadaoka, Suita, Osaka, 565-0871, Japan
| |
Collapse
|
3
|
Maeta K, Nishiyama Y, Fujibayashi K, Gunji T, Sasabe N, Iijima K, Naito T. Prediction of Glucose Metabolism Disorder Risk Using a Machine Learning Algorithm: Pilot Study. JMIR Diabetes 2018; 3:e10212. [PMID: 30478026 PMCID: PMC6288596 DOI: 10.2196/10212] [Citation(s) in RCA: 14] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/24/2018] [Revised: 08/16/2018] [Accepted: 10/17/2018] [Indexed: 01/10/2023] Open
Abstract
Background A 75-g oral glucose tolerance test (OGTT) provides important information about glucose metabolism, although the test is expensive and invasive. Complete OGTT information, such as 1-hour and 2-hour postloading plasma glucose and immunoreactive insulin levels, may be useful for predicting the future risk of diabetes or glucose metabolism disorders (GMD), which includes both diabetes and prediabetes. Objective We trained several classification models for predicting the risk of developing diabetes or GMD using data from thousands of OGTTs and a machine learning technique (XGBoost). The receiver operating characteristic (ROC) curves and their area under the curve (AUC) values for the trained classification models are reported, along with the sensitivity and specificity determined by the cutoff values of the Youden index. We compared the performance of the machine learning techniques with logistic regressions (LR), which are traditionally used in medical research studies. Methods Data were collected from subjects who underwent multiple OGTTs during comprehensive check-up medical examinations conducted at a single facility in Tokyo, Japan, from May 2006 to April 2017. For each examination, a subject was diagnosed with diabetes or prediabetes according to the American Diabetes Association guidelines. Given the data, 2 studies were conducted: predicting the risk of developing diabetes (study 1) or GMD (study 2). For each study, to apply supervised machine learning methods, the required label data was prepared. If a subject was diagnosed with diabetes or GMD at least once during the period, then that subject’s data obtained in previous trials were classified into the risk group (y=1). After data processing, 13,581 and 6760 OGTTs were analyzed for study 1 and study 2, respectively. For each study, a randomly chosen subset representing 80% of the data was used for training 9 classification models and the remaining 20% was used for evaluating the models. Three classification models, A to C, used XGBoost with various input variables, some including OGTT data. The other 6 classification models, D to I, used LR for comparison. Results For study 1, the AUC values ranged from 0.78 to 0.93. For study 2, the AUC values ranged from 0.63 to 0.78. The machine learning approach using XGBoost showed better performance compared with traditional LR methods. The AUC values increased when the full OGTT variables were included. In our analysis using a particular setting of input variables, XGBoost showed that the OGTT variables were more important than fasting plasma glucose or glycated hemoglobin. Conclusions A machine learning approach, XGBoost, showed better prediction accuracy compared with LR, suggesting that advanced machine learning methods are useful for detecting the early signs of diabetes or GMD. The prediction accuracy increased when all OGTT variables were added. This indicates that complete OGTT information is important for predicting the future risk of diabetes and GMD accurately.
Collapse
Affiliation(s)
- Katsutoshi Maeta
- Faculty of Informatics and Engineering, The University of Electro-Communications, Tokyo, Japan
| | - Yu Nishiyama
- Faculty of Informatics and Engineering, The University of Electro-Communications, Tokyo, Japan
| | - Kazutoshi Fujibayashi
- Department of General Medicine, School of Medicine, Juntendo University, Tokyo, Japan
| | - Toshiaki Gunji
- Center for Preventive Medicine, NTT Medical Center Tokyo, Tokyo, Japan
| | - Noriko Sasabe
- Center for Preventive Medicine, NTT Medical Center Tokyo, Tokyo, Japan
| | - Kimiko Iijima
- Center for Preventive Medicine, NTT Medical Center Tokyo, Tokyo, Japan
| | - Toshio Naito
- Department of General Medicine, School of Medicine, Juntendo University, Tokyo, Japan
| |
Collapse
|