Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For: Olivera AR, Roesler V, Iochpe C, Schmidt MI, Vigo Á, Barreto SM, Duncan BB. Comparison of machine-learning algorithms to build a predictive model for detecting undiagnosed diabetes - ELSA-Brasil: accuracy study. SAO PAULO MED J 2017;135:234-246. [PMID: 28746659 PMCID: PMC10019841 DOI: 10.1590/1516-3180.2016.0309010217] [Citation(s) in RCA: 32] [Impact Index Per Article: 4.6] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 01/19/2017] [Accepted: 02/01/2017] [Indexed: 01/23/2023] Open

For:	Olivera AR, Roesler V, Iochpe C, Schmidt MI, Vigo Á, Barreto SM, Duncan BB. Comparison of machine-learning algorithms to build a predictive model for detecting undiagnosed diabetes - ELSA-Brasil: accuracy study. SAO PAULO MED J 2017;135:234-246. [PMID: 28746659 PMCID: PMC10019841 DOI: 10.1590/1516-3180.2016.0309010217] [Citation(s) in RCA: 32] [Impact Index Per Article: 4.6] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 01/19/2017] [Accepted: 02/01/2017] [Indexed: 01/23/2023] Open

Number

Cited by Other Article(s)

Paula DP, Camacho M, Barbosa O, Marques L, Harter Griep R, da Fonseca MJM, Barreto S, Lekadir K. Sex and population differences in the cardiometabolic continuum: a machine learning study using the UK Biobank and ELSA-Brasil cohorts. BMC Public Health 2024;24:2131. [PMID: 39107721 PMCID: PMC11304673 DOI: 10.1186/s12889-024-19395-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/10/2023] [Accepted: 04/08/2024] [Indexed: 08/10/2024] Open

Abstract

BACKGROUND

The temporal relationships across cardiometabolic diseases (CMDs) were recently conceptualized as the cardiometabolic continuum (CMC), sequence of cardiovascular events that stem from gene-environmental interactions, unhealthy lifestyle influences, and metabolic diseases such as diabetes, and hypertension. While the physiological pathways linking metabolic and cardiovascular diseases have been investigated, the study of the sex and population differences in the CMC have still not been described.

METHODS

We present a machine learning approach to model the CMC and investigate sex and population differences in two distinct cohorts: the UK Biobank (17,700 participants) and the Brazilian Longitudinal Study of Adult Health (ELSA-Brasil) (7162 participants). We consider the following CMDs: hypertension (Hyp), diabetes (DM), heart diseases (HD: angina, myocardial infarction, or heart failure), and stroke (STK). For the identification of the CMC patterns, individual trajectories with the time of disease occurrence were clustered using k-means. Based on clinical, sociodemographic, and lifestyle characteristics, we built multiclass random forest classifiers and used the SHAP methodology to evaluate feature importance.

RESULTS

Five CMC patterns were identified across both sexes and cohorts: EarlyHyp, FirstDM, FirstHD, Healthy, and LateHyp, named according to prevalence and disease occurrence time that depicted around 95%, 78%, 75%, 88% and 99% of individuals, respectively. Within the UK Biobank, more women were classified in the Healthy cluster and more men in all others. In the EarlyHyp and LateHyp clusters, isolated hypertension occurred earlier among women. Smoking habits and education had high importance and clear directionality for both sexes. For ELSA-Brasil, more men were classified in the Healthy cluster and more women in the FirstDM. The diabetes occurrence time when followed by hypertension was lower among women. Education and ethnicity had high importance and clear directionality for women, while for men these features were smoking, alcohol, and coffee consumption.

CONCLUSIONS

There are clear sex differences in the CMC that varied across the UK and Brazilian cohorts. In particular, disadvantages regarding incidence and the time to onset of diseases were more pronounced in Brazil, against woman. The results show the need to strengthen public health policies to prevent and control the time course of CMD, with an emphasis on women.

Collapse

Anteneh LM, Lokonon BE, Kakaï RG. Modelling techniques in cholera epidemiology: A systematic and critical review. Math Biosci 2024;373:109210. [PMID: 38777029 DOI: 10.1016/j.mbs.2024.109210] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/21/2023] [Revised: 05/09/2024] [Accepted: 05/13/2024] [Indexed: 05/25/2024]

Abstract

Diverse modelling techniques in cholera epidemiology have been developed and used to (1) study its transmission dynamics, (2) predict and manage cholera outbreaks, and (3) assess the impact of various control and mitigation measures. In this study, we carry out a critical and systematic review of various approaches used for modelling the dynamics of cholera. Also, we discuss the strengths and weaknesses of each modelling approach. A systematic search of articles was conducted in Google Scholar, PubMed, Science Direct, and Taylor & Francis. Eligible studies were those concerned with the dynamics of cholera excluding studies focused on models for cholera transmission in animals, socio-economic factors, and genetic & molecular related studies. A total of 476 peer-reviewed articles met the inclusion criteria, with about 40% (32%) of the studies carried out in Asia (Africa). About 52%, 21%, and 9%, of the studies, were based on compartmental (e.g., SIRB), statistical (time series and regression), and spatial (spatiotemporal clustering) models, respectively, while the rest of the analysed studies used other modelling approaches such as network, machine learning and artificial intelligence, Bayesian, and agent-based approaches. Cholera modelling studies that incorporate vector/housefly transmission of the pathogen are scarce and a small portion of researchers (3.99%) considers the estimation of key epidemiological parameters. Vaccination only platform was utilized as a control measure in more than half (58%) of the studies. Research productivity in cholera epidemiological modelling studies have increased in recent years, but authors used diverse range of models. Future models should consider incorporating vector/housefly transmission of the pathogen and on the estimation of key epidemiological parameters for the transmission of cholera dynamics.

Collapse

Massago M, Massago M, Iora PH, Tavares Gurgel SJ, Conegero CI, Carolino IDR, Mushi MM, Chaves Forato GA, de Souza JVP, Hernandes Rocha TA, Bonfim S, Staton CA, Nihei OK, Vissoci JRN, de Andrade L. Applicability of machine learning algorithm to predict the therapeutic intervention success in Brazilian smokers. PLoS One 2024;19:e0295970. [PMID: 38437221 PMCID: PMC10911606 DOI: 10.1371/journal.pone.0295970] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/09/2023] [Accepted: 12/02/2023] [Indexed: 03/06/2024] Open

Affiliation(s)

Miyoko Massago PhD Student in the Postgraduate Program in Health Sciences, State University of Maringa, Maringa, Parana, Brazil
Mamoru Massago Master in Computer Sciences, State University of Maringa, Maringa, Parana, Brazil
Pedro Henrique Iora Professor in the Morphological Sciences Department, State University of Maringa, Maringa, Parana, Brazil
Sanderland José Tavares Gurgel Professor in the Morphological Sciences Department, State University of Maringa, Maringa, Parana, Brazil
Celso Ivam Conegero Professor in the Department of Medicine, State University of Maringa, Maringa, Parana, Brazil
Idalina Diair Regla Carolino Professor in the Morphological Sciences Department, State University of Maringa, Maringa, Parana, Brazil
Maria Muzanila Mushi Global Emergency Medicine Innovation and Implementation Research Center, Duke University School of Medicine, Duke Global Health Institute, Durham, North Carolina, United States of America
Giane Aparecida Chaves Forato Master Student in the Postgraduate Program in Health Sciences, State University of Maringa, Maringa, Parana, Brazil
João Vitor Perez de Souza Assistant Professor of Emergency Medicine and Global Health, Duke Global Health Institute, Department of Emergency Medicine, Duke University School of Medicine, Durham, North Carolina, United States of America
Thiago Augusto Hernandes Rocha Assistant Professor of Emergency Medicine and Global Health, Duke Global Health Institute, Department of Emergency Medicine, Duke University School of Medicine, Durham, North Carolina, United States of America
Samile Bonfim PhD Student in the Postgraduate Program in Health Sciences, State University of Maringa, Maringa, Parana, Brazil
Catherine Ann Staton Assistant Professor of Emergency Medicine and Global Health, Duke Global Health Institute, Department of Emergency Medicine, Duke University School of Medicine, Durham, North Carolina, United States of America
Oscar Kenji Nihei Professor in the Center of Education, Literature and Health, Western Parana State University, Foz do Iguaçu, Parana, Brazil
João Ricardo Nickenig Vissoci Assistant Professor of Emergency Medicine and Global Health, Duke Global Health Institute, Department of Emergency Medicine, Duke University School of Medicine, Durham, North Carolina, United States of America
Luciano de Andrade Professor in the Postgraduate Program in Health Sciences, State University of Maringa, Maringa, Parana, Brazil

Collapse

Budhathoki N, Bhandari R, Bashyal S, Lee C. Predicting asthma using imbalanced data modeling techniques: Evidence from 2019 Michigan BRFSS data. PLoS One 2023;18:e0295427. [PMID: 38060576 PMCID: PMC10703315 DOI: 10.1371/journal.pone.0295427] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/13/2022] [Accepted: 11/10/2023] [Indexed: 12/18/2023] Open

Abstract

Studies in the past have examined asthma prevalence and the associated risk factors in the United States using data from national surveys. However, the findings of these studies may not be relevant to specific states because of the different environmental and socioeconomic factors that vary across regions. The 2019 Behavioral Risk Factor Surveillance System (BRFSS) showed that Michigan had higher asthma prevalence rates than the national average. In this regard, we employ various modern machine learning techniques to predict asthma and identify risk factors associated with asthma among Michigan adults using the 2019 BRFSS data. After data cleaning, a sample of 10,337 individuals was selected for analysis, out of which 1,118 individuals (10.8%) reported having asthma during the survey period. Typical machine learning techniques often perform poorly due to imbalanced data issues. To address this challenge, we employed two synthetic data generation techniques, namely the Random Over-Sampling Examples (ROSE) and Synthetic Minority Over-Sampling Technique (SMOTE) and compared their performances. The overall performance of machine learning algorithms was improved using both methods, with ROSE performing better than SMOTE. Among the ROSE-adjusted models, we found that logistic regression, partial least squares, gradient boosting, LASSO, and elastic net had comparable performance, with sensitivity at around 50% and area under the curve (AUC) at around 63%. Due to ease of interpretability, logistic regression is chosen for further exploration of risk factors. Presence of chronic obstructive pulmonary disease, lower income, female sex, financial barrier to see a doctor due to cost, taken flu shot/spray in the past 12 months, 18-24 age group, Black, non-Hispanic group, and presence of diabetes are identified as asthma risk factors. This study demonstrates the potentiality of machine learning coupled with imbalanced data modeling approaches for predicting asthma from a large survey dataset. We conclude that the findings could guide early screening of at-risk asthma patients and designing appropriate interventions to improve care practices.

Collapse

Chen K, Abtahi F, Carrero JJ, Fernandez-Llatas C, Seoane F. Process mining and data mining applications in the domain of chronic diseases: A systematic review. Artif Intell Med 2023;144:102645. [PMID: 37783545 DOI: 10.1016/j.artmed.2023.102645] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/02/2023] [Revised: 08/24/2023] [Accepted: 08/28/2023] [Indexed: 10/04/2023]

Terabe ML, Massago M, Iora PH, Hernandes Rocha TA, de Souza JVP, Huo L, Massago M, Senda DM, Kobayashi EM, Vissoci JR, Staton CA, de Andrade L. Applicability of machine learning technique in the screening of patients with mild traumatic brain injury. PLoS One 2023;18:e0290721. [PMID: 37616279 PMCID: PMC10449130 DOI: 10.1371/journal.pone.0290721] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/14/2022] [Accepted: 08/14/2023] [Indexed: 08/26/2023] Open

Affiliation(s)

Miriam Leiko Terabe Postgraduate Program in Management, Technology and Innovation in Urgency and Emergency, State University of Maringa, Maringa, Parana, Brazil
Miyoko Massago Postgraduate Program in Health Sciences, State University of Maringa, Maringa, Parana, Brazil
Pedro Henrique Iora Department of Medicine, State University of Maringa, Maringa, Parana, Brazil
Thiago Augusto Hernandes Rocha Duke Global Health Institute, Duke University Medical Center, Durham, North Carolina, United States of America
João Vitor Perez de Souza Postgraduate Program in Biosciences and Physiopathology, State University of Maringa, Maringa, Parana, Brazil
Lily Huo Duke Global Health Institute, Duke University Medical Center, Durham, North Carolina, United States of America
Mamoru Massago Postgraduate Program in Computer Sciences, State University of Maringa, Maringa, Parana, Brazil
Dalton Makoto Senda Postgraduate Program in Health Sciences, State University of Maringa, Maringa, Parana, Brazil
Elisabete Mitiko Kobayashi Department of Medicine, State University of Maringa, Maringa, Parana, Brazil
João Ricardo Vissoci Postgraduate Program in Health Sciences, State University of Maringa, Maringa, Parana, Brazil Duke Global Health Institute, Duke University Medical Center, Durham, North Carolina, United States of America
Catherine Ann Staton Postgraduate Program in Health Sciences, State University of Maringa, Maringa, Parana, Brazil Duke Global Health Institute, Duke University Medical Center, Durham, North Carolina, United States of America
Luciano de Andrade Postgraduate Program in Management, Technology and Innovation in Urgency and Emergency, State University of Maringa, Maringa, Parana, Brazil Postgraduate Program in Health Sciences, State University of Maringa, Maringa, Parana, Brazil Department of Medicine, State University of Maringa, Maringa, Parana, Brazil

Collapse

Chellappan D, Rajaguru H. Detection of Diabetes through Microarray Genes with Enhancement of Classifiers Performance. Diagnostics (Basel) 2023;13:2654. [PMID: 37627916 PMCID: PMC10453776 DOI: 10.3390/diagnostics13162654] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/30/2023] [Revised: 08/06/2023] [Accepted: 08/07/2023] [Indexed: 08/27/2023] Open

Shamsutdinova D, Das-Munshi J, Ashworth M, Roberts A, Stahl D. Predicting type 2 diabetes prevalence for people with severe mental illness in a multi-ethnic East London population. Int J Med Inform 2023;172:105019. [PMID: 36787689 DOI: 10.1016/j.ijmedinf.2023.105019] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/19/2022] [Revised: 01/20/2023] [Accepted: 02/03/2023] [Indexed: 02/10/2023]

Abstract

BACKGROUND AND AIMS

Prevalence of type two diabetes mellitus (T2DM) in people with severe mental illness (SMI) is 2-3 times higher than in general population. Predictive modelling has advanced greatly in the past decade, and it is important to apply cutting-edge methods to vulnerable groups. However, few T2DM prediction models account for the presence of mental illness, and none seemed to have been developed specifically for people with SMI. Therefore, we aimed to develop and internally validate a T2DM prevalence model for people with SMI.

METHODS

We utilised a large cross-sectional sample representative of a multi-ethnic population from London (674,000 adults); 10,159 people with SMI formed our analytical sample (1,513 T2DM cases). We fitted a linear logistic regression and XGBoost as stand-alone models and as a stacked ensemble. Age, sex, body mass index, ethnicity, area-based deprivation, past hypertension, cardiovascular diseases, prescribed antipsychotics, and SMI illness were the predictors.

RESULTS

Logistic regression performed well while detecting T2DM presence for people with SMI: area under the receiver operator curve (ROC-AUC) was 0.83 (95 % CI 0.79-0.87). XGBoost and LR-XGBoost ensemble performed equally well, ROC-AUC 0.83 (95 % CI 0.79-0.87), indicating a negligible contribution of non-linear terms to predictive power. Ethnicity was the most important predictor after age. We demonstrated how the derived models can be utilised and estimated a 2.14 % (95 %CI 2.03 %-2.24 %) increase in T2DM prevalence in East London SMI population in 20 years' time, driven by the projected demographic changes.

CONCLUSIONS

Primary care data, the setting where prediction models could be most fruitfully used, provide enough information for well-performing T2DM prevalence models for people with SMI. We demonstrated how thorough internal cross-validation of an ensemble of a linear and machine-learning model can quantify the predictive value of non-linearity in the data.

Collapse

Mistry S, Riches NO, Gouripeddi R, Facelli JC. Environmental exposures in machine learning and data mining approaches to diabetes etiology: A scoping review. Artif Intell Med 2023;135:102461. [PMID: 36628796 PMCID: PMC9834645 DOI: 10.1016/j.artmed.2022.102461] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/05/2022] [Revised: 10/06/2022] [Accepted: 11/23/2022] [Indexed: 12/03/2022]

Abstract

BACKGROUND

Environmental exposures are implicated in diabetes etiology, but are poorly understood due to disease heterogeneity, complexity of exposures, and analytical challenges. Machine learning and data mining are artificial intelligence methods that can address these limitations. Despite their increasing adoption in etiology and prediction of diabetes research, the types of methods and exposures analyzed have not been thoroughly reviewed.

OBJECTIVE

We aimed to review articles that implemented machine learning and data mining methods to understand environmental exposures in diabetes etiology and disease prediction.

METHODS

We queried PubMed and Scopus databases for machine learning and data mining studies that used environmental exposures to understand diabetes etiology on September 19th, 2022. Exposures were classified into specific external, general external, or internal exposures. We reviewed machine learning and data mining methods and characterized the scope of environmental exposures studied in the etiology of general diabetes, type 1 diabetes, type 2 diabetes, and other types of diabetes.

RESULTS

We identified 44 articles for inclusion. Specific external exposures were the most common exposures studied, and supervised models were the most common methods used. Well-established specific external exposures of low physical activity, high cholesterol, and high triglycerides were predictive of general diabetes, type 2 diabetes, and prediabetes, while novel metabolic and gut microbiome biomarkers were implicated in type 1 diabetes.

DISCUSSION

The use of machine learning and data mining methods to elucidate environmental triggers of diabetes was largely limited to well-established risk factors identified using easily explainable and interpretable models. Future studies should seek to leverage machine learning and data mining to explore the temporality and co-occurrence of multiple exposures and further evaluate the role of general external and internal exposures in diabetes etiology.

Collapse

Sinha K, Uddin Z, Kawsar H, Islam S, Deen M, Howlader M. Analyzing chronic disease biomarkers using electrochemical sensors and artificial neural networks. Trends Analyt Chem 2023. [DOI: 10.1016/j.trac.2022.116861] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/12/2022]

Olusanya MO, Ogunsakin RE, Ghai M, Adeleke MA. Accuracy of Machine Learning Classification Models for the Prediction of Type 2 Diabetes Mellitus: A Systematic Survey and Meta-Analysis Approach. INTERNATIONAL JOURNAL OF ENVIRONMENTAL RESEARCH AND PUBLIC HEALTH 2022;19:ijerph192114280. [PMID: 36361161 PMCID: PMC9655196 DOI: 10.3390/ijerph192114280] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/27/2022] [Revised: 10/22/2022] [Accepted: 10/25/2022] [Indexed: 05/13/2023]

Polessa Paula D, Barbosa Aguiar O, Pruner Marques L, Bensenor I, Suemoto CK, Mendes da Fonseca MDJ, Griep RH. Comparing machine learning algorithms for multimorbidity prediction: An example from the Elsa-Brasil study. PLoS One 2022;17:e0275619. [PMID: 36206287 PMCID: PMC9543987 DOI: 10.1371/journal.pone.0275619] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/21/2022] [Accepted: 09/20/2022] [Indexed: 11/18/2022] Open

Suzuki Y, Suzuki H, Ishikawa T, Yamada Y, Yatoh S, Sugano Y, Iwasaki H, Sekiya M, Yahagi N, Hada Y, Shimano H. Exploratory analysis using machine learning of predictive factors for falls in type 2 diabetes. Sci Rep 2022;12:11965. [PMID: 35831378 PMCID: PMC9279484 DOI: 10.1038/s41598-022-15224-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/27/2021] [Accepted: 06/21/2022] [Indexed: 11/09/2022] Open

Affiliation(s)

Yasuhiro Suzuki Department of Rehabilitation Medicine, University of Tsukuba Hospital, Tsukuba, Ibaraki, 305-8576, Japan.
Hiroaki Suzuki Department of Internal Medicine (Endocrinology and Metabolism), Faculty of Medicine, University of Tsukuba, Tsukuba, Ibaraki, 305-8575, Japan.
Tatsuya Ishikawa IBM Research, Tokyo, 103-8510, Japan
Yasunori Yamada IBM Research, Tokyo, 103-8510, Japan
Shigeru Yatoh Department of Internal Medicine (Endocrinology and Metabolism), Faculty of Medicine, University of Tsukuba, Tsukuba, Ibaraki, 305-8575, Japan
Yoko Sugano Department of Internal Medicine (Endocrinology and Metabolism), Faculty of Medicine, University of Tsukuba, Tsukuba, Ibaraki, 305-8575, Japan
Hitoshi Iwasaki Department of Internal Medicine (Endocrinology and Metabolism), Faculty of Medicine, University of Tsukuba, Tsukuba, Ibaraki, 305-8575, Japan
Motohiro Sekiya Department of Internal Medicine (Endocrinology and Metabolism), Faculty of Medicine, University of Tsukuba, Tsukuba, Ibaraki, 305-8575, Japan
Naoya Yahagi Department of Internal Medicine (Endocrinology and Metabolism), Faculty of Medicine, University of Tsukuba, Tsukuba, Ibaraki, 305-8575, Japan
Yasushi Hada Department of Rehabilitation Medicine, University of Tsukuba Hospital, Tsukuba, Ibaraki, 305-8576, Japan
Hitoshi Shimano Department of Internal Medicine (Endocrinology and Metabolism), Faculty of Medicine, University of Tsukuba, Tsukuba, Ibaraki, 305-8575, Japan.,International Institute for Integrative Sleep Medicine (WPI-IIIS), University of Tsukuba, Tsukuba, Ibaraki, 305-8575, Japan.,Life Science Center of Tsukuba Advanced Research Alliance (TARA), University of Tsukuba, Tsukuba, Ibaraki, 305-8577, Japan.,Japan Agency for Medical Research and Development-Core Research for Evolutional Science and Technology (AMED-CREST), Chiyoda-ku, Tokyo, 100-0004, Japan

Collapse

Tuppad A, Patil SD. Machine learning for diabetes clinical decision support: a review. ADVANCES IN COMPUTATIONAL INTELLIGENCE 2022;2:22. [PMID: 35434723 PMCID: PMC9006199 DOI: 10.1007/s43674-022-00034-y] [Citation(s) in RCA: 9] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 10/17/2021] [Revised: 02/27/2022] [Accepted: 03/03/2022] [Indexed: 12/14/2022]

Abstract

Type 2 diabetes has recently acquired the status of an epidemic silent killer, though it is non-communicable. There are two main reasons behind this perception of the disease. First, a gradual but exponential growth in the disease prevalence has been witnessed irrespective of age groups, geography or gender. Second, the disease dynamics are very complex in terms of multifactorial risks involved, initial asymptomatic period, different short-term and long-term complications posing serious health threat and related co-morbidities. Majority of its risk factors are lifestyle habits like physical inactivity, lack of exercise, high body mass index (BMI), poor diet, smoking except some inevitable ones like family history of diabetes, ethnic predisposition, ageing etc. Nowadays, machine learning (ML) is increasingly being applied for alleviation of diabetes health burden and many research works have been proposed in the literature to offer clinical decision support in different application areas as well. In this paper, we present a review of such efforts for the prevention and management of type 2 diabetes. Firstly, we present the medical gaps in diabetes knowledge base, guidelines and medical practice identified from relevant articles and highlight those that can be addressed by ML. Further, we review the ML research works in three different application areas namely—(1) risk assessment (statistical risk scores and ML-based risk models), (2) diagnosis (using non-invasive and invasive features), (3) prognosis (from normoglycemia/prior morbidity to incident diabetes and prognosis of incident diabetes to related complications). We discuss and summarize the shortcomings or gaps in the existing ML methodologies for diabetes to be addressed in future. This review provides the breadth of ML predictive modeling applications for diabetes while highlighting the medical and technological gaps as well as various aspects involved in ML-based diabetes clinical decision support.

Collapse

Use of Machine Learning and Routine Laboratory Tests for Diabetes Mellitus Screening. BIOMED RESEARCH INTERNATIONAL 2022;2022:8114049. [PMID: 35392258 PMCID: PMC8983182 DOI: 10.1155/2022/8114049] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 10/21/2021] [Revised: 02/18/2022] [Accepted: 03/10/2022] [Indexed: 12/28/2022]

Delpino F, Costa Â, Farias S, Chiavegatto Filho A, Arcêncio R, Nunes B. Machine learning for predicting chronic diseases: a systematic review. Public Health 2022;205:14-25. [DOI: 10.1016/j.puhe.2022.01.007] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/27/2021] [Revised: 10/26/2021] [Accepted: 01/11/2022] [Indexed: 12/12/2022]

Fregoso-Aparicio L, Noguez J, Montesinos L, García-García JA. Machine learning and deep learning predictive models for type 2 diabetes: a systematic review. Diabetol Metab Syndr 2021;13:148. [PMID: 34930452 PMCID: PMC8686642 DOI: 10.1186/s13098-021-00767-9] [Citation(s) in RCA: 27] [Impact Index Per Article: 9.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 07/06/2021] [Accepted: 12/07/2021] [Indexed: 12/12/2022] Open

Nagpal MS, Barbaric A, Sherifali D, Morita PP, Cafazzo JA. Patient-Generated Data Analytics of Health Behaviors of People Living With Type 2 Diabetes: Scoping Review. JMIR Diabetes 2021;6:e29027. [PMID: 34783668 PMCID: PMC8726031 DOI: 10.2196/29027] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/23/2021] [Revised: 08/01/2021] [Accepted: 10/31/2021] [Indexed: 11/13/2022] Open

Abstract

Background

Complications due to type 2 diabetes (T2D) can be mitigated through proper self-management that can positively change health behaviors. Technological tools are available to help people living with, or at risk of developing, T2D to manage their condition, and such tools provide a large repository of patient-generated health data (PGHD). Analytics can provide insights into the health behaviors of people living with T2D.

Objective

The aim of this review is to investigate what can be learned about the health behaviors of those living with, or at risk of developing, T2D through analytics from PGHD.

Methods

A scoping review using the Arksey and O’Malley framework was conducted in which a comprehensive search of the literature was conducted by 2 reviewers. In all, 3 electronic databases (PubMed, IEEE Xplore, and ACM Digital Library) were searched using keywords associated with diabetes, behaviors, and analytics. Several rounds of screening using predetermined inclusion and exclusion criteria were conducted, after which studies were selected. Critical examination took place through a descriptive-analytical narrative method, and data extracted from the studies were classified into thematic categories. These categories reflect the findings of this study as per our objective.

Results

We identified 43 studies that met the inclusion criteria for this review. Although 70% (30/43) of the studies examined PGHD independently, 30% (13/43) combined PGHD with other data sources. Most of these studies used machine learning algorithms to perform their analysis. The themes identified through this review include predicting diabetes or obesity, deriving factors that contribute to diabetes or obesity, obtaining insights from social media or web-based forums, predicting glycemia, improving adherence and outcomes, analyzing sedentary behaviors, deriving behavior patterns, discovering clinical correlations from behaviors, and developing design principles.

Conclusions

The increased volume and availability of PGHD have the potential to derive analytical insights into the health behaviors of people living with T2D. From the literature, we determined that analytics can predict outcomes and identify granular behavior patterns from PGHD. This review determined the broad range of insights that can be examined through PGHD, which constitutes a unique source of data for these applications that would not be possible through the use of other data sources.

Collapse

Application of Data Mining Algorithms for Dementia in People with HIV/AIDS. COMPUTATIONAL AND MATHEMATICAL METHODS IN MEDICINE 2021;2021:4602465. [PMID: 34335861 PMCID: PMC8286188 DOI: 10.1155/2021/4602465] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 06/02/2020] [Accepted: 06/21/2021] [Indexed: 11/30/2022]

Dogan O, Tiwari S, Jabbar MA, Guggari S. A systematic review on AI/ML approaches against COVID-19 outbreak. COMPLEX INTELL SYST 2021;7:2655-2678. [PMID: 34777970 PMCID: PMC8256231 DOI: 10.1007/s40747-021-00424-8] [Citation(s) in RCA: 24] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/09/2020] [Accepted: 06/05/2021] [Indexed: 12/24/2022]

Channa R, Wolf R, Abramoff MD. Autonomous Artificial Intelligence in Diabetic Retinopathy: From Algorithm to Clinical Application. J Diabetes Sci Technol 2021;15:695-698. [PMID: 32126819 PMCID: PMC8120059 DOI: 10.1177/1932296820909900] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 12/12/2022]

Avilés-Santa ML, Monroig-Rivera A, Soto-Soto A, Lindberg NM. Current State of Diabetes Mellitus Prevalence, Awareness, Treatment, and Control in Latin America: Challenges and Innovative Solutions to Improve Health Outcomes Across the Continent. Curr Diab Rep 2020;20:62. [PMID: 33037442 PMCID: PMC7546937 DOI: 10.1007/s11892-020-01341-9] [Citation(s) in RCA: 51] [Impact Index Per Article: 12.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Accepted: 09/10/2020] [Indexed: 02/06/2023]

Kopitar L, Kocbek P, Cilar L, Sheikh A, Stiglic G. Early detection of type 2 diabetes mellitus using machine learning-based prediction models. Sci Rep 2020;10:11981. [PMID: 32686721 PMCID: PMC7371679 DOI: 10.1038/s41598-020-68771-z] [Citation(s) in RCA: 77] [Impact Index Per Article: 19.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/23/2019] [Accepted: 06/30/2020] [Indexed: 02/07/2023] Open

Musacchio N, Giancaterini A, Guaita G, Ozzello A, Pellegrini MA, Ponzani P, Russo GT, Zilich R, de Micheli A. Artificial Intelligence and Big Data in Diabetes Care: A Position Statement of the Italian Association of Medical Diabetologists. J Med Internet Res 2020;22:e16922. [PMID: 32568088 PMCID: PMC7338925 DOI: 10.2196/16922] [Citation(s) in RCA: 15] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/05/2019] [Revised: 03/09/2020] [Accepted: 04/12/2020] [Indexed: 12/24/2022] Open

Abstract

Since the last decade, most of our daily activities have become digital. Digital health takes into account the ever-increasing synergy between advanced medical technologies, innovation, and digital communication. Thanks to machine learning, we are not limited anymore to a descriptive analysis of the data, as we can obtain greater value by identifying and predicting patterns resulting from inductive reasoning. Machine learning software programs that disclose the reasoning behind a prediction allow for “what-if” models by which it is possible to understand if and how, by changing certain factors, one may improve the outcomes, thereby identifying the optimal behavior. Currently, diabetes care is facing several challenges: the decreasing number of diabetologists, the increasing number of patients, the reduced time allowed for medical visits, the growing complexity of the disease both from the standpoints of clinical and patient care, the difficulty of achieving the relevant clinical targets, the growing burden of disease management for both the health care professional and the patient, and the health care accessibility and sustainability. In this context, new digital technologies and the use of artificial intelligence are certainly a great opportunity. Herein, we report the results of a careful analysis of the current literature and represent the vision of the Italian Association of Medical Diabetologists (AMD) on this controversial topic that, if well used, may be the key for a great scientific innovation. AMD believes that the use of artificial intelligence will enable the conversion of data (descriptive) into knowledge of the factors that “affect” the behavior and correlations (predictive), thereby identifying the key aspects that may establish an improvement of the expected results (prescriptive). Artificial intelligence can therefore become a tool of great technical support to help diabetologists become fully responsible of the individual patient, thereby assuring customized and precise medicine. This, in turn, will allow for comprehensive therapies to be built in accordance with the evidence criteria that should always be the ground for any therapeutic choice.

Collapse

Zhang Y, Zhang Q, Li L, Thomas R, Li SZ, He MG, Wang NL. Establishment and Comparison of Algorithms for Detection of Primary Angle Closure Suspect Based on Static and Dynamic Anterior Segment Parameters. Transl Vis Sci Technol 2020;9:16. [PMID: 32821488 PMCID: PMC7401939 DOI: 10.1167/tvst.9.5.16] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/02/2019] [Accepted: 02/12/2020] [Indexed: 12/14/2022] Open

Battineni G, Sagaro GG, Chinatalapudi N, Amenta F. Applications of Machine Learning Predictive Models in the Chronic Disease Diagnosis. J Pers Med 2020;10:jpm10020021. [PMID: 32244292 PMCID: PMC7354442 DOI: 10.3390/jpm10020021] [Citation(s) in RCA: 79] [Impact Index Per Article: 19.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/10/2020] [Revised: 03/09/2020] [Accepted: 03/23/2020] [Indexed: 02/07/2023] Open

Ambriola Oku AY, Zimeo Morais GA, Arantes Bueno AP, Fujita A, Sato JR. Potential Confounders in the Analysis of Brazilian Adolescent's Health: A Combination of Machine Learning and Graph Theory. INTERNATIONAL JOURNAL OF ENVIRONMENTAL RESEARCH AND PUBLIC HEALTH 2019;17:ijerph17010090. [PMID: 31877700 PMCID: PMC6981403 DOI: 10.3390/ijerph17010090] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 10/30/2019] [Revised: 12/09/2019] [Accepted: 12/16/2019] [Indexed: 12/20/2022]

Santos HGD, Nascimento CFD, Izbicki R, Duarte YADO, Porto Chiavegatto Filho AD. [Machine learning for predictive analyses in health: an example of an application to predict death in the elderly in São Paulo, Brazil]. CAD SAUDE PUBLICA 2019;35:e00050818. [PMID: 31365698 DOI: 10.1590/0102-311x00050818] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/13/2018] [Accepted: 05/20/2019] [Indexed: 01/15/2023] Open

Machine Learning Model for Imbalanced Cholera Dataset in Tanzania. ScientificWorldJournal 2019;2019:9397578. [PMID: 31427903 PMCID: PMC6683776 DOI: 10.1155/2019/9397578] [Citation(s) in RCA: 13] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/24/2019] [Revised: 05/15/2019] [Accepted: 06/09/2019] [Indexed: 11/28/2022] Open

Abstract

Cholera epidemic remains a public threat throughout history, affecting vulnerable population living with unreliable water and substandard sanitary conditions. Various studies have observed that the occurrence of cholera has strong linkage with environmental factors such as climate change and geographical location. Climate change has been strongly linked to the seasonal occurrence and widespread of cholera through the creation of weather patterns that favor the disease's transmission, infection, and the growth of Vibrio cholerae, which cause the disease. Over the past decades, there have been great achievements in developing epidemic models for the proper prediction of cholera. However, the integration of weather variables and use of machine learning techniques have not been explicitly deployed in modeling cholera epidemics in Tanzania due to the challenges that come with its datasets such as imbalanced data and missing information. This paper explores the use of machine learning techniques to model cholera epidemics with linkage to seasonal weather changes while overcoming the data imbalance problem. Adaptive Synthetic Sampling Approach (ADASYN) and Principal Component Analysis (PCA) were used to the restore sampling balance and dimensional of the dataset. In addition, sensitivity, specificity, and balanced-accuracy metrics were used to evaluate the performance of the seven models. Based on the results of the Wilcoxon sign-rank test and features of the models, XGBoost classifier was selected to be the best model for the study. Overall results improved our understanding of the significant roles of machine learning strategies in health-care data. However, the study could not be treated as a time series problem due to the data collection bias. The study recommends a review of health-care systems in order to facilitate quality data collection and deployment of machine learning techniques.

Collapse

Christodoulou E, Ma J, Collins GS, Steyerberg EW, Verbakel JY, Van Calster B. A systematic review shows no performance benefit of machine learning over logistic regression for clinical prediction models. J Clin Epidemiol 2019;110:12-22. [PMID: 30763612 DOI: 10.1016/j.jclinepi.2019.02.004] [Citation(s) in RCA: 793] [Impact Index Per Article: 158.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/05/2018] [Revised: 01/18/2019] [Accepted: 02/05/2019] [Indexed: 02/06/2023]

Becker A. Artificial intelligence in medicine: What is it doing for us today? HEALTH POLICY AND TECHNOLOGY 2019. [DOI: 10.1016/j.hlpt.2019.03.004] [Citation(s) in RCA: 46] [Impact Index Per Article: 9.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/06/2023]

An Accurate Clinical Implication Assessment for Diabetes Mellitus Prevalence Based on a Study from Nigeria. Processes (Basel) 2019. [DOI: 10.3390/pr7050289] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/16/2022] Open

ARANDA ALFREDO, VALENCIA ALVARO. COMPUTATIONAL STUDY ON THE RUPTURE RISK IN REAL CEREBRAL ANEURYSMS WITH GEOMETRICAL AND FLUID-MECHANICAL PARAMETERS USING FSI SIMULATIONS AND MACHINE LEARNING ALGORITHMS. J MECH MED BIOL 2019. [DOI: 10.1142/s0219519419500143] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022]

Pei D, Gong Y, Kang H, Zhang C, Guo Q. Accurate and rapid screening model for potential diabetes mellitus. BMC Med Inform Decis Mak 2019;19:41. [PMID: 30866905 PMCID: PMC6416888 DOI: 10.1186/s12911-019-0790-3] [Citation(s) in RCA: 14] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/28/2018] [Accepted: 03/03/2019] [Indexed: 11/26/2022] Open

Abstract

Background

Prediction or early diagnosis of diabetes is crucial for populations with high risk of diabetes.

Methods

In this study, we assessed the ability of five popular classifiers (J48, AdaboostM1, SMO, Bayes Net, and Naïve Bayes) to identify individuals with diabetes based on nine non-invasive and easily obtained clinical features, including age, gender, body mass index (BMI), hypertension, history of cardiovascular disease or stroke, family history of diabetes, physical activity, work stress, and salty food preference. A total of 4205 data entries were obtained from annual physical examination reports for adults in the Shengjing Hospital of China Medical University during January–April 2017. Weka data mining software was used to identify the best algorithm for diabetes classification.

Results

The results indicate that decision tree classifier J48 has the best performance (accuracy = 0.9503, precision = 0.950, recall = 0.950, F-measure = 0.948, and AUC = 0.964). The decision tree structure shows that age is the most significant feature, followed by family history of diabetes, work stress, BMI, salty food preference, physical activity, hypertension, gender, and history of cardiovascular disease or stroke.

Conclusions

Our study shows that decision tree analyses can be applied to screen individuals for early diabetes risk without the need for invasive tests. This procedure will be particularly useful in developing regions with high epidemiological risk and poor socioeconomic status, and enable clinical practitioners to rapidly screen patients for increased risk of diabetes. The key features in the tree structure could further facilitate diabetes prevention through targeted community interventions, which can potentially improve early diabetes diagnosis and reduce burdens on the healthcare system.

Collapse

Pei D, Zhang C, Quan Y, Guo Q. Identification of Potential Type II Diabetes in a Chinese Population with a Sensitive Decision Tree Approach. J Diabetes Res 2019;2019:4248218. [PMID: 30805372 PMCID: PMC6362481 DOI: 10.1155/2019/4248218] [Citation(s) in RCA: 12] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 07/19/2018] [Revised: 11/20/2018] [Accepted: 12/18/2018] [Indexed: 12/17/2022] Open

Abstract

BACKGROUND

Diabetes mellitus is a chronic disease with a steadfast increase in prevalence. Due to the chronic course of the disease combining with devastating complications, this disorder could easily carry a financial burden. The early diagnosis of diabetes remains as one of the major challenges medical providers are facing, and the satisfactory screening tools or methods are still required, especially a population- or community-based tool.

METHODS

This is a retrospective cross-sectional study involving 15,323 subjects who underwent the annual check-up in the Department of Family Medicine of Shengjing Hospital of China Medical University from January 2017 to June 2017. With a strict data filtration, 10,436 records from the eligible participants were utilized to develop a prediction model using the J48 decision tree algorithm. Nine variables, including age, gender, body mass index (BMI), hypertension, history of cardiovascular disease or stroke, family history of diabetes, physical activity, work-related stress, and salty food preference, were considered.

RESULTS

The accuracy, precision, recall, and area under the receiver operating characteristic curve (AUC) value for identifying potential diabetes were 94.2%, 94.0%, 94.2%, and 94.8%, respectively. The structure of the decision tree shows that age is the most significant feature. The decision tree demonstrated that among those participants with age ≤ 49, 5497 participants (97%) of the individuals were identified as nondiabetic, while age > 49, 771 participants (50%) of the individuals were identified as nondiabetic. In the subgroup where people were 34 < age ≤ 49 and BMI ≥ 25, when with positive family history of diabetes, 89 (92%) out of 97 individuals were identified as diabetic and, when without family history of diabetes, 576 (58%) of the individuals were identified as nondiabetic. Work-related stress was identified as being associated with diabetes. In individuals with 34 < age ≤ 49 and BMI ≥ 25 and without family history of diabetes, 22 (51%) of the individuals with high work-related stress were identified as nondiabetic while 349 (88%) of the individuals with low or moderate work-related stress were identified as not having diabetes.

CONCLUSIONS

We proposed a classifier based on a decision tree which used nine features of patients which are easily obtained and noninvasive as predictor variables to identify potential incidents of diabetes. The classifier indicates that a decision tree analysis can be successfully applied to screen diabetes, which will support clinical practitioners for rapid diabetes identification. The model provides a means to target the prevention of diabetes which could reduce the burden on the health system through effective case management.

Collapse