151
|
De Silva K, Enticott J, Barton C, Forbes A, Saha S, Nikam R. Use and performance of machine learning models for type 2 diabetes prediction in clinical and community care settings: Protocol for a systematic review and meta-analysis of predictive modeling studies. Digit Health 2021; 7:20552076211047390. [PMID: 34868616 PMCID: PMC8642048 DOI: 10.1177/20552076211047390] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/02/2019] [Accepted: 09/01/2021] [Indexed: 12/23/2022] Open
Abstract
Objective Machine learning involves the use of algorithms without explicit
instructions. Of late, machine learning models have been widely applied for
the prediction of type 2 diabetes. However, no evidence synthesis of the
performance of these prediction models of type 2 diabetes is available. We
aim to identify machine learning prediction models for type 2 diabetes in
clinical and community care settings and determine their predictive
performance. Methods The systematic review of English language machine learning predictive
modeling studies in 12 databases will be conducted. Studies predicting type
2 diabetes in predefined clinical or community settings are eligible.
Standard CHARMS and TRIPOD guidelines will guide data extraction.
Methodological quality will be assessed using a predefined risk of bias
assessment tool. The extent of validation will be categorized by
Reilly–Evans levels. Primary outcomes include model performance metrics of
discrimination ability, calibration, and classification accuracy. Secondary
outcomes include candidate predictors, algorithms used, level of validation,
and intended use of models. The random-effects meta-analysis of c-indices
will be performed to evaluate discrimination abilities. The c-indices will
be pooled per prediction model, per model type, and per algorithm.
Publication bias will be assessed through funnel plots and regression tests.
Sensitivity analysis will be conducted to estimate the effects of study
quality and missing data on primary outcome. The sources of heterogeneity
will be assessed through meta-regression. Subgroup analyses will be
performed for primary outcomes. Ethics and dissemination No ethics approval is required, as no primary or personal data are collected.
Findings will be disseminated through scientific sessions and peer-reviewed
journals. PROSPERO registration number CRD42019130886
Collapse
Affiliation(s)
- Kushan De Silva
- Monash Centre for Health Research and Implementation, School of Public Health and Preventive Medicine, Faculty of Medicine, Nursing, and Health Sciences, Monash University, Australia
| | - Joanne Enticott
- Monash Centre for Health Research and Implementation, School of Public Health and Preventive Medicine, Faculty of Medicine, Nursing, and Health Sciences, Monash University, Australia
| | - Christopher Barton
- Department of General Practice, School of Public Health and Preventive Medicine, Faculty of Medicine, Nursing, and Health Sciences, Monash University, Australia
| | - Andrew Forbes
- Biostatistics Unit, Division of Research Methodology, School of Public Health and Preventive Medicine, Faculty of Medicine, Nursing, and Health Sciences, Monash University, Australia
| | - Sajal Saha
- Department of General Practice, School of Public Health and Preventive Medicine, Faculty of Medicine, Nursing, and Health Sciences, Monash University, Australia
| | - Rujuta Nikam
- Department of General Practice, School of Public Health and Preventive Medicine, Faculty of Medicine, Nursing, and Health Sciences, Monash University, Australia
| |
Collapse
|
152
|
The Role of New Technological Opportunities and the Need to Evaluate the Activities Performed in the Prevention of Diabetic Foot with Exercise Therapy. MEDICINES 2021; 8:medicines8120076. [PMID: 34940288 PMCID: PMC8706849 DOI: 10.3390/medicines8120076] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Subscribe] [Scholar Register] [Received: 08/24/2021] [Revised: 11/22/2021] [Accepted: 11/28/2021] [Indexed: 01/22/2023]
Abstract
The diabetic foot (DF) is one of the most feared conditions among chronic complications of diabetes, which affects a growing number of patients. Although exercise therapy (ET) has always been considered a pillar in the treatment of patients at risk of DF it is not usually used. Several causes can contribute to hindering both the organization of ET protocols for Diabetes Units and the participation in ET programs for patients at different levels of risk of foot ulceration. The risk of favoring the occurrence of ulcers and the absence of clear evidence on the role played by ET in the prevention of ulcers could be considered among the most important causes leading to the low application of ET. The increased availability of new technologies and in particular of systems and devices equipped with sensors can enable the remote monitoring and management of physical activity performed by patients. Consequently, they can become an opportunity for introducing the systematic use of ET for the treatment of patients at risk. Considering the complexity of the clinical conditions that patients at risk or with diabetic foot ulcer can show, the evaluation of how patients perform the ET proposed can consequently be very important. All this can contribute to improving the treatment of patients and avoiding possible adverse effects. The aim of this brief review was to describe that the use of new technologies and the assessment of the execution of the ET proposed allows an important step forward in the management of patients at risk.
Collapse
|
153
|
Zhang C, Chen X, Wang S, Hu J, Wang C, Liu X. Using CatBoost algorithm to identify middle-aged and elderly depression, national health and nutrition examination survey 2011-2018. Psychiatry Res 2021; 306:114261. [PMID: 34781111 DOI: 10.1016/j.psychres.2021.114261] [Citation(s) in RCA: 16] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 08/04/2021] [Revised: 10/10/2021] [Accepted: 10/30/2021] [Indexed: 12/16/2022]
Abstract
Depression is one of the most common mental health problems in middle-aged and elderly people. The establishment of risk factor-based depression risk assessment model is conducive to early detection and early treatment of high-risk groups of depression. Five machine learning models (logistic regression (LR); back propagation (BP); random forest (RF); support vector machines (SVM); category boosting (CatBoost) were used to evaluate the depression among 8374 middle-aged people and 4636 elderly people in the NHANES database from 2011 to 2018. In the 2011-2018 cycle, the estimated prevalence of depression was 8.97% in the middle-aged participants and 8.02% in the elderly participants. Among the middle-aged and elderly participants, CatBoost was the best model to identify depression, and its area under the working characteristic curve (AUC) reaches the highest. The second is LR model and SVM model, while the performance of BP and RF model was slightly worse. The primary influencing factor of depression in middle-aged male is alanine aminotransferase. All five machine learning models can identify the occurrence of depression in the NHANES data set through social demographics, lifestyle, laboratory data and other data of middle-aged and elderly people, and among five models, the CatBoost model performed best.
Collapse
Affiliation(s)
- Chenyang Zhang
- Department of Epidemiology and Statistics, School of Public Health, Jilin University, Changchun, Jilin 130021, China
| | - Xiaofei Chen
- Department of Epidemiology and Statistics, School of Public Health, Jilin University, Changchun, Jilin 130021, China
| | - Song Wang
- Department of Epidemiology and Statistics, School of Public Health, Jilin University, Changchun, Jilin 130021, China
| | - Junjun Hu
- Department of Epidemiology and Statistics, School of Public Health, Jilin University, Changchun, Jilin 130021, China
| | - Chunpeng Wang
- School of Mathematics and Statistics, Northeast Normal University, Changchun 130000, China.
| | - Xin Liu
- Department of Epidemiology and Statistics, School of Public Health, Jilin University, Changchun, Jilin 130021, China.
| |
Collapse
|
154
|
Li W, Song Y, Chen K, Ying J, Zheng Z, Qiao S, Yang M, Zhang M, Zhang Y. Predictive model and risk analysis for diabetic retinopathy using machine learning: a retrospective cohort study in China. BMJ Open 2021; 11:e050989. [PMID: 34836899 PMCID: PMC8628336 DOI: 10.1136/bmjopen-2021-050989] [Citation(s) in RCA: 22] [Impact Index Per Article: 7.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 01/03/2023] Open
Abstract
OBJECTIVE Aiming to investigate diabetic retinopathy (DR) risk factors and predictive models by machine learning using a large sample dataset. DESIGN Retrospective study based on a large sample and a high dimensional database. SETTING A Chinese central tertiary hospital in Beijing. PARTICIPANTS Information on 32 452 inpatients with type-2 diabetes mellitus (T2DM) were retrieved from the electronic medical record system from 1 January 2013 to 31 December 2017. METHODS Sixty variables (including demography information, physical and laboratory measurements, system diseases and insulin treatments) were retained for baseline analysis. The optimal 17 variables were selected by recursive feature elimination. The prediction model was built based on XGBoost algorithm, and it was compared with three other popular machine learning techniques: logistic regression, random forest and support vector machine. In order to explain the results of XGBoost model more visually, the Shapley Additive exPlanation (SHAP) method was used. RESULTS DR occurred in 2038 (6.28%) T2DM patients. The XGBoost model was identified as the best prediction model with the highest AUC (area under the curve value, 0.90) and showed that an HbA1c value greater than 8%, nephropathy, a serum creatinine value greater than 100 µmol/L, insulin treatment and diabetic lower extremity arterial disease were associated with an increased risk of DR. A patient's age over 65 was associated with a decreased risk of DR. CONCLUSIONS With better comprehensive performance, XGBoost model had high reliability to assess risk indicators of DR. The most critical risk factors of DR and the cut-off of risk factors can be found by SHAP method to render the output of the XGBoost model clinically interpretable.
Collapse
Affiliation(s)
- Wanyue Li
- Medical School of Chinese PLA, Beijing, China
- Department of Ophthalmology, Chinese PLA General Hospital, Beijing, China
| | - Yanan Song
- Medical Big Data Research Center, Medical Innovation Research Division of Chinese PLA General Hospital, Beijing, China
| | - Kang Chen
- Department of Endocrinology, Chinese PLA General Hospital, Beijing, China
| | - Jun Ying
- Information Management Department, Chinese PLA General Hospital, Beijing, China
| | - Zhong Zheng
- Information Center, Logistics Support Department, Central Military Commission, Beijing, China
| | - Shen Qiao
- Medical Big Data Research Center, Medical Innovation Research Division of Chinese PLA General Hospital, Beijing, China
| | - Ming Yang
- Medical Big Data Research Center, Medical Innovation Research Division of Chinese PLA General Hospital, Beijing, China
| | - Maonian Zhang
- Medical School of Chinese PLA, Beijing, China
- Department of Ophthalmology, Chinese PLA General Hospital, Beijing, China
| | - Ying Zhang
- Department of Ophthalmology, Chinese PLA General Hospital, Beijing, China
| |
Collapse
|
155
|
Machine learning algorithms for diabetes detection: a comparative evaluation of performance of algorithms. EVOLUTIONARY INTELLIGENCE 2021. [DOI: 10.1007/s12065-021-00685-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/19/2022]
|
156
|
Tsichlaki S, Koumakis L, Tsiknakis M. A Systematic Review of T1D Hypoglycemia Prediction Algorithms (Preprint). JMIR Diabetes 2021; 7:e34699. [PMID: 35862181 PMCID: PMC9353679 DOI: 10.2196/34699] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/04/2021] [Revised: 04/02/2022] [Accepted: 04/20/2022] [Indexed: 11/13/2022] Open
Affiliation(s)
- Stella Tsichlaki
- Department of Electrical & Computer Engineering, Hellenic Mediterranean University, Heraklion, Greece
| | - Lefteris Koumakis
- Institute of Computer Science, Foundation for Research and Technology-Hellas, Heraklion, Greece
| | - Manolis Tsiknakis
- Department of Electrical & Computer Engineering, Hellenic Mediterranean University, Heraklion, Greece
- Institute of Computer Science, Foundation for Research and Technology-Hellas, Heraklion, Greece
| |
Collapse
|
157
|
Broda MD, Bogenschutz M, Dinora P, Prohn SM, Lineberry S, Ross E. Using Machine Learning to Predict Patterns of Employment and Day Program Participation. AMERICAN JOURNAL ON INTELLECTUAL AND DEVELOPMENTAL DISABILITIES 2021; 126:477-491. [PMID: 34700349 DOI: 10.1352/1944-7558-126.6.477] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/24/2020] [Accepted: 03/16/2021] [Indexed: 06/13/2023]
Abstract
In this article, we demonstrate the potential of machine learning approaches as inductive analytic tools for expanding our current evidence base for policy making and practice that affects people with intellectual and developmental disabilities (IDD). Using data from the National Core Indicators In-Person Survey (NCI-IPS), a nationally validated annual survey of more than 20,000 nationally representative people with IDD, we fit a series of classification tree and random forest models to predict individuals' employment status and day activity participation as a function of their responses to all other items on the 2017-2018 NCI-IPS. The most accurate model, a random forest classifier, predicted employment outcomes of adults with IDD with an accuracy of 89 percent on the testing sample, and 80 percent on the holdout sample. The most important variable in this prediction was whether or not community employment was a goal in this person's service plan. These results suggest the potential machine learning tools to examine other valued outcomes used in evidence-based policy making to support people with IDD.
Collapse
Affiliation(s)
- Michael D Broda
- Michael D. Broda, Matthew Bogenschutz, Parthenia Dinora, Seb M. Prohn, Sarah Lineberry, and Erica Ross, Virginia Commonwealth University
| | - Matthew Bogenschutz
- Michael D. Broda, Matthew Bogenschutz, Parthenia Dinora, Seb M. Prohn, Sarah Lineberry, and Erica Ross, Virginia Commonwealth University
| | - Parthenia Dinora
- Michael D. Broda, Matthew Bogenschutz, Parthenia Dinora, Seb M. Prohn, Sarah Lineberry, and Erica Ross, Virginia Commonwealth University
| | - Seb M Prohn
- Michael D. Broda, Matthew Bogenschutz, Parthenia Dinora, Seb M. Prohn, Sarah Lineberry, and Erica Ross, Virginia Commonwealth University
| | - Sarah Lineberry
- Michael D. Broda, Matthew Bogenschutz, Parthenia Dinora, Seb M. Prohn, Sarah Lineberry, and Erica Ross, Virginia Commonwealth University
| | - Erica Ross
- Michael D. Broda, Matthew Bogenschutz, Parthenia Dinora, Seb M. Prohn, Sarah Lineberry, and Erica Ross, Virginia Commonwealth University
| |
Collapse
|
158
|
Kuo YY, Huang ST, Chiu HW. Applying artificial neural network for early detection of sepsis with intentionally preserved highly missing real-world data for simulating clinical situation. BMC Med Inform Decis Mak 2021; 21:290. [PMID: 34686163 PMCID: PMC8539833 DOI: 10.1186/s12911-021-01653-0] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/02/2021] [Accepted: 10/12/2021] [Indexed: 12/12/2022] Open
Abstract
Purpose Some predictive systems using machine learning models have been developed to predict sepsis; however, they were mostly built with a low percent of missing values, which does not correspond with the actual clinical situation. In this study, we developed a machine learning model with a high rate of missing and erroneous data to enable prediction under missing, noisy, and erroneous inputs, as in the actual clinical situation. Materials and methods The proposed artificial neural network model was implemented using the MATLAB ANN toolbox, based on stochastic gradient descent. The dataset was collected over the past decade with approval from the appropriate institutional review boards, and the sepsis status was identified and labeled using Sepsis-3 clinical criteria. The imputation method was built by last observation carried forward and mean value, aimed to simulate clinical situation. Results The mean area under the receiver operating characteristic (ROC) curve (AUC) of classifying sepsis and nonsepsis patients was 0.82 and 0.786 at 0 h and 40 h prior to onset, respectively. The highest model performance was found for one-hourly data, demonstrating that our ANN model can perform adequately with limited hourly data provided. Conclusions Our model has the moderate ability to predict sepsis up to 40 h in advance under simulated clinical situation with real-world data.
Collapse
Affiliation(s)
- Yao-Yi Kuo
- School of Medicine, College of Medicine, Taipei Medical University, Taipei, Taiwan
| | - Shu-Tien Huang
- Department of Emergency Medicine, Mackay Memorial Hospital, Taipei, Taiwan
| | - Hung-Wen Chiu
- Graduate Institute of Biomedical Informatics, College of Medical Science and Technology, Taipei Medical University, Taipei, Taiwan.
| |
Collapse
|
159
|
Hatmal MM, Alshaer W, Mahmoud IS, Al-Hatamleh MAI, Al-Ameer HJ, Abuyaman O, Zihlif M, Mohamud R, Darras M, Al Shhab M, Abu-Raideh R, Ismail H, Al-Hamadi A, Abdelhay A. Investigating the association of CD36 gene polymorphisms (rs1761667 and rs1527483) with T2DM and dyslipidemia: Statistical analysis, machine learning based prediction, and meta-analysis. PLoS One 2021; 16:e0257857. [PMID: 34648514 PMCID: PMC8516279 DOI: 10.1371/journal.pone.0257857] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/14/2021] [Accepted: 09/11/2021] [Indexed: 12/15/2022] Open
Abstract
CD36 (cluster of differentiation 36) is a membrane protein involved in lipid metabolism and has been linked to pathological conditions associated with metabolic disorders, such as diabetes and dyslipidemia. A case-control study was conducted and included 177 patients with type-2 diabetes mellitus (T2DM) and 173 control subjects to study the involvement of CD36 gene rs1761667 (G>A) and rs1527483 (C>T) polymorphisms in the pathogenesis of T2DM and dyslipidemia among Jordanian population. Lipid profile, blood sugar, gender and age were measured and recorded. Also, genotyping analysis for both polymorphisms was performed. Following statistical analysis, 10 different neural networks and machine learning (ML) tools were used to predict subjects with diabetes or dyslipidemia. Towards further understanding of the role of CD36 protein and gene in T2DM and dyslipidemia, a protein-protein interaction network and meta-analysis were carried out. For both polymorphisms, the genotypic frequencies were not significantly different between the two groups (p > 0.05). On the other hand, some ML tools like multilayer perceptron gave high prediction accuracy (≥ 0.75) and Cohen's kappa (κ) (≥ 0.5). Interestingly, in K-star tool, the accuracy and Cohen's κ values were enhanced by including the genotyping results as inputs (0.73 and 0.46, respectively, compared to 0.67 and 0.34 without including them). This study confirmed, for the first time, that there is no association between CD36 polymorphisms and T2DM or dyslipidemia among Jordanian population. Prediction of T2DM and dyslipidemia, using these extensive ML tools and based on such input data, is a promising approach for developing diagnostic and prognostic prediction models for a wide spectrum of diseases, especially based on large medical databases.
Collapse
Affiliation(s)
- Ma’mon M. Hatmal
- Department of Medical Laboratory Sciences, Faculty of Applied Medical Sciences, The Hashemite University, Zarqa, Jordan
- * E-mail:
| | - Walhan Alshaer
- Cell Therapy Centre, The University of Jordan, Amman, Jordan
| | - Ismail S. Mahmoud
- Department of Medical Laboratory Sciences, Faculty of Applied Medical Sciences, The Hashemite University, Zarqa, Jordan
| | - Mohammad A. I. Al-Hatamleh
- Department of Immunology, School of Medical Sciences, Universiti Sains Malaysia, Kubang Kerian, Kelantan, Malaysia
| | - Hamzeh J. Al-Ameer
- Department of Biology and Biotechnology, American University of Madaba, Madaba, Jordan
- Department of Pharmacology, Faculty of Medicine, The University of Jordan, Amman, Jordan
| | - Omar Abuyaman
- Department of Medical Laboratory Sciences, Faculty of Applied Medical Sciences, The Hashemite University, Zarqa, Jordan
| | - Malek Zihlif
- Department of Pharmacology, Faculty of Medicine, The University of Jordan, Amman, Jordan
| | - Rohimah Mohamud
- Department of Immunology, School of Medical Sciences, Universiti Sains Malaysia, Kubang Kerian, Kelantan, Malaysia
| | - Mais Darras
- Department of Medical Laboratory Sciences, Faculty of Applied Medical Sciences, The Hashemite University, Zarqa, Jordan
| | - Mohammad Al Shhab
- Department of Pharmacology, Faculty of Medicine, The University of Jordan, Amman, Jordan
| | - Rand Abu-Raideh
- Department of Medical Laboratory Sciences, Faculty of Applied Medical Sciences, The Hashemite University, Zarqa, Jordan
| | - Hilweh Ismail
- Department of Medical Laboratory Sciences, Faculty of Applied Medical Sciences, The Hashemite University, Zarqa, Jordan
| | - Ali Al-Hamadi
- Department of Medical Laboratory Sciences, Faculty of Applied Medical Sciences, The Hashemite University, Zarqa, Jordan
| | - Ali Abdelhay
- Department of Pharmacology, Faculty of Medicine, The University of Jordan, Amman, Jordan
| |
Collapse
|
160
|
Maurya MR, Riyaz NUSS, Reddy MSB, Yalcin HC, Ouakad HM, Bahadur I, Al-Maadeed S, Sadasivuni KK. A review of smart sensors coupled with Internet of Things and Artificial Intelligence approach for heart failure monitoring. Med Biol Eng Comput 2021; 59:2185-2203. [PMID: 34611787 DOI: 10.1007/s11517-021-02447-2] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/21/2021] [Accepted: 09/01/2021] [Indexed: 02/07/2023]
Abstract
Over the last decade, there has been a huge demand for health care technologies such as sensors-based prediction using digital health. With the continuous rise in the human population, these technologies showed to be potentially effective solutions to life-threatening diseases such as heart failure (HF). Besides being a potential for early death, HF has a significantly reduced quality of life (QoL). Heart failure has no cure. However, treatment can help you live a longer and more active life with fewer symptoms. Thus, it is essential to develop technological aid solutions allowing early diagnosis and consequently, effective treatment with possibly delayed mortality. Commonly, forecasts of HF are based on the generation of vast volumes of data usually collected from an individual patient by different components of the family history, physical examination, basic laboratory results, and other medical records. Though, these data are not effectively useful for predicting this failure, nevertheless, with the aid of advanced medical technology such as interconnected multi-sensory-based devices, and based on several medical history characteristics, the broad data provided machine learning algorithms to predict risk factors for heart disease of an individual is beneficial. There will be many challenges for the next decade of advancements in HF care: exploiting an increasingly growing repertoire of interconnected internal and external sensors for the benefit of patients and processing large, multimodal datasets with new Artificial Intelligence (AI) software. Various methods for predicting heart failure and, primarily the significance of invasive and non-invasive sensors along with different strategies for machine learning to predict heart failure are presented and summarized in the present study.
Collapse
Affiliation(s)
- Muni Raj Maurya
- Center for Advanced Materials, Qatar University, P.O. Box 2713, Doha, Qatar
- Department of Mechanical and Industrial Engineering, Qatar University, P.O. Box 2713, Doha, Qatar
| | | | - M Sai Bhargava Reddy
- Center for Nanoscience and Technology, Institute of Science and Technology, Jawaharlal Nehru Technological University, Hyderabad, Telangana State, 500085, India
| | | | - Hassen M Ouakad
- Mechanical and Industrial Engineering Department, College of Engineering, Sultan Qaboos University, Al-Khoudh, 123, PO-BOX 33, Muscat, Oman.
| | - Issam Bahadur
- Mechanical and Industrial Engineering Department, College of Engineering, Sultan Qaboos University, Al-Khoudh, 123, PO-BOX 33, Muscat, Oman
| | - Somaya Al-Maadeed
- Department of Computer Engineering, Qatar University, P.O. Box 2713, Doha, Qatar
| | | |
Collapse
|
161
|
Nguyen P, Ohnmacht AJ, Galhoz A, Büttner M, Theis F, Menden MP. Künstliche Intelligenz und maschinelles Lernen in der Diabetesforschung. DIABETOLOGE 2021. [DOI: 10.1007/s11428-021-00817-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
|
162
|
Prediction of Diabetic Sensorimotor Polyneuropathy Using Machine Learning Techniques. J Clin Med 2021; 10:jcm10194576. [PMID: 34640594 PMCID: PMC8509372 DOI: 10.3390/jcm10194576] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/07/2021] [Revised: 09/28/2021] [Accepted: 09/30/2021] [Indexed: 02/07/2023] Open
Abstract
Diabetic sensorimotor polyneuropathy (DSPN) is a major complication in patients with diabetes mellitus (DM), and early detection or prediction of DSPN is important for preventing or managing neuropathic pain and foot ulcer. Our aim is to delineate whether machine learning techniques are more useful than traditional statistical methods for predicting DSPN in DM patients. Four hundred seventy DM patients were classified into four groups (normal, possible, probable, and confirmed) based on clinical and electrophysiological findings of suspected DSPN. Three ML methods, XGBoost (XGB), support vector machine (SVM), and random forest (RF), and their combinations were used for analysis. RF showed the best area under the receiver operator characteristic curve (AUC, 0.8250) for differentiating between two categories—criteria by clinical findings (normal, possible, and probable groups) and those by electrophysiological findings (confirmed group)—and the result was superior to that of linear regression analysis (AUC = 0.6620). Average values of serum glucose, International Federation of Clinical Chemistry (IFCC), HbA1c, and albumin levels were identified as the four most important predictors of DSPN. In conclusion, machine learning techniques, especially RF, can predict DSPN in DM patients effectively, and electrophysiological analysis is important for identifying DSPN.
Collapse
|
163
|
Ramazi R, Perndorfer C, Soriano EC, Laurenceau JP, Beheshti R. Predicting Progression Patterns of Type 2 Diabetes using Multi-sensor Measurements. ACTA ACUST UNITED AC 2021; 21. [PMID: 34568534 DOI: 10.1016/j.smhl.2021.100206] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
Abstract
Type 2 diabetes - a prevalent chronic disease worldwide - increases risk for serious health consequences including heart and kidney disease. Forecasting diabetes progression can inform disease management strategies, thereby potentially reducing the likelihood or severity of its consequences. We use continuous glucose monitoring and actigraphy data from 54 individuals with Type 2 diabetes to predict their future hemoglobin A1c, HDL cholesterol, LDL cholesterol, and triglyceride levels one year later. We use a combination of convolutional and recurrent neural networks to develop a deep neural network architecture that can learn the dynamic patterns in different sensors' data and combine those patterns with additional demographic and lab data. To further demonstrate the generalizability of our models, we also evaluate their performance using an independent public dataset of individuals with Type 1 diabetes. In addition to diabetes, our approach could be useful for other serious and chronic physical illness, where dynamic (e.g., from multiple sensors) and static (e.g., demographic) data are used for creating predictive models.
Collapse
Affiliation(s)
- Ramin Ramazi
- Department of Computer & Informational Sciences, University of Delaware, Newark, DE, USA
| | - Christine Perndorfer
- Department of Psychological & Brain Sciences, University of Delaware, Newark, DE, USA
| | - Emily C Soriano
- Department of Psychological & Brain Sciences, University of Delaware, Newark, DE, USA
| | | | - Rahmatollah Beheshti
- Department of Computer & Informational Sciences, University of Delaware, Newark, DE, USA
| |
Collapse
|
164
|
Haneef R, Kab S, Hrzic R, Fuentes S, Fosse-Edorh S, Cosson E, Gallay A. Use of artificial intelligence for public health surveillance: a case study to develop a machine Learning-algorithm to estimate the incidence of diabetes mellitus in France. ACTA ACUST UNITED AC 2021; 79:168. [PMID: 34551816 PMCID: PMC8456679 DOI: 10.1186/s13690-021-00687-0] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/25/2020] [Accepted: 09/02/2021] [Indexed: 11/10/2022]
Abstract
BACKGROUND The use of machine learning techniques is increasing in healthcare which allows to estimate and predict health outcomes from large administrative data sets more efficiently. The main objective of this study was to develop a generic machine learning (ML) algorithm to estimate the incidence of diabetes based on the number of reimbursements over the last 2 years. METHODS We selected a final data set from a population-based epidemiological cohort (i.e., CONSTANCES) linked with French National Health Database (i.e., SNDS). To develop this algorithm, we adopted a supervised ML approach. Following steps were performed: i. selection of final data set, ii. target definition, iii. Coding variables for a given window of time, iv. split final data into training and test data sets, v. variables selection, vi. training model, vii. Validation of model with test data set and viii. Selection of the model. We used the area under the receiver operating characteristic curve (AUC) to select the best algorithm. RESULTS The final data set used to develop the algorithm included 44,659 participants from CONSTANCES. Out of 3468 variables from SNDS linked to CONSTANCES cohort were coded, 23 variables were selected to train different algorithms. The final algorithm to estimate the incidence of diabetes was a Linear Discriminant Analysis model based on number of reimbursements of selected variables related to biological tests, drugs, medical acts and hospitalization without a procedure over the last 2 years. This algorithm has a sensitivity of 62%, a specificity of 67% and an accuracy of 67% [95% CI: 0.66-0.68]. CONCLUSIONS Supervised ML is an innovative tool for the development of new methods to exploit large health administrative databases. In context of InfAct project, we have developed and applied the first time a generic ML-algorithm to estimate the incidence of diabetes for public health surveillance. The ML-algorithm we have developed, has a moderate performance. The next step is to apply this algorithm on SNDS to estimate the incidence of type 2 diabetes cases. More research is needed to apply various MLTs to estimate the incidence of various health conditions.
Collapse
Affiliation(s)
- Romana Haneef
- Department of Non-Communicable Diseases and Injuries, Santé Publique France, 12 rue du Val d'Onse, 94415, Saint-Maurice, France.
| | - Sofiane Kab
- Population-Based Epidemiological Cohorts Unit, INSERM UMS 011, Villejuif, France
| | - Rok Hrzic
- Department of International Health, Care and Public Health Research Institute - CAPHRI, University of Maastricht University, Maastricht, The Netherlands
| | - Sonsoles Fuentes
- Department of Non-Communicable Diseases and Injuries, Santé Publique France, 12 rue du Val d'Onse, 94415, Saint-Maurice, France
| | - Sandrine Fosse-Edorh
- Department of Non-Communicable Diseases and Injuries, Santé Publique France, 12 rue du Val d'Onse, 94415, Saint-Maurice, France
| | - Emmanuel Cosson
- Department of Endocrinology-Diabetology-Nutrition, AP-HP, Avicenne Hospital, Paris 13 University, Sorbonne Paris Cité, CRNH-IdF, CINFO, Bobigny, France.,Sorbonne Paris Cité, UMR U1153 Inserm/U1125 Inra/Cnam/Université Paris 13, Bobigny, France
| | - Anne Gallay
- Department of Non-Communicable Diseases and Injuries, Santé Publique France, 12 rue du Val d'Onse, 94415, Saint-Maurice, France
| |
Collapse
|
165
|
Cichosz SL, Kronborg T, Jensen MH, Hejlesen O. Penalty weighted glucose prediction models could lead to better clinically usage. Comput Biol Med 2021; 138:104865. [PMID: 34543891 DOI: 10.1016/j.compbiomed.2021.104865] [Citation(s) in RCA: 14] [Impact Index Per Article: 4.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/02/2021] [Revised: 08/27/2021] [Accepted: 09/10/2021] [Indexed: 12/23/2022]
Abstract
BACKGROUND AND OBJECTIVE Numerous attempts to predict glucose value from continuous glucose monitors (CGM) have been published. However, there is a lack of proper analysis and modeling of penalty for errors in different glycemic ranges. The aim of this study was to investigate the potential for developing glucose prediction models with focus on the clinical aspects. METHODS We developed and compared six different models to test which approach were best suited for predicting glucose levels at different lead times between 10 and 60 min. The models were: last observation carried forward, linear extrapolation, ensemble methods using LSBoost and bagging, neural networks, one without error-weights and one with error-weights. The modeling and test were based on 225 type 1 diabetes patients with 315,000 h of CGM data. RESULTS Results show that it is possible to predict glucose levels based on CGM with a reasonable accuracy and precision with a 30-min prediction lead time. A comparison of different methods shows that there are improvements on performance gained from using more advanced machine learning algorithms (MARD 10.26-10.79 @ 30-min lead time) compared to a simple modeling (MARD 10.75-12.97 @ 30-min lead time). Moreover, the proposed use of error weights could lead to better clinical performance of these models, which is an important factor for real usage. E.g., the percentages in the C-zone of the consensus error grid without error-weights (0.57-0.68%) vs including error-weights (0.28%). CONCLUSIONS The results point toward that using error weighting in the training of the models could lead to better clinical performance.
Collapse
Affiliation(s)
| | - Thomas Kronborg
- Department of Health Science and Technology, Aalborg University, Denmark
| | - Morten Hasselstrøm Jensen
- Department of Health Science and Technology, Aalborg University, Denmark; Steno Diabetes Center North Denmark, Aalborg University Hospital, Aalborg, Denmark
| | - Ole Hejlesen
- Department of Health Science and Technology, Aalborg University, Denmark
| |
Collapse
|
166
|
Alex SA, Nayahi JJV, Shine H, Gopirekha V. Deep convolutional neural network for diabetes mellitus prediction. Neural Comput Appl 2021. [DOI: 10.1007/s00521-021-06431-7] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/05/2023]
|
167
|
Yang YC, Islam SU, Noor A, Khan S, Afsar W, Nazir S. Influential Usage of Big Data and Artificial Intelligence in Healthcare. COMPUTATIONAL AND MATHEMATICAL METHODS IN MEDICINE 2021; 2021:5812499. [PMID: 34527076 PMCID: PMC8437645 DOI: 10.1155/2021/5812499] [Citation(s) in RCA: 15] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 07/08/2021] [Accepted: 08/09/2021] [Indexed: 01/07/2023]
Abstract
Artificial intelligence (AI) is making computer systems capable of executing human brain tasks in many fields in all aspects of daily life. The enhancement in information and communications technology (ICT) has indisputably improved the quality of people's lives around the globe. Especially, ICT has led to a very needy and tremendous improvement in the health sector which is commonly known as electronic health (eHealth) and medical health (mHealth). Deep machine learning and AI approaches are commonly presented in many applications using big data, which consists of all relevant data about the medical health and diseases which a model can access at the time of execution or diagnosis of diseases. For example, cardiovascular imaging has now accurate imaging combined with big data from the eHealth record and pathology to better characterize the disease and personalized therapy. In clinical work and imaging, cancer care is getting improved by knowing the tumor biology and helping in the implementation of precision medicine. The Markov model is used to extract new approaches for leveraging cancer. In this paper, we have reviewed existing research relevant to eHealth and mHealth where various models are discussed which uses big data for the diagnosis and healthcare system. This paper summarizes the recent promising applications of AI and big data in medical health and electronic health, which have potentially added value to diagnosis and patient care.
Collapse
Affiliation(s)
- Yan Cheng Yang
- Foreign Language Department, Luoyang Institute of Science and Technology, Luoyang, Henan, China
- Foreign Language Department/Language and Cognition Center, Hunan University, Changsha, Hunan, China
| | - Saad Ul Islam
- Department of Computer Science, University of Swabi, Swabi, Pakistan
| | - Asra Noor
- Department of Computer Science, University of Swabi, Swabi, Pakistan
| | - Sadia Khan
- Department of Computer Science, University of Swabi, Swabi, Pakistan
| | - Waseem Afsar
- Department of Computer Science, University of Swabi, Swabi, Pakistan
| | - Shah Nazir
- Department of Computer Science, University of Swabi, Swabi, Pakistan
| |
Collapse
|
168
|
Foo LL, Ng WY, Lim GYS, Tan TE, Ang M, Ting DSW. Artificial intelligence in myopia: current and future trends. Curr Opin Ophthalmol 2021; 32:413-424. [PMID: 34310401 DOI: 10.1097/icu.0000000000000791] [Citation(s) in RCA: 13] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/12/2022]
Abstract
PURPOSE OF REVIEW Myopia is one of the leading causes of visual impairment, with a projected increase in prevalence globally. One potential approach to address myopia and its complications is early detection and treatment. However, current healthcare systems may not be able to cope with the growing burden. Digital technological solutions such as artificial intelligence (AI) have emerged as a potential adjunct for myopia management. RECENT FINDINGS There are currently four significant domains of AI in myopia, including machine learning (ML), deep learning (DL), genetics and natural language processing (NLP). ML has been demonstrated to be a useful adjunctive for myopia prediction and biometry for cataract surgery in highly myopic individuals. DL techniques, particularly convoluted neural networks, have been applied to various image-related diagnostic and predictive solutions. Applications of AI in genomics and NLP appear to be at a nascent stage. SUMMARY Current AI research is mainly focused on disease classification and prediction in myopia. Through greater collaborative research, we envision AI will play an increasingly critical role in big data analysis by aggregating a greater variety of parameters including genomics and environmental factors. This may enable the development of generalizable adjunctive DL systems that could help realize predictive and individualized precision medicine for myopic patients.
Collapse
Affiliation(s)
- Li Lian Foo
- Singapore National Eye Centre, Singapore Eye Research Institute
- Duke-NUS Medical School, National University of Singapore, Singapore, Singapore
| | - Wei Yan Ng
- Singapore National Eye Centre, Singapore Eye Research Institute
- Duke-NUS Medical School, National University of Singapore, Singapore, Singapore
| | | | - Tien-En Tan
- Singapore National Eye Centre, Singapore Eye Research Institute
| | - Marcus Ang
- Singapore National Eye Centre, Singapore Eye Research Institute
- Duke-NUS Medical School, National University of Singapore, Singapore, Singapore
| | - Daniel Shu Wei Ting
- Singapore National Eye Centre, Singapore Eye Research Institute
- Duke-NUS Medical School, National University of Singapore, Singapore, Singapore
| |
Collapse
|
169
|
Xu Z, Shen D, Nie T, Kou Y, Yin N, Han X. A cluster-based oversampling algorithm combining SMOTE and k-means for imbalanced medical data. Inf Sci (N Y) 2021. [DOI: 10.1016/j.ins.2021.02.056] [Citation(s) in RCA: 22] [Impact Index Per Article: 7.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
|
170
|
Chen Z, Zhang J, Zhang Y, Huang Z. Traffic Accident Data Generation Based on Improved Generative Adversarial Networks. SENSORS 2021; 21:s21175767. [PMID: 34502657 PMCID: PMC8434573 DOI: 10.3390/s21175767] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 07/31/2021] [Revised: 08/19/2021] [Accepted: 08/24/2021] [Indexed: 11/18/2022]
Abstract
For urban traffic, traffic accidents are the most direct and serious risk to people’s lives, and rapid recognition and warning of traffic accidents is an important remedy to reduce their harmful effects. However, research scholars are often confronted with the problem of scarce and difficult-to-collect accident data resources for traffic accident scenarios. Therefore, in this paper, a traffic data generation model based on Generative Adversarial Networks (GAN) is developed. To make GAN applicable to non-graphical data, we improve the generator network structure of the model and used the generated model to resample the original data to obtain new traffic accident data. By constructing an adversarial neural network model, we generate a large number of data samples that are similar to the original traffic accident data. Results of the statistical test indicate that the generated samples are not significantly different from the original data. Furthermore, the experiments of traffic accident recognition with several representative classifiers demonstrate that the augmented data can effectively enhance the performance of accident recognition, with a maximum increase in accuracy of 3.05% and a maximum decrease in the false positive rate of 2.95%. Experimental results verify that the proposed method can provide reliable mass data support for the recognition of traffic accidents and road traffic safety.
Collapse
Affiliation(s)
- Zhijun Chen
- Intelligent Transportation Systems Research Center, Wuhan University of Technology, Wuhan 430070, China; (Z.C.); (J.Z.)
- School of Computer and Artificial Intelligence, Wuhan University of Technology, Wuhan 430070, China
| | - Jingming Zhang
- Intelligent Transportation Systems Research Center, Wuhan University of Technology, Wuhan 430070, China; (Z.C.); (J.Z.)
- School of Computer and Artificial Intelligence, Wuhan University of Technology, Wuhan 430070, China
| | - Yishi Zhang
- School of Management, Wuhan University of Technology, Wuhan 430070, China;
| | - Zihao Huang
- School of Civil Engineering and Transportation, South China University of Technology, Guangzhou 510640, China
- Correspondence: ; Tel.: +86-134-1951-6612
| |
Collapse
|
171
|
Gouda P, Zheng S, Peters T, Fudim M, Randhawa VK, Ezekowitz J, Mavrakanas TA, Giannetti N, Tsoukas M, Lopes R, Sharma A. Clinical Phenotypes in Patients With Type 2 Diabetes Mellitus: Characteristics, Cardiovascular Outcomes and Treatment Strategies. Curr Heart Fail Rep 2021; 18:253-263. [PMID: 34427881 DOI: 10.1007/s11897-021-00527-w] [Citation(s) in RCA: 7] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Accepted: 07/22/2021] [Indexed: 02/06/2023]
Abstract
PURPOSE OF REVIEW With recent advances in the pharmacological management of type 2 diabetes mellitus (T2DM), there is a growing need to understand which patients optimally benefit from these novel therapies. Various clinical clustering methodologies have emerged that utilise data-agnostic strategies to categorise patients that have similar clinical characteristics and outcomes; broadly, this characterisation is termed phenotyping. In patients with T2DM, we aimed to describe patient characteristics from phenotype studies, their cardiovascular risk profiles and the impact of antihyperglycemic treatment. RECENT FINDINGS Numerous phenotypic studies have been undertaken that have utilised a combination of clinical, biochemical, imaging and genetic variables. Each of these has produced phenotypes that display a spectrum of cardiovascular risk. Studies that aimed to describe pathophysiological phenotypes generally identified five phenotypes: autoimmune phenotype, insulin-related phenotypes (including permutations of insulin deficiency and resistance), obesity phenotype, ageing phenotype, and a sex-related phenotype. Studies examining risk profiles have demonstrated that across such phenotypes there is a spectrum of risk for diabetic complications. Few studies have examined treatment effects across these phenotypes, and thus provide little insights towards making phenotype-guided treatment decisions Clustering analyses in patients with T2DM have identified distinct phenotypes with unique risk profiles. Further studies are needed that harness the use of clinical, biochemical, imaging and genetic data to explore therapeutic heterogeneity and response to antihyperglycemic treatment across the spectrum of patient phenotypes.
Collapse
Affiliation(s)
- Pishoy Gouda
- Department of Medicine, Division of Cardiology, University of Alberta, Edmonton, AB, Canada
| | - Sijia Zheng
- Department of Medicine, McGill University Health Centre, 1001 Decarie Blvd, Montreal, QC, H4A3J1, Canada
| | - Tricia Peters
- Centre for Clinical Epidemiology, Lady Davis Institute for Medical Research, Montreal, QC, Canada.,Division of Endocrinology, Department of Medicine, The Jewish General Hospital, McGill University, Montreal, QC, Canada
| | - Marat Fudim
- Department of Medicine, Duke Clinical Research Institute, Duke University Medical Center, Durham, NC, USA
| | - Varinder Kaur Randhawa
- Department of Cardiovascular Medicine, Kaufman Center for Heart Failure and Recovery, Cleveland Clinic Foundation, Cleveland, OH, 44195, USA
| | - Justin Ezekowitz
- Department of Medicine, Division of Cardiology, University of Alberta, Edmonton, AB, Canada.,Canadian VIGOUR Centre, Alberta, Canada
| | - Thomas A Mavrakanas
- Division of Nephrology, Department of Medicine, McGill University Health Centre and Research Institute, Montreal, Canada
| | - Nadia Giannetti
- Department of Medicine, McGill University Health Centre, 1001 Decarie Blvd, Montreal, QC, H4A3J1, Canada
| | - Michael Tsoukas
- Division of Endocrinology, McGill University Health Centre, Montreal, Canada
| | - Renato Lopes
- Department of Medicine, Duke Clinical Research Institute, Duke University Medical Center, Durham, NC, USA
| | - Abhinav Sharma
- Department of Medicine, McGill University Health Centre, 1001 Decarie Blvd, Montreal, QC, H4A3J1, Canada.
| |
Collapse
|
172
|
Abdeldayem M, Aldulaimi S. Entrepreneurial finance and crowdfunding in the Middle East. INTERNATIONAL JOURNAL OF ORGANIZATIONAL ANALYSIS 2021. [DOI: 10.1108/ijoa-03-2021-2684] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/17/2022]
Abstract
Purpose
The purpose of this study is to analyze crowdfunding (CF) as new entrepreneurial finance (EF) tool and to predict the success of CF projects in the Middle East region.
Design/methodology/approach
This study was conducted in seven Middle Eastern countries (i.e. Turkey, Egypt, Iraq, Saudi Arabia, Bahrain, Kuwait and UAE) in addition to serval CF platforms that are commonly used by crowd funders in this region (such as Kickstarter, GoFundMe, Beehive and Zoomal) with total members (195,193). A pilot sample of 20 units was used to validate and verify the research instrument of the study. The research sample consists of 1,910 respondents from the seven countries included in the study. The study emphasizes the partners, micro-structures, administrative conditions and CF advancement in the Middle East.
Findings
The findings reveal that CF’s presence positively impacts fundraising success and that CF platforms are an effective financial technology (Fintech) tool for financing entrepreneurs in the Middle East. The study shows that the success of CF projects in the Middle East can be anticipated by estimating and breaking down enormous information of web-based and social media movement, human resources of funders and online venture introduction. The authors conclude with recommendations for future EF and CF research.
Originality/value
This study aims to analyze the CF and EF principles in the Middle East region as the CF experience and practice in this part of the world tend to be unexplored in terms of research. Presently a very few numbers published research on CF exists. Moreover, to the best of the knowledge, there is no single study investigating CF as an alternative financing source in the Middle East. In particular, the study.
Collapse
|
173
|
Dekamin A, Wahab MIM, Guergachi A, Keshavjee K. FIUS: Fixed partitioning undersampling method. Clin Chim Acta 2021; 522:174-183. [PMID: 34425104 DOI: 10.1016/j.cca.2021.08.023] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/05/2021] [Revised: 07/30/2021] [Accepted: 08/18/2021] [Indexed: 11/28/2022]
Abstract
BACKGROUND AND OBJECTIVE In the medical field, data techniques for prediction and finding patterns of prevalent diseases are of increasing interest. Classification is one of the methods used to provide insight into predicting the future onset of type 2 diabetes of those at high risk of progression from pre-diabetes to diabetes. When applying classification techniques to real-world datasets, imbalanced class distribution has been one of the most significant limitations that leads to patients' misclassification. In this paper, we propose a novel balancing method to improve the prediction performance of type 2 diabetes mellitus in imbalanced electronic medical records (EMR). METHODS A novel undersampling method is proposed by utilizing a fixed partitioning distribution scheme in a regular grid. The proposed approach retains valuable information when balancing methods are applied to datasets. RESULTS The best AUC of 80% compared to other classifiers was obtained from the logistic regression (LR) classifier for EMR by applying our proposed undersampling method to balance the data. The new method improved the performance of the LR classifier compared to existing undersampling methods used in the balancing stage. CONCLUSION The results demonstrate the effectiveness and high performance of the proposed method for predicting diabetes in a Canadian imbalanced dataset. Our methodology can be used in other areas to overcome the limitations of imbalanced class distributions.
Collapse
Affiliation(s)
- Azam Dekamin
- Department of Mechanical and Industrial Engineering, Ryerson University, 350 Victoria Street, Toronto, ON M5B 2K3, Canada.
| | - M I M Wahab
- Department of Mechanical and Industrial Engineering, Ryerson University, 350 Victoria Street, Toronto, ON M5B 2K3, Canada
| | - Aziz Guergachi
- Ted Rogers, School of Information Technology Management, Ryerson University, 350 Victoria Street, Toronto, ON M5B 2K3, Canada
| | - Karim Keshavjee
- Institute of Health Policy, Management and Evaluation, University of Toronto, Toronto, ON M5T 3M6, Canada
| |
Collapse
|
174
|
Alarcón-Narváez D, Hernández-Torruco J, Hernández-Ocaña B, Chávez-Bosquez O, Marchi J, Méndez-Castillo JJ. Toward a machine learning model for a primary diagnosis of Guillain-Barré syndrome subtypes. Health Informatics J 2021; 27:14604582211021471. [PMID: 34405722 DOI: 10.1177/14604582211021471] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/15/2022]
Abstract
Guillain-Barré Syndrome (GBS) is a neurological disorder affecting people of any age and sex, mainly damaging the peripheral nervous system. GBS is divided into several subtypes, in which only four are the most common, demanding different treatments. Identifying the subtype is an expensive and time-consuming task. Early GBS detection is crucial to save the patient's life and not aggravate the disease. This work aims to provide a primary screening tool for GBS subtypes fast and efficiently without complementary invasive methods, based only on clinical variables prospected in consultation, taken from clinical history, and based on risk factors. We conducted experiments with four classifiers with different approaches, five different filters for feature selection, six wrappers, and One versus All (OvA) classification. For the experiments, we used a data set that includes 129 records of Mexican patients and 26 clinical representative variables. Random Forest filter obtained the best results in each classifier for the diagnosis of the four subtypes, in the same way, this filter with the SVM classifier achieved the best result (0.6840). OvA with SVM classifier reached a balanced accuracy of 0.8884 for the Miller-Fisher (MF) subtype.
Collapse
|
175
|
Jangir SK, Joshi N, Kumar M, Choubey DK, Singh S, Verma M. Functional link convolutional neural network for the classification of diabetes mellitus. INTERNATIONAL JOURNAL FOR NUMERICAL METHODS IN BIOMEDICAL ENGINEERING 2021; 37:e3496. [PMID: 33964103 DOI: 10.1002/cnm.3496] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/23/2020] [Revised: 04/27/2021] [Accepted: 05/03/2021] [Indexed: 06/12/2023]
Abstract
Diabetes is a faction of metabolic ailments distinguished by hyperglycemia which is the consequence of a defect, in the action of insulin, insulin secretion, or both and producing various abnormalities in the human body. In recent years, the utilization of intelligent systems has been expanded in disease classification and numerous researches have been proposed. In this research article, a variant of Convolutional Neural Network (CNN) that is, Functional Link Convolutional Neural Network (FLCNN) is proposed for the diabetes classification. The main goal of this article is to find the potential of a computationally less complex deep learning network like FLCNN and applied the proposed technique on a real dataset of diabetes for classification. This article also presents the comparative studies where various other machine learning techniques are implemented and outcomes are compared with the proposed FLCNN network. The performance of each classification techniques have been evaluated based on standard measures and also validated with a non-parametric statistical test such as Friedman. Data for modeling diabetes classification is collected from Bombay Medical Hall, Upper Bazar, Ranchi, India. Accuracy achieve by the proposed classifier is more than 90% which is closer to the other state-of-the-art implemented classifiers.
Collapse
Affiliation(s)
- Sunil Kumar Jangir
- Department of Computer Science & Engineering, Mody University of Science and Technology, Sikar, India
| | | | - Manish Kumar
- Department of Biomedical Engineering, Mody University of Science and Technology, Sikar, India
| | - Dilip Kumar Choubey
- Department of Computer Science & Engineering, Indian Institute of Information Technology Bhagalpur, Bhagalpur, India
| | - Shatakshi Singh
- Department of Computer Science & Engineering, Mody University of Science and Technology, Sikar, India
| | - Madhushi Verma
- Department of Computer Science and Engineering, Bennett University, Noida, India
| |
Collapse
|
176
|
Wu C, Entezari A, Zheng K, Fang J, Zreiqat H, Steven GP, Swain MV, Li Q. A machine learning-based multiscale model to predict bone formation in scaffolds. NATURE COMPUTATIONAL SCIENCE 2021; 1:532-541. [PMID: 38217252 DOI: 10.1038/s43588-021-00115-x] [Citation(s) in RCA: 7] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/01/2021] [Accepted: 07/19/2021] [Indexed: 01/15/2024]
Abstract
Computational modeling methods combined with non-invasive imaging technologies have exhibited great potential and unique opportunities to model new bone formation in scaffold tissue engineering, offering an effective alternate and viable complement to laborious and time-consuming in vivo studies. However, existing numerical approaches are still highly demanding computationally in such multiscale problems. To tackle this challenge, we propose a machine learning (ML)-based approach to predict bone ingrowth outcomes in bulk tissue scaffolds. The proposed in silico procedure is developed by correlating with a dedicated longitudinal (12-month) animal study on scaffold treatment of a major segmental defect in sheep tibia. Comparison of the ML-based time-dependent prediction of bone ingrowth with the conventional multilevel finite element (FE2) model demonstrates satisfactory accuracy and efficiency. The ML-based modeling approach provides an effective means for predicting in vivo bone tissue regeneration in a subject-specific scaffolding system.
Collapse
Affiliation(s)
- Chi Wu
- School of Aerospace, Mechanical and Mechatronic Engineering, The University of Sydney, Sydney, New South Wales, Australia
| | - Ali Entezari
- School of Biomedical Engineering, The University of Sydney, Sydney, New South Wales, Australia
| | - Keke Zheng
- School of Aerospace, Mechanical and Mechatronic Engineering, The University of Sydney, Sydney, New South Wales, Australia
| | - Jianguang Fang
- School of Civil and Environmental Engineering, University of Technology Sydney, Sydney, New South Wales, Australia
| | - Hala Zreiqat
- School of Biomedical Engineering, The University of Sydney, Sydney, New South Wales, Australia
| | - Grant P Steven
- School of Aerospace, Mechanical and Mechatronic Engineering, The University of Sydney, Sydney, New South Wales, Australia
| | - Michael V Swain
- School of Aerospace, Mechanical and Mechatronic Engineering, The University of Sydney, Sydney, New South Wales, Australia
| | - Qing Li
- School of Aerospace, Mechanical and Mechatronic Engineering, The University of Sydney, Sydney, New South Wales, Australia.
| |
Collapse
|
177
|
A Scoping Review of Artificial Intelligence and Machine Learning in Bariatric and Metabolic Surgery: Current Status and Future Perspectives. Obes Surg 2021; 31:4555-4563. [PMID: 34264433 DOI: 10.1007/s11695-021-05548-x] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/04/2021] [Revised: 06/12/2021] [Accepted: 06/17/2021] [Indexed: 01/01/2023]
Abstract
Artificial intelligence (AI) is a revolution in data analysis with emerging roles in various specialties and with various applications. The objective of this scoping review was to retrieve current literature on the fields of AI that have been applied to metabolic bariatric surgery (MBS) and to investigate potential applications of AI as a decision-making tool of the bariatric surgeon. Initial search yielded 3260 studies published from January 2000 until March 2021. After screening, 49 unique articles were included in the final analysis. Studies were grouped into categories, and the frequency of appearing algorithms, dataset types, and metrics were documented. The heterogeneity of current studies showed that meticulous validation, strict reporting systems, and reliable benchmarking are mandatory for ensuring the clinical validity of future research.
Collapse
|
178
|
Deng Y, Lu L, Aponte L, Angelidi AM, Novak V, Karniadakis GE, Mantzoros CS. Deep transfer learning and data augmentation improve glucose levels prediction in type 2 diabetes patients. NPJ Digit Med 2021; 4:109. [PMID: 34262114 PMCID: PMC8280162 DOI: 10.1038/s41746-021-00480-x] [Citation(s) in RCA: 38] [Impact Index Per Article: 12.7] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/18/2021] [Accepted: 06/21/2021] [Indexed: 12/12/2022] Open
Abstract
Accurate prediction of blood glucose variations in type 2 diabetes (T2D) will facilitate better glycemic control and decrease the occurrence of hypoglycemic episodes as well as the morbidity and mortality associated with T2D, hence increasing the quality of life of patients. Owing to the complexity of the blood glucose dynamics, it is difficult to design accurate predictive models in every circumstance, i.e., hypo/normo/hyperglycemic events. We developed deep-learning methods to predict patient-specific blood glucose during various time horizons in the immediate future using patient-specific every 30-min long glucose measurements by the continuous glucose monitoring (CGM) to predict future glucose levels in 5 min to 1 h. In general, the major challenges to address are (1) the dataset of each patient is often too small to train a patient-specific deep-learning model, and (2) the dataset is usually highly imbalanced given that hypo- and hyperglycemic episodes are usually much less common than normoglycemia. We tackle these two challenges using transfer learning and data augmentation, respectively. We systematically examined three neural network architectures, different loss functions, four transfer-learning strategies, and four data augmentation techniques, including mixup and generative models. Taken together, utilizing these methodologies we achieved over 95% prediction accuracy and 90% sensitivity for a time period within the clinically useful 1 h prediction horizon that would allow a patient to react and correct either hypoglycemia and/or hyperglycemia. We have also demonstrated that the same network architecture and transfer-learning methods perform well for the type 1 diabetes OhioT1DM public dataset.
Collapse
Affiliation(s)
- Yixiang Deng
- School of Engineering, Brown University, Providence, RI, 02912, USA
| | - Lu Lu
- Department of Chemical and Biomolecular Engineering, University of Pennsylvania, Philadelphia, PA, 19104, USA
| | - Laura Aponte
- Department of Medicine, Beth Israel Deaconess Medical Center, Harvard Medical School, Boston, MA, 02215, USA
| | - Angeliki M Angelidi
- Department of Medicine, Beth Israel Deaconess Medical Center, Harvard Medical School, Boston, MA, 02215, USA
| | - Vera Novak
- Department of Medicine, Beth Israel Deaconess Medical Center, Harvard Medical School, Boston, MA, 02215, USA
| | - George Em Karniadakis
- School of Engineering, Brown University, Providence, RI, 02912, USA.
- Division of Applied Mathematics, Brown University, Providence, RI, 02912, USA.
| | - Christos S Mantzoros
- Department of Medicine, Beth Israel Deaconess Medical Center, Harvard Medical School, Boston, MA, 02215, USA.
- VA Boston Healthcare System, Harvard Medical School, Boston, MA, 02215, USA.
| |
Collapse
|
179
|
Predicting Type 2 Diabetes Using Logistic Regression and Machine Learning Approaches. INTERNATIONAL JOURNAL OF ENVIRONMENTAL RESEARCH AND PUBLIC HEALTH 2021; 18:ijerph18147346. [PMID: 34299797 PMCID: PMC8306487 DOI: 10.3390/ijerph18147346] [Citation(s) in RCA: 37] [Impact Index Per Article: 12.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 05/27/2021] [Revised: 07/02/2021] [Accepted: 07/05/2021] [Indexed: 12/27/2022]
Abstract
Diabetes mellitus is one of the most common human diseases worldwide and may cause several health-related complications. It is responsible for considerable morbidity, mortality, and economic loss. A timely diagnosis and prediction of this disease could provide patients with an opportunity to take the appropriate preventive and treatment strategies. To improve the understanding of risk factors, we predict type 2 diabetes for Pima Indian women utilizing a logistic regression model and decision tree—a machine learning algorithm. Our analysis finds five main predictors of type 2 diabetes: glucose, pregnancy, body mass index (BMI), diabetes pedigree function, and age. We further explore a classification tree to complement and validate our analysis. The six-fold classification tree indicates glucose, BMI, and age are important factors, while the ten-node tree implies glucose, BMI, pregnancy, diabetes pedigree function, and age as the significant predictors. Our preferred specification yields a prediction accuracy of 78.26% and a cross-validation error rate of 21.74%. We argue that our model can be applied to make a reasonable prediction of type 2 diabetes, and could potentially be used to complement existing preventive measures to curb the incidence of diabetes and reduce associated costs.
Collapse
|
180
|
Swan BP, Mayorga ME, Ivy JS. The SMART Framework: Selection of Machine learning Algorithms with ReplicaTions - a Case Study on the Microvascular Complications of Diabetes. IEEE J Biomed Health Inform 2021; 26:809-817. [PMID: 34232896 DOI: 10.1109/jbhi.2021.3094777] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/07/2022]
Abstract
Over 34 million people in the US have diabetes, a major cause of blindness, renal failure, and amputations. Machine learning (ML) models can predict high-risk patients to help prevent adverse outcomes. Selecting the 'best' prediction model for a given disease, population, and clinical application is challenging due to the hundreds of health-related ML models in the literature and the increasing availability of ML methodologies. To support this decision process, we developed the Selection of Machine-learning Algorithms with ReplicaTions (SMART) Framework that integrates building and selecting ML models with decision theory. We build ML models and estimate performance for multiple plausible future populations with a replicated nested cross-validation technique. We rank ML models by simulating decision-maker priorities, using a range of accuracy measures (e.g., AUC) and robustness metrics from decision theory (e.g., minimax Regret). We present the SMART Framework through a case study on the microvascular complications of diabetes using data from the ACCORD clinical trial. We compare selections made by risk-averse, -neutral, and -seeking decision-makers, finding agreement in 80% of the risk-averse and risk-neutral selections, with the risk-averse selections showing consistency for a given complication. We also found that the models that best predicted outcomes in the validation set were those with low performance variance on the testing set, indicating a risk-averse approach in model selection is ideal when there is a potential for high population feature variability. The SMART Framework is a powerful, interactive tool that incorporates various ML algorithms and stakeholder preferences, generalizable to new data and technological advancements.
Collapse
|
181
|
Reel PS, Reel S, Pearson E, Trucco E, Jefferson E. Using machine learning approaches for multi-omics data analysis: A review. Biotechnol Adv 2021; 49:107739. [PMID: 33794304 DOI: 10.1016/j.biotechadv.2021.107739] [Citation(s) in RCA: 287] [Impact Index Per Article: 95.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/15/2020] [Revised: 03/01/2021] [Accepted: 03/25/2021] [Indexed: 02/06/2023]
Abstract
With the development of modern high-throughput omic measurement platforms, it has become essential for biomedical studies to undertake an integrative (combined) approach to fully utilise these data to gain insights into biological systems. Data from various omics sources such as genetics, proteomics, and metabolomics can be integrated to unravel the intricate working of systems biology using machine learning-based predictive algorithms. Machine learning methods offer novel techniques to integrate and analyse the various omics data enabling the discovery of new biomarkers. These biomarkers have the potential to help in accurate disease prediction, patient stratification and delivery of precision medicine. This review paper explores different integrative machine learning methods which have been used to provide an in-depth understanding of biological systems during normal physiological functioning and in the presence of a disease. It provides insight and recommendations for interdisciplinary professionals who envisage employing machine learning skills in multi-omics studies.
Collapse
Affiliation(s)
- Parminder S Reel
- Division of Population Health and Genomics, School of Medicine, University of Dundee, Dundee, United Kingdom
| | - Smarti Reel
- Division of Population Health and Genomics, School of Medicine, University of Dundee, Dundee, United Kingdom
| | - Ewan Pearson
- Division of Population Health and Genomics, School of Medicine, University of Dundee, Dundee, United Kingdom
| | - Emanuele Trucco
- VAMPIRE project, Computing, School of Science and Engineering, University of Dundee, Dundee, United Kingdom
| | - Emily Jefferson
- Division of Population Health and Genomics, School of Medicine, University of Dundee, Dundee, United Kingdom.
| |
Collapse
|
182
|
Fan Y, Long E, Cai L, Cao Q, Wu X, Tong R. Machine Learning Approaches to Predict Risks of Diabetic Complications and Poor Glycemic Control in Nonadherent Type 2 Diabetes. Front Pharmacol 2021; 12:665951. [PMID: 34239440 PMCID: PMC8258097 DOI: 10.3389/fphar.2021.665951] [Citation(s) in RCA: 10] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/09/2021] [Accepted: 06/01/2021] [Indexed: 12/20/2022] Open
Abstract
Purpose: The objective of this study was to evaluate the efficacy of machine learning algorithms in predicting risks of complications and poor glycemic control in nonadherent type 2 diabetes (T2D). Materials and Methods: This study was a real-world study of the complications and blood glucose prognosis of nonadherent T2D patients. Data of inpatients in Sichuan Provincial People's Hospital from January 2010 to December 2015 were collected. The T2D patients who had neither been monitored for glycosylated hemoglobin A nor had changed their hyperglycemia treatment regimens within the last 12 months were the object of this study. Seven types of machine learning algorithms were used to develop 18 prediction models. The predictive performance was mainly assessed using the area under the curve of the testing set. Results: Of 800 T2D patients, 165 (20.6%) met the inclusion criteria, of which 129 (78.2%) had poor glycemic control (defined as glycosylated hemoglobin A ≥7%). The highest area under the curves of the testing set for diabetic nephropathy, diabetic peripheral neuropathy, diabetic angiopathy, diabetic eye disease, and glycosylated hemoglobin A were 0.902 ± 0.040, 0.859 ± 0.050, 0.889 ± 0.059, 0.832 ± 0.086, and 0.825 ± 0.092, respectively. Conclusion: Both univariate analysis and machine learning methods reached the same conclusion. The duration of T2D and the duration of unadjusted hypoglycemic treatment were the key risk factors of diabetic complications, and the number of hypoglycemic drugs was the key risk factor of glycemic control of nonadherent T2D. This was the first study to use machine learning algorithms to explore the potential adverse outcomes of nonadherent T2D. The performances of the final prediction models we developed were acceptable; our prediction performances outperformed most other previous studies in most evaluation measures. Those models have potential clinical applicability in improving T2D care.
Collapse
Affiliation(s)
- Yuting Fan
- Personalized Drug Therapy Key Laboratory of Sichuan Province, School of Medicine, University of Electronic Science and Technology of China, Chengdu, China
| | - Enwu Long
- Personalized Drug Therapy Key Laboratory of Sichuan Province, School of Medicine, University of Electronic Science and Technology of China, Chengdu, China
- Department of Pharmacy, Sichuan Academy of Medical Sciences and Sichuan Provincial People’s Hospital, Chengdu, China
| | - Lulu Cai
- Personalized Drug Therapy Key Laboratory of Sichuan Province, School of Medicine, University of Electronic Science and Technology of China, Chengdu, China
- Department of Pharmacy, Sichuan Academy of Medical Sciences and Sichuan Provincial People’s Hospital, Chengdu, China
| | - Qiyuan Cao
- West China Medical College of Sichuan University, Chengdu, China
| | - Xingwei Wu
- Personalized Drug Therapy Key Laboratory of Sichuan Province, School of Medicine, University of Electronic Science and Technology of China, Chengdu, China
- Department of Pharmacy, Sichuan Academy of Medical Sciences and Sichuan Provincial People’s Hospital, Chengdu, China
| | - Rongsheng Tong
- Personalized Drug Therapy Key Laboratory of Sichuan Province, School of Medicine, University of Electronic Science and Technology of China, Chengdu, China
- Department of Pharmacy, Sichuan Academy of Medical Sciences and Sichuan Provincial People’s Hospital, Chengdu, China
| |
Collapse
|
183
|
Jones LK, Walters N, Brangan A, Ahmed CD, Gatusky M, Campbell-Salome G, Ladd IG, Sheldon A, Gidding SS, McGowan MP, Rahm AK, Sturm AC. Acceptability, Appropriateness, and Feasibility of Automated Screening Approaches and Family Communication Methods for Identification of Familial Hypercholesterolemia: Stakeholder Engagement Results from the IMPACT-FH Study. J Pers Med 2021; 11:587. [PMID: 34205662 PMCID: PMC8234213 DOI: 10.3390/jpm11060587] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/13/2021] [Revised: 06/08/2021] [Accepted: 06/15/2021] [Indexed: 12/18/2022] Open
Abstract
Guided by the Conceptual Model of Implementation Research, we explored the acceptability, appropriateness, and feasibility of: (1) automated screening approaches utilizing existing health data to identify those who require subsequent diagnostic evaluation for familial hypercholesterolemia (FH) and (2) family communication methods including chatbots and direct contact to communicate information about inherited risk for FH. Focus groups were conducted with 22 individuals with FH (2 groups) and 20 clinicians (3 groups). These were recorded, transcribed, and analyzed using deductive (coded to implementation outcomes) and inductive (themes based on focus group discussions) methods. All stakeholders described these initiatives as: (1) acceptable and appropriate to identify individuals with FH and communicate risk with at-risk relatives; and (2) feasible to implement in current practice. Stakeholders cited current initiatives, outside of FH (e.g., pneumonia protocols, colon cancer and breast cancer screenings), that gave them confidence for successful implementation. Stakeholders described perceived obstacles, such as nonfamiliarity with FH, that could hinder implementation and potential solutions to improve systematic uptake of these initiatives. Automated health data screening, chatbots, and direct contact approaches may be useful for patients and clinicians to improve FH diagnosis and cascade screening.
Collapse
Affiliation(s)
- Laney K. Jones
- Genomic Medicine Institute, Geisinger, Danville, PA 17822, USA; (N.W.); (A.B.); (M.G.); (G.C.-S.); (I.G.L.); (S.S.G.); (A.K.R.); (A.C.S.)
| | - Nicole Walters
- Genomic Medicine Institute, Geisinger, Danville, PA 17822, USA; (N.W.); (A.B.); (M.G.); (G.C.-S.); (I.G.L.); (S.S.G.); (A.K.R.); (A.C.S.)
| | - Andrew Brangan
- Genomic Medicine Institute, Geisinger, Danville, PA 17822, USA; (N.W.); (A.B.); (M.G.); (G.C.-S.); (I.G.L.); (S.S.G.); (A.K.R.); (A.C.S.)
| | | | - Michael Gatusky
- Genomic Medicine Institute, Geisinger, Danville, PA 17822, USA; (N.W.); (A.B.); (M.G.); (G.C.-S.); (I.G.L.); (S.S.G.); (A.K.R.); (A.C.S.)
| | - Gemme Campbell-Salome
- Genomic Medicine Institute, Geisinger, Danville, PA 17822, USA; (N.W.); (A.B.); (M.G.); (G.C.-S.); (I.G.L.); (S.S.G.); (A.K.R.); (A.C.S.)
- Department of Advertising, University of Florida, Gainesville, FL 33620, USA
| | - Ilene G. Ladd
- Genomic Medicine Institute, Geisinger, Danville, PA 17822, USA; (N.W.); (A.B.); (M.G.); (G.C.-S.); (I.G.L.); (S.S.G.); (A.K.R.); (A.C.S.)
| | - Amanda Sheldon
- The FH Foundation, Pasadena, CA 91101, USA; (C.D.A.); (A.S.); (M.P.M.)
| | - Samuel S. Gidding
- Genomic Medicine Institute, Geisinger, Danville, PA 17822, USA; (N.W.); (A.B.); (M.G.); (G.C.-S.); (I.G.L.); (S.S.G.); (A.K.R.); (A.C.S.)
| | - Mary P. McGowan
- The FH Foundation, Pasadena, CA 91101, USA; (C.D.A.); (A.S.); (M.P.M.)
- Dartmouth Hitchcock Medical Center, Lebanon, NH 03756, USA
| | - Alanna K. Rahm
- Genomic Medicine Institute, Geisinger, Danville, PA 17822, USA; (N.W.); (A.B.); (M.G.); (G.C.-S.); (I.G.L.); (S.S.G.); (A.K.R.); (A.C.S.)
| | - Amy C. Sturm
- Genomic Medicine Institute, Geisinger, Danville, PA 17822, USA; (N.W.); (A.B.); (M.G.); (G.C.-S.); (I.G.L.); (S.S.G.); (A.K.R.); (A.C.S.)
| |
Collapse
|
184
|
A patient network-based machine learning model for disease prediction: The case of type 2 diabetes mellitus. APPL INTELL 2021. [DOI: 10.1007/s10489-021-02533-w] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/11/2022]
|
185
|
Bajaj NS, Pardeshi SS, Patange AD, Khade HS, Mate K. A comparative study of modified SIR and logistic predictors using local level database of COVID-19 in India. INFORMATION DISCOVERY AND DELIVERY 2021. [DOI: 10.1108/idd-09-2020-0112] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/17/2022]
Abstract
Purpose
Several national- and state-level studies have been predicting the course of the COVID-19 pandemic using supervised machine learning algorithms. However, the comparison of such models has not been discussed before. This is the first-ever research wherein the two leading models, susceptible-infected-recovered (SIR) and logistic are compared. The purpose of this study is to observe their utility, at both the National and Municipal Corporation level in India.
Design/methodology/approach
The modified SIR and the logistic were deployed for analysis of the COVID-19 patients’ database of India and three Municipal Corporations, namely, Akola, Kalyan-Dombivli and Mira-Bhayander. The data for the study was collected from the official notifications for COVID-19 released by respective government websites.
Findings
This study provides evidence to show the superiority of the modified SIR over the logistic model. The models give accurate predictions for a period up to 14 days. The prediction accuracy of the models is limited due to change in government policies. This can be observed by the drastic increase in the COVID-19 cases after Unlock 1.0 in India. The models have proven that they can effectively predict for both National and Municipal Corporation level database.
Practical implications
The modified SIR model can be used to signify the practicality and effectiveness of the decisions taken by the authorities to contain the spread of coronavirus.
Originality/value
This study modifies the SIR model and also identifies and fulfills the need to find a more accurate prediction algorithm to help curb the pandemic.
Collapse
|
186
|
Wang K, Tian J, Zheng C, Yang H, Ren J, Li C, Han Q, Zhang Y. Improving Risk Identification of Adverse Outcomes in Chronic Heart Failure Using SMOTE+ENN and Machine Learning. Risk Manag Healthc Policy 2021; 14:2453-2463. [PMID: 34149290 PMCID: PMC8206455 DOI: 10.2147/rmhp.s310295] [Citation(s) in RCA: 12] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/09/2021] [Accepted: 05/24/2021] [Indexed: 01/14/2023] Open
Abstract
PURPOSE This study sought to develop models with good identification for adverse outcomes in patients with heart failure (HF) and find strong factors that affect prognosis. PATIENTS AND METHODS A total of 5004 qualifying cases were selected, among which 498 cases had adverse outcomes and 4506 cases were discharged after improvement. The study subjects were hospitalized patients diagnosed with HF from a regional cardiovascular hospital and the cardiology department of a medical university hospital in Shanxi Province of China between January 2014 and June 2019. Synthesizing minority oversampling technology combined with edited nearest neighbors (SMOTE+ENN) was used to pre-process unbalanced data. Traditional logistic regression (LR), k-nearest neighbor (KNN), support vector machine (SVM), random forest (RF), and extreme gradient boosting (XGBoost) were used to build risk identification models, and each model was repeated 100 times. Model discrimination and calibration were estimated using F1-score, the area under the receiver-operating characteristic curve (AUROC), and Brier score. The best performing of the five models was used to identify the risk of adverse outcomes and evaluate the influencing factors. RESULTS The SME-XGBoost was the best performing model with means of F1-score (0.3673, 95% confidence interval [CI]: 0.3633-0.3712), AUC (0.8010, CI: 0.7974-0.8046), and Brier score (0.1769, CI: 0.1748-0.1789). Age, N-terminal pronatriuretic peptide, pulmonary disease, etc. were the most significant factors of adverse outcomes in patients with HF. CONCLUSION The combination of SMOTE+ENN and advanced machine learning methods effectively improved the discrimination efficacy of adverse outcomes in HF patients, accurately stratified patients at risk of adverse outcomes, and found the top factors of adverse outcomes. These models and factors emphasize the importance of health status data in determining adverse outcomes in patients with HF.
Collapse
Affiliation(s)
- Ke Wang
- Department of Health Statistics, School of Public Health, Shanxi Medical University, Taiyuan, People’s Republic of China
- Department of Epidemiology and Biostatistics, Xuzhou Medical University, Xuzhou, People’s Republic of China
- Shanxi Provincial Key Laboratory of Major Diseases Risk Assessment, Shanxi Medical University, Taiyuan, People's Republic of China
| | - Jing Tian
- Department of Cardiology, The First Affiliated Hospital of Shanxi Medical University, Taiyuan, People’s Republic of China
| | - Chu Zheng
- Department of Health Statistics, School of Public Health, Shanxi Medical University, Taiyuan, People’s Republic of China
- Shanxi Provincial Key Laboratory of Major Diseases Risk Assessment, Shanxi Medical University, Taiyuan, People's Republic of China
| | - Hong Yang
- Department of Health Statistics, School of Public Health, Shanxi Medical University, Taiyuan, People’s Republic of China
- Shanxi Provincial Key Laboratory of Major Diseases Risk Assessment, Shanxi Medical University, Taiyuan, People's Republic of China
| | - Jia Ren
- Department of Health Statistics, School of Public Health, Shanxi Medical University, Taiyuan, People’s Republic of China
| | - Chenhao Li
- Department of Health Statistics, School of Public Health, Shanxi Medical University, Taiyuan, People’s Republic of China
- Shanxi Provincial Key Laboratory of Major Diseases Risk Assessment, Shanxi Medical University, Taiyuan, People's Republic of China
| | - Qinghua Han
- Department of Cardiology, The First Affiliated Hospital of Shanxi Medical University, Taiyuan, People’s Republic of China
| | - Yanbo Zhang
- Department of Health Statistics, School of Public Health, Shanxi Medical University, Taiyuan, People’s Republic of China
- Shanxi Provincial Key Laboratory of Major Diseases Risk Assessment, Shanxi Medical University, Taiyuan, People's Republic of China
| |
Collapse
|
187
|
Recenti M, Ricciardi C, Edmunds KJ, Gislason MK, Sigurdsson S, Carraro U, Gargiulo P. Healthy Aging Within an Image: Using Muscle Radiodensitometry and Lifestyle Factors to Predict Diabetes and Hypertension. IEEE J Biomed Health Inform 2021; 25:2103-2112. [PMID: 33306475 DOI: 10.1109/jbhi.2020.3044158] [Citation(s) in RCA: 11] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/08/2022]
Abstract
The strong age dependency of many deleterious health outcomes likely reflects the cumulative effects from a variety of risk and protective factors that occur over one's life course. This notion has become increasingly explored in the etiology of chronic disease and associated comorbidities in aging. Our recent work has shown the robust classification of individuals at risk for cardiovascular pathophysiology using CT-based soft tissue radiodensity parameters obtained from nonlinear trimodal regression analysis (NTRA). Past and present lifestyle influences the incidence of comorbidities like hypertension (HTN), diabetes (DM) and cardiac diseases. 2,943 elderly subjects from the AGES-Reykjavik study were sorted into a three-level binary-tree structure defined by: 1) lifestyle factors (smoking and self-reported physical activity level), 2) comorbid HTN or DM, and 3) cardiac pathophysiology. NTRA parameters were extracted from mid-thigh CT cross-sections to quantify radiodensitometric changes in three tissue types: lean muscle, fat, and loose-connective tissue. Between-group differences were assessed at each binary-tree level, which were then used in tree-based machine learning (ML) models to classify subjects with DM or HTN. Classification scores for detecting HTN or DM based on lifestyle factors were excellent (AUCROC: 0.978 and 0.990, respectively). Finally, tissue importance analysis underlined the comparatively-high significance of connective tissue parameters in ML classification, while predictive models of DM onset from five-year longitudinal data gave a classification accuracy of 94.9%. Altogether, this work serves as an important milestone toward the construction of predictive tools for assessing the impact of lifestyle factors and healthy aging based on a single image.
Collapse
|
188
|
De Silva K, Lim S, Mousa A, Teede H, Forbes A, Demmer RT, Jönsson D, Enticott J. Nutritional markers of undiagnosed type 2 diabetes in adults: Findings of a machine learning analysis with external validation and benchmarking. PLoS One 2021; 16:e0250832. [PMID: 33951067 PMCID: PMC8099133 DOI: 10.1371/journal.pone.0250832] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/24/2020] [Accepted: 04/14/2021] [Indexed: 02/07/2023] Open
Abstract
OBJECTIVES Using a nationally-representative, cross-sectional cohort, we examined nutritional markers of undiagnosed type 2 diabetes in adults via machine learning. METHODS A total of 16429 men and non-pregnant women ≥ 20 years of age were analysed from five consecutive cycles of the National Health and Nutrition Examination Survey. Cohorts from years 2013-2016 (n = 6673) was used for external validation. Undiagnosed type 2 diabetes was determined by a negative response to the question "Have you ever been told by a doctor that you have diabetes?" and a positive glycaemic response to one or more of the three diagnostic tests (HbA1c > 6.4% or FPG >125 mg/dl or 2-hr post-OGTT glucose > 200mg/dl). Following comprehensive literature search, 114 potential nutritional markers were modelled with 13 behavioural and 12 socio-economic variables. We tested three machine learning algorithms on original and resampled training datasets built using three resampling methods. From this, the derived 12 predictive models were validated on internal- and external validation cohorts. Magnitudes of associations were gauged through odds ratios in logistic models and variable importance in others. Models were benchmarked against the ADA diabetes risk test. RESULTS The prevalence of undiagnosed type 2 diabetes was 5.26%. Four best-performing models (AUROC range: 74.9%-75.7%) classified 39 markers of undiagnosed type 2 diabetes; 28 via one or more of the three best-performing non-linear/ensemble models and 11 uniquely by the logistic model. They comprised 14 nutrient-based, 12 anthropometry-based, 9 socio-behavioural, and 4 diet-associated markers. AUROC of all models were on a par with ADA diabetes risk test on both internal and external validation cohorts (p>0.05). CONCLUSIONS Models performed comparably to the chosen benchmark. Novel behavioural markers such as the number of meals not prepared from home were revealed. This approach may be useful in nutritional epidemiology to unravel new associations with type 2 diabetes.
Collapse
Affiliation(s)
- Kushan De Silva
- Monash Centre for Health Research and Implementation, School of Public Health and Preventive Medicine, Faculty of Medicine, Nursing, and Health Sciences, Monash University, Clayton, Australia
| | - Siew Lim
- Monash Centre for Health Research and Implementation, School of Public Health and Preventive Medicine, Faculty of Medicine, Nursing, and Health Sciences, Monash University, Clayton, Australia
| | - Aya Mousa
- Monash Centre for Health Research and Implementation, School of Public Health and Preventive Medicine, Faculty of Medicine, Nursing, and Health Sciences, Monash University, Clayton, Australia
| | - Helena Teede
- Monash Centre for Health Research and Implementation, School of Public Health and Preventive Medicine, Faculty of Medicine, Nursing, and Health Sciences, Monash University, Clayton, Australia
| | - Andrew Forbes
- Biostatistics Unit, Division of Research Methodology, School of Public Health and Preventive Medicine, Faculty of Medicine, Nursing, and Health Sciences, Monash University, Melbourne, Australia
| | - Ryan T Demmer
- Division of Epidemiology and Community Health, School of Public Health, University of Minnesota, Minneapolis, Minnesota, United States of America.,Mailman School of Public Health, Columbia University, New York, New York, United States of America
| | - Daniel Jönsson
- Department of Periodontology, Faculty of Odontology, Malmö University, Malmö, Sweden.,Swedish Dental Service of Skane, Lund, Sweden
| | - Joanne Enticott
- Monash Centre for Health Research and Implementation, School of Public Health and Preventive Medicine, Faculty of Medicine, Nursing, and Health Sciences, Monash University, Clayton, Australia
| |
Collapse
|
189
|
An intelligent hybrid classification algorithm integrating fuzzy rule-based extraction and harmony search optimization: Medical diagnosis applications. Knowl Based Syst 2021. [DOI: 10.1016/j.knosys.2021.106943] [Citation(s) in RCA: 17] [Impact Index Per Article: 5.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/07/2023]
|
190
|
Umar Ibrahim A, Ozsoz M, Serte S, Al‐Turjman F, Habeeb Kolapo S. Convolutional neural network for diagnosis of viral pneumonia and COVID-19 alike diseases. EXPERT SYSTEMS 2021; 39:e12705. [PMID: 34177037 PMCID: PMC8209916 DOI: 10.1111/exsy.12705] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/05/2021] [Accepted: 04/03/2021] [Indexed: 05/09/2023]
Abstract
Reverse-Transcription Polymerase Chain Reaction (RT-PCR) method is currently the gold standard method for detection of viral strains in human samples, but this technique is very expensive, take time and often leads to misdiagnosis. The recent outbreak of COVID-19 has led scientists to explore other options such as the use of artificial intelligence driven tools as an alternative or a confirmatory approach for detection of viral pneumonia. In this paper, we utilized a Convolutional Neural Network (CNN) approach to detect viral pneumonia in x-ray images using a pretrained AlexNet model thereby adopting a transfer learning approach. The dataset used for the study was obtained in the form of optical Coherence Tomography and chest X-ray images made available by Kermany et al. (2018, https://doi.org/10.17632/rscbjbr9sj.3) with a total number of 5853 pneumonia (positive) and normal (negative) images. To evaluate the average efficiency of the model, the dataset was split into on 50:50, 60:40, 70:30, 80:20 and 90:10 for training and testing respectively. To evaluate the performance of the model, 10 K Cross-validation was carried out. The performance of the model using overall dataset was compared with the means of cross-validation and the currents state of arts. The classification model has shown high performance in terms of accuracy, sensitivity and specificity. 70:30 split performed better compare to other splits with accuracy of 98.73%, sensitivity of 98.59% and specificity of 99.84%.
Collapse
Affiliation(s)
| | - Mehmet Ozsoz
- Department of Biomedical EngineeringNear East UniversityNicosiaMersin 10Turkey
| | - Sertan Serte
- Department of Electrical EngineeringNear East UniversityNicosiaMersin 10Turkey
| | - Fadi Al‐Turjman
- Department of Artificial Intelligence, Research Center for AI and IoTNear East UniversityNicosiaMersin 10Turkey
| | | |
Collapse
|
191
|
Borges do Nascimento IJ, Marcolino MS, Abdulazeem HM, Weerasekara I, Azzopardi-Muscat N, Gonçalves MA, Novillo-Ortiz D. Impact of Big Data Analytics on People's Health: Overview of Systematic Reviews and Recommendations for Future Studies. J Med Internet Res 2021; 23:e27275. [PMID: 33847586 PMCID: PMC8080139 DOI: 10.2196/27275] [Citation(s) in RCA: 21] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/19/2021] [Revised: 02/19/2021] [Accepted: 03/24/2021] [Indexed: 12/17/2022] Open
Abstract
Background Although the potential of big data analytics for health care is well recognized, evidence is lacking on its effects on public health. Objective The aim of this study was to assess the impact of the use of big data analytics on people’s health based on the health indicators and core priorities in the World Health Organization (WHO) General Programme of Work 2019/2023 and the European Programme of Work (EPW), approved and adopted by its Member States, in addition to SARS-CoV-2–related studies. Furthermore, we sought to identify the most relevant challenges and opportunities of these tools with respect to people’s health. Methods Six databases (MEDLINE, Embase, Cochrane Database of Systematic Reviews via Cochrane Library, Web of Science, Scopus, and Epistemonikos) were searched from the inception date to September 21, 2020. Systematic reviews assessing the effects of big data analytics on health indicators were included. Two authors independently performed screening, selection, data extraction, and quality assessment using the AMSTAR-2 (A Measurement Tool to Assess Systematic Reviews 2) checklist. Results The literature search initially yielded 185 records, 35 of which met the inclusion criteria, involving more than 5,000,000 patients. Most of the included studies used patient data collected from electronic health records, hospital information systems, private patient databases, and imaging datasets, and involved the use of big data analytics for noncommunicable diseases. “Probability of dying from any of cardiovascular, cancer, diabetes or chronic renal disease” and “suicide mortality rate” were the most commonly assessed health indicators and core priorities within the WHO General Programme of Work 2019/2023 and the EPW 2020/2025. Big data analytics have shown moderate to high accuracy for the diagnosis and prediction of complications of diabetes mellitus as well as for the diagnosis and classification of mental disorders; prediction of suicide attempts and behaviors; and the diagnosis, treatment, and prediction of important clinical outcomes of several chronic diseases. Confidence in the results was rated as “critically low” for 25 reviews, as “low” for 7 reviews, and as “moderate” for 3 reviews. The most frequently identified challenges were establishment of a well-designed and structured data source, and a secure, transparent, and standardized database for patient data. Conclusions Although the overall quality of included studies was limited, big data analytics has shown moderate to high accuracy for the diagnosis of certain diseases, improvement in managing chronic diseases, and support for prompt and real-time analyses of large sets of varied input data to diagnose and predict disease outcomes. Trial Registration International Prospective Register of Systematic Reviews (PROSPERO) CRD42020214048; https://www.crd.york.ac.uk/prospero/display_record.php?RecordID=214048
Collapse
Affiliation(s)
- Israel Júnior Borges do Nascimento
- School of Medicine, Universidade Federal de Minas Gerais, Belo Horizonte, Minas Gerais, Brazil.,Department of Medicine, School of Medicine, Medical College of Wisconsin, Wauwatosa, WI, United States
| | - Milena Soriano Marcolino
- Department of Internal Medicine, University Hospital, Universidade Federal de Minas Gerais, Belo Horizonte, Minas Gerais, Brazil.,School of Medicine and Telehealth Center, Universidade Federal de Minas Gerais, Belo Horizonte, Minas Gerais, Brazil
| | | | - Ishanka Weerasekara
- School of Health Sciences, Faculty of Health and Medicine, The University of Newcastle, Callaghan, Australia.,Department of Physiotherapy, Faculty of Allied Health Sciences, University of Peradeniya, Peradeniya, Sri Lanka
| | - Natasha Azzopardi-Muscat
- Division of Country Health Policies and Systems, World Health Organization, Regional Office for Europe, Copenhagen, Denmark
| | - Marcos André Gonçalves
- Department of Computer Science, Institute of Exact Sciences, Federal University of Minas Gerais, Belo Horizonte, Minas Gerais, Brazil
| | - David Novillo-Ortiz
- Division of Country Health Policies and Systems, World Health Organization, Regional Office for Europe, Copenhagen, Denmark
| |
Collapse
|
192
|
A Non-Invasive Photonics-Based Device for Monitoring of Diabetic Foot Ulcers: Architectural/Sensorial Components & Technical Specifications. INVENTIONS 2021. [DOI: 10.3390/inventions6020027] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]
Abstract
This paper proposes a new photonic-based non-invasive device for managing of Diabetic Foot Ulcers (DFUs) for people suffering from diabetes. DFUs are one of the main severe complications of diabetes, which may lead to major disabilities, such as foot amputation, or even to the death. The proposed device exploits hyperspectral (HSI) and thermal imaging to measure the status of an ulcer, in contrast to the current practice where invasive biopsies are often applied. In particular, these two photonic-based imaging techniques can estimate the biomarkers of oxyhaemoglobin (HbO2) and deoxyhaemoglobin (Hb), through which the Peripheral Oxygen Saturation (SpO2) and Tissue Oxygen Saturation (StO2) is computed. These factors are very important for the early prediction and prognosis of a DFU. The device is implemented at two editions: the in-home edition suitable for patients and the PRO (professional) edition for the medical staff. The latter is equipped with active photonic tools, such as tuneable diodes, to permit detailed diagnosis and treatment of an ulcer and its progress. The device is enriched with embedding signal processing tools for noise removal and enhancing pixel accuracy using super resolution schemes. In addition, a machine learning framework is adopted, through deep learning structures, to assist the doctors and the patients in understanding the effect of the biomarkers on DFU. The device is to be validated at large scales at three European hospitals (Charité–University Hospital in Berlin, Germany; Attikon in Athens, Greece, and Victor Babes in Timisoara, Romania) for its efficiency and performance.
Collapse
|
193
|
Development of ensemble learning classification with density peak decomposition-based evolutionary multi-objective optimization. INT J MACH LEARN CYB 2021. [DOI: 10.1007/s13042-020-01271-8] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/07/2023]
|
194
|
Alcalá-Rmz V, Galván-Tejada CE, García-Hernández A, Valladares-Salgado A, Cruz M, Galván-Tejada JI, Celaya-Padilla JM, Luna-Garcia H, Gamboa-Rosales H. Identification of People with Diabetes Treatment through Lipids Profile Using Machine Learning Algorithms. Healthcare (Basel) 2021; 9:422. [PMID: 33917300 PMCID: PMC8067355 DOI: 10.3390/healthcare9040422] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/31/2020] [Revised: 03/02/2021] [Accepted: 03/08/2021] [Indexed: 11/16/2022] Open
Abstract
Diabetes incidence has been a problem, because according with the World Health Organization and the International Diabetes Federation, the number of people with this disease is increasing very fast all over the world. Diabetic treatment is important to prevent the development of several complications, also lipid profile monitoring is important. For that reason the aim of this work is the implementation of machine learning algorithms that are able to classify cases, that corresponds to patients diagnosed with diabetes that have diabetes treatment, and controls that refers to subjects who do not have diabetes treatment but some of them have diabetes, bases on lipids profile levels. Logistic regression, K-nearest neighbor, decision trees and random forest were implemented, all of them were evaluated with accuracy, sensitivity, specificity and AUC-ROC curve metrics. Artificial neural network obtain an acurracy of 0.685 and an AUC value of 0.750, logistic regression achieve an accuracy of 0.729 and an AUC value of 0.795, K-nearest neighbor gets an accuracy of 0.669 and an AUC value of 0.709, on the other hand, decision tree reached an accuracy pg 0.691 and a AUC value of 0.683, finally random forest achieve an accuracy of 0.704 and an AUC curve of 0.776. The performance of all models was statistically significant, but the best performance model for this problem corresponds to logistic regression.
Collapse
Affiliation(s)
- Vanessa Alcalá-Rmz
- Unidad Académica de Ingeniería Eléctrica, Universidad Autónoma de Zacatecas, Jardín Juárez 147, Centro, Zacatecas 98000, Mexico; (V.A.-R.); (A.G.-H.); (J.I.G.-T.); (J.M.C.-P.); (H.L.-G.); (H.G.-R.)
| | - Carlos E. Galván-Tejada
- Unidad Académica de Ingeniería Eléctrica, Universidad Autónoma de Zacatecas, Jardín Juárez 147, Centro, Zacatecas 98000, Mexico; (V.A.-R.); (A.G.-H.); (J.I.G.-T.); (J.M.C.-P.); (H.L.-G.); (H.G.-R.)
| | - Alejandra García-Hernández
- Unidad Académica de Ingeniería Eléctrica, Universidad Autónoma de Zacatecas, Jardín Juárez 147, Centro, Zacatecas 98000, Mexico; (V.A.-R.); (A.G.-H.); (J.I.G.-T.); (J.M.C.-P.); (H.L.-G.); (H.G.-R.)
| | - Adan Valladares-Salgado
- Unidad de Investigación Médica en Bioquímica, Hospital de Especialidades, Centro Médico Nacional Siglo XXI, Instituto Mexicano del Seguro Social, Av. Cuauhtémoc 330, Col. Doctores, Del. Cuauhtémoc, Mexico City 06720, Mexico; (A.V.-S.); (M.C.)
| | - Miguel Cruz
- Unidad de Investigación Médica en Bioquímica, Hospital de Especialidades, Centro Médico Nacional Siglo XXI, Instituto Mexicano del Seguro Social, Av. Cuauhtémoc 330, Col. Doctores, Del. Cuauhtémoc, Mexico City 06720, Mexico; (A.V.-S.); (M.C.)
| | - Jorge I. Galván-Tejada
- Unidad Académica de Ingeniería Eléctrica, Universidad Autónoma de Zacatecas, Jardín Juárez 147, Centro, Zacatecas 98000, Mexico; (V.A.-R.); (A.G.-H.); (J.I.G.-T.); (J.M.C.-P.); (H.L.-G.); (H.G.-R.)
| | - Jose M. Celaya-Padilla
- Unidad Académica de Ingeniería Eléctrica, Universidad Autónoma de Zacatecas, Jardín Juárez 147, Centro, Zacatecas 98000, Mexico; (V.A.-R.); (A.G.-H.); (J.I.G.-T.); (J.M.C.-P.); (H.L.-G.); (H.G.-R.)
| | - Huizilopoztli Luna-Garcia
- Unidad Académica de Ingeniería Eléctrica, Universidad Autónoma de Zacatecas, Jardín Juárez 147, Centro, Zacatecas 98000, Mexico; (V.A.-R.); (A.G.-H.); (J.I.G.-T.); (J.M.C.-P.); (H.L.-G.); (H.G.-R.)
| | - Hamurabi Gamboa-Rosales
- Unidad Académica de Ingeniería Eléctrica, Universidad Autónoma de Zacatecas, Jardín Juárez 147, Centro, Zacatecas 98000, Mexico; (V.A.-R.); (A.G.-H.); (J.I.G.-T.); (J.M.C.-P.); (H.L.-G.); (H.G.-R.)
| |
Collapse
|
195
|
Kim M, Choi HJ. Digital Therapeutics for Obesity and Eating-Related Problems. Endocrinol Metab (Seoul) 2021; 36:220-228. [PMID: 33761233 PMCID: PMC8090472 DOI: 10.3803/enm.2021.107] [Citation(s) in RCA: 13] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 01/26/2021] [Accepted: 02/17/2021] [Indexed: 12/21/2022] Open
Abstract
In recent years, digital technologies have rapidly advanced and are being applied to remedy medical problems. These technologies allow us to monitor and manage our physical and mental health in our daily lives. Since lifestyle modification is the cornerstone of the management of obesity and eating behavior problems, digital therapeutics (DTx) represent a powerful and easily accessible treatment modality. This review discusses the critical issues to consider for enhancing the efficacy of DTx in future development initiatives. To competently adapt and expand public access to DTx, it is important for various stakeholders, including health professionals, patients, and guardians, to collaborate with other industry partners and policy-makers in the ecosystem.
Collapse
Affiliation(s)
- Meelim Kim
- Department of Biomedical Sciences, Seoul National University College of Medicine, Seoul,
Korea
- BK21 Plus Biomedical Science Project Team, Seoul National University College of Medicine, Seoul,
Korea
| | - Hyung Jin Choi
- Department of Biomedical Sciences, Seoul National University College of Medicine, Seoul,
Korea
- BK21 Plus Biomedical Science Project Team, Seoul National University College of Medicine, Seoul,
Korea
| |
Collapse
|
196
|
Aizawa T. Dynamic variation in receptive vocabulary acquisitions: Further evidence from the Young Lives study. COGNITIVE DEVELOPMENT 2021. [DOI: 10.1016/j.cogdev.2021.101031] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/22/2022]
|
197
|
Artificial Intelligence in Differential Diagnostics of Meningitis: A Nationwide Study. Diagnostics (Basel) 2021; 11:diagnostics11040602. [PMID: 33800653 PMCID: PMC8065596 DOI: 10.3390/diagnostics11040602] [Citation(s) in RCA: 11] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/01/2021] [Revised: 03/23/2021] [Accepted: 03/26/2021] [Indexed: 11/17/2022] Open
Abstract
Differential diagnosis between bacterial and viral meningitis is crucial. In our study, to differentiate bacterial vs. viral meningitis, three machine learning (ML) algorithms (multiple logistic regression (MLR), random forest (RF), and naïve-Bayes (NB)) were applied for the two age groups (0-14 and >14 years) of patients with meningitis by both conventional (culture) and molecular (PCR) methods. Cerebrospinal fluid (CSF) neutrophils, CSF lymphocytes, neutrophil-to-lymphocyte ratio (NLR), blood albumin, blood C-reactive protein (CRP), glucose, blood soluble urokinase-type plasminogen activator receptor (suPAR), and CSF lymphocytes-to-blood CRP ratio (LCR) were used as predictors for the ML algorithms. The performance of the ML algorithms was evaluated through a cross-validation procedure, and optimal predictions of the type of meningitis were above 95% for viral and 78% for bacterial meningitis. Overall, MLR and RF yielded the best performance when using CSF neutrophils, CSF lymphocytes, NLR, albumin, glucose, gender, and CRP. Also, our results reconfirm the high diagnostic accuracy of NLR in the differential diagnosis between bacterial and viral meningitis.
Collapse
|
198
|
Deberneh HM, Kim I. Prediction of Type 2 Diabetes Based on Machine Learning Algorithm. INTERNATIONAL JOURNAL OF ENVIRONMENTAL RESEARCH AND PUBLIC HEALTH 2021; 18:3317. [PMID: 33806973 PMCID: PMC8004981 DOI: 10.3390/ijerph18063317] [Citation(s) in RCA: 43] [Impact Index Per Article: 14.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 02/02/2021] [Revised: 03/15/2021] [Accepted: 03/17/2021] [Indexed: 12/17/2022]
Abstract
Prediction of type 2 diabetes (T2D) occurrence allows a person at risk to take actions that can prevent onset or delay the progression of the disease. In this study, we developed a machine learning (ML) model to predict T2D occurrence in the following year (Y + 1) using variables in the current year (Y). The dataset for this study was collected at a private medical institute as electronic health records from 2013 to 2018. To construct the prediction model, key features were first selected using ANOVA tests, chi-squared tests, and recursive feature elimination methods. The resultant features were fasting plasma glucose (FPG), HbA1c, triglycerides, BMI, gamma-GTP, age, uric acid, sex, smoking, drinking, physical activity, and family history. We then employed logistic regression, random forest, support vector machine, XGBoost, and ensemble machine learning algorithms based on these variables to predict the outcome as normal (non-diabetic), prediabetes, or diabetes. Based on the experimental results, the performance of the prediction model proved to be reasonably good at forecasting the occurrence of T2D in the Korean population. The model can provide clinicians and patients with valuable predictive information on the likelihood of developing T2D. The cross-validation (CV) results showed that the ensemble models had a superior performance to that of the single models. The CV performance of the prediction models was improved by incorporating more medical history from the dataset.
Collapse
Affiliation(s)
| | - Intaek Kim
- Department of Information and Communications Engineering, Myongji University, 116 Myongji-ro, Yongin, Gyeonggi 17058, Korea;
| |
Collapse
|
199
|
Kolling ML, Furstenau LB, Sott MK, Rabaioli B, Ulmi PH, Bragazzi NL, Tedesco LPC. Data Mining in Healthcare: Applying Strategic Intelligence Techniques to Depict 25 Years of Research Development. INTERNATIONAL JOURNAL OF ENVIRONMENTAL RESEARCH AND PUBLIC HEALTH 2021; 18:ijerph18063099. [PMID: 33802880 PMCID: PMC8002654 DOI: 10.3390/ijerph18063099] [Citation(s) in RCA: 14] [Impact Index Per Article: 4.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 01/29/2021] [Revised: 03/12/2021] [Accepted: 03/15/2021] [Indexed: 12/15/2022]
Abstract
In order to identify the strategic topics and the thematic evolution structure of data mining applied to healthcare, in this paper, a bibliometric performance and network analysis (BPNA) was conducted. For this purpose, 6138 articles were sourced from the Web of Science covering the period from 1995 to July 2020 and the SciMAT software was used. Our results present a strategic diagram composed of 19 themes, of which the 8 motor themes ('NEURAL-NETWORKS', 'CANCER', 'ELETRONIC-HEALTH-RECORDS', 'DIABETES-MELLITUS', 'ALZHEIMER'S-DISEASE', 'BREAST-CANCER', 'DEPRESSION', and 'RANDOM-FOREST') are depicted in a thematic network. An in-depth analysis was carried out in order to find hidden patterns and to provide a general perspective of the field. The thematic network structure is arranged thusly that its subjects are organized into two different areas, (i) practices and techniques related to data mining in healthcare, and (ii) health concepts and disease supported by data mining, embodying, respectively, the hotspots related to the data mining and medical scopes, hence demonstrating the field's evolution over time. Such results make it possible to form the basis for future research and facilitate decision-making by researchers and practitioners, institutions, and governments interested in data mining in healthcare.
Collapse
Affiliation(s)
- Maikel Luis Kolling
- Graduate Program of Industrial Systems and Processes, University of Santa Cruz do Sul, Santa Cruz do Sul 96816-501, Brazil; (M.L.K.); (M.K.S.)
| | - Leonardo B. Furstenau
- Department of Industrial Engineering, Federal University of Rio Grande do Sul, Porto Alegre 90035-190, Brazil;
| | - Michele Kremer Sott
- Graduate Program of Industrial Systems and Processes, University of Santa Cruz do Sul, Santa Cruz do Sul 96816-501, Brazil; (M.L.K.); (M.K.S.)
| | - Bruna Rabaioli
- Department of Medicine, University of Santa Cruz do Sul, Santa Cruz do Sul 96816-501, Brazil;
| | - Pedro Henrique Ulmi
- Department of Computer Science, University of Santa Cruz do Sul, Santa Cruz do Sul 96816-501, Brazil;
| | - Nicola Luigi Bragazzi
- Laboratory for Industrial and Applied Mathematics (LIAM), Department of Mathematics and Statistics, York University, Toronto, ON M3J 1P3, Canada
- Correspondence: (N.L.B.); (L.P.C.T.)
| | - Leonel Pablo Carvalho Tedesco
- Graduate Program of Industrial Systems and Processes, University of Santa Cruz do Sul, Santa Cruz do Sul 96816-501, Brazil; (M.L.K.); (M.K.S.)
- Department of Computer Science, University of Santa Cruz do Sul, Santa Cruz do Sul 96816-501, Brazil;
- Correspondence: (N.L.B.); (L.P.C.T.)
| |
Collapse
|
200
|
Liu Y, Zhang D, Tang Y, Zhang Y, Chang Y, Zheng J. Machine Learning-Enabled Design and Prediction of Protein Resistance on Self-Assembled Monolayers and Beyond. ACS APPLIED MATERIALS & INTERFACES 2021; 13:11306-11319. [PMID: 33635641 DOI: 10.1021/acsami.1c00642] [Citation(s) in RCA: 9] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/12/2023]
Abstract
The rational design of highly antifouling materials is crucial for a wide range of fundamental research and practical applications. The immense variety and complexity of the intrinsic physicochemical properties of materials (i.e., chemical structure, hydrophobicity, charge distribution, and molecular weight) and their surface coating properties (i.e., packing density, film thickness and roughness, and chain conformation) make it challenging to rationally design antifouling materials and reveal their fundamental structure-property relationships. In this work, we developed a data-driven machine learning model, a combination of factor analysis of functional group (FAFG), Pearson analysis, random forest (RF) and artificial neural network (ANN) algorithms, and Bayesian statistics, to computationally extract structure/chemical/surface features in correlation with the antifouling activity of self-assembled monolayers (SAMs) from a self-construction data set. The resultant model demonstrates the robustness of QCV2 = 0.90 and RMSECV = 0.21 and the predictive ability of Qext2 = 0.84 and RMSEext = 0.28, determines key descriptors and functional groups important for the antifouling activity, and enables to design original antifouling SAMs using the predicted antifouling functional groups. Three computationally designed molecules were further coated onto the surfaces in different forms of SAMs and polymer brushes. The resultant coatings with negative fouling indexes exhibited strong surface resistance to protein adsorption from undiluted blood serum and plasma, validating the model predictions. The data-driven machine learning model demonstrates their design and predictive capacity for next-generation antifouling materials and surfaces, which hopefully help to accelerate the discovery and understanding of functional materials.
Collapse
Affiliation(s)
- Yonglan Liu
- Department of Chemical, Biomolecular, and Corrosion Engineering, The University of Akron, Ohio 44325, United States
| | - Dong Zhang
- Department of Chemical, Biomolecular, and Corrosion Engineering, The University of Akron, Ohio 44325, United States
| | - Yijing Tang
- Department of Chemical, Biomolecular, and Corrosion Engineering, The University of Akron, Ohio 44325, United States
| | - Yanxian Zhang
- Department of Chemical, Biomolecular, and Corrosion Engineering, The University of Akron, Ohio 44325, United States
| | - Yung Chang
- Department of Chemical Engineering, R&D Center for Membrane Technology, Chung Yuan Christian University, Taoyuan 32023, Taiwan
| | - Jie Zheng
- Department of Chemical, Biomolecular, and Corrosion Engineering, The University of Akron, Ohio 44325, United States
| |
Collapse
|