Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For: Chen P, Pan C. Diabetes classification model based on boosting algorithms. BMC Bioinformatics 2018;19:109. [PMID: 29587624 PMCID: PMC5872396 DOI: 10.1186/s12859-018-2090-9] [Citation(s) in RCA: 31] [Impact Index Per Article: 5.2] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/12/2017] [Accepted: 02/28/2018] [Indexed: 12/17/2022] Open

For:	Chen P, Pan C. Diabetes classification model based on boosting algorithms. BMC Bioinformatics 2018;19:109. [PMID: 29587624 PMCID: PMC5872396 DOI: 10.1186/s12859-018-2090-9] [Citation(s) in RCA: 31] [Impact Index Per Article: 5.2] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/12/2017] [Accepted: 02/28/2018] [Indexed: 12/17/2022] Open

Number

Cited by Other Article(s)

Zhou Y, Zhang Z, Li Q, Mao G, Zhou Z. Construction and validation of machine learning algorithm for predicting depression among home-quarantined individuals during the large-scale COVID-19 outbreak: based on Adaboost model. BMC Psychol 2024;12:230. [PMID: 38659077 PMCID: PMC11044386 DOI: 10.1186/s40359-024-01696-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/20/2024] [Accepted: 03/29/2024] [Indexed: 04/26/2024] Open

Abstract

OBJECTIVES

COVID-19 epidemics often lead to elevated levels of depression. To accurately identify and predict depression levels in home-quarantined individuals during a COVID-19 epidemic, this study constructed a depression prediction model based on multiple machine learning algorithms and validated its effectiveness.

METHODS

A cross-sectional method was used to examine the depression status of individuals quarantined at home during the epidemic via the network. Characteristics included variables on sociodemographics, COVID-19 and its prevention and control measures, impact on life, work, health and economy after the city was sealed off, and PHQ-9 scale scores. The home-quarantined subjects were randomly divided into training set and validation set according to the ratio of 7:3, and the performance of different machine learning models were compared by 10-fold cross-validation, and the model algorithm with the best performance was selected from 15 models to construct and validate the depression prediction model for home-quarantined subjects. The validity of different models was compared based on accuracy, precision, receiver operating characteristic (ROC) curve, and area under the ROC curve (AUC), and the best model suitable for the data framework of this study was identified.

RESULTS

The prevalence of depression among home-quarantined individuals during the epidemic was 31.66% (202/638), and the constructed Adaboost depression prediction model had an ACC of 0.7917, an accuracy of 0.7180, and an AUC of 0.7803, which was better than the other 15 models on the combination of various performance measures. In the validation sets, the AUC was greater than 0.83.

CONCLUSIONS

The Adaboost machine learning algorithm developed in this study can be used to construct a depression prediction model for home-quarantined individuals that has better machine learning performance, as well as high effectiveness, robustness, and generalizability.

Collapse

Talari P, N B, Kaur G, Alshahrani H, Al Reshan MS, Sulaiman A, Shaikh A. Hybrid feature selection and classification technique for early prediction and severity of diabetes type 2. PLoS One 2024;19:e0292100. [PMID: 38236900 PMCID: PMC10796060 DOI: 10.1371/journal.pone.0292100] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/26/2023] [Accepted: 09/12/2023] [Indexed: 01/22/2024] Open

Kuang A, Kouznetsova VL, Kesari S, Tsigelny IF. Diagnostics of Thyroid Cancer Using Machine Learning and Metabolomics. Metabolites 2023;14:11. [PMID: 38248814 PMCID: PMC10818630 DOI: 10.3390/metabo14010011] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/27/2023] [Revised: 12/14/2023] [Accepted: 12/18/2023] [Indexed: 01/23/2024] Open

Abnoosian K, Farnoosh R, Behzadi MH. Prediction of diabetes disease using an ensemble of machine learning multi-classifier models. BMC Bioinformatics 2023;24:337. [PMID: 37697283 PMCID: PMC10496262 DOI: 10.1186/s12859-023-05465-z] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/30/2023] [Accepted: 09/04/2023] [Indexed: 09/13/2023] Open

Abstract

BACKGROUND AND OBJECTIVE

Diabetes is a life-threatening chronic disease with a growing global prevalence, necessitating early diagnosis and treatment to prevent severe complications. Machine learning has emerged as a promising approach for diabetes diagnosis, but challenges such as limited labeled data, frequent missing values, and dataset imbalance hinder the development of accurate prediction models. Therefore, a novel framework is required to address these challenges and improve performance.

METHODS

In this study, we propose an innovative pipeline-based multi-classification framework to predict diabetes in three classes: diabetic, non-diabetic, and prediabetes, using the imbalanced Iraqi Patient Dataset of Diabetes. Our framework incorporates various pre-processing techniques, including duplicate sample removal, attribute conversion, missing value imputation, data normalization and standardization, feature selection, and k-fold cross-validation. Furthermore, we implement multiple machine learning models, such as k-NN, SVM, DT, RF, AdaBoost, and GNB, and introduce a weighted ensemble approach based on the Area Under the Receiver Operating Characteristic Curve (AUC) to address dataset imbalance. Performance optimization is achieved through grid search and Bayesian optimization for hyper-parameter tuning.

RESULTS

Our proposed model outperforms other machine learning models, including k-NN, SVM, DT, RF, AdaBoost, and GNB, in predicting diabetes. The model achieves high average accuracy, precision, recall, F1-score, and AUC values of 0.9887, 0.9861, 0.9792, 0.9851, and 0.999, respectively.

CONCLUSION

Our pipeline-based multi-classification framework demonstrates promising results in accurately predicting diabetes using an imbalanced dataset of Iraqi diabetic patients. The proposed framework addresses the challenges associated with limited labeled data, missing values, and dataset imbalance, leading to improved prediction performance. This study highlights the potential of machine learning techniques in diabetes diagnosis and management, and the proposed framework can serve as a valuable tool for accurate prediction and improved patient care. Further research can build upon our work to refine and optimize the framework and explore its applicability in diverse datasets and populations.

Collapse

Zhou H, Zhang PY, Zou X, Liu J, Wang WJ. Chronic disease diagnosis model based on convolutional neural network and ensemble learning method. Digit Health 2023;9:20552076231198643. [PMID: 37667686 PMCID: PMC10475259 DOI: 10.1177/20552076231198643] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/22/2022] [Accepted: 08/15/2023] [Indexed: 09/06/2023] Open

Zhang X, Gavaldà R, Baixeries J. Interpretable prediction of mortality in liver transplant recipients based on machine learning. Comput Biol Med 2022;151:106188. [PMID: 36306583 DOI: 10.1016/j.compbiomed.2022.106188] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/08/2022] [Revised: 09/24/2022] [Accepted: 10/08/2022] [Indexed: 12/27/2022]

Abstract

BACKGROUND

Accurate prediction of the mortality of post-liver transplantation is an important but challenging task. It relates to optimizing organ allocation and estimating the risk of possible dysfunction. Existing risk scoring models, such as the Balance of Risk (BAR) score and the Survival Outcomes Following Liver Transplantation (SOFT) score, do not predict the mortality of post-liver transplantation with sufficient accuracy. In this study, we evaluate the performance of machine learning models and establish an explainable machine learning model for predicting mortality in liver transplant recipients.

METHOD

The optimal feature set for the prediction of the mortality was selected by a wrapper method based on binary particle swarm optimization (BPSO). With the selected optimal feature set, seven machine learning models were applied to predict mortality over different time windows. The best-performing model was used to predict mortality through a comprehensive comparison and evaluation. An interpretable approach based on machine learning and SHapley Additive exPlanations (SHAP) is used to explicitly explain the model's decision and make new discoveries.

RESULTS

With regard to predictive power, our results demonstrated that the feature set selected by BPSO outperformed both the feature set in the existing risk score model (BAR score, SOFT score) and the feature set processed by principal component analysis (PCA). The best-performing model, extreme gradient boosting (XGBoost), was found to improve the Area Under a Curve (AUC) values for mortality prediction by 6.7%, 11.6%, and 17.4% at 3 months, 3 years, and 10 years, respectively, compared to the SOFT score. The main predictors of mortality and their impact were discussed for different age groups and different follow-up periods.

CONCLUSIONS

Our analysis demonstrates that XGBoost can be an ideal method to assess the mortality risk in liver transplantation. In combination with the SHAP approach, the proposed framework provides a more intuitive and comprehensive interpretation of the predictive model, thereby allowing the clinician to better understand the decision-making process of the model and the impact of factors associated with mortality risk in liver transplantation.

Collapse

Lee J, Wanyan T, Chen Q, Keenan TDL, Glicksberg BS, Chew EY, Lu Z, Wang F, Peng Y. Predicting Age-related Macular Degeneration Progression with Longitudinal Fundus Images Using Deep Learning. MACHINE LEARNING IN MEDICAL IMAGING. MLMI (WORKSHOP) 2022;13583:11-20. [PMID: 36656604 PMCID: PMC9842432 DOI: 10.1007/978-3-031-21014-3_2] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/23/2022]

Wanyan T, Lin M, Klang E, Menon KM, Gulamali FF, Azad A, Zhang Y, Ding Y, Wang Z, Wang F, Glicksberg B, Peng Y. Supervised Pretraining through Contrastive Categorical Positive Samplings to Improve COVID-19 Mortality Prediction. ACM-BCB ... ... : THE ... ACM CONFERENCE ON BIOINFORMATICS, COMPUTATIONAL BIOLOGY AND BIOMEDICINE. ACM CONFERENCE ON BIOINFORMATICS, COMPUTATIONAL BIOLOGY AND BIOMEDICINE 2022;2022:9. [PMID: 35960866 PMCID: PMC9365529 DOI: 10.1145/3535508.3545541] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/15/2023]

Vision for Improving Pregnancy Health: Innovation and the Future of Pregnancy Research. Reprod Sci 2022;29:2908-2920. [PMID: 35534766 PMCID: PMC9537127 DOI: 10.1007/s43032-022-00951-w] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/04/2022] [Accepted: 04/15/2022] [Indexed: 10/25/2022]

A Comprehensive Review of Computation-Based Metal-Binding Prediction Approaches at the Residue Level. BIOMED RESEARCH INTERNATIONAL 2022;2022:8965712. [PMID: 35402609 PMCID: PMC8989566 DOI: 10.1155/2022/8965712] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 01/02/2022] [Accepted: 03/04/2022] [Indexed: 12/29/2022]

Reported Adverse Effects and Attitudes among Arab Populations Following COVID-19 Vaccination: A Large-Scale Multinational Study Implementing Machine Learning Tools in Predicting Post-Vaccination Adverse Effects Based on Predisposing Factors. Vaccines (Basel) 2022;10:vaccines10030366. [PMID: 35334998 PMCID: PMC8955470 DOI: 10.3390/vaccines10030366] [Citation(s) in RCA: 32] [Impact Index Per Article: 16.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/20/2022] [Revised: 02/23/2022] [Accepted: 02/24/2022] [Indexed: 02/04/2023] Open

Abstract

Background: The unprecedented global spread of coronavirus disease 2019 (COVID-19) has imposed huge challenges on the healthcare facilities, and impacted every aspect of life. This has led to the development of several vaccines against COVID-19 within one year. This study aimed to assess the attitudes and the side effects among Arab communities after receiving a COVID-19 vaccine and use of machine learning (ML) tools to predict post-vaccination side effects based on predisposing factors. Methods: An online-based multinational survey was carried out via social media platforms from 14 June to 31 August 2021, targeting individuals who received at least one dose of a COVID-19 vaccine from 22 Arab countries. Descriptive statistics, correlation, and chi-square tests were used to analyze the data. Moreover, extensive ML tools were utilized to predict 30 post vaccination adverse effects and their severity based on 15 predisposing factors. The importance of distinct predisposing factors in predicting particular side effects was determined using global feature importance employing gradient boost as AutoML. Results: A total of 10,064 participants from 19 Arab countries were included in this study. Around 56% were female and 59% were aged from 20 to 39 years old. A high rate of vaccine hesitancy (51%) was reported among participants. Almost 88% of the participants were vaccinated with one of three COVID-19 vaccines, including Pfizer-BioNTech (52.8%), AstraZeneca (20.7%), and Sinopharm (14.2%). About 72% of participants experienced post-vaccination side effects. This study reports statistically significant associations (p < 0.01) between various predisposing factors and post-vaccinations side effects. In terms of predicting post-vaccination side effects, gradient boost, random forest, and XGBoost outperformed other ML methods. The most important predisposing factors for predicting certain side effects (i.e., tiredness, fever, headache, injection site pain and swelling, myalgia, and sleepiness and laziness) were revealed to be the number of doses, gender, type of vaccine, age, and hesitancy to receive a COVID-19 vaccine. Conclusions: The reported side effects following COVID-19 vaccination among Arab populations are usually non-life-threatening; flu-like symptoms and injection site pain. Certain predisposing factors have greater weight and importance as input data in predicting post-vaccination side effects. Based on the most significant input data, ML can also be used to predict these side effects; people with certain predicted side effects may require additional medical attention, or possibly hospitalization.

Collapse

Zemmal N, Benzebouchi NE, Azizi N, Schwab D, Belhaouari SB. Unbalanced Learning for Diabetes Diagnosis Based on Enhanced Resampling and Stacking Classifier. INTERNATIONAL JOURNAL OF INTELLIGENT INFORMATION TECHNOLOGIES 2022. [DOI: 10.4018/ijiit.309583] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]

Miller A, Panneerselvam J, Liu L. A review of regression and classification techniques for analysis of common and rare variants and gene-environmental factors. Neurocomputing 2021. [DOI: 10.1016/j.neucom.2021.08.150] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/25/2022]

Beinecke J, Heider D. Gaussian noise up-sampling is better suited than SMOTE and ADASYN for clinical decision making. BioData Min 2021;14:49. [PMID: 34844620 PMCID: PMC8628399 DOI: 10.1186/s13040-021-00283-6] [Citation(s) in RCA: 11] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/17/2021] [Accepted: 11/10/2021] [Indexed: 02/08/2023] Open

Gandouz M, Holzmann H, Heider D. Machine learning with asymmetric abstention for biomedical decision-making. BMC Med Inform Decis Mak 2021;21:294. [PMID: 34702225 PMCID: PMC8549182 DOI: 10.1186/s12911-021-01655-y] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/11/2021] [Accepted: 10/13/2021] [Indexed: 02/08/2023] Open

Hatmal MM, Alshaer W, Mahmoud IS, Al-Hatamleh MAI, Al-Ameer HJ, Abuyaman O, Zihlif M, Mohamud R, Darras M, Al Shhab M, Abu-Raideh R, Ismail H, Al-Hamadi A, Abdelhay A. Investigating the association of CD36 gene polymorphisms (rs1761667 and rs1527483) with T2DM and dyslipidemia: Statistical analysis, machine learning based prediction, and meta-analysis. PLoS One 2021;16:e0257857. [PMID: 34648514 PMCID: PMC8516279 DOI: 10.1371/journal.pone.0257857] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/14/2021] [Accepted: 09/11/2021] [Indexed: 12/15/2022] Open

Ylenia C, Lauri Chiara D, Giovanni I, Lucia R, Donatella V, Tiziana S, Vincenzo G, Ciro V, Stefania S. A Clinical Decision Support System based on fuzzy rules and classification algorithms for monitoring the physiological parameters of type-2 diabetic patients. MATHEMATICAL BIOSCIENCES AND ENGINEERING : MBE 2021;18:2653-2674. [PMID: 33892565 DOI: 10.3934/mbe.2021135] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/12/2023]

Zhou Y, Ma XL, Zhang T, Wang J, Zhang T, Tian R. Use of radiomics based on ¹⁸F-FDG PET/CT and machine learning methods to aid clinical decision-making in the classification of solitary pulmonary lesions: an innovative approach. Eur J Nucl Med Mol Imaging 2021;48:2904-2913. [PMID: 33547553 DOI: 10.1007/s00259-021-05220-7] [Citation(s) in RCA: 24] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/20/2020] [Accepted: 01/25/2021] [Indexed: 02/06/2023]

Abstract

PURPOSE

This study was designed and performed to assess the ability of ¹⁸F-fluorodeoxyglucose (FDG) positron emission tomography (PET) and computed tomography (CT) radiomics features combined with machine learning methods to differentiate between primary and metastatic lung lesions and to classify histological subtypes. Moreover, we identified the optimal machine learning method.

METHODS

A total of 769 patients pathologically diagnosed with primary or metastatic lung cancers were enrolled. We used the LIFEx package to extract radiological features from semiautomatically segmented PET and CT images within the same volume of interest. Patients were randomly distributed in training and validation sets. Through the evaluation of five feature selection methods and nine classification methods, discriminant models were established. The robustness of the procedure was controlled by tenfold cross-validation. The model's performance was evaluated using the area under the receiver operating characteristic curve (AUC).

RESULTS

Based on the radiomics features extracted from PET and CT images, forty-five discriminative models were established. Combined with appropriate feature selection methods, most classifiers showed excellent discriminative ability with AUCs greater than 0.75. In the differentiation between primary and metastatic lung lesions, the feature selection method gradient boosting decision tree (GBDT) combined with the classifier GBDT achieved the highest classification AUC of 0.983 in the PET dataset. In contrast, the feature selection method eXtreme gradient boosting combined with the classifier random forest (RF) achieved the highest AUC of 0.828 in the CT dataset. In the discrimination between squamous cell carcinoma and adenocarcinoma, the combination of GBDT feature selection method with GBDT classification had the highest AUC of 0.897 in the PET dataset. In contrast, the combination of the GBDT feature selection method with the RF classification had the highest AUC of 0.839 in the CT dataset. Most of the decision tree (DT)-based models were overfitted, suggesting that the classification method was not appropriate for practical application.

CONCLUSION

¹⁸F-FDG PET/CT radiomics features combined with machine learning methods can distinguish between primary and metastatic lung lesions and identify histological subtypes in lung cancer. GBDT and RF were considered optimal classification methods for the PET and CT datasets, respectively, and GBDT was considered the optimal feature selection method in our analysis.

Collapse

Comparison of machine learning tools for the prediction of AMD based on genetic, age, and diabetes-related variables in the Chinese population. Regen Ther 2021;15:180-186. [PMID: 33426217 PMCID: PMC7770346 DOI: 10.1016/j.reth.2020.09.001] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/16/2020] [Revised: 09/01/2020] [Accepted: 09/09/2020] [Indexed: 11/23/2022] Open

Liberda EN, Zuk AM, Martin ID, Tsuji LJS. Fisher's Linear Discriminant Function Analysis and its Potential Utility as a Tool for the Assessment of Health-and-Wellness Programs in Indigenous Communities. INTERNATIONAL JOURNAL OF ENVIRONMENTAL RESEARCH AND PUBLIC HEALTH 2020;17:ijerph17217894. [PMID: 33126498 PMCID: PMC7663610 DOI: 10.3390/ijerph17217894] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 08/28/2020] [Revised: 10/22/2020] [Accepted: 10/25/2020] [Indexed: 11/16/2022]

Liu J, Wang L, Zhang L, Zhang Z, Zhang S. Predictive analytics for blood glucose concentration: an empirical study using the tree-based ensemble approach. LIBRARY HI TECH 2020. [DOI: 10.1108/lht-08-2019-0171] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/17/2022]

Abstract PurposeThe primary objective of this study was to recognize critical indicators in predicting blood glucose (BG) through data-driven methods and to compare the prediction performance of four tree-based ensemble models, i.e. bagging with tree regressors (bagging-decision tree [Bagging-DT]), AdaBoost with tree regressors (Adaboost-DT), random forest (RF) and gradient boosting decision tree (GBDT).Design/methodology/approachThis study proposed a majority voting feature selection method by combining lasso regression with the Akaike information criterion (AIC) (LR-AIC), lasso regression with the Bayesian information criterion (BIC) (LR-BIC) and RF to select indicators with excellent predictive performance from initial 38 indicators in 5,642 samples. The selected features were deployed to build the tree-based ensemble models. The 10-fold cross-validation (CV) method was used to evaluate the performance of each ensemble model.FindingsThe results of feature selection indicated that age, corpuscular hemoglobin concentration (CHC), red blood cell volume distribution width (RBCVDW), red blood cell volume and leucocyte count are five most important clinical/physical indicators in BG prediction. Furthermore, this study also found that the GBDT ensemble model combined with the proposed majority voting feature selection method is better than other three models with respect to prediction performance and stability.Practical implicationsThis study proposed a novel BG prediction framework for better predictive analytics in health care.Social implicationsThis study incorporated medical background and machine learning technology to reduce diabetes morbidity and formulate precise medical schemes.Originality/valueThe majority voting feature selection method combined with the GBDT ensemble model provides an effective decision-making tool for predicting BG and detecting diabetes risk in advance. Collapse

Anand PK, Shin DR, Memon ML. Adaptive Boosting Based Personalized Glucose Monitoring System (PGMS) for Non-Invasive Blood Glucose Prediction with Improved Accuracy. Diagnostics (Basel) 2020;10:diagnostics10050285. [PMID: 32392841 PMCID: PMC7278000 DOI: 10.3390/diagnostics10050285] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/14/2020] [Revised: 04/28/2020] [Accepted: 05/04/2020] [Indexed: 12/13/2022] Open

Zou Y, Wang D, Liu L. Research on Human Movement Target Recognition Algorithm in Complex Traffic Environment. INT J PATTERN RECOGN 2020. [DOI: 10.1142/s0218001420500123] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022]

Abstract With the increase in the total population of the society and the continuous increase in the number of trips, the traffic pressures faced by people are increasing. With the development and advancement of computer technology, the emergence of intelligent transportation provides a better way to solve the problem of effectively alleviating traffic pressure and reducing the incidence of traffic accidents. In recent years, intelligent traffic monitoring system, as one of the important branches in the field of intelligent transportation, has also received more and more attention. Among them, video-based moving target recognition technology involves theoretical knowledge in various fields such as artificial intelligence, image processing, pattern recognition and computer vision. It is an important means to realize “safe city” and “smart city” and a key technology for intelligent monitoring. Therefore, the research on human motion target recognition algorithm in complex traffic environment has important theoretical and practical value. In the field of intelligent traffic monitoring, the moving target detection and recognition effect of video images will have certain influence on the classification and behavior understanding of subsequent moving targets. In this paper, the commonly used moving target detection methods are studied first, and the convergence problem of the traditional Adaboost algorithm is improved. An Adaboost algorithm based on adaptive weight update is proposed, and then the support vector machine (SVM) is used. The algorithm identifies the detected moving target. Finally, through simulation experiments on the acquired video images, the results show that the proposed human motion target recognition algorithm based on adaptive weight update Adaboost and SVM has good feasibility and rationality. Collapse

Construction of cascaded depth model based on boosting feature selection and classification. EVOLUTIONARY INTELLIGENCE 2020. [DOI: 10.1007/s12065-020-00413-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/24/2022]

Choubey DK, Kumar M, Shukla V, Tripathi S, Dhandhania VK. Comparative Analysis of Classification Methods with PCA and LDA for Diabetes. Curr Diabetes Rev 2020;16:833-850. [PMID: 31971112 DOI: 10.2174/1573399816666200123124008] [Citation(s) in RCA: 26] [Impact Index Per Article: 6.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 07/27/2019] [Revised: 09/30/2019] [Accepted: 11/11/2019] [Indexed: 12/20/2022]

Abstract

BACKGROUND

The modern society is extremely prone to many life-threatening diseases, which can be easily controlled as well as cured if diagnosed at an early stage. The development and implementation of a disease diagnostic system have gained huge popularity over the years. In the current scenario, there are certain factors such as environment, sedentary lifestyle, genetic (hereditary) are the major factors behind the life threatening diseases such as 'diabetes.' Moreover, diabetes has achieved the status of the modern man's leading chronic disease. So one of the prime needs of this generation is to develop a state-of-the-art expert system which can predict diabetes at a very early stage with a minimum of complexity and in an expedited manner. The primary objective of this work is to develop an indigenous and efficient diagnostic technique for detection of diabetes. Method & Discussion: The proposed methodology comprises of two phases: In the first phase The Pima Indian Diabetes Dataset (PIDD) has been collected from the UCI machine learning repository databases and Localized Diabetes Dataset (LDD) has been gathered from Bombay Medical Hall, Upper Bazar Ranchi, Jharkhand, India. In the second phase, the dataset has been processed through two different approaches. The first approach entails classification through Adaboost, Classification via Regression (CVR), Radial Basis Function Network (RBFN), K-Nearest Neighbor (KNN) on Pima Indian Diabetes Dataset and Localized Diabetes Dataset. In the second approach, Principal Component Analysis (PCA) and Linear Discriminant Analysis (LDA) have been applied as a feature reduction method followed by using the same set of classification methods used in the first approach. Among all of the implemented classification methods, PCA_CVR achieves the maximum performance for both the above mentioned datasets.

CONCLUSION

In this article, comparative analysis of outcomes obtained by with and without the use of PCA and LDA for the same set of classification method has been done w.r.t performance assessment. Finally, it has been concluded that PCA & LDA both are useful to remove the insignificant features, decreasing the expense and computation time while improving the ROC and accuracy. The used methodology may similarly be applied to other medical diseases.

Collapse

Performance enhanced Boosted SVM for Imbalanced datasets. Appl Soft Comput 2019. [DOI: 10.1016/j.asoc.2019.105601] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/23/2022]

Comparison of Machine Learning Techniques for Prediction of Hospitalization in Heart Failure Patients. J Clin Med 2019;8:jcm8091298. [PMID: 31450546 PMCID: PMC6780582 DOI: 10.3390/jcm8091298] [Citation(s) in RCA: 25] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/11/2019] [Revised: 08/20/2019] [Accepted: 08/22/2019] [Indexed: 12/23/2022] Open

Spänig S, Emberger-Klein A, Sowa JP, Canbay A, Menrad K, Heider D. The virtual doctor: An interactive clinical-decision-support system based on deep learning for non-invasive prediction of diabetes. Artif Intell Med 2019;100:101706. [PMID: 31607340 DOI: 10.1016/j.artmed.2019.101706] [Citation(s) in RCA: 48] [Impact Index Per Article: 9.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/19/2019] [Revised: 06/27/2019] [Accepted: 08/18/2019] [Indexed: 11/16/2022]

A hybrid Forecast Cost Benefit Classification of diabetes mellitus prevalence based on epidemiological study on Real-life patient's data. Sci Rep 2019;9:10103. [PMID: 31300715 PMCID: PMC6626127 DOI: 10.1038/s41598-019-46631-9] [Citation(s) in RCA: 11] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/12/2018] [Accepted: 07/01/2019] [Indexed: 12/12/2022] Open

Yoo TK, Ryu IH, Lee G, Kim Y, Kim JK, Lee IS, Kim JS, Rim TH. Adopting machine learning to automatically identify candidate patients for corneal refractive surgery. NPJ Digit Med 2019;2:59. [PMID: 31304405 PMCID: PMC6586803 DOI: 10.1038/s41746-019-0135-8] [Citation(s) in RCA: 36] [Impact Index Per Article: 7.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/07/2019] [Accepted: 05/30/2019] [Indexed: 12/26/2022] Open

Radiomics-based machine learning methods for isocitrate dehydrogenase genotype prediction of diffuse gliomas. J Cancer Res Clin Oncol 2019;145:543-550. [PMID: 30719536 PMCID: PMC6394679 DOI: 10.1007/s00432-018-2787-1] [Citation(s) in RCA: 64] [Impact Index Per Article: 12.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/16/2018] [Accepted: 11/01/2018] [Indexed: 12/20/2022]

Farran B, AlWotayan R, Alkandari H, Al-Abdulrazzaq D, Channanath A, Thanaraj TA. Use of Non-invasive Parameters and Machine-Learning Algorithms for Predicting Future Risk of Type 2 Diabetes: A Retrospective Cohort Study of Health Data From Kuwait. Front Endocrinol (Lausanne) 2019;10:624. [PMID: 31572303 PMCID: PMC6749017 DOI: 10.3389/fendo.2019.00624] [Citation(s) in RCA: 22] [Impact Index Per Article: 4.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 11/14/2018] [Accepted: 08/28/2019] [Indexed: 12/12/2022] Open

Abstract

Objective: In recent decades, the Arab population has experienced an increase in the prevalence of type 2 diabetes (T2DM), particularly within the Gulf Cooperation Council. In this context, early intervention programmes rely on an ability to identify individuals at risk of T2DM. We aimed to build prognostic models for the risk of T2DM in the Arab population using machine-learning algorithms vs. conventional logistic regression (LR) and simple non-invasive clinical markers over three different time scales (3, 5, and 7 years from the baseline). Design: This retrospective cohort study used three models based on LR, k-nearest neighbours (k-NN), and support vector machines (SVM) with five-fold cross-validation. The models included the following baseline non-invasive parameters: age, sex, body mass index (BMI), pre-existing hypertension, family history of hypertension, and T2DM. Setting: This study was based on data from the Kuwait Health Network (KHN), which integrated primary health and hospital laboratory data into a single system. Participants: The study included 1,837 native Kuwaiti Arab individuals (equal proportion of men and women) with mean age as 59.5 ± 11.4 years. Among them, 647 developed T2DM within 7 years of the baseline non-invasive measurements. Analytical methods: The discriminatory power of each model for classifying people at risk of T2DM within 3, 5, or 7 years and the area under the receiver operating characteristic curve (AUC) were determined. Outcome measures: Onset of T2DM at 3, 5, and 7 years. Results: The k-NN machine-learning technique, which yielded AUC values of 0.83, 0.82, and 0.79 for 3-, 5-, and 7-year prediction horizons, respectively, outperformed the most commonly used LR method and other previously reported methods. Comparable results were achieved using the SVM and LR models with corresponding AUC values of (SVM: 0.73, LR: 0.74), (SVM: 0.68, LR: 0.72), and (SVM: 0.71, LR: 0.70) for 3-, 5-, and 7-year prediction horizons, respectively. For all models, the discriminatory power decreased as the prediction horizon increased from 3 to 7 years. Conclusions: Machine-learning techniques represent a useful addition to the commonly reported LR technique. Our prognostic models for the future risk of T2DM could be used to plan and implement early prevention programmes for at risk groups in the Arab population.

Collapse