Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For: Mahajan P, Uddin S, Hajati F, Moni MA. Ensemble Learning for Disease Prediction: A Review. Healthcare (Basel) 2023;11:1808. [PMID: 37372925 DOI: 10.3390/healthcare11121808] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/06/2023] [Revised: 06/19/2023] [Accepted: 06/19/2023] [Indexed: 06/29/2023] Open

For:	Mahajan P, Uddin S, Hajati F, Moni MA. Ensemble Learning for Disease Prediction: A Review. Healthcare (Basel) 2023;11:1808. [PMID: 37372925 DOI: 10.3390/healthcare11121808] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/06/2023] [Revised: 06/19/2023] [Accepted: 06/19/2023] [Indexed: 06/29/2023] Open

Number

Cited by Other Article(s)

Kuo PH, Li YH, Yau HT. Development of feline infectious peritonitis diagnosis system by using CatBoost algorithm. Comput Biol Chem 2024;113:108227. [PMID: 39342699 DOI: 10.1016/j.compbiolchem.2024.108227] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/05/2024] [Revised: 08/29/2024] [Accepted: 09/25/2024] [Indexed: 10/01/2024]

Asadi S, Tartibian B, Moni MA, Eslami R. Prediction of white blood cell count during exercise: a comparison between standalone and hybrid intelligent algorithms. Sci Rep 2024;14:20683. [PMID: 39237538 PMCID: PMC11377723 DOI: 10.1038/s41598-024-71576-z] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/12/2023] [Accepted: 08/29/2024] [Indexed: 09/07/2024] Open

Abstract

Decades of research in exercise immunology have demonstrated the profound impact of exercise on the immune response, influencing an individual's disease susceptibility. Accurate prediction of white blood cells (WBCs) count during exercise can help to design effective training programs to maintain optimal the immune system function and prevent its suppression. In this regard, this study aimed to develop an easy-to-use and efficient modelling tool for predicting WBCs count during exercise. To achieve this goal, the predictive power of a range of machine-learning algorithms, including six standalone models (M5 prime (M5P), random forest (RF), alternating model trees (AMT), reduced error pruning tree (REPT), locally weighted learning (LWL), and support vector regression (SVR)) were assessed along with six types of hybrid models trained with a bagging (BA) algorithm (BA-M5P, BA-RF, BA-AMT, BA-REPT, BA-LWL, and BA- SVR). A comprehensive database was constructed from 200 eligible people. The models employed post-exercise training WBCs counts as the output parameter and seven WBCs-influencing factors, including intensity and duration of exercise, pre-exercise training WBCs counts, age, body fat percentage, maximal aerobic capacity, and muscle mass as input parameters. Comparing the prediction results of the models to the observed WBCs using standard statistics indicated that the BA-M5P model had the greatest potential to produce a robust prediction of the number of lymphocytes, neutrophils, monocytes, and WBC compared to other models. Moreover, pre-exercise training WBCs counts, intensity and duration of exercise and body fat percentage were the most important features in predicting WBCs counts. These findings hold significant implications for the advancement of exercise immunology and the promotion of public health.

Collapse

Hong R, Li Q, Ma J, Lu C, Zhong Z. Computed tomography-based radiomics machine learning models for differentiating enchondroma and atypical cartilaginous tumor in long bones. ROFO-FORTSCHR RONTG 2024. [PMID: 39074797 DOI: 10.1055/a-2344-5398] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 07/31/2024]

Abstract

To explore the value of CT-based radiomics machine learning models for differentiating enchondroma from atypical cartilaginous tumor (ACT) in long bones and methods to improve model performance.59 enchondromas and 53 ACTs in long bones confirmed by pathology were collected retrospectively. The features were extracted from preoperative CT images of these patients, and least absolute shrinkage and selection operator (LASSO) regression was used for feature selection and dimensionality reduction. The selected features were used to construct classification models by thirteen machine learning algorithms. The data set was randomly divided into a training set and a test set at a proportion of 7:3 by ten-fold cross-validation to evaluate the performance of these models.A total of 1199 features were extracted, 9 features were selected, and 13 radiomics machine learning models were constructed. The area under the curve (AUC) of 11 models was more than 0.8, and that of 3 models was more than 0.9. The Extremely Randomized Trees model achieved the best performance (AUC = 0.9375 ± 0.0884), followed by the Adaptive Boosting model (AUC = 0.9188 ± 0.1010) and the Linear Discriminant Analysis model (AUC = 0.9062 ± 0.1459).CT-based radiomics machine learning models had great ability to distinguish enchondroma and ACT in long bones. By using filters to deeply mine high-order features in the original image and selecting appropriate machine learning algorithms, the performance of the model can be improved. · CT-based radiomics machine learning models can distinguish enchondroma and ACT in long bones.. · Using filters and selecting advanced machine learning algorithms can improve model performance.. · Clinical features have limited utility in distinguishing enchondroma and ACT in long bones.. · Hong R, Li Q, Ma J et al. Computed tomography-based radiomics machine learning models for differentiating enchondroma and atypical cartilaginous tumor in long bones. Fortschr Röntgenstr 2024; DOI 10.1055/a-2344-5398.

Collapse

Zhang M, Zheng Y, Maidaiti X, Liang B, Wei Y, Sun F. Integrating Machine Learning into Statistical Methods in Disease Risk Prediction Modeling: A Systematic Review. HEALTH DATA SCIENCE 2024;4:0165. [PMID: 39050273 PMCID: PMC11266123 DOI: 10.34133/hds.0165] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 01/03/2024] [Accepted: 06/20/2024] [Indexed: 07/27/2024]

Abstract

Background: Disease prediction models often use statistical methods or machine learning, both with their own corresponding application scenarios, raising the risk of errors when used alone. Integrating machine learning into statistical methods may yield robust prediction models. This systematic review aims to comprehensively assess current development of global disease prediction integration models. Methods: PubMed, EMbase, Web of Science, CNKI, VIP, WanFang, and SinoMed databases were searched to collect studies on prediction models integrating machine learning into statistical methods from database inception to 2023 May 1. Information including basic characteristics of studies, integrating approaches, application scenarios, modeling details, and model performance was extracted. Results: A total of 20 eligible studies in English and 1 in Chinese were included. Five studies concentrated on diagnostic models, while 16 studies concentrated on predicting disease occurrence or prognosis. Integrating strategies of classification models included majority voting, weighted voting, stacking, and model selection (when statistical methods and machine learning disagreed). Regression models adopted strategies including simple statistics, weighted statistics, and stacking. AUROC of integration models surpassed 0.75 and performed better than statistical methods and machine learning in most studies. Stacking was used for situations with >100 predictors and needed relatively larger amount of training data. Conclusion: Research on integrating machine learning into statistical methods in prediction models remains limited, but some studies have exhibited great potential that integration models outperform single models. This study provides insights for the selection of integration methods for different scenarios. Future research could emphasize on the improvement and validation of integrating strategies.

Collapse

Li Y, Cao Y, Wang M, Wang L, Wu Y, Fang Y, Zhao Y, Fan Y, Liu X, Liang H, Yang M, Yuan R, Zhou F, Zhang Z, Kang H. Development and validation of machine learning models to predict MDRO colonization or infection on ICU admission by using electronic health record data. Antimicrob Resist Infect Control 2024;13:74. [PMID: 38971777 PMCID: PMC11227715 DOI: 10.1186/s13756-024-01428-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/01/2024] [Accepted: 06/24/2024] [Indexed: 07/08/2024] Open

Abstract

BACKGROUND

Multidrug-resistant organisms (MDRO) pose a significant threat to public health. Intensive Care Units (ICU), characterized by the extensive use of antimicrobial agents and a high prevalence of bacterial resistance, are hotspots for MDRO proliferation. Timely identification of patients at high risk for MDRO can aid in curbing transmission, enhancing patient outcomes, and maintaining the cleanliness of the ICU environment. This study focused on developing a machine learning (ML) model to identify patients at risk of MDRO during the initial phase of their ICU stay.

METHODS

Utilizing patient data from the First Medical Center of the People's Liberation Army General Hospital (PLAGH-ICU) and the Medical Information Mart for Intensive Care (MIMIC-IV), the study analyzed variables within 24 h of ICU admission. Machine learning algorithms were applied to these datasets, emphasizing the early detection of MDRO colonization or infection. Model efficacy was evaluated by the area under the receiver operating characteristics curve (AUROC), alongside internal and external validation sets.

RESULTS

The study evaluated 3,536 patients in PLAGH-ICU and 34,923 in MIMIC-IV, revealing MDRO prevalence of 11.96% and 8.81%, respectively. Significant differences in ICU and hospital stays, along with mortality rates, were observed between MDRO positive and negative patients. In the temporal validation, the PLAGH-ICU model achieved an AUROC of 0.786 [0.748, 0.825], while the MIMIC-IV model reached 0.744 [0.723, 0.766]. External validation demonstrated reduced model performance across different datasets. Key predictors included biochemical markers and the duration of pre-ICU hospital stay.

CONCLUSIONS

The ML models developed in this study demonstrated their capability in early identification of MDRO risks in ICU patients. Continuous refinement and validation in varied clinical contexts remain essential for future applications.

Collapse

Affiliation(s)

Yun Li Medical School of Chinese PLA, Beijing, 100853, China Department of Critical Care Medicine, The First Medical Centre, Chinese PLA General Hospital, No. 28, Fuxing Road, Haidian District, Beijing, 100853, China
Yuan Cao Medical School of Chinese PLA, Beijing, 100853, China Department of Critical Care Medicine, The First Medical Centre, Chinese PLA General Hospital, No. 28, Fuxing Road, Haidian District, Beijing, 100853, China
Min Wang Medical School of Chinese PLA, Beijing, 100853, China Department of Critical Care Medicine, The First Medical Centre, Chinese PLA General Hospital, No. 28, Fuxing Road, Haidian District, Beijing, 100853, China
Lu Wang Medical School of Chinese PLA, Beijing, 100853, China Department of Critical Care Medicine, The First Medical Centre, Chinese PLA General Hospital, No. 28, Fuxing Road, Haidian District, Beijing, 100853, China
Yiqi Wu Medical School of Chinese PLA, Beijing, 100853, China Department of Critical Care Medicine, The First Medical Centre, Chinese PLA General Hospital, No. 28, Fuxing Road, Haidian District, Beijing, 100853, China
Yuan Fang Medical School of Chinese PLA, Beijing, 100853, China Department of Critical Care Medicine, The First Medical Centre, Chinese PLA General Hospital, No. 28, Fuxing Road, Haidian District, Beijing, 100853, China
Yan Zhao Department of Critical Care Medicine, The First Medical Centre, Chinese PLA General Hospital, No. 28, Fuxing Road, Haidian District, Beijing, 100853, China
Yong Fan Center for Artificial Intelligence in Medicine, Chinese PLA General Hospital, No. 28, Fuxing Road, Haidian District, Beijing, 100853, China
Xiaoli Liu Center for Artificial Intelligence in Medicine, Chinese PLA General Hospital, No. 28, Fuxing Road, Haidian District, Beijing, 100853, China
Hong Liang Center for Artificial Intelligence in Medicine, Chinese PLA General Hospital, No. 28, Fuxing Road, Haidian District, Beijing, 100853, China
Mengmeng Yang Department of Critical Care Medicine, The First Medical Centre, Chinese PLA General Hospital, No. 28, Fuxing Road, Haidian District, Beijing, 100853, China
Rui Yuan Medical School of Chinese PLA, Beijing, 100853, China Department of Critical Care Medicine, The First Medical Centre, Chinese PLA General Hospital, No. 28, Fuxing Road, Haidian District, Beijing, 100853, China
Feihu Zhou Department of Critical Care Medicine, The First Medical Centre, Chinese PLA General Hospital, No. 28, Fuxing Road, Haidian District, Beijing, 100853, China
Zhengbo Zhang Center for Artificial Intelligence in Medicine, Chinese PLA General Hospital, No. 28, Fuxing Road, Haidian District, Beijing, 100853, China.
Hongjun Kang Department of Critical Care Medicine, The First Medical Centre, Chinese PLA General Hospital, No. 28, Fuxing Road, Haidian District, Beijing, 100853, China.

Collapse

Giacobbe DR, Marelli C, Mora S, Cappello A, Signori A, Vena A, Guastavino S, Rosso N, Campi C, Giacomini M, Bassetti M. Prediction of candidemia with machine learning techniques: state of the art. Future Microbiol 2024;19:931-940. [PMID: 39072500 PMCID: PMC11290752 DOI: 10.2217/fmb-2023-0269] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/30/2023] [Accepted: 03/06/2024] [Indexed: 07/30/2024] Open

Ganie SM, Dutta Pramanik PK, Zhao Z. Improved liver disease prediction from clinical data through an evaluation of ensemble learning approaches. BMC Med Inform Decis Mak 2024;24:160. [PMID: 38849815 PMCID: PMC11157956 DOI: 10.1186/s12911-024-02550-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/25/2024] [Accepted: 05/21/2024] [Indexed: 06/09/2024] Open

Hagen M, Dass R, Westhues C, Blom J, Schultheiss SJ, Patz S. Interpretable machine learning decodes soil microbiome's response to drought stress. ENVIRONMENTAL MICROBIOME 2024;19:35. [PMID: 38812054 PMCID: PMC11138018 DOI: 10.1186/s40793-024-00578-1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/30/2023] [Accepted: 05/10/2024] [Indexed: 05/31/2024]

Kasim S, Amir Rudin PNF, Malek S, Ibrahim KS, Wan Ahmad WA, Fong AYY, Lin WY, Aziz F, Ibrahim N. Ensemble machine learning for predicting in-hospital mortality in Asian women with ST-elevation myocardial infarction (STEMI). Sci Rep 2024;14:12378. [PMID: 38811643 PMCID: PMC11137033 DOI: 10.1038/s41598-024-61151-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/21/2023] [Accepted: 05/02/2024] [Indexed: 05/31/2024] Open

Chaudhary R, Nourelahi M, Thoma FW, Gellad WF, Lo-Ciganic WH, Bliden KP, Gurbel PA, Neal MD, Jain SK, Bhonsale A, Mulukutla SR, Wang Y, Harinstein ME, Saba S, Visweswaran S. Machine Learning - Based Bleeding Risk Predictions in Atrial Fibrillation Patients on Direct Oral Anticoagulants. MEDRXIV : THE PREPRINT SERVER FOR HEALTH SCIENCES 2024:2024.05.27.24307985. [PMID: 38854094 PMCID: PMC11160827 DOI: 10.1101/2024.05.27.24307985] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/11/2024]

Abstract

Importance

Accurately predicting major bleeding events in non-valvular atrial fibrillation (AF) patients on direct oral anticoagulants (DOACs) is crucial for personalized treatment and improving patient outcomes, especially with emerging alternatives like left atrial appendage closure devices. The left atrial appendage closure devices reduce stroke risk comparably but with significantly fewer non-procedural bleeding events.

Objective

To evaluate the performance of machine learning (ML) risk models in predicting clinically significant bleeding events requiring hospitalization and hemorrhagic stroke in non-valvular AF patients on DOACs compared to conventional bleeding risk scores (HAS-BLED, ORBIT, and ATRIA) at the index visit to a cardiologist for AF management.

Design

Prognostic modeling with retrospective cohort study design using electronic health record (EHR) data, with clinical follow-up at one-, two-, and five-years.

Setting

University of Pittsburgh Medical Center (UPMC) system.

Participants

24,468 non-valvular AF patients aged ≥18 years treated with DOACs, excluding those with prior history of significant bleeding, other indications for DOACs, on warfarin or contraindicated to DOACs.

Exposures

DOAC therapy for non-valvular AF.

Main Outcomes and Measures

The primary endpoint was clinically significant bleeding requiring hospitalization within one year of index visit. The models incorporated demographic, clinical, and laboratory variables available in the EHR at the index visit.

Results

Among 24,468 patients, 553 (2.3%) had bleeding events within one year, 829 (3.5%) within two years, and 1,292 (5.8%) within five years of index visit. We evaluated multivariate logistic regression and ML models including random forest, classification trees, k-nearest neighbor, naive Bayes, and extreme gradient boosting (XGBoost) which modestly outperformed HAS-BLED, ATRIA, and ORBIT scores in predicting clinically significant bleeding at 1-year follow-up. The best performing model (random forest) showed area under the curve (AUC-ROC) 0.76 (0.70-0.81), G-Mean score of 0.67, net reclassification index 0.14 compared to 0.57 (0.50-0.63), G-Mean score of 0.57 for HASBLED score, p-value for difference <0.001. The ML models had improved performance compared to conventional risk across time-points of 2-year and 5-years and within the subgroup of hemorrhagic stroke. SHAP analysis identified novel risk factors including measures from body mass index, cholesterol profile, and insurance type beyond those used in conventional risk scores.

Conclusions and Relevance

Our findings demonstrate the superior performance of ML models compared to conventional bleeding risk scores and identify novel risk factors highlighting the potential for personalized bleeding risk assessment in AF patients on DOACs.

Collapse

Hassan A, Gulzar Ahmad S, Ullah Munir E, Ali Khan I, Ramzan N. Predictive modelling and identification of key risk factors for stroke using machine learning. Sci Rep 2024;14:11498. [PMID: 38769427 PMCID: PMC11106277 DOI: 10.1038/s41598-024-61665-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/26/2024] [Accepted: 05/08/2024] [Indexed: 05/22/2024] Open

Tao L, Zhou T, Wu Z, Hu F, Yang S, Kong X, Li C. ESPDHot: An Effective Machine Learning-Based Approach for Predicting Protein-DNA Interaction Hotspots. J Chem Inf Model 2024;64:3548-3557. [PMID: 38587997 DOI: 10.1021/acs.jcim.3c02011] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 04/10/2024]

Uddin S, Lu H. Confirming the statistically significant superiority of tree-based machine learning algorithms over their counterparts for tabular data. PLoS One 2024;19:e0301541. [PMID: 38635591 PMCID: PMC11025817 DOI: 10.1371/journal.pone.0301541] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/28/2024] [Accepted: 03/18/2024] [Indexed: 04/20/2024] Open

Zhang Y, Zhang L, Lv H, Zhang G. Ensemble machine learning prediction of hyperuricemia based on a prospective health checkup population. Front Physiol 2024;15:1357404. [PMID: 38665596 PMCID: PMC11043598 DOI: 10.3389/fphys.2024.1357404] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/21/2023] [Accepted: 03/11/2024] [Indexed: 04/28/2024] Open

Abstract

Objectives: An accurate prediction model for hyperuricemia (HUA) in adults remain unavailable. This study aimed to develop a stacking ensemble prediction model for HUA to identify high-risk groups and explore risk factors. Methods: A prospective health checkup cohort of 40899 subjects was examined and randomly divided into the training and validation sets with the ratio of 7:3. LASSO regression was employed to screen out important features and then the ROSE sampling was used to handle the imbalanced classes. An ensemble model using stacking strategy was constructed based on three individual models, including support vector machine, decision tree C5.0, and eXtreme gradient boosting. Model validations were conducted using the area under the receiver operating characteristic curve (AUC) and the calibration curve, as well as metrics including accuracy, sensitivity, specificity, positive predictive value, negative predictive value, and F1 score. A model agnostic instance level variable attributions technique (iBreakdown) was used to illustrate the black-box nature of our ensemble model, and to identify contributing risk factors. Results: Fifteen important features were screened out of 23 clinical variables. Our stacking ensemble model with an AUC of 0.854, outperformed the other three models, support vector machine, decision tree C5.0, and eXtreme gradient boosting with AUCs of 0.848, 0.851 and 0.849 respectively. Calibration accuracy as well as other metrics including accuracy, specificity, negative predictive value, and F1 score were also proved our ensemble model's superiority. The contributing risk factors were estimated using six randomly selected subjects, which showed that being female and relatively younger, together with having higher baseline uric acid, body mass index, γ-glutamyl transpeptidase, total protein, triglycerides, creatinine, and fasting blood glucose can increase the risk of HUA. To further validate our model's applicability in the health checkup population, we used another cohort of 8559 subjects that also showed our ensemble prediction model had favorable performances with an AUC of 0.846. Conclusion: In this study, the stacking ensemble prediction model for HUA was developed, and it outperformed three individual models that compose it (support vector machine, decision tree C5.0, and eXtreme gradient boosting). The contributing risk factors were identified with insightful ideas.

Collapse

Mukherjee A, Abraham S, Singh A, Balaji S, Mukunthan KS. From Data to Cure: A Comprehensive Exploration of Multi-omics Data Analysis for Targeted Therapies. Mol Biotechnol 2024:10.1007/s12033-024-01133-6. [PMID: 38565775 DOI: 10.1007/s12033-024-01133-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/27/2023] [Accepted: 02/27/2024] [Indexed: 04/04/2024]

Nwadiugwu M, Onwuekwe I, Ezeanolue E, Deng H. Beyond Amyloid: A Machine Learning-Driven Approach Reveals Properties of Potent GSK-3β Inhibitors Targeting Neurofibrillary Tangles. Int J Mol Sci 2024;25:2646. [PMID: 38473895 PMCID: PMC10931970 DOI: 10.3390/ijms25052646] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/13/2024] [Revised: 02/16/2024] [Accepted: 02/21/2024] [Indexed: 03/14/2024] Open

Abstract

Current treatments for Alzheimer's disease (AD) focus on slowing memory and cognitive decline, but none offer curative outcomes. This study aims to explore and curate the common properties of active, drug-like molecules that modulate glycogen synthase kinase 3β (GSK-3β), a well-documented kinase with increased activity in tau hyperphosphorylation and neurofibrillary tangles-hallmarks of AD pathology. Leveraging quantitative structure-activity relationship (QSAR) data from the PubChem and ChEMBL databases, we employed seven machine learning models: logistic regression (LogR), k-nearest neighbors (KNN), random forest (RF), support vector machine (SVM), extreme gradient boosting (XGB), neural networks (NNs), and ensemble majority voting. Our goal was to correctly predict active and inactive compounds that inhibit GSK-3β activity and identify their key properties. Among the six individual models, the NN demonstrated the highest performance with a 79% AUC-ROC on unbalanced external validation data, while the SVM model was superior in accurately classifying the compounds. The SVM and RF models surpassed NN in terms of Kappa values, and the ensemble majority voting model demonstrated slightly better accuracy to the NN on the external validation data. Feature importance analysis revealed that hydrogen bonds, phenol groups, and specific electronic characteristics are important features of molecular descriptors that positively correlate with active GSK-3β inhibition. Conversely, structural features like imidazole rings, sulfides, and methoxy groups showed a negative correlation. Our study highlights the significance of structural, electronic, and physicochemical descriptors in screening active candidates against GSK-3β. These predictive features could prove useful in therapeutic strategies to understand the important properties of GSK-3β candidate inhibitors that may potentially benefit non-amyloid-based AD treatments targeting neurofibrillary tangles.

Collapse

Kim SH, Park SH, Lee H. Machine learning for predicting hepatitis B or C virus infection in diabetic patients. Sci Rep 2023;13:21518. [PMID: 38057379 PMCID: PMC10700585 DOI: 10.1038/s41598-023-49046-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/03/2023] [Accepted: 12/04/2023] [Indexed: 12/08/2023] Open

Mohsin SN, Gapizov A, Ekhator C, Ain NU, Ahmad S, Khan M, Barker C, Hussain M, Malineni J, Ramadhan A, Halappa Nagaraj R. The Role of Artificial Intelligence in Prediction, Risk Stratification, and Personalized Treatment Planning for Congenital Heart Diseases. Cureus 2023;15:e44374. [PMID: 37664359 PMCID: PMC10469091 DOI: 10.7759/cureus.44374] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 08/30/2023] [Indexed: 09/05/2023] Open