1
|
Nawaz MS, Nawaz MZ, Gong Y, Fournier-Viger P, Diallo AB. In silico framework for genome analysis. FUTURE GENERATION COMPUTER SYSTEMS 2025; 164:107585. [DOI: 10.1016/j.future.2024.107585] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/05/2025]
|
2
|
Wu YH, Sun J, Huang JH, Lu XY. Bioinformatics Identification of angiogenesis-related biomarkers and therapeutic targets in cerebral ischemia-reperfusion. Sci Rep 2024; 14:32096. [PMID: 39738531 PMCID: PMC11685884 DOI: 10.1038/s41598-024-83783-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/28/2024] [Accepted: 12/17/2024] [Indexed: 01/02/2025] Open
Abstract
Promoting vascular endothelial cell regeneration can enhance recovery from cerebral ischemia reperfusion injury (CIRI), but there is a lack of bioinformatic studies on angiogenesis-related biomarkers in CIRI. In this study, we utilized the GSE97537 and GSE61616 datasets from GEO to identify 181 angiogenesis-related genes (ARGs) and analyzed differentially expressed genes (DEGs) between CIRI and control groups. We converted ARGs to 169 rat homologues and intersected them with DEGs to find DE-ARGs. RF and XGBoost models were employed to identify five biomarkers (Stat3, Hmox1, Egfr, Col18a1, Ptgs2) and conducted GSEA on these biomarkers, revealing their enrichment in pathways such as ECM-receptor interaction and hematopoietic cell lineage. We also analyzed the immune microenvironment, finding significant differences in 21 immune cells between CIRI and control groups. Furthermore, we constructed lncRNA-miRNA-mRNA networks and drug-gene networks. Finally, biomarker expression was compared between the CIRI and control groups by qRT-PCR in tissue and blood samples. Overall, our bioinformatic exploration of angiogenesis-related biomarkers in CIRI provides new insights for the diagnosis and treatment of CIRI.
Collapse
Affiliation(s)
- Yong-Hong Wu
- Key Laboratory of Biomedical Information Engineering of Ministry of Education, School of Life Science and Technology, Xi'an Jiaotong University, Xi'an, 710049, Shanxi Province, China
- School of Medical Technology & Institute of Basic Translational Medicine, Xi'an Medical University, Xi'an, 710021, Shanxi Province, China
| | - Jing Sun
- School of Medical Technology & Institute of Basic Translational Medicine, Xi'an Medical University, Xi'an, 710021, Shanxi Province, China
| | - Jun-Hua Huang
- School of Medical Technology & Institute of Basic Translational Medicine, Xi'an Medical University, Xi'an, 710021, Shanxi Province, China
| | - Xiao-Yun Lu
- Key Laboratory of Biomedical Information Engineering of Ministry of Education, School of Life Science and Technology, Xi'an Jiaotong University, Xi'an, 710049, Shanxi Province, China.
| |
Collapse
|
3
|
Yousefi M, Akhbari M, Mohamadi Z, Karami S, Dasoomi H, Atabi A, Sarkeshikian SA, Abdoullahi Dehaki M, Bayati H, Mashayekhi N, Varmazyar S, Rahimian Z, Asadi Anar M, Shafiei D, Mohebbi A. Machine learning based algorithms for virtual early detection and screening of neurodegenerative and neurocognitive disorders: a systematic-review. Front Neurol 2024; 15:1413071. [PMID: 39717687 PMCID: PMC11663744 DOI: 10.3389/fneur.2024.1413071] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/06/2024] [Accepted: 11/05/2024] [Indexed: 12/25/2024] Open
Abstract
Background and aim Neurodegenerative disorders (e.g., Alzheimer's, Parkinson's) lead to neuronal loss; neurocognitive disorders (e.g., delirium, dementia) show cognitive decline. Early detection is crucial for effective management. Machine learning aids in more precise disease identification, potentially transforming healthcare. This comprehensive systematic review discusses how machine learning (ML), can enhance early detection of these disorders, surpassing traditional diagnostics' constraints. Methods In this review, databases were examined up to August 15th, 2023, for ML data on neurodegenerative and neurocognitive diseases using PubMed, Scopus, Google Scholar, and Web of Science. Two investigators used the RAYYAN intelligence tool for systematic reviews to conduct the screening. Six blinded reviewers reviewed titles/abstracts. Cochrane risk of bias tool was used for quality assessment. Results Our search found 7,069 research studies, of which 1,365 items were duplicates and thus removed. Four thousand three hundred and thirty four studies were screened, and 108 articles met the criteria for inclusion after preprocessing. Twelve ML algorithms were observed for dementia, showing promise in early detection. Eighteen ML algorithms were identified for Parkinson's, each effective in detection and diagnosis. Studies emphasized that ML algorithms are necessary for Alzheimer's to be successful. Fourteen ML algorithms were discovered for mild cognitive impairment, with LASSO logistic regression being the only one with unpromising results. Conclusion This review emphasizes the pressing necessity of integrating verified digital health resources into conventional medical practice. This integration may signify a new era in the early detection of neurodegenerative and neurocognitive illnesses, potentially changing the course of these conditions for millions globally. This study showcases specific and statistically significant findings to illustrate the progress in the area and the prospective influence of these advancements on the global management of neurocognitive and neurodegenerative illnesses.
Collapse
Affiliation(s)
- Milad Yousefi
- Institute for Cognitive and Brain Sciences, Shahid Beheshti University, Tehran, Iran
| | - Matin Akhbari
- Faculty of Medicine, Istanbul Yeni Yuzyil University, Istanbul, Türkiye
| | - Zhina Mohamadi
- School of Medicine, Kermanshah University of Medical Sciences, Kermanshah, Iran
| | - Shaghayegh Karami
- School of Medicine, Tehran University of Medical Sciences, Tehran, Iran
| | - Hediyeh Dasoomi
- Student Research Committee, Ahvaz Jundishapur University of Medical Sciences, Ahvaz, Iran
| | - Alireza Atabi
- School of Medicine, Ahvaz Jundishapur University of Medical Sciences, Ahvaz, Iran
| | | | - Mahdi Abdoullahi Dehaki
- Master’s of AI Engineering, Islamic Azad University Tehran Science and Research Branch, Tehran, Iran
| | - Hesam Bayati
- Department of Radiology, Shahid Beheshti University of Medical Sciences, Tehran, Iran
| | - Negin Mashayekhi
- Department of Neuroscience, Bahçeşehir University, Istanbul, Türkiye
| | - Shirin Varmazyar
- School of Medicine, Shahroud University of Medical Sciences, Shahrud, Iran
| | - Zahra Rahimian
- School of Medicine, Shiraz University of Medical Sciences, Shiraz, Iran
| | - Mahsa Asadi Anar
- Student Research Committee, Shahid Beheshti University of Medical Sciences, Tehran, Iran
| | - Daniel Shafiei
- School of Medicine, Shahid Beheshti University of Medical Sciences, Tehran, Iran
| | - Alireza Mohebbi
- Students Research Committee, Ardabil University of Medical Sciences, Ardabil, Iran
| |
Collapse
|
4
|
Ahammad I, Lamisa AB, Bhattacharjee A, Jamal TB, Arefin MS, Chowdhury ZM, Hossain MU, Das KC, Keya CA, Salimullah M. AITeQ: a machine learning framework for Alzheimer's prediction using a distinctive five-gene signature. Brief Bioinform 2024; 25:bbae291. [PMID: 38877887 PMCID: PMC11179120 DOI: 10.1093/bib/bbae291] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/15/2023] [Revised: 05/23/2024] [Accepted: 06/04/2024] [Indexed: 06/18/2024] Open
Abstract
Neurodegenerative diseases, such as Alzheimer's disease, pose a significant global health challenge with their complex etiology and elusive biomarkers. In this study, we developed the Alzheimer's Identification Tool (AITeQ) using ribonucleic acid-sequencing (RNA-seq), a machine learning (ML) model based on an optimized ensemble algorithm for the identification of Alzheimer's from RNA-seq data. Analysis of RNA-seq data from several studies identified 87 differentially expressed genes. This was followed by a ML protocol involving feature selection, model training, performance evaluation, and hyperparameter tuning. The feature selection process undertaken in this study, employing a combination of four different methodologies, culminated in the identification of a compact yet impactful set of five genes. Twelve diverse ML models were trained and tested using these five genes (CNKSR1, EPHA2, CLSPN, OLFML3, and TARBP1). Performance metrics, including precision, recall, F1 score, accuracy, Matthew's correlation coefficient, and receiver operating characteristic area under the curve were assessed for the finally selected model. Overall, the ensemble model consisting of logistic regression, naive Bayes classifier, and support vector machine with optimized hyperparameters was identified as the best and was used to develop AITeQ. AITeQ is available at: https://github.com/ishtiaque-ahammad/AITeQ.
Collapse
Affiliation(s)
- Ishtiaque Ahammad
- Bioinformatics Division, National Institute of Biotechnology, Ganakbari, Ashulia, Savar, Dhaka 1349, Bangladesh
| | - Anika Bushra Lamisa
- Bioinformatics Division, National Institute of Biotechnology, Ganakbari, Ashulia, Savar, Dhaka 1349, Bangladesh
| | - Arittra Bhattacharjee
- Bioinformatics Division, National Institute of Biotechnology, Ganakbari, Ashulia, Savar, Dhaka 1349, Bangladesh
| | - Tabassum Binte Jamal
- Bioinformatics Division, National Institute of Biotechnology, Ganakbari, Ashulia, Savar, Dhaka 1349, Bangladesh
| | - Md Shamsul Arefin
- Department of Biochemistry and Microbiology, North South University, Bashundhara, Dhaka 1229, Bangladesh
| | - Zeshan Mahmud Chowdhury
- Bioinformatics Division, National Institute of Biotechnology, Ganakbari, Ashulia, Savar, Dhaka 1349, Bangladesh
| | - Mohammad Uzzal Hossain
- Bioinformatics Division, National Institute of Biotechnology, Ganakbari, Ashulia, Savar, Dhaka 1349, Bangladesh
| | - Keshob Chandra Das
- Molecular Biotechnology Division, National Institute of Biotechnology, Ganakbari, Ashulia, Savar, Dhaka 1349, Bangladesh
| | - Chaman Ara Keya
- Department of Biochemistry and Microbiology, North South University, Bashundhara, Dhaka 1229, Bangladesh
| | - Md Salimullah
- Molecular Biotechnology Division, National Institute of Biotechnology, Ganakbari, Ashulia, Savar, Dhaka 1349, Bangladesh
| |
Collapse
|
5
|
Flaherty R, Sui YV, Masurkar AV, Betensky RA, Rusinek H, Lazar M. Diffusion imaging markers of accelerated aging of the lower cingulum in subjective cognitive decline. Front Neurol 2024; 15:1360273. [PMID: 38784911 PMCID: PMC11111894 DOI: 10.3389/fneur.2024.1360273] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/22/2023] [Accepted: 04/29/2024] [Indexed: 05/25/2024] Open
Abstract
Introduction Alzheimer's Disease (AD) typically starts in the medial temporal lobe, then develops into a neurodegenerative cascade which spreads to other brain regions. People with subjective cognitive decline (SCD) are more likely to develop dementia, especially in the presence of amyloid pathology. Thus, we were interested in the white matter microstructure of the medial temporal lobe in SCD, specifically the lower cingulum bundle that leads into the hippocampus. Diffusion tensor imaging (DTI) has been shown to differentiate SCD participants who will progress to mild cognitive impairment from those who will not. However, the biology underlying these DTI metrics is unclear, and results in the medial temporal lobe have been inconsistent. Methods To better characterize the microstructure of this region, we applied DTI to cognitively normal participants in the Cam-CAN database over the age of 55 with cognitive testing and diffusion MRI available (N = 325, 127 SCD). Diffusion MRI was processed to generate regional and voxel-wise diffusion tensor values in bilateral lower cingulum white matter, while T1-weighted MRI was processed to generate regional volume and cortical thickness in the medial temporal lobe white matter, entorhinal cortex, temporal pole, and hippocampus. Results SCD participants had thinner cortex in bilateral entorhinal cortex and right temporal pole. No between-group differences were noted for any of the microstructural metrics of the lower cingulum. However, correlations with delayed story recall were significant for all diffusion microstructure metrics in the right lower cingulum in SCD, but not in controls, with a significant interaction effect. Additionally, the SCD group showed an accelerated aging effect in bilateral lower cingulum with MD, AxD, and RD. Discussion The diffusion profiles observed in both interaction effects are suggestive of a mixed neuroinflammatory and neurodegenerative pathology. Left entorhinal cortical thinning correlated with decreased FA and increased RD, suggestive of demyelination. However, right entorhinal cortical thinning also correlated with increased AxD, suggestive of a mixed pathology. This may reflect combined pathologies implicated in early AD. DTI was more sensitive than cortical thickness to the associations between SCD, memory, and age. The combined effects of mixed pathology may increase the sensitivity of DTI metrics to variations with age and cognition.
Collapse
Affiliation(s)
- Ryn Flaherty
- Center for Advanced Imaging Innovation and Research, Department of Radiology, New York University Grossman School of Medicine, New York, NY, United States
- Vilcek Institute of Graduate Biomedical Sciences, New York University Grossman School of Medicine, New York, NY, United States
| | - Yu Veronica Sui
- Center for Advanced Imaging Innovation and Research, Department of Radiology, New York University Grossman School of Medicine, New York, NY, United States
| | - Arjun V. Masurkar
- Department of Neurology, New York University Grossman School of Medicine, New York, NY, United States
- Neuroscience Institute, New York University Grossman School of Medicine, New York, NY, United States
| | - Rebecca A. Betensky
- Department of Biostatistics, New York University School of Global Public Health, New York, NY, United States
| | - Henry Rusinek
- Center for Advanced Imaging Innovation and Research, Department of Radiology, New York University Grossman School of Medicine, New York, NY, United States
- Department of Psychiatry, New York University Grossman School of Medicine, New York, NY, United States
| | - Mariana Lazar
- Center for Advanced Imaging Innovation and Research, Department of Radiology, New York University Grossman School of Medicine, New York, NY, United States
| |
Collapse
|
6
|
Ejiyi CJ, Qin Z, Ukwuoma CC, Nneji GU, Monday HN, Ejiyi MB, Ejiyi TU, Okechukwu U, Bamisile OO. Comparative performance analysis of Boruta, SHAP, and Borutashap for disease diagnosis: A study with multiple machine learning algorithms. NETWORK (BRISTOL, ENGLAND) 2024:1-38. [PMID: 38511557 DOI: 10.1080/0954898x.2024.2331506] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/12/2023] [Accepted: 03/13/2024] [Indexed: 03/22/2024]
Abstract
Interpretable machine learning models are instrumental in disease diagnosis and clinical decision-making, shedding light on relevant features. Notably, Boruta, SHAP (SHapley Additive exPlanations), and BorutaShap were employed for feature selection, each contributing to the identification of crucial features. These selected features were then utilized to train six machine learning algorithms, including LR, SVM, ETC, AdaBoost, RF, and LR, using diverse medical datasets obtained from public sources after rigorous preprocessing. The performance of each feature selection technique was evaluated across multiple ML models, assessing accuracy, precision, recall, and F1-score metrics. Among these, SHAP showcased superior performance, achieving average accuracies of 80.17%, 85.13%, 90.00%, and 99.55% across diabetes, cardiovascular, statlog, and thyroid disease datasets, respectively. Notably, the LGBM emerged as the most effective algorithm, boasting an average accuracy of 91.00% for most disease states. Moreover, SHAP enhanced the interpretability of the models, providing valuable insights into the underlying mechanisms driving disease diagnosis. This comprehensive study contributes significant insights into feature selection techniques and machine learning algorithms for disease diagnosis, benefiting researchers and practitioners in the medical field. Further exploration of feature selection methods and algorithms holds promise for advancing disease diagnosis methodologies, paving the way for more accurate and interpretable diagnostic models.
Collapse
Affiliation(s)
- Chukwuebuka Joseph Ejiyi
- School of Information and Software Engineering, University of Electronic Science and Technology of China, Chengdu, China
| | - Zhen Qin
- School of Information and Software Engineering, University of Electronic Science and Technology of China, Chengdu, China
| | - Chiagoziem Chima Ukwuoma
- School of Information and Software Engineering, University of Electronic Science and Technology of China, Chengdu, China
| | - Grace Ugochi Nneji
- Software Engineering Department, Sino-British Collaborative Education, Chengdu University of Technology, Oxford Brookes University, Chengdu, China
| | - Happy Nkanta Monday
- Software Engineering Department, Sino-British Collaborative Education, Chengdu University of Technology, Oxford Brookes University, Chengdu, China
| | | | - Thomas Ugochukwu Ejiyi
- Department of Pure and Industrial Chemistry, University of Nigeria Nsukka, Enugu, Nigeria
| | | | - Olusola O Bamisile
- Sichuan Industrial Internet Intelligent Monitoring and Application Engineering Technology Research Centre, Chengdu University of Technology, Chengdu, China
| |
Collapse
|
7
|
Zeinali N, Youn N, Albashayreh A, Fan W, Gilbertson White S. Machine Learning Approaches to Predict Symptoms in People With Cancer: Systematic Review. JMIR Cancer 2024; 10:e52322. [PMID: 38502171 PMCID: PMC10988375 DOI: 10.2196/52322] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/12/2023] [Revised: 01/18/2024] [Accepted: 01/19/2024] [Indexed: 03/20/2024] Open
Abstract
BACKGROUND People with cancer frequently experience severe and distressing symptoms associated with cancer and its treatments. Predicting symptoms in patients with cancer continues to be a significant challenge for both clinicians and researchers. The rapid evolution of machine learning (ML) highlights the need for a current systematic review to improve cancer symptom prediction. OBJECTIVE This systematic review aims to synthesize the literature that has used ML algorithms to predict the development of cancer symptoms and to identify the predictors of these symptoms. This is essential for integrating new developments and identifying gaps in existing literature. METHODS We conducted this systematic review in accordance with the PRISMA (Preferred Reporting Items for Systematic Reviews and Meta-Analyses) checklist. We conducted a systematic search of CINAHL, Embase, and PubMed for English records published from 1984 to August 11, 2023, using the following search terms: cancer, neoplasm, specific symptoms, neural networks, machine learning, specific algorithm names, and deep learning. All records that met the eligibility criteria were individually reviewed by 2 coauthors, and key findings were extracted and synthesized. We focused on studies using ML algorithms to predict cancer symptoms, excluding nonhuman research, technical reports, reviews, book chapters, conference proceedings, and inaccessible full texts. RESULTS A total of 42 studies were included, the majority of which were published after 2017. Most studies were conducted in North America (18/42, 43%) and Asia (16/42, 38%). The sample sizes in most studies (27/42, 64%) typically ranged from 100 to 1000 participants. The most prevalent category of algorithms was supervised ML, accounting for 39 (93%) of the 42 studies. Each of the methods-deep learning, ensemble classifiers, and unsupervised ML-constituted 3 (3%) of the 42 studies. The ML algorithms with the best performance were logistic regression (9/42, 17%), random forest (7/42, 13%), artificial neural networks (5/42, 9%), and decision trees (5/42, 9%). The most commonly included primary cancer sites were the head and neck (9/42, 22%) and breast (8/42, 19%), with 17 (41%) of the 42 studies not specifying the site. The most frequently studied symptoms were xerostomia (9/42, 14%), depression (8/42, 13%), pain (8/42, 13%), and fatigue (6/42, 10%). The significant predictors were age, gender, treatment type, treatment number, cancer site, cancer stage, chemotherapy, radiotherapy, chronic diseases, comorbidities, physical factors, and psychological factors. CONCLUSIONS This review outlines the algorithms used for predicting symptoms in individuals with cancer. Given the diversity of symptoms people with cancer experience, analytic approaches that can handle complex and nonlinear relationships are critical. This knowledge can pave the way for crafting algorithms tailored to a specific symptom. In addition, to improve prediction precision, future research should compare cutting-edge ML strategies such as deep learning and ensemble methods with traditional statistical models.
Collapse
Affiliation(s)
- Nahid Zeinali
- Department of Computer Science and Informatics, University of Iowa, Iowa City, IA, United States
| | - Nayung Youn
- College of Nursing, University of Iowa, Iowa City, IA, United States
| | - Alaa Albashayreh
- College of Nursing, University of Iowa, Iowa City, IA, United States
| | - Weiguo Fan
- Department of Business Analytics, University of Iowa, Iowa City, IA, United States
| | | |
Collapse
|
8
|
Dahdah A, de Silva NH, Maniam S, Blanch EW. Characterizing fibril morphological changes by spirooxindoles for neurodegenerative disease application. Analyst 2024; 149:1229-1237. [PMID: 38224234 DOI: 10.1039/d3an01773g] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/16/2024]
Abstract
Fibrillation of proteins and polypeptides, which leads to the deposition of plaques in cells and tissues has been widely associated with many neuropathological diseases. Inhibition of protein misfolding and aggregation is crucial for the prevention and treatment of these conditions. The growing interest in identifying inhibitor molecules to prevent the formation of fibrils in vivo has led to the results highlighted in this study. Due to their hydrophobic structure and potential to readily cross the blood brain barrier, a library of spirooxindole compounds were synthesized with those labelled Hd-63, Hd-66 and Hd-74 proving to be the most potent against fibril formation. Our spectroscopic analysis provides detailed insight, that the introduction of these spirooxindole compounds leads to morphological changes in the mechanism of fibril formation which prevent the formation of highly ordered fibrils, instead results in the formation of disordered aggregates which are not fibrillar in nature.
Collapse
Affiliation(s)
- Anthony Dahdah
- School of Science, STEM College, RMIT University, 124 La Trobe Street, Melbourne, VIC, 3001, Australia.
| | - Nilamuni H de Silva
- School of Science, STEM College, RMIT University, 124 La Trobe Street, Melbourne, VIC, 3001, Australia.
| | - Subashani Maniam
- School of Science, STEM College, RMIT University, 124 La Trobe Street, Melbourne, VIC, 3001, Australia.
| | - Ewan W Blanch
- School of Science, STEM College, RMIT University, 124 La Trobe Street, Melbourne, VIC, 3001, Australia.
| |
Collapse
|