1
|
R HC, Datta A, S UK, Zayed H, D TK, C GPD. Decoding genetic and pathophysiological mechanisms in amyotrophic lateral sclerosis and primary lateral sclerosis: A comparative study of differentially expressed genes and implicated pathways in motor neuron disorders. ADVANCES IN PROTEIN CHEMISTRY AND STRUCTURAL BIOLOGY 2024; 141:177-201. [PMID: 38960473 DOI: 10.1016/bs.apcsb.2023.12.008] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 07/05/2024]
Abstract
Motor Neuron Disorders (MNDs), characterized by the degradation and loss of function of motor neurons, are recognized as fatal conditions with limited treatment options and no known cure. The present study aimed to identify the pathophysiological functions and affected genes in patients with MNDs, specifically Amyotrophic Lateral Sclerosis (ALS) and Primary Lateral Sclerosis (PLS). The GSE56808 dataset comprised three sample groups: six patients diagnosed with ALS (GSM1369650, GSM1369652, GSM1369654, GSM1369656, GSM1369657, GSM1369658), five patients diagnosed with PLS (GSM1369648, GSM1369649, GSM1369653, GSM1369655, GSM1369659), and six normal controls (GSM1369642, GSM1369643, GSM1369644, GSM1369645, GSM1369646, and GSM1369647). The application of computational analysis of microarray gene expression profiles enabled us to identify 346 significantly differentially expressed genes (DEGs), 169 genes for the ALS sample study, and 177 genes for the PLS sample study. Enrichment was carried out using MCODE, a Cytoscape plugin. Functional annotation of DEGs was carried out via ClueGO/CluePedia (v2.5.9) and further validated via the DAVID database. NRP2, SEMA3D, ROBO3 and, CACNB1, CACNG2 genes were identified as the gene of interest for ALS and PLS sample groups, respectively. Axonal guidance (GO:0007411) and calcium ion transmembrane transport (GO:0070588) were identified to be some of the significantly dysregulated gene ontology (GO) terms, with arrhythmogenic right ventricular cardiomyopathy (KEGG:05412) to be the top relevant KEGG pathway which is affected in MND patients. ROBO3 gene was observed to have distinctive roles in ALS and PLS-affected patients, hinting towards the differential progression of ALS from PLS. The insights derived from our comprehensive analysis accentuate the distinct variances in the underlying molecular pathogenesis of ALS and PLS. Further research should investigate the mechanistic roles of the identified DEGs and molecular pathways, leading to potential targeted therapies for ALS and PLS.
Collapse
Affiliation(s)
- Hephzibah Cathryn R
- Laboratory of Integrative Genomics, Department of Integrative Biology, School of Biosciences and Technology, Vellore Institute of Technology (VIT), Vellore, Tamil Nadu, India
| | - Ankur Datta
- Laboratory of Integrative Genomics, Department of Integrative Biology, School of Biosciences and Technology, Vellore Institute of Technology (VIT), Vellore, Tamil Nadu, India
| | - Udhaya Kumar S
- Laboratory of Integrative Genomics, Department of Integrative Biology, School of Biosciences and Technology, Vellore Institute of Technology (VIT), Vellore, Tamil Nadu, India; Department of Medicine, Division Endocrinology, Diabetes and Metabolism, Baylor College of Medicine, Houston, TX, United States
| | - Hatem Zayed
- Department of Biomedical Sciences, College of Health and Sciences, Qatar University, Doha, Qatar
| | - Thirumal Kumar D
- Faculty of Allied Health Sciences, Meenakshi Academy of Higher Education and Research, Chennai, India
| | - George Priya Doss C
- Laboratory of Integrative Genomics, Department of Integrative Biology, School of Biosciences and Technology, Vellore Institute of Technology (VIT), Vellore, Tamil Nadu, India.
| |
Collapse
|
2
|
Wang Z, Zhang G, Fu J, Li G, Zhao Z, Choe H, Ding K, Ma J, Wei J, Shang D, Zhang L. Mechanism exploration and biomarker identification of glycemic deterioration in patients with diseases of the exocrine pancreas. Sci Rep 2024; 14:4374. [PMID: 38388766 PMCID: PMC10883946 DOI: 10.1038/s41598-024-52956-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/29/2023] [Accepted: 01/25/2024] [Indexed: 02/24/2024] Open
Abstract
The damage to the endocrine pancreas among patients with diseases of the exocrine pancreas (DP) leads to reduced glycemic deterioration, ultimately resulting in diabetes of the exocrine pancreas (DEP). The present research aims to investigate the mechanism responsible for glycemic deterioration in DP patients, and to identify useful biomarkers, with the ultimate goal of enhancing clinical practice awareness. Gene expression profiles of patients with DP in this study were acquired from the Gene Expression Omnibus database. The original study defines DP patients to belong in one of three categories: non-diabetic (ND), impaired glucose tolerance (IGT) and DEP, which correspond to normoglycemia, early and late glycemic deterioration, respectively. After ensuring quality control, the discovery cohort included 8 ND, 20 IGT, and 12 DEP, while the validation cohort included 27 ND, 15 IGT, and 20 DEP. Gene set enrichment analysis (GSEA) employed differentially expressed genes (DEGs), while immunocyte infiltration was determined using single sample gene set enrichment analysis (ssGSEA). Additionally, correlation analysis was conducted to establish the link between clinical characteristics and immunocyte infiltration. The least absolute shrinkage and selection operator regression and random forest combined to identify biomarkers indicating glycemic deterioration in DP patients. These biomarkers were further validated through independent cohorts and animal experiments. With glycemic deterioration, biological processes in the pancreatic islets such as nutrient metabolism and complex immune responses are disrupted in DP patients. The expression of ACOT4, B2M, and ACKR2 was upregulated, whereas the expression of CACNA1F was downregulated. Immunocyte infiltration in the islet microenvironment showed a significant positive correlation with the age, body mass index (BMI), HbA1c and glycemia at the 2-h of patients. It was a crucial factor in glycemic deterioration. Additionally, B2M demonstrated a significant positive correlation with immunocyte infiltration and clinical features. Quantitative real-time PCR (qRT-PCR) and western blotting confirmed the upregulation in B2M. Immunofluorescent staining suggested the alteration of B2M was mainly in the alpha cells and beta cells. Overall, the study showed that gradually increased immunocyte infiltration was a significant contributor to glycemic deterioration in patients with DP, and it also highlighted B2M as a biomarker.
Collapse
Affiliation(s)
- Zhen Wang
- Institute (College) of Integrative Medicine, Dalian Medical University, Dalian, 116044, China
- Department of General Surgery, The First Affiliated Hospital of Dalian Medical University, Dalian, 116000, China
| | - Guolin Zhang
- Department of Cardiology II, The Second Affiliated Hospital of Dalian Medical University, Dalian, 116027, China
| | - Jixian Fu
- Department of Interventional Radiology, The First Hospital of China Medical University, Shenyang, 110001, China
| | - Guangxing Li
- Institute (College) of Integrative Medicine, Dalian Medical University, Dalian, 116044, China
| | - Zhihao Zhao
- Institute (College) of Integrative Medicine, Dalian Medical University, Dalian, 116044, China
| | - HyokChol Choe
- Institute (College) of Integrative Medicine, Dalian Medical University, Dalian, 116044, China
- Department of Clinical Medicine, Sinuiju Medical University, Sinuiju, Republic of Korea
| | - Kaiyue Ding
- Institute (College) of Integrative Medicine, Dalian Medical University, Dalian, 116044, China
| | - Junnan Ma
- Institute (College) of Integrative Medicine, Dalian Medical University, Dalian, 116044, China
| | - Jing Wei
- Department of Immunology, College of Basic Medical Science, Dalian Medical University, Dalian, 116044, China.
| | - Dong Shang
- Department of General Surgery, The First Affiliated Hospital of Dalian Medical University, Dalian, 116000, China.
| | - Lin Zhang
- Institute (College) of Integrative Medicine, Dalian Medical University, Dalian, 116044, China.
| |
Collapse
|
3
|
Al-Azani S, Alkhnbashi OS, Ramadan E, Alfarraj M. Gene Expression-Based Cancer Classification for Handling the Class Imbalance Problem and Curse of Dimensionality. Int J Mol Sci 2024; 25:2102. [PMID: 38396779 PMCID: PMC10889442 DOI: 10.3390/ijms25042102] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/27/2023] [Revised: 12/25/2023] [Accepted: 12/27/2023] [Indexed: 02/25/2024] Open
Abstract
Cancer is a leading cause of death globally. The majority of cancer cases are only diagnosed in the late stages of cancer due to the use of conventional methods. This reduces the chance of survival for cancer patients. Therefore, early detection consequently followed by early diagnoses are important tasks in cancer research. Gene expression microarray technology has been applied to detect and diagnose most types of cancers in their early stages and has gained encouraging results. In this paper, we address the problem of classifying cancer based on gene expression for handling the class imbalance problem and the curse of dimensionality. The oversampling technique is utilized to overcome this problem by adding synthetic samples. Another common issue related to the gene expression dataset addressed in this paper is the curse of dimensionality. This problem is addressed by applying chi-square and information gain feature selection techniques. After applying these techniques individually, we proposed a method to select the most significant genes by combining those two techniques (CHiS and IG). We investigated the effect of these techniques individually and in combination. Four benchmarking biomedical datasets (Leukemia-subtypes, Leukemia-ALLAML, Colon, and CuMiDa) were used. The experimental results reveal that the oversampling techniques improve the results in most cases. Additionally, the performance of the proposed feature selection technique outperforms individual techniques in nearly all cases. In addition, this study provides an empirical study for evaluating several oversampling techniques along with ensemble-based learning. The experimental results also reveal that SVM-SMOTE, along with the random forests classifier, achieved the highest results, with a reporting accuracy of 100%. The obtained results surpass the findings in the existing literature as well.
Collapse
Affiliation(s)
- Sadam Al-Azani
- SDAIA-KFUPM Joint Research Center for Artificial Intelligence, King Fahd University of Petroleum and Minerals (KFUPM), Dhahran 31261, Saudi Arabia;
| | - Omer S. Alkhnbashi
- Information and Computer Science Department, King Fahd University of Petroleum and Minerals (KFUPM), Dhahran 31261, Saudi Arabia; (O.S.A.); (E.R.)
| | - Emad Ramadan
- Information and Computer Science Department, King Fahd University of Petroleum and Minerals (KFUPM), Dhahran 31261, Saudi Arabia; (O.S.A.); (E.R.)
| | - Motaz Alfarraj
- SDAIA-KFUPM Joint Research Center for Artificial Intelligence, King Fahd University of Petroleum and Minerals (KFUPM), Dhahran 31261, Saudi Arabia;
- Information and Computer Science Department, King Fahd University of Petroleum and Minerals (KFUPM), Dhahran 31261, Saudi Arabia; (O.S.A.); (E.R.)
- Electrical Engineering Department, King Fahd University of Petroleum and Minerals (KFUPM), Dhahran 31261, Saudi Arabia
| |
Collapse
|
4
|
Zhou Q, Liu J, Xin L, Hu Y, Qi Y. The Diagnostic Features of Peripheral Blood Biomarkers in Identifying Osteoarthritis Individuals: Machine Learning Strategies and Clinical Evidence. Curr Comput Aided Drug Des 2024; 20:928-942. [PMID: 37594094 DOI: 10.2174/1573409920666230818092427] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/22/2023] [Revised: 07/04/2023] [Accepted: 07/14/2023] [Indexed: 08/19/2023]
Abstract
BACKGROUND People with osteoarthritis place a huge burden on society. Early diagnosis is essential to prevent disease progression and to select the best treatment strategy more effectively. In this study, the aim was to examine the diagnostic features and clinical value of peripheral blood biomarkers for osteoarthritis. OBJECTIVE The goal of this project was to investigate the diagnostic features of peripheral blood and immune cell infiltration in osteoarthritis (OA). METHODS Two eligible datasets (GSE63359 and GSE48556) were obtained from the GEO database to discern differentially expressed genes (DEGs). The machine learning strategy was employed to filtrate diagnostic biomarkers for OA. Additional verification was implemented by collecting clinical samples of OA. The CIBERSORT website estimated relative subsets of RNA transcripts to evaluate the immune-inflammatory states of OA. The link between specific DEGs and clinical immune-inflammatory markers was found by correlation analysis. RESULTS Overall, 67 robust DEGs were identified. The nuclear receptor subfamily 2 group C member 2 (NR2C2), transcription factor 4 (TCF4), stromal antigen 1 (STAG1), and interleukin 18 receptor accessory protein (IL18RAP) were identified as effective diagnostic markers of OA in peripheral blood. All four diagnostic markers showed significant increases in expression in OA. Analysis of immune cell infiltration revealed that macrophages are involved in the occurrence of OA. Candidate diagnostic markers were correlated with clinical immune-inflammatory indicators of OA patients. CONCLUSION We highlight that DEGs associated with immune inflammation (NR2C2, TCF4, STAG1, and IL18RAP) may be potential biomarkers for peripheral blood in OA, which are also associated with clinical immune-inflammatory indicators.
Collapse
Affiliation(s)
- Qiao Zhou
- Department of Rheumatism Immunity, The First Affiliated Hospital of Anhui University of Chinese Medicine, Hefei, Anhui, 230031, China
- Department of Geriatrics, The Second Affiliated Hospital of Anhui University of Chinese Medicine, Hefei, Anhui, 230061, China
- Institute of Rheumatism Prevention and Treatment of Traditional Chinese Medicine, Anhui Academy of Chinese Medicine Sciences, Hefei, Anhui, 230031, China
- The First Clinical School of Medicine, Anhui University of Chinese Medicine, Hefei, Anhui, 230012, China
| | - Jian Liu
- Department of Rheumatism Immunity, The First Affiliated Hospital of Anhui University of Chinese Medicine, Hefei, Anhui, 230031, China
- Institute of Rheumatism Prevention and Treatment of Traditional Chinese Medicine, Anhui Academy of Chinese Medicine Sciences, Hefei, Anhui, 230031, China
| | - Ling Xin
- Institute of Rheumatism Prevention and Treatment of Traditional Chinese Medicine, Anhui Academy of Chinese Medicine Sciences, Hefei, Anhui, 230031, China
| | - Yuedi Hu
- Institute of Rheumatism Prevention and Treatment of Traditional Chinese Medicine, Anhui Academy of Chinese Medicine Sciences, Hefei, Anhui, 230031, China
- The First Clinical School of Medicine, Anhui University of Chinese Medicine, Hefei, Anhui, 230012, China
| | - Yajun Qi
- Institute of Rheumatism Prevention and Treatment of Traditional Chinese Medicine, Anhui Academy of Chinese Medicine Sciences, Hefei, Anhui, 230031, China
- The First Clinical School of Medicine, Anhui University of Chinese Medicine, Hefei, Anhui, 230012, China
| |
Collapse
|
5
|
Sancha-Velasco A, Uceda-Heras A, García-Cabezas MÁ. Cortical type: a conceptual tool for meaningful biological interpretation of high-throughput gene expression data in the human cerebral cortex. Front Neuroanat 2023; 17:1187280. [PMID: 37426901 PMCID: PMC10323436 DOI: 10.3389/fnana.2023.1187280] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/15/2023] [Accepted: 05/31/2023] [Indexed: 07/11/2023] Open
Abstract
The interpretation of massive high-throughput gene expression data requires computational and biological analyses to identify statistically and biologically significant differences, respectively. There are abundant sources that describe computational tools for statistical analysis of massive gene expression data but few address data analysis for biological significance. In the present article we exemplify the importance of selecting the proper biological context in the human brain for gene expression data analysis and interpretation. For this purpose, we use cortical type as conceptual tool to make predictions about gene expression in areas of the human temporal cortex. We predict that the expression of genes related to glutamatergic transmission would be higher in areas of simpler cortical type, the expression of genes related to GABAergic transmission would be higher in areas of more complex cortical type, and the expression of genes related to epigenetic regulation would be higher in areas of simpler cortical type. Then, we test these predictions with gene expression data from several regions of the human temporal cortex obtained from the Allen Human Brain Atlas. We find that the expression of several genes shows statistically significant differences in agreement with the predicted gradual expression along the laminar complexity gradient of the human cortex, suggesting that simpler cortical types may have greater glutamatergic excitability and epigenetic turnover compared to more complex types; on the other hand, complex cortical types seem to have greater GABAergic inhibitory control compared to simpler types. Our results show that cortical type is a good predictor of synaptic plasticity, epigenetic turnover, and selective vulnerability in human cortical areas. Thus, cortical type can provide a meaningful context for interpreting high-throughput gene expression data in the human cerebral cortex.
Collapse
Affiliation(s)
- Ariadna Sancha-Velasco
- Department of Anatomy, Histology and Neuroscience, School of Medicine, Autonomous University of Madrid, Madrid, Spain
- Master Program in Neuroscience, Autonomous University of Madrid, Madrid, Spain
| | - Alicia Uceda-Heras
- Master Program in Neuroscience, Autonomous University of Madrid, Madrid, Spain
- Ph.D. Program in Neuroscience UAM-Cajal, Autonomous University of Madrid, Madrid, Spain
| | - Miguel Ángel García-Cabezas
- Department of Anatomy, Histology and Neuroscience, School of Medicine, Autonomous University of Madrid, Madrid, Spain
- Master Program in Neuroscience, Autonomous University of Madrid, Madrid, Spain
- Ph.D. Program in Neuroscience UAM-Cajal, Autonomous University of Madrid, Madrid, Spain
- Neural Systems Laboratory, Department of Health Sciences, Boston University, Boston, MA, United States
| |
Collapse
|
6
|
Alharbi F, Vakanski A. Machine Learning Methods for Cancer Classification Using Gene Expression Data: A Review. Bioengineering (Basel) 2023; 10:bioengineering10020173. [PMID: 36829667 PMCID: PMC9952758 DOI: 10.3390/bioengineering10020173] [Citation(s) in RCA: 16] [Impact Index Per Article: 16.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/22/2022] [Revised: 01/24/2023] [Accepted: 01/26/2023] [Indexed: 01/31/2023] Open
Abstract
Cancer is a term that denotes a group of diseases caused by the abnormal growth of cells that can spread in different parts of the body. According to the World Health Organization (WHO), cancer is the second major cause of death after cardiovascular diseases. Gene expression can play a fundamental role in the early detection of cancer, as it is indicative of the biochemical processes in tissue and cells, as well as the genetic characteristics of an organism. Deoxyribonucleic acid (DNA) microarrays and ribonucleic acid (RNA)-sequencing methods for gene expression data allow quantifying the expression levels of genes and produce valuable data for computational analysis. This study reviews recent progress in gene expression analysis for cancer classification using machine learning methods. Both conventional and deep learning-based approaches are reviewed, with an emphasis on the application of deep learning models due to their comparative advantages for identifying gene patterns that are distinctive for various types of cancers. Relevant works that employ the most commonly used deep neural network architectures are covered, including multi-layer perceptrons, as well as convolutional, recurrent, graph, and transformer networks. This survey also presents an overview of the data collection methods for gene expression analysis and lists important datasets that are commonly used for supervised machine learning for this task. Furthermore, we review pertinent techniques for feature engineering and data preprocessing that are typically used to handle the high dimensionality of gene expression data, caused by a large number of genes present in data samples. The paper concludes with a discussion of future research directions for machine learning-based gene expression analysis for cancer classification.
Collapse
|