1
|
Baldi S, He Y, Ivanov I, Khamgan H, Safi M, Alradhi M, Shopit A, Al-Danakh A, Al-Nusaif M, Gao Y, Tian H. Aberrantly hypermethylated ARID1B is a novel biomarker and potential therapeutic target of colon adenocarcinoma. Front Genet 2022; 13:914354. [PMID: 36313455 PMCID: PMC9614077 DOI: 10.3389/fgene.2022.914354] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/06/2022] [Accepted: 07/21/2022] [Indexed: 11/18/2022] Open
Abstract
Background and Objective: Understanding the tumor microenvironment (TME) and immune cell infiltration (ICI) may help guide immunotherapy efforts for colon cancer (COAD). However, whether ARID1B is truly regulated by hypermethylation or linked to immune infiltration remains unknown. The current work focused on the ARID1B gene expression and methylation in COAD, as well as its relation with ICI. Methods and Results: Multiple tools based on TCGA were used to analyze the differences in the expression of the ARID1B gene, DNA methylation, and its association with various clinicopathological features, somatic mutations, copy number variation, and the prognosis of patients with COAD. According to the analysis results, patients with high mRNA, low methylation levels showed better overall survival than patients with low mRNA, high methylation levels. The correlation analysis of immune cell infiltration and immune checkpoint gene expression showed that the infiltration rates of the main ICI subtypes, cancer-associated fibroblast, and myeloid cells were significantly enriched and correlated with ARID1B in COAD. An association between ARID1B expression and immune infiltration in COAD was found by correlating ICI indicators with ARID1B expression in the immune cell composition of the COAD microenvironment. Notably, M2 chemokines were related to ARID1B expression, while M1 chemokines were not. Conclusion: This study provided evidence that ARID1B may have a role in the pathogenesis of COAD. The specific underlying mechanisms that could be responsible for ARID1B’s downregulation in COAD will need to be investigated in the future.
Collapse
Affiliation(s)
- Salem Baldi
- Research Center of Molecular Diagnostics and Sequencing, Axbio Biotechnology (Shenzhen) Co.,Ltd, Shenzhen, Guangdong, China
- *Correspondence: Salem Baldi, ; Yaping Gao, ; Hui Tian,
| | - Yun He
- Research Center of Molecular Diagnostics and Sequencing, Axbio Biotechnology (Shenzhen) Co.,Ltd, Shenzhen, Guangdong, China
| | - Igor Ivanov
- Research Center of Molecular Diagnostics and Sequencing, Axbio Biotechnology (Shenzhen) Co.,Ltd, Shenzhen, Guangdong, China
| | - Hassan Khamgan
- Department of Molecular Diagnostics and Therapeutics, Genetic Engineering and Biotechnology Research Institute, University of Sadat City, Sadat, Egypt
| | - Mohammed Safi
- Department of respiratory, Shandong Second Provincial General Hospital, Shandong University, Jinan, China
| | - Mohammed Alradhi
- Department of Urology, The Affiliated Hospital of Qingdao Binhai University, Qingdao, China
| | - Abdullah Shopit
- Academic Integrated Medicine and Collage of Pharmacy, School of Pharmacy, Department of Pharmacology, Dalian Medical University, Dalian, China
| | - Abdullah Al-Danakh
- Department of Urology, First Affiliated Hospital of Dalian Medical University, Dalian, China
| | - Murad Al-Nusaif
- Center for Clinical Research on Neurological Diseases, First Affiliated Hospital, Dalian Medical University, Dalian, China
| | - Yaping Gao
- Research Center of Molecular Diagnostics and Sequencing, Research Institute of Tsinghua University in Shenzhen, Shenzhen, Guangdong, China
- *Correspondence: Salem Baldi, ; Yaping Gao, ; Hui Tian,
| | - Hui Tian
- Research Center of Molecular Diagnostics and Sequencing, Axbio Biotechnology (Shenzhen) Co.,Ltd, Shenzhen, Guangdong, China
- *Correspondence: Salem Baldi, ; Yaping Gao, ; Hui Tian,
| |
Collapse
|
2
|
Savino A, Provero P, Poli V. Differential Co-Expression Analyses Allow the Identification of Critical Signalling Pathways Altered during Tumour Transformation and Progression. Int J Mol Sci 2020; 21:E9461. [PMID: 33322692 PMCID: PMC7764314 DOI: 10.3390/ijms21249461] [Citation(s) in RCA: 21] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/30/2020] [Revised: 12/02/2020] [Accepted: 12/09/2020] [Indexed: 02/02/2023] Open
Abstract
Biological systems respond to perturbations through the rewiring of molecular interactions, organised in gene regulatory networks (GRNs). Among these, the increasingly high availability of transcriptomic data makes gene co-expression networks the most exploited ones. Differential co-expression networks are useful tools to identify changes in response to an external perturbation, such as mutations predisposing to cancer development, and leading to changes in the activity of gene expression regulators or signalling. They can help explain the robustness of cancer cells to perturbations and identify promising candidates for targeted therapy, moreover providing higher specificity with respect to standard co-expression methods. Here, we comprehensively review the literature about the methods developed to assess differential co-expression and their applications to cancer biology. Via the comparison of normal and diseased conditions and of different tumour stages, studies based on these methods led to the definition of pathways involved in gene network reorganisation upon oncogenes' mutations and tumour progression, often converging on immune system signalling. A relevant implementation still lagging behind is the integration of different data types, which would greatly improve network interpretability. Most importantly, performance and predictivity evaluation of the large variety of mathematical models proposed would urgently require experimental validations and systematic comparisons. We believe that future work on differential gene co-expression networks, complemented with additional omics data and experimentally tested, will considerably improve our insights into the biology of tumours.
Collapse
Affiliation(s)
- Aurora Savino
- Molecular Biotechnology Center, Department of Molecular Biotechnology and Health Sciences, University of Turin, Via Nizza 52, 10126 Turin, Italy
| | - Paolo Provero
- Department of Neurosciences “Rita Levi Montalcini”, University of Turin, Corso Massimo D’Ázeglio 52, 10126 Turin, Italy;
- Center for Omics Sciences, Ospedale San Raffaele IRCCS, Via Olgettina 60, 20132 Milan, Italy
| | - Valeria Poli
- Molecular Biotechnology Center, Department of Molecular Biotechnology and Health Sciences, University of Turin, Via Nizza 52, 10126 Turin, Italy
| |
Collapse
|
3
|
Mishra B, Kumar N, Mukhtar MS. Systems Biology and Machine Learning in Plant-Pathogen Interactions. MOLECULAR PLANT-MICROBE INTERACTIONS : MPMI 2019; 32:45-55. [PMID: 30418085 DOI: 10.1094/mpmi-08-18-0221-fi] [Citation(s) in RCA: 47] [Impact Index Per Article: 9.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/02/2023]
Abstract
Systems biology is an inclusive approach to study the static and dynamic emergent properties on a global scale by integrating multiomics datasets to establish qualitative and quantitative associations among multiple biological components. With an abundance of improved high throughput -omics datasets, network-based analyses and machine learning technologies are playing a pivotal role in comprehensive understanding of biological systems. Network topological features reveal most important nodes within a network as well as prioritize significant molecular components for diverse biological networks, including coexpression, protein-protein interaction, and gene regulatory networks. Machine learning techniques provide enormous predictive power through specific feature extraction from biological data. Deep learning, a subtype of machine learning, has plausible future applications because a domain expert for feature extraction is not needed in this algorithm. Inspired by diverse domains of biology, we here review classic systems biology techniques applied in plant immunity thus far. We also discuss additional advanced approaches in both graph theory and machine learning, which may provide new insights for understanding plant-microbe interactions. Finally, we propose a hybrid approach in plant immune systems that harnesses the power of both network biology and machine learning, with a potential to be applicable to both model systems and agronomically important crop plants.
Collapse
Affiliation(s)
| | | | - M Shahid Mukhtar
- 1 Department of Biology, and
- 2 Nutrition Obesity Research Center, University of Alabama at Birmingham, 1300 University Blvd., Birmingham 35294, U.S.A
| |
Collapse
|
4
|
Cui X, Li X, Li J, Wang X, Sun W, Cheng Z, Ding J, Wang H. inFRank: a ranking-based identification of influential genes in biological networks. Oncotarget 2018; 8:43810-43821. [PMID: 27623074 PMCID: PMC5546442 DOI: 10.18632/oncotarget.11878] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/15/2016] [Accepted: 07/27/2016] [Indexed: 11/25/2022] Open
Abstract
Capturing the predominant driver genes is critical in the analysis of high-throughput experimental data; however, existing methods scarcely include the unique characters of biological networks. Herein we introduced a ranking-based computational framework (inFRank) to rank the proteins by their influence. Using inFRank, we identified the top 20 influential genes in hepatocellular carcinoma (HCC). Network analysis revealed a prominent community composed of 7 influential genes. Intriguingly, five genes among the community were critical for mitotic spindle assembly checkpoint (SAC), suggesting that dysregulation of SAC could be a distinct feature of HCC and targeting SAC-associated genes might be a promising therapeutic strategy. Cox regression analysis revealed that CDC20 exerted as an independent risk factor for patient survival, indicating that CDC20 could be a novel biomarker for HCC prognosis. inFRank was then used for pan-cancer study, and all of the most influential genes in 18 cancers were achieved. We identified altogether 19 genes that were important in multiple cancers, and observed that cancers originating from the same organ or function-related organs tended to share more influential genes. Collectively, our results demonstrated that the inFRank was a powerful approach for deep interpretation of high-throughput data and better understanding of complex diseases.
Collapse
Affiliation(s)
- Xiuliang Cui
- The International Cooperation Laboratory on Signal Transduction, Shanghai, 200433, China.,National Center for Liver Cancer, Eastern Hepatobiliary Surgery Institute, Shanghai, 200433, China
| | - Xiaofeng Li
- National Center for Liver Cancer, Eastern Hepatobiliary Surgery Institute, Shanghai, 200433, China
| | - Jing Li
- Department of Surgery, The Second Military Medical University, Shanghai, 200433, China
| | - Xue Wang
- The International Cooperation Laboratory on Signal Transduction, Shanghai, 200433, China.,National Center for Liver Cancer, Eastern Hepatobiliary Surgery Institute, Shanghai, 200433, China
| | - Wen Sun
- The International Cooperation Laboratory on Signal Transduction, Shanghai, 200433, China.,National Center for Liver Cancer, Eastern Hepatobiliary Surgery Institute, Shanghai, 200433, China
| | - Zhuo Cheng
- The International Cooperation Laboratory on Signal Transduction, Shanghai, 200433, China.,National Center for Liver Cancer, Eastern Hepatobiliary Surgery Institute, Shanghai, 200433, China
| | - Jin Ding
- The International Cooperation Laboratory on Signal Transduction, Shanghai, 200433, China.,National Center for Liver Cancer, Eastern Hepatobiliary Surgery Institute, Shanghai, 200433, China
| | - Hongyang Wang
- The International Cooperation Laboratory on Signal Transduction, Shanghai, 200433, China.,National Center for Liver Cancer, Eastern Hepatobiliary Surgery Institute, Shanghai, 200433, China
| |
Collapse
|
5
|
Huang X, Liu H, Li X, Guan L, Li J, Tellier LCAM, Yang H, Wang J, Zhang J. Revealing Alzheimer's disease genes spectrum in the whole-genome by machine learning. BMC Neurol 2018; 18:5. [PMID: 29320986 PMCID: PMC5763548 DOI: 10.1186/s12883-017-1010-3] [Citation(s) in RCA: 22] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/20/2017] [Accepted: 12/21/2017] [Indexed: 11/23/2022] Open
Abstract
Background Alzheimer’s disease (AD) is an important, progressive neurodegenerative disease, with a complex genetic architecture. A key goal of biomedical research is to seek out disease risk genes, and to elucidate the function of these risk genes in the development of disease. For this purpose, expanding the AD-associated gene set is necessary. In past research, the prediction methods for AD related genes has been limited in their exploration of the target genome regions. We here present a genome-wide method for AD candidate genes predictions. Methods We present a machine learning approach (SVM), based upon integrating gene expression data with human brain-specific gene network data, to discover the full spectrum of AD genes across the whole genome. Results We classified AD candidate genes with an accuracy and the area under the receiver operating characteristic (ROC) curve of 84.56% and 94%. Our approach provides a supplement for the spectrum of AD-associated genes extracted from more than 20,000 genes in a genome wide scale. Conclusions In this study, we have elucidated the whole-genome spectrum of AD, using a machine learning approach. Through this method, we expect for the candidate gene catalogue to provide a more comprehensive annotation of AD for researchers. Electronic supplementary material The online version of this article (10.1186/s12883-017-1010-3) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Xiaoyan Huang
- BGI Education Center, University of Chinese Academy of Sciences, Shenzhen, 518083, China.,BGI-Shenzhen, Shenzhen, 518083, China.,China National GeneBank, BGI-Shenzhen, Shenzhen, 518120, China
| | - Hankui Liu
- BGI-Shenzhen, Shenzhen, 518083, China.,China National GeneBank, BGI-Shenzhen, Shenzhen, 518120, China
| | - Xinming Li
- College of Life Science and Technology, Huazhong University of Science and Technology, Wuhan, China
| | - Liping Guan
- BGI-Shenzhen, Shenzhen, 518083, China.,China National GeneBank, BGI-Shenzhen, Shenzhen, 518120, China
| | - Jiankang Li
- BGI-Shenzhen, Shenzhen, 518083, China.,China National GeneBank, BGI-Shenzhen, Shenzhen, 518120, China
| | - Laurent Christian Asker M Tellier
- BGI-Shenzhen, Shenzhen, 518083, China.,China National GeneBank, BGI-Shenzhen, Shenzhen, 518120, China.,Department of Biology, Bioinformatics, University of Copenhagen, Copenhagen, Denmark
| | - Huanming Yang
- BGI-Shenzhen, Shenzhen, 518083, China.,James D. Watson Institute of Genome Sciences, Hangzhou, 310058, China
| | - Jian Wang
- BGI-Shenzhen, Shenzhen, 518083, China.,James D. Watson Institute of Genome Sciences, Hangzhou, 310058, China
| | - Jianguo Zhang
- BGI-Shenzhen, Shenzhen, 518083, China. .,China National GeneBank, BGI-Shenzhen, Shenzhen, 518120, China. .,Shenzhen Key Lab of Neurogenomics, BGI-Shenzhen, Shenzhen, 518120, China.
| |
Collapse
|
6
|
Caldera M, Buphamalai P, Müller F, Menche J. Interactome-based approaches to human disease. ACTA ACUST UNITED AC 2017. [DOI: 10.1016/j.coisb.2017.04.015] [Citation(s) in RCA: 19] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/21/2022]
|
7
|
Sagvekar P, Mangoli V, Desai S, Patil A, Mukherjee S. LINE1 CpG-DNA Hypomethylation in Granulosa Cells and Blood Leukocytes Is Associated With PCOS and Related Traits. J Clin Endocrinol Metab 2017; 102:1396-1405. [PMID: 28324041 DOI: 10.1210/jc.2016-2645] [Citation(s) in RCA: 22] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 07/11/2016] [Accepted: 01/03/2017] [Indexed: 02/07/2023]
Abstract
CONTEXT Altered global DNA methylation is indicative of epigenomic instability concerning chronic diseases. Investigating its incidence and association with polycystic ovary syndrome (PCOS) is essential to understand the etiopathogenesis of this disorder. OBJECTIVES We assessed global DNA methylation differences in peripheral blood leukocytes (PBLs) and cumulus granulosa cells (CGCs) of controls and women with PCOS; and their association with PCOS and its traits. DESIGN, SETTING, PARTICIPANTS, MAIN OUTCOME MEASURE This study included a total of 102 controls and women with PCOS. Forty-one women undergoing controlled ovarian hyperstimulation (COH) and 61 women not undergoing COH were recruited from in vitro fertilization (IVF) and infertility clinics. DNA methylation was measured by ELISA for 5'-methyl-cytosine content and bisulfite sequencing of 5'-untranslated region (5'-UTR) of long interspersed nucleotide element-1 (LINE1/L1). RESULTS Total 5'-methyl-cytosine and L1 methylation levels in PBLs and CGCs were similar between controls and women with PCOS. Methylation assessed at CpG sites of L1 5'-UTR revealed a single CpG-site (CpG-4) to be consistently hypomethylated in PBLs of both PCOS groups and CGCs of stimulated PCOS group. In unstimulated women, hypomethylation at CpG-4 was strongly associated with PCOS susceptibility, whereas in stimulated group it showed strong associations with PCOS and its hormonal traits. Furthermore, CGCs demonstrated consistent global and CpG-DNA hypomethylation relative to PBLs, irrespective of normal or disease states. CONCLUSION Our study revealed strong association of single hypomethylated CpG-site with PCOS. Identification and characterization of more such methyl-CpG signatures in repetitive elements in larger study populations would provide valuable epigenetic insights into PCOS.
Collapse
Affiliation(s)
| | - Vijay Mangoli
- Fertility Clinic and IVF Center, Gamdevi, Mumbai 400007, Maharashtra, India
| | - Sadhana Desai
- Fertility Clinic and IVF Center, Gamdevi, Mumbai 400007, Maharashtra, India
| | - Anushree Patil
- Department of Infertility and Endocrinology, National Institute for Research in Reproductive Health, Parel, Mumbai 400012, India
| | | |
Collapse
|
8
|
Chen B, Li M, Wang J, Shang X, Wu FX. A fast and high performance multiple data integration algorithm for identifying human disease genes. BMC Med Genomics 2015; 8 Suppl 3:S2. [PMID: 26399620 PMCID: PMC4582601 DOI: 10.1186/1755-8794-8-s3-s2] [Citation(s) in RCA: 29] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/01/2023] Open
Abstract
BACKGROUND Integrating multiple data sources is indispensable in improving disease gene identification. It is not only due to the fact that disease genes associated with similar genetic diseases tend to lie close with each other in various biological networks, but also due to the fact that gene-disease associations are complex. Although various algorithms have been proposed to identify disease genes, their prediction performances and the computational time still should be further improved. RESULTS In this study, we propose a fast and high performance multiple data integration algorithm for identifying human disease genes. A posterior probability of each candidate gene associated with individual diseases is calculated by using a Bayesian analysis method and a binary logistic regression model. Two prior probability estimation strategies and two feature vector construction methods are developed to test the performance of the proposed algorithm. CONCLUSIONS The proposed algorithm is not only generated predictions with high AUC scores, but also runs very fast. When only a single PPI network is employed, the AUC score is 0.769 by using F2 as feature vectors. The average running time for each leave-one-out experiment is only around 1.5 seconds. When three biological networks are integrated, the AUC score using F3 as feature vectors increases to 0.830, and the average running time for each leave-one-out experiment takes only about 12.54 seconds. It is better than many existing algorithms.
Collapse
Affiliation(s)
- Bolin Chen
- School of Computer Science, Northwestern Polytechnical University, 127 Youyi West Road, 710072, Xi'an, P.R. China
| | - Min Li
- School of Information Science and Engineering, Central South University, 410083, Changsha, P.R.China
| | - Jianxin Wang
- School of Information Science and Engineering, Central South University, 410083, Changsha, P.R.China
| | - Xuequn Shang
- School of Computer Science, Northwestern Polytechnical University, 127 Youyi West Road, 710072, Xi'an, P.R. China
| | - Fang-Xiang Wu
- Division of Biomedical Engineering, University of Saskatchewan, 57 Campus Dr., S7N 5A9, Saskatoon, Canada
- Department of Mechanical Engineering, University of Saskatchewan, 57 Campus Dr., S7N 5A9, Saskatoon, Canada
| |
Collapse
|
9
|
Yeger-Lotem E, Sharan R. Human protein interaction networks across tissues and diseases. Front Genet 2015; 6:257. [PMID: 26347769 PMCID: PMC4541328 DOI: 10.3389/fgene.2015.00257] [Citation(s) in RCA: 55] [Impact Index Per Article: 6.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/28/2015] [Accepted: 07/17/2015] [Indexed: 11/13/2022] Open
Abstract
Protein interaction networks are an important framework for studying protein function, cellular processes, and genotype-to-phenotype relationships. While our view of the human interaction network is constantly expanding, less is known about networks that form in biologically important contexts such as within distinct tissues or in disease conditions. Here we review efforts to characterize these networks and to harness them to gain insights into the molecular mechanisms underlying human disease.
Collapse
Affiliation(s)
- Esti Yeger-Lotem
- Department of Clinical Biochemistry and Pharmacology, Ben-Gurion University of the Negev Beer-Sheva, Israel
| | - Roded Sharan
- Blavatnik School of Computer Science, Tel Aviv University Tel Aviv, Israel
| |
Collapse
|
10
|
ProSim: A Method for Prioritizing Disease Genes Based on Protein Proximity and Disease Similarity. BIOMED RESEARCH INTERNATIONAL 2015; 2015:213750. [PMID: 26339594 PMCID: PMC4538409 DOI: 10.1155/2015/213750] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 12/15/2014] [Accepted: 01/16/2015] [Indexed: 01/19/2023]
Abstract
Predicting disease genes for a particular genetic disease is very challenging in bioinformatics. Based on current research studies, this challenge can be tackled via network-based approaches. Furthermore, it has been highlighted that it is necessary to consider disease similarity along with the protein's proximity to disease genes in a protein-protein interaction (PPI) network in order to improve the accuracy of disease gene prioritization. In this study we propose a new algorithm called proximity disease similarity algorithm (ProSim), which takes both of the aforementioned properties into consideration, to prioritize disease genes. To illustrate the proposed algorithm, we have conducted six case studies, namely, prostate cancer, Alzheimer's disease, diabetes mellitus type 2, breast cancer, colorectal cancer, and lung cancer. We employed leave-one-out cross validation, mean enrichment, tenfold cross validation, and ROC curves to evaluate our proposed method and other existing methods. The results show that our proposed method outperforms existing methods such as PRINCE, RWR, and DADA.
Collapse
|
11
|
Ganegoda GU, Li M, Wang W, Feng Q. Heterogeneous Network Model to Infer Human Disease-Long Intergenic Non-Coding RNA Associations. IEEE Trans Nanobioscience 2015; 14:175-83. [DOI: 10.1109/tnb.2015.2391133] [Citation(s) in RCA: 45] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/08/2022]
|