101
|
Liu J, Su B. Integrated analysis supports ATXN1 as a schizophrenia risk gene. Schizophr Res 2018; 195:298-305. [PMID: 29055568 DOI: 10.1016/j.schres.2017.10.010] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 03/16/2017] [Revised: 07/27/2017] [Accepted: 10/08/2017] [Indexed: 12/13/2022]
Abstract
Protein-protein interaction (PPI) is informative in identifying hidden disease risk genes that tend to interact with known risk genes usually working together in the same disease module. With the use of an integrated approach combining PPI information with pathway and expression analysis as well as genome-wide association study (GWAS), we intended to find new risk genes for schizophrenia (SCZ). We showed that ATXN1 was the only direct PPI partner of the know SCZ risk gene ZNF804A, and it also had direct PPIs with other 18 known SCZ risk genes. ATXN1 serves as one of the hub genes in the PPI network containing many known SCZ risk genes, and this network is significantly enriched for the MAPK signaling pathway. Further gene expression analysis indicated that ATXN1 is highly expressed in prefrontal cortex, and SCZ patients had significantly decreased expression compared with healthy controls. Finally, the published GWAS data supports an association of ATXN1 with SCZ as well as other psychiatric disorders though not reaching genome-wide significance. These convergent evidences support ATXN1 as a promising risk gene for SCZ, and the integrated approach serves as a useful tool for dissecting the genetic basis of psychiatric disorders.
Collapse
Affiliation(s)
- Jiewei Liu
- State Key Laboratory of Genetic Resources and Evolution, Kunming Institute of Zoology, Chinese Academy of Sciences, Kunming 650223, Yunnan, China; Kunming College of Life Science, University of Chinese Academy of Sciences, Kunming 650204, Yunnan, China
| | - Bing Su
- State Key Laboratory of Genetic Resources and Evolution, Kunming Institute of Zoology, Chinese Academy of Sciences, Kunming 650223, Yunnan, China.
| |
Collapse
|
102
|
Abstract
Genome-wide association studies (GWAS) have identified more than 100 loci that show robust association with schizophrenia risk. However, due to the complexity of linkage disequilibrium and gene regulatory, it is challenging to pinpoint the causal genes at the risk loci and translate the genetic findings from GWAS into disease mechanism and clinical treatment. Here we systematically predicted the plausible candidate causal genes for schizophrenia at genome-wide level. We utilized different approaches and strategies to predict causal genes for schizophrenia, including Sherlock, SMR, DAPPLE, Prix Fixe, NetWAS, and DEPICT. By integrating the results from different prediction approaches, we identified six top candidates that represent promising causal genes for schizophrenia, including CNTN4, GATAD2A, GPM6A, MMP16, PSMA4, and TCF4. Besides, we also identified 35 additional high-confidence causal genes for schizophrenia. The identified causal genes showed distinct spatio-temporal expression patterns in developing and adult human brain. Cell-type-specific expression analysis indicated that the expression level of the predicted causal genes was significantly higher in neurons compared with oligodendrocytes and microglia (P < 0.05). We found that synaptic transmission-related genes were significantly enriched among the identified causal genes (P < 0.05), providing further support for the dysregulation of synaptic transmission in schizophrenia. Finally, we showed that the top six causal genes are dysregulated in schizophrenia cases compared with controls and knockdown of these genes impaired the proliferation of neuronal cells. Our study depicts the landscape of plausible schizophrenia causal genes for the first time. Further genetic and functional validation of these genes will provide mechanistic insights into schizophrenia pathogenesis and may facilitate to provide potential targets for future therapeutics and diagnostics.
Collapse
Affiliation(s)
- Changguo Ma
- 0000000119573309grid.9227.eKey Laboratory of Animal Models and Human Disease Mechanisms of the Chinese Academy of Sciences and Yunnan Province, Kunming Institute of Zoology, Chinese Academy of Sciences, Kunming, Yunnan 650223 China
| | - Chunjie Gu
- 0000000119573309grid.9227.eKey Laboratory of Animal Models and Human Disease Mechanisms of the Chinese Academy of Sciences and Yunnan Province, Kunming Institute of Zoology, Chinese Academy of Sciences, Kunming, Yunnan 650223 China
| | - Yongxia Huo
- 0000000119573309grid.9227.eKey Laboratory of Animal Models and Human Disease Mechanisms of the Chinese Academy of Sciences and Yunnan Province, Kunming Institute of Zoology, Chinese Academy of Sciences, Kunming, Yunnan 650223 China
| | - Xiaoyan Li
- 0000000119573309grid.9227.eKey Laboratory of Animal Models and Human Disease Mechanisms of the Chinese Academy of Sciences and Yunnan Province, Kunming Institute of Zoology, Chinese Academy of Sciences, Kunming, Yunnan 650223 China
| | - Xiong-Jian Luo
- Key Laboratory of Animal Models and Human Disease Mechanisms of the Chinese Academy of Sciences and Yunnan Province, Kunming Institute of Zoology, Chinese Academy of Sciences, Kunming, Yunnan, 650223, China. .,Center for Excellence in Animal Evolution and Genetics, Chinese Academy of Sciences, Kunming, Yunnan, 650223, China.
| |
Collapse
|
103
|
Abstract
Computational identification of special protein molecules is a key issue in understanding protein function. It can guide molecular experiments and help to save costs. I assessed 18 papers published in the special issue of Int. J. Mol. Sci., and also discussed the related works. The computational methods employed in this special issue focused on machine learning, network analysis, and molecular docking. New methods and new topics were also proposed. There were in addition several wet experiments, with proven results showing promise. I hope our special issue will help in protein molecules identification researches.
Collapse
|
104
|
Zou Q, He W. Special Protein Molecules Computational Identification. Int J Mol Sci 2018; 19:ijms19020536. [PMID: 29439426 PMCID: PMC5855758 DOI: 10.3390/ijms19020536] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/16/2018] [Revised: 02/02/2018] [Accepted: 02/10/2018] [Indexed: 01/29/2023] Open
Abstract
Computational identification of special protein molecules is a key issue in understanding protein function. It can guide molecular experiments and help to save costs. I assessed 18 papers published in the special issue of Int. J. Mol. Sci., and also discussed the related works. The computational methods employed in this special issue focused on machine learning, network analysis, and molecular docking. New methods and new topics were also proposed. There were in addition several wet experiments, with proven results showing promise. I hope our special issue will help in protein molecules identification researches.
Collapse
Affiliation(s)
- Quan Zou
- School of Computer Science and Technology, Tianjin University, Tianjin 300354, China.
| | - Wenying He
- School of Computer Science and Technology, Tianjin University, Tianjin 300354, China.
| |
Collapse
|
105
|
Zhou J, Fu BQ. The research on gene-disease association based on text-mining of PubMed. BMC Bioinformatics 2018; 19:37. [PMID: 29415654 PMCID: PMC5804013 DOI: 10.1186/s12859-018-2048-y] [Citation(s) in RCA: 25] [Impact Index Per Article: 4.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/18/2017] [Accepted: 01/29/2018] [Indexed: 11/23/2022] Open
Abstract
Background The associations between genes and diseases are of critical significance in aspects of prevention, diagnosis and treatment. Although gene-disease relationships have been investigated extensively, much of the underpinnings of these associations are yet to be elucidated. Methods A novel method integrates MeSH database, term weight (TW), and co-occurrence methods to predict gene-disease associations based on the cosine similarity between gene vectors and disease vectors. Vectors are transformed from the texts of documents in the PubMed database according to the appearance and location of the gene or disease terms. The disease related text data has been optimized during the process of constructing vectors. Results The overall distribution of cosine similarity value was investigated. By using the gene-disease association data in OMIM database as golden standard, the performance of cosine similarity in predicting gene-disease linkage was evaluated. The effects of applying weight matrix, penalty weights for keywords (PWK), and normalization were also investigated. Finally, we demonstrated that our method outperforms heterogeneous network edge prediction (HNEP) in aspects of precision rate and recall rate. Conclusions Our method proposed in this paper is easy to be conducted and the results can be integrated with other models to improve the overall performance of gene-disease association predictions.
Collapse
Affiliation(s)
- Jie Zhou
- Guangdong Key Laboratory of Computer Network, School of Computer Science and Engineering, South China University of Technology, Guangzhou, 510006, China.
| | - Bo-Quan Fu
- Guangdong Key Laboratory of Computer Network, School of Computer Science and Engineering, South China University of Technology, Guangzhou, 510006, China
| |
Collapse
|
106
|
Kim D, Shivakumar M, Han S, Sinclair MS, Lee YJ, Zheng Y, Olopade OI, Kim D, Lee Y. Population-dependent Intron Retention and DNA Methylation in Breast Cancer. Mol Cancer Res 2018; 16:461-469. [PMID: 29330282 DOI: 10.1158/1541-7786.mcr-17-0227] [Citation(s) in RCA: 19] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/27/2017] [Revised: 09/15/2017] [Accepted: 11/27/2017] [Indexed: 12/31/2022]
Abstract
Regulation of gene expression by DNA methylation in gene promoter regions is well studied; however, the effects of methylation in the gene body (exons and introns) on gene expression are comparatively understudied. Recently, hypermethylation has been implicated in the inclusion of alternatively spliced exons; moreover, exon recognition can be enhanced by recruiting the methyl-CpG-binding protein (MeCP2) to hypermethylated sites. This study examines whether the methylation status of an intron is correlated with how frequently the intron is retained during splicing using DNA methylation and RNA sequencing data from breast cancer tissue specimens in The Cancer Genome Atlas. Interestingly, hypomethylation of introns is correlated with higher levels of intron expression in mRNA and the methylation level of an intron is inversely correlated with its retention in mRNA from the gene in which it is located. Furthermore, significant population differences were observed in the methylation level of retained introns. In African-American donors, retained introns were not only less methylated compared to European-American donors, but also were more highly expressed. This underscores the need for understanding epigenetic differences in populations and their correlation with breast cancer is an important step toward achieving personalized cancer care.Implications: This research contributes to the understanding of how epigenetic markers in the gene body communicate with the transcriptional machinery to control transcript diversity and differential biological response to changes in methylation status could underlie some of the known, yet unexplained, disparities in certain breast cancer patient populations. Mol Cancer Res; 16(3); 461-9. ©2018 AACR.
Collapse
Affiliation(s)
- Dongwook Kim
- Department of Biomedical Informatics, University of Utah School of Medicine, Salt Lake City, Utah
| | - Manu Shivakumar
- Department of Biomedical and Translational Informatics, Geisinger Health System, Danville, Pennsylvania
| | - Seonggyun Han
- Department of Biomedical Informatics, University of Utah School of Medicine, Salt Lake City, Utah
| | - Michael S Sinclair
- Department of Biomedical Informatics, University of Utah School of Medicine, Salt Lake City, Utah
| | - Young-Ji Lee
- Department of Health and Community Systems, University of Pittsburgh School of Nursing, Pittsburgh, Pennsylvania
| | - Yonglan Zheng
- Department of Medicine, University of Chicago, Chicago, Illinois
| | | | - Dokyoon Kim
- Department of Biomedical and Translational Informatics, Geisinger Health System, Danville, Pennsylvania.
- The Huck Institutes of the Life Sciences, Pennsylvania State University, University Park, Pennsylvania
| | - Younghee Lee
- Department of Biomedical Informatics, University of Utah School of Medicine, Salt Lake City, Utah.
- Huntsman Cancer Institute, Salt Lake City, Utah
| |
Collapse
|
107
|
Vlaic S, Conrad T, Tokarski-Schnelle C, Gustafsson M, Dahmen U, Guthke R, Schuster S. ModuleDiscoverer: Identification of regulatory modules in protein-protein interaction networks. Sci Rep 2018; 8:433. [PMID: 29323246 PMCID: PMC5764996 DOI: 10.1038/s41598-017-18370-2] [Citation(s) in RCA: 40] [Impact Index Per Article: 6.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/25/2017] [Accepted: 12/06/2017] [Indexed: 02/08/2023] Open
Abstract
The identification of disease-associated modules based on protein-protein interaction networks (PPINs) and gene expression data has provided new insights into the mechanistic nature of diverse diseases. However, their identification is hampered by the detection of protein communities within large-scale, whole-genome PPINs. A presented successful strategy detects a PPIN's community structure based on the maximal clique enumeration problem (MCE), which is a non-deterministic polynomial time-hard problem. This renders the approach computationally challenging for large PPINs implying the need for new strategies. We present ModuleDiscoverer, a novel approach for the identification of regulatory modules from PPINs and gene expression data. Following the MCE-based approach, ModuleDiscoverer uses a randomization heuristic-based approximation of the community structure. Given a PPIN of Rattus norvegicus and public gene expression data, we identify the regulatory module underlying a rodent model of non-alcoholic steatohepatitis (NASH), a severe form of non-alcoholic fatty liver disease (NAFLD). The module is validated using single-nucleotide polymorphism (SNP) data from independent genome-wide association studies and gene enrichment tests. Based on gene enrichment tests, we find that ModuleDiscoverer performs comparably to three existing module-detecting algorithms. However, only our NASH-module is significantly enriched with genes linked to NAFLD-associated SNPs. ModuleDiscoverer is available at http://www.hki-jena.de/index.php/0/2/490 (Others/ModuleDiscoverer).
Collapse
Affiliation(s)
- Sebastian Vlaic
- Leibniz Institute for Natural Product Research and Infection Biology - Hans-Knöll-Institute, Systems Biology and Bioinformatics, Jena, 07745, Germany.
- Friedrich-Schiller-University, Department of Bioinformatics, Jena, 07743, Germany.
| | - Theresia Conrad
- Leibniz Institute for Natural Product Research and Infection Biology - Hans-Knöll-Institute, Systems Biology and Bioinformatics, Jena, 07745, Germany
| | - Christian Tokarski-Schnelle
- Leibniz Institute for Natural Product Research and Infection Biology - Hans-Knöll-Institute, Systems Biology and Bioinformatics, Jena, 07745, Germany
- University Hospital Jena, Friedrich-Schiller-University, General, Visceral and Vascular Surgery, Jena, 07749, Germany
| | - Mika Gustafsson
- Linköping University, Bioinformatics, Department of Physics, Chemistry and Biology, Linköping, 581 83, Sweden
| | - Uta Dahmen
- University Hospital Jena, Friedrich-Schiller-University, General, Visceral and Vascular Surgery, Jena, 07749, Germany
| | - Reinhard Guthke
- Leibniz Institute for Natural Product Research and Infection Biology - Hans-Knöll-Institute, Systems Biology and Bioinformatics, Jena, 07745, Germany
| | - Stefan Schuster
- Friedrich-Schiller-University, Department of Bioinformatics, Jena, 07743, Germany
| |
Collapse
|
108
|
Huang X, Liu H, Li X, Guan L, Li J, Tellier LCAM, Yang H, Wang J, Zhang J. Revealing Alzheimer's disease genes spectrum in the whole-genome by machine learning. BMC Neurol 2018; 18:5. [PMID: 29320986 PMCID: PMC5763548 DOI: 10.1186/s12883-017-1010-3] [Citation(s) in RCA: 22] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/20/2017] [Accepted: 12/21/2017] [Indexed: 11/23/2022] Open
Abstract
Background Alzheimer’s disease (AD) is an important, progressive neurodegenerative disease, with a complex genetic architecture. A key goal of biomedical research is to seek out disease risk genes, and to elucidate the function of these risk genes in the development of disease. For this purpose, expanding the AD-associated gene set is necessary. In past research, the prediction methods for AD related genes has been limited in their exploration of the target genome regions. We here present a genome-wide method for AD candidate genes predictions. Methods We present a machine learning approach (SVM), based upon integrating gene expression data with human brain-specific gene network data, to discover the full spectrum of AD genes across the whole genome. Results We classified AD candidate genes with an accuracy and the area under the receiver operating characteristic (ROC) curve of 84.56% and 94%. Our approach provides a supplement for the spectrum of AD-associated genes extracted from more than 20,000 genes in a genome wide scale. Conclusions In this study, we have elucidated the whole-genome spectrum of AD, using a machine learning approach. Through this method, we expect for the candidate gene catalogue to provide a more comprehensive annotation of AD for researchers. Electronic supplementary material The online version of this article (10.1186/s12883-017-1010-3) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Xiaoyan Huang
- BGI Education Center, University of Chinese Academy of Sciences, Shenzhen, 518083, China.,BGI-Shenzhen, Shenzhen, 518083, China.,China National GeneBank, BGI-Shenzhen, Shenzhen, 518120, China
| | - Hankui Liu
- BGI-Shenzhen, Shenzhen, 518083, China.,China National GeneBank, BGI-Shenzhen, Shenzhen, 518120, China
| | - Xinming Li
- College of Life Science and Technology, Huazhong University of Science and Technology, Wuhan, China
| | - Liping Guan
- BGI-Shenzhen, Shenzhen, 518083, China.,China National GeneBank, BGI-Shenzhen, Shenzhen, 518120, China
| | - Jiankang Li
- BGI-Shenzhen, Shenzhen, 518083, China.,China National GeneBank, BGI-Shenzhen, Shenzhen, 518120, China
| | - Laurent Christian Asker M Tellier
- BGI-Shenzhen, Shenzhen, 518083, China.,China National GeneBank, BGI-Shenzhen, Shenzhen, 518120, China.,Department of Biology, Bioinformatics, University of Copenhagen, Copenhagen, Denmark
| | - Huanming Yang
- BGI-Shenzhen, Shenzhen, 518083, China.,James D. Watson Institute of Genome Sciences, Hangzhou, 310058, China
| | - Jian Wang
- BGI-Shenzhen, Shenzhen, 518083, China.,James D. Watson Institute of Genome Sciences, Hangzhou, 310058, China
| | - Jianguo Zhang
- BGI-Shenzhen, Shenzhen, 518083, China. .,China National GeneBank, BGI-Shenzhen, Shenzhen, 518120, China. .,Shenzhen Key Lab of Neurogenomics, BGI-Shenzhen, Shenzhen, 518120, China.
| |
Collapse
|
109
|
|
110
|
Lin JR, Zhang Q, Cai Y, Morrow BE, Zhang ZD. Integrated rare variant-based risk gene prioritization in disease case-control sequencing studies. PLoS Genet 2017; 13:e1007142. [PMID: 29281626 PMCID: PMC5760082 DOI: 10.1371/journal.pgen.1007142] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/06/2017] [Revised: 01/09/2018] [Accepted: 12/01/2017] [Indexed: 12/17/2022] Open
Abstract
Rare variants of major effect play an important role in human complex diseases and can be discovered by sequencing-based genome-wide association studies. Here, we introduce an integrated approach that combines the rare variant association test with gene network and phenotype information to identify risk genes implicated by rare variants for human complex diseases. Our data integration method follows a 'discovery-driven' strategy without relying on prior knowledge about the disease and thus maintains the unbiased character of genome-wide association studies. Simulations reveal that our method can outperform a widely-used rare variant association test method by 2 to 3 times. In a case study of a small disease cohort, we uncovered putative risk genes and the corresponding rare variants that may act as genetic modifiers of congenital heart disease in 22q11.2 deletion syndrome patients. These variants were missed by a conventional approach that relied on the rare variant association test alone. Case-control sequencing studies are a promising design to uncover risk genes of human complex diseases implicated by rare variants. The recent development of different types of rare variant association tests has improved the statistical power to identify disease genes that harbor risk rare variants. However, none of the recent sequencing-based genome-wide association studies identified robust disease association of rare variants or genes based on them. Due to limited sample sizes that can be feasibly achieved in real applications, current rare variant association tests can only generate marginal association signals for most risk genes. Here we proposed an integrated method that combined association signals with orthogonal biological evidence to uncover risk genes in sequencing studies. Designed to address the lack-of-power issue, our method was shown to effectively uncover risk genes with marginal association signals in data simulation. Indeed, in a real application demonstrated in our case study our method disclosed important risk genes of congenital heart disease in 22q11.2 deletion syndrome that were missed by the previous study.
Collapse
Affiliation(s)
- Jhih-Rong Lin
- Department of Genetics, Albert Einstein College of Medicine, Bronx, New York, United States of America
| | - Quanwei Zhang
- Department of Genetics, Albert Einstein College of Medicine, Bronx, New York, United States of America
| | - Ying Cai
- Department of Genetics, Albert Einstein College of Medicine, Bronx, New York, United States of America
| | - Bernice E Morrow
- Department of Genetics, Albert Einstein College of Medicine, Bronx, New York, United States of America
| | - Zhengdong D Zhang
- Department of Genetics, Albert Einstein College of Medicine, Bronx, New York, United States of America
| |
Collapse
|
111
|
Zhu L, Su F, Xu Y, Zou Q. Network-based method for mining novel HPV infection related genes using random walk with restart algorithm. Biochim Biophys Acta Mol Basis Dis 2017; 1864:2376-2383. [PMID: 29197659 DOI: 10.1016/j.bbadis.2017.11.021] [Citation(s) in RCA: 23] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/19/2017] [Revised: 11/03/2017] [Accepted: 11/26/2017] [Indexed: 12/27/2022]
Abstract
The human papillomavirus (HPV), a common virus that infects the reproductive tract, may lead to malignant changes within the infection area in certain cases and is directly associated with such cancers as cervical cancer, anal cancer, and vaginal cancer. Identification of novel HPV infection related genes can lead to a better understanding of the specific signal pathways and cellular processes related to HPV infection, providing information for the development of more efficient therapies. In this study, several novel HPV infection related genes were predicted by a computation method based on the known genes involved in HPV infection from HPVbase. This method applied the algorithm of random walk with restart (RWR) to a protein-protein interaction (PPI) network. The candidate genes were further filtered by the permutation and association tests. These steps eliminated genes occupying special positions in the PPI network and selected key genes with strong associations to known HPV infection related genes based on the interaction confidence and functional similarity obtained from published databases, such as STRING, gene ontology (GO) terms and KEGG pathways. Our study identified 104 novel HPV infection related genes, a number of which were confirmed to relate to the infection processes and complications of HPV infection, as reported in the literature. These results demonstrate the reliability of our method in identifying HPV infection related genes. This article is part of a Special Issue entitled: Accelerating Precision Medicine through Genetic and Genomic Big Data Analysis edited by Yudong Cai & Tao Huang.
Collapse
Affiliation(s)
- Liucun Zhu
- School of Life Sciences, Shanghai University, Shanghai 200444, China.
| | - Fangchu Su
- School of Life Sciences, Shanghai University, Shanghai 200444, China.
| | - YaoChen Xu
- Institute of Biochemistry and Cell Biology, Shanghai Institutes for Biological Sciences, Chinese Academy of Sciences, Shanghai 200031, China.
| | - Quan Zou
- School of Computer Science and Technology, TianJin University, Tianjin 300350, China.
| |
Collapse
|
112
|
Liu G, Wang H, Chu H, Yu J, Zhou X. Functional diversity of topological modules in human protein-protein interaction networks. Sci Rep 2017; 7:16199. [PMID: 29170401 PMCID: PMC5701033 DOI: 10.1038/s41598-017-16270-z] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/12/2017] [Accepted: 11/09/2017] [Indexed: 01/18/2023] Open
Abstract
A large-scale molecular interaction network of protein-protein interactions (PPIs) enables the automatic detection of molecular functional modules through a computational approach. However, the functional modules that are typically detected by topological community detection algorithms may be diverse in functional homogeneity and are empirically considered to be default functional modules. Thus, a significant challenge that has been described but not elucidated is investigating the relationship between topological modules and functional modules. We systematically investigated this issue by initially using seven widely used community detection algorithms to partition the PPI network into communities. Four homogeneity measures were subsequently implemented to evaluate the functional homogeneity of protein community. We determined that a significant portion of topological modules with heterogeneous functionality exists and should be further investigated; moreover, these findings indicated that topologically based functional module detection approaches must be reconsidered. Furthermore, we found that the functional homogeneity of topological modules is positively correlated with their edge densities, degree of association with diseases and general Gene Ontology (GO) terms. Thus, topologically based module detection approaches should be used with caution in the identification of functional modules with high homogeneity
Collapse
Affiliation(s)
- Guangming Liu
- School of Computer and Information Technology and Beijing Key Lab of Traffic Data Analysis and Mining, Beijing Jiaotong University, Beijing, 100044, China
| | - Huixin Wang
- School of Computer and Information Technology and Beijing Key Lab of Traffic Data Analysis and Mining, Beijing Jiaotong University, Beijing, 100044, China
| | - Hongwei Chu
- Dalian University of Technology, Dalian, 116024, China.,Dalian Institute of Chemical Physics, Chinese Academy of Sciences, Dalian, 116023, China
| | - Jian Yu
- School of Computer and Information Technology and Beijing Key Lab of Traffic Data Analysis and Mining, Beijing Jiaotong University, Beijing, 100044, China.
| | - Xuezhong Zhou
- School of Computer and Information Technology and Beijing Key Lab of Traffic Data Analysis and Mining, Beijing Jiaotong University, Beijing, 100044, China.
| |
Collapse
|
113
|
Peng C, Li A, Wang M. Discovery of Bladder Cancer-related Genes Using Integrative Heterogeneous Network Modeling of Multi-omics Data. Sci Rep 2017; 7:15639. [PMID: 29142286 PMCID: PMC5688092 DOI: 10.1038/s41598-017-15890-9] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/09/2017] [Accepted: 11/02/2017] [Indexed: 02/06/2023] Open
Abstract
In human health, a fundamental challenge is the identification of disease-related genes. Bladder cancer (BC) is a worldwide malignant tumor, which has resulted in 170,000 deaths in 2010 up from 114,000 in 1990. Moreover, with the emergence of multi-omics data, more comprehensive analysis of human diseases become possible. In this study, we propose a multi-step approach for the identification of BC-related genes by using integrative Heterogeneous Network Modeling of Multi-Omics data (iHNMMO). The heterogeneous network model properly and comprehensively reflects the multiple kinds of relationships between genes in the multi-omics data of BC, including general relationships, unique relationships under BC condition, correlational relationships within each omics and regulatory relationships between different omics. Besides, a network-based propagation algorithm with resistance is utilized to quantize the relationships between genes and BC precisely. The results of comprehensive performance evaluation suggest that iHNMMO significantly outperforms other approaches. Moreover, further analysis suggests that the top ranked genes may be functionally implicated in BC, which also confirms the superiority of iHNMMO. In summary, this study shows that disease-related genes can be better identified through reasonable integration of multi-omics data.
Collapse
Affiliation(s)
- Chen Peng
- School of Information Science and Technology, University of Science and Technology of China, Hefei, AH230027, China
- Institute of Machine Learning and Systems Biology, College of Electronics and Information Engineering, Tongji University, Shanghai, 201804, P.R. China
| | - Ao Li
- School of Information Science and Technology, University of Science and Technology of China, Hefei, AH230027, China.
- Centers for Biomedical Engineering, University of Science and Technology of China, Hefei, AH230037, China.
| | - Minghui Wang
- School of Information Science and Technology, University of Science and Technology of China, Hefei, AH230027, China
- Centers for Biomedical Engineering, University of Science and Technology of China, Hefei, AH230037, China
| |
Collapse
|
114
|
Nakaya N, Sultana A, Tomarev SI. Impaired AMPA receptor trafficking by a double knockout of zebrafish olfactomedin1a/b. J Neurochem 2017; 143:635-644. [PMID: 28975619 DOI: 10.1111/jnc.14231] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/01/2017] [Revised: 09/15/2017] [Accepted: 09/27/2017] [Indexed: 01/06/2023]
Abstract
The olfm1a and olfm1b genes in zebrafish encode conserved secreted glycoproteins. These genes are preferentially expressed in the brain and retina starting from 16 h post-fertilization until adulthood. Functions of the Olfm1 gene is still unclear. Here, we produced and analyzed a null zebrafish mutant of both olfm1a and olfm1b genes (olfm1 null). olfm1 null fish were born at a normal Mendelian ratio and showed normal body shape and fertility as well as no visible defects from larval stages to adult. Olfm1 proteins were preferentially localized in the synaptosomes of the adult brain. Olfm1 co-immunoprecipitated with GluR2 and soluble NSF attachment protein receptor complexes indicating participation of Olfm1 in both pre- and post-synaptic events. Phosphorylation of GluR2 was not changed while palmitoylation of GluR2 was decreased in the brain synaptosomal membrane fraction of olfm1 null compared with wt fish. The levels of GluR2, SNAP25, flotillin1, and VAMP2 were markedly reduced in the synaptic microdomain of olfm1 null brain compared with wt. The internalization of GluR2 in retinal cells and the localization of VAMP2 in brain synaptosome were modified by olfm1 null mutation. This indicates that Olfm1 may regulate receptor trafficking from the intracellular compartments to the synaptic membrane microdomain, partly through the alteration of post-translational GluR2 modifications such as palmitoylation. Olfm1 may be considered a novel regulator of the composition and function of the α-amino-3-hydroxy-5-methylisoxazole-4-propionate receptor complex.
Collapse
Affiliation(s)
- Naoki Nakaya
- Section of Retinal Ganglion Cell Biology, Laboratory of Retinal Cell and Molecular Biology, National Eye Institute, NIH, Bethesda, Maryland, USA
| | - Afia Sultana
- Section of Retinal Ganglion Cell Biology, Laboratory of Retinal Cell and Molecular Biology, National Eye Institute, NIH, Bethesda, Maryland, USA
| | - Stanislav I Tomarev
- Section of Retinal Ganglion Cell Biology, Laboratory of Retinal Cell and Molecular Biology, National Eye Institute, NIH, Bethesda, Maryland, USA
| |
Collapse
|
115
|
Grewal N, Singh S, Chand T. Effect of Aggregation Operators on Network-Based Disease Gene Prioritization: A Case Study on Blood Disorders. IEEE/ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS 2017; 14:1276-1287. [PMID: 29220322 DOI: 10.1109/tcbb.2016.2599155] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/07/2023]
Abstract
Owing to the innate noise in the biological data sources, a single source or a single measure do not suffice for an effective disease gene prioritization. So, the integration of multiple data sources or aggregation of multiple measures is the need of the hour. The aggregation operators combine multiple related data values to a single value such that the combined value has the effect of all the individual values. In this paper, an attempt has been made for applying the fuzzy aggregation on the network-based disease gene prioritization and investigate its effect under noise conditions. This study has been conducted for a set of 15 blood disorders by fusing four different network measures, computed from the protein interaction network, using a selected set of aggregation operators and ranking the genes on the basis of the aggregated value. The aggregation operator-based rankings have been compared with the "Random walk with restart" gene prioritization method. The impact of noise has also been investigated by adding varying proportions of noise to the seed set. The results reveal that for all the selected blood disorders, the Mean of Maximal operator has relatively outperformed the other aggregation operators for noisy as well as non-noisy data.
Collapse
|
116
|
Maji P, Shah E. Significance and Functional Similarity for Identification of Disease Genes. IEEE/ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS 2017; 14:1419-1433. [PMID: 28113633 DOI: 10.1109/tcbb.2016.2598163] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/06/2023]
Abstract
One of the most significant research issues in functional genomics is insilico identification of disease related genes. In this regard, the paper presents a new gene selection algorithm, termed as SiFS, for identification of disease genes. It integrates the information obtained from interaction network of proteins and gene expression profiles. The proposed SiFS algorithm culls out a subset of genes from microarray data as disease genes by maximizing both significance and functional similarity of the selected gene subset. Based on the gene expression profiles, the significance of a gene with respect to another gene is computed using mutual information. On the other hand, a new measure of similarity is introduced to compute the functional similarity between two genes. Information derived from the protein-protein interaction network forms the basis of the proposed SiFS algorithm. The performance of the proposed gene selection algorithm and new similarity measure, is compared with that of other related methods and similarity measures, using several cancer microarray data sets.
Collapse
|
117
|
Joe MK, Lieberman RL, Nakaya N, Tomarev SI. Myocilin Regulates Metalloprotease 2 Activity Through Interaction With TIMP3. Invest Ophthalmol Vis Sci 2017; 58:5308-5318. [PMID: 29049729 PMCID: PMC5644706 DOI: 10.1167/iovs.16-20336] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/15/2023] Open
Abstract
Purpose To elucidate functions of wild-type myocilin, a secreted glycoprotein associated with glaucoma. Methods Lysates of mouse eyes were used for immunoprecipitation with affinity-purified antibodies against mouse myocilin. Shotgun proteomic analysis was used for the identification of proteins interacting with myocilin. Colocalization of myocilin and tissue inhibitor of metalloproteinases 3 (TIMP3) in different eye structures was investigated by a multiplex fluorescent in situ hybridization and immunofluorescent labeling with subsequent confocal microscopy. Matrix metalloproteinase 2 (MMP2) activity assay was used to test effects of myocilin on TIMP3 inhibitory action. Results TIMP3 was identified by a shotgun proteomic analysis as a protein that was coimmunoprecipitated with myocilin from eye lysates of wild-type and transgenic mice expressing elevated levels of mouse myocilin but not from lysates of transgenic mice expressing mutated mouse myocilin. Interaction of myocilin and TIMP3 was confirmed by coimmunoprecipitation of myocilin and TIMP3 from HEK293 cells transiently transfected with cDNAs encoding these proteins. The olfactomedin domain of myocilin is essential for interaction with TIMP3. In the eye, the main sites of myocilin and TIMP3 colocalization are the trabecular meshwork, sclera, and choroid. Using purified proteins, it has been shown that myocilin markedly enhanced the inhibitory activity of TIMP3 toward MMP2. Conclusions Myocilin may serve as a modulator of TIMP3 activity via interactions with the myocilin olfactomedin domain. Our data imply that in the case of MYOCILIN null or some glaucoma-causing mutations, inhibitory activity of TIMP3 toward MMP2 might be reduced, mimicking deleterious mutations in the TIMP3 gene.
Collapse
Affiliation(s)
- Myung Kuk Joe
- Section of Retinal Ganglion Cell Biology, Laboratory of Retinal Cell and Molecular Biology, National Eye Institute, National Institutes of Health, Bethesda, Maryland, United States
| | - Raquel L Lieberman
- School of Chemistry and Biochemistry, Georgia Institute of Technology, Atlanta, Georgia, United States
| | - Naoki Nakaya
- Section of Retinal Ganglion Cell Biology, Laboratory of Retinal Cell and Molecular Biology, National Eye Institute, National Institutes of Health, Bethesda, Maryland, United States
| | - Stanislav I Tomarev
- Section of Retinal Ganglion Cell Biology, Laboratory of Retinal Cell and Molecular Biology, National Eye Institute, National Institutes of Health, Bethesda, Maryland, United States
| |
Collapse
|
118
|
Tian Z, Guo M, Wang C, Xing L, Wang L, Zhang Y. Constructing an integrated gene similarity network for the identification of disease genes. J Biomed Semantics 2017; 8:32. [PMID: 29297379 PMCID: PMC5763299 DOI: 10.1186/s13326-017-0141-1] [Citation(s) in RCA: 16] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/28/2022] Open
Abstract
BACKGROUND Discovering novel genes that are involved human diseases is a challenging task in biomedical research. In recent years, several computational approaches have been proposed to prioritize candidate disease genes. Most of these methods are mainly based on protein-protein interaction (PPI) networks. However, since these PPI networks contain false positives and only cover less half of known human genes, their reliability and coverage are very low. Therefore, it is highly necessary to fuse multiple genomic data to construct a credible gene similarity network and then infer disease genes on the whole genomic scale. RESULTS We proposed a novel method, named RWRB, to infer causal genes of interested diseases. First, we construct five individual gene (protein) similarity networks based on multiple genomic data of human genes. Then, an integrated gene similarity network (IGSN) is reconstructed based on similarity network fusion (SNF) method. Finally, we employee the random walk with restart algorithm on the phenotype-gene bilayer network, which combines phenotype similarity network, IGSN as well as phenotype-gene association network, to prioritize candidate disease genes. We investigate the effectiveness of RWRB through leave-one-out cross-validation methods in inferring phenotype-gene relationships. Results show that RWRB is more accurate than state-of-the-art methods on most evaluation metrics. Further analysis shows that the success of RWRB is benefited from IGSN which has a wider coverage and higher reliability comparing with current PPI networks. Moreover, we conduct a comprehensive case study for Alzheimer's disease and predict some novel disease genes that supported by literature. CONCLUSIONS RWRB is an effective and reliable algorithm in prioritizing candidate disease genes on the genomic scale. Software and supplementary information are available at http://nclab.hit.edu.cn/~tianzhen/RWRB/ .
Collapse
Affiliation(s)
- Zhen Tian
- School of Computer Science and Engineering, Harbin Institute of Technology, Harbin, 150001 People’s Republic of China
| | - Maozu Guo
- School of Computer Science and Engineering, Harbin Institute of Technology, Harbin, 150001 People’s Republic of China
| | - Chunyu Wang
- School of Computer Science and Engineering, Harbin Institute of Technology, Harbin, 150001 People’s Republic of China
| | - LinLin Xing
- School of Computer Science and Engineering, Harbin Institute of Technology, Harbin, 150001 People’s Republic of China
| | - Lei Wang
- Institute of Health Service and Medical Information Academy of Military Medical Sciences Beijing, Beijing, 100850 China
| | - Yin Zhang
- Institute of Health Service and Medical Information Academy of Military Medical Sciences Beijing, Beijing, 100850 China
| |
Collapse
|
119
|
Li L, Wang Y, An L, Kong X, Huang T. A network-based method using a random walk with restart algorithm and screening tests to identify novel genes associated with Menière's disease. PLoS One 2017; 12:e0182592. [PMID: 28787010 PMCID: PMC5546581 DOI: 10.1371/journal.pone.0182592] [Citation(s) in RCA: 27] [Impact Index Per Article: 3.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/12/2017] [Accepted: 07/20/2017] [Indexed: 12/28/2022] Open
Abstract
As a chronic illness derived from hair cells of the inner ear, Menière’s disease (MD) negatively influences the quality of life of individuals and leads to a number of symptoms, such as dizziness, temporary hearing loss, and tinnitus. The complete identification of novel genes related to MD would help elucidate its underlying pathological mechanisms and improve its diagnosis and treatment. In this study, a network-based method was developed to identify novel MD-related genes based on known MD-related genes. A human protein-protein interaction (PPI) network was constructed using the PPI information reported in the STRING database. A classic ranking algorithm, the random walk with restart (RWR) algorithm, was employed to search for novel genes using known genes as seed nodes. To make the identified genes more reliable, a series of screening tests, including a permutation test, an interaction test and an enrichment test, were designed to select essential genes from those obtained by the RWR algorithm. As a result, several inferred genes, such as CD4, NOTCH2 and IL6, were discovered. Finally, a detailed biological analysis was performed on fifteen of the important inferred genes, which indicated their strong associations with MD.
Collapse
Affiliation(s)
- Lin Li
- Department of Otorhinolaryngology and Head & Neck, China-Japan Union Hospital of Jilin University, Changchun, China
| | - YanShu Wang
- Department of Anesthesia, The First Hospital of Jilin University, Changchun, China
| | - Lifeng An
- Department of Otorhinolaryngology and Head & Neck, China-Japan Union Hospital of Jilin University, Changchun, China
- * E-mail:
| | - XiangYin Kong
- Institute of Health Sciences, Shanghai Institutes for Biological Sciences, Chinese Academy of Sciences, Shanghai, China
| | - Tao Huang
- Institute of Health Sciences, Shanghai Institutes for Biological Sciences, Chinese Academy of Sciences, Shanghai, China
| |
Collapse
|
120
|
Abstract
Biological networks are powerful resources for the discovery of genes and genetic modules that drive disease. Fundamental to network analysis is the concept that genes underlying the same phenotype tend to interact; this principle can be used to combine and to amplify signals from individual genes. Recently, numerous bioinformatic techniques have been proposed for genetic analysis using networks, based on random walks, information diffusion and electrical resistance. These approaches have been applied successfully to identify disease genes, genetic modules and drug targets. In fact, all these approaches are variations of a unifying mathematical machinery - network propagation - suggesting that it is a powerful data transformation method of broad utility in genetic research.
Collapse
|
121
|
Identification of oral cancer related candidate genes by integrating protein-protein interactions, gene ontology, pathway analysis and immunohistochemistry. Sci Rep 2017; 7:2472. [PMID: 28559546 PMCID: PMC5449392 DOI: 10.1038/s41598-017-02522-5] [Citation(s) in RCA: 20] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/12/2016] [Accepted: 04/10/2017] [Indexed: 12/12/2022] Open
Abstract
In the recent years, bioinformatics methods have been reported with a high degree of success for candidate gene identification. In this milieu, we have used an integrated bioinformatics approach assimilating information from gene ontologies (GO), protein–protein interaction (PPI) and network analysis to predict candidate genes related to oral squamous cell carcinoma (OSCC). A total of 40973 PPIs were considered for 4704 cancer-related genes to construct human cancer gene network (HCGN). The importance of each node was measured in HCGN by ten different centrality measures. We have shown that the top ranking genes are related to a significantly higher number of diseases as compared to other genes in HCGN. A total of 39 candidate oral cancer target genes were predicted by combining top ranked genes and the genes corresponding to significantly enriched oral cancer related GO terms. Initial verification using literature and available experimental data indicated that 29 genes were related with OSCC. A detailed pathway analysis led us to propose a role for the selected candidate genes in the invasion and metastasis in OSCC. We further validated our predictions using immunohistochemistry (IHC) and found that the gene FLNA was upregulated while the genes ARRB1 and HTT were downregulated in the OSCC tissue samples.
Collapse
|
122
|
Qu S, Du Y, Chang S, Guo L, Fang K, Li Y, Zhang F, Zhang K, Wang J. Common variants near IKZF1 are associated with primary Sjögren's syndrome in Han Chinese. PLoS One 2017; 12:e0177320. [PMID: 28552951 PMCID: PMC5446195 DOI: 10.1371/journal.pone.0177320] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/08/2016] [Accepted: 04/25/2017] [Indexed: 01/02/2023] Open
Abstract
Primary Sjögren's syndrome (pSS) is a systematic autoimmune disease with evidence of genetic predisposition. The IKZF1 (IKAROS family zinc finger 1 (Ikaros)) gene is located at 7p12.2, encodes a transcription factor related to chromatin remodeling, regulates lymphocyte differentiation, and has been reported to be associated with some autoimmune diseases. However, there have been no reports of an association between IKZF1 and pSS. To investigate the possibility of an association between the IKZF1 locus and pSS, we selected two single nucleotide polymorphisms (SNPs) in the IKZF1 locus, rs4917129 and rs4917014, based on a detailed analysis of genome-wide association study (GWAS) data and performed genotyping in 665 Han Chinese pSS patients and 863 healthy controls. The results of an association test showed significant association signals (rs4917129: P-value = 5.5e-4, OR (odds ratio) = 0.72, 95% CI (confidence interval) = 0.60-0.87; rs4917014: P-value = 1.2e-3, OR = 0.76, 95% CI = 0.64-0.89). A meta-analysis that combined the above results with data from previous GWAS, further confirmed these associations (rs4917129: Pmeta = 4.24e-8, ORmeta = 0.70, 95% CI = 0.61-0.79; rs4917014: Pmeta = 6.0e-8, ORmeta = 0.72, 95% CI = 0.64-0.81). A bioinformatics analysis indicated that both SNPs were located in a putative enhancer area in immune-related cell lines and tissues. A protein-protein interaction analysis found that IKZF1, together with GTF2I (an SS susceptibility gene newly identified through GWAS), could interact with histone deacetylase family proteins. In summary, this is the first study to report an association between IKZF1 and SS in Han Chinese.
Collapse
Affiliation(s)
- Susu Qu
- CAS Key Laboratory of Mental Health, Institute of Psychology, Beijing, China
- Department of Psychology, University of Chinese Academy of Sciences, Beijing, China
| | - Yang Du
- CAS Key Laboratory of Mental Health, Institute of Psychology, Beijing, China
- Department of Psychology, University of Chinese Academy of Sciences, Beijing, China
| | - Suhua Chang
- CAS Key Laboratory of Mental Health, Institute of Psychology, Beijing, China
- Department of Psychology, University of Chinese Academy of Sciences, Beijing, China
| | - Liyuan Guo
- CAS Key Laboratory of Mental Health, Institute of Psychology, Beijing, China
- Department of Psychology, University of Chinese Academy of Sciences, Beijing, China
| | - Kechi Fang
- CAS Key Laboratory of Mental Health, Institute of Psychology, Beijing, China
- Department of Psychology, University of Chinese Academy of Sciences, Beijing, China
| | - Yongzhe Li
- Department of Rheumatology and Clinical Immunology, Peking Union Medical College Hospital, Peking Union Medical College and Chinese Academy of Medical Sciences, Key Laboratory of Rheumatology and Clinical Immunology, Ministry of Education, Beijing, China
| | - Fengchun Zhang
- Department of Rheumatology and Clinical Immunology, Peking Union Medical College Hospital, Peking Union Medical College and Chinese Academy of Medical Sciences, Key Laboratory of Rheumatology and Clinical Immunology, Ministry of Education, Beijing, China
| | - Kunlin Zhang
- CAS Key Laboratory of Mental Health, Institute of Psychology, Beijing, China
- Department of Psychology, University of Chinese Academy of Sciences, Beijing, China
- * E-mail: (J. Wang); (K. Zhang)
| | - Jing Wang
- CAS Key Laboratory of Mental Health, Institute of Psychology, Beijing, China
- Department of Psychology, University of Chinese Academy of Sciences, Beijing, China
- * E-mail: (J. Wang); (K. Zhang)
| |
Collapse
|
123
|
Lu S, Yan Y, Li Z, Chen L, Yang J, Zhang Y, Wang S, Liu L. Determination of Genes Related to Uveitis by Utilization of the Random Walk with Restart Algorithm on a Protein-Protein Interaction Network. Int J Mol Sci 2017; 18:ijms18051045. [PMID: 28505077 PMCID: PMC5454957 DOI: 10.3390/ijms18051045] [Citation(s) in RCA: 19] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/09/2017] [Revised: 05/08/2017] [Accepted: 05/09/2017] [Indexed: 12/14/2022] Open
Abstract
Uveitis, defined as inflammation of the uveal tract, may cause blindness in both young and middle-aged people. Approximately 10–15% of blindness in the West is caused by uveitis. Therefore, a comprehensive investigation to determine the disease pathogenesis is urgent, as it will thus be possible to design effective treatments. Identification of the disease genes that cause uveitis is an important requirement to achieve this goal. To begin to answer this question, in this study, a computational method was proposed to identify novel uveitis-related genes. This method was executed on a large protein–protein interaction network and employed a popular ranking algorithm, the Random Walk with Restart (RWR) algorithm. To improve the utility of the method, a permutation test and a procedure for selecting core genes were added, which helped to exclude false discoveries and select the most important candidate genes. The five-fold cross-validation was adopted to evaluate the method, yielding the average F1-measure of 0.189. In addition, we compared our method with a classic GBA-based method to further indicate its utility. Based on our method, 56 putative genes were chosen for further assessment. We have determined that several of these genes (e.g., CCL4, Jun, and MMP9) are likely to be important for the pathogenesis of uveitis.
Collapse
Affiliation(s)
- Shiheng Lu
- Department of Ophthalmology, Ren Ji Hospital, School of Medicine, Shanghai Jiao Tong University, Shanghai 200127, China.
| | - Yan Yan
- Department of Ophthalmology, Ren Ji Hospital, School of Medicine, Shanghai Jiao Tong University, Shanghai 200127, China.
| | - Zhen Li
- Department of Ophthalmology, Ren Ji Hospital, School of Medicine, Shanghai Jiao Tong University, Shanghai 200127, China.
| | - Lei Chen
- College of Information Engineering, Shanghai Maritime University, Shanghai 201306, China.
| | - Jing Yang
- School of Life Sciences, Shanghai University, Shanghai 200444, China.
| | - Yuhang Zhang
- Institute of Health Sciences, Shanghai Institutes for Biological Sciences, Chinese Academy of Sciences, Shanghai 200031, China.
| | - Shaopeng Wang
- School of Life Sciences, Shanghai University, Shanghai 200444, China.
| | - Lin Liu
- Department of Ophthalmology, Ren Ji Hospital, School of Medicine, Shanghai Jiao Tong University, Shanghai 200127, China.
| |
Collapse
|
124
|
Peng C, Li A. A Heterogeneous Network Based Method for Identifying GBM-Related Genes by Integrating Multi-Dimensional Data. IEEE/ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS 2017; 14:713-720. [PMID: 28113912 DOI: 10.1109/tcbb.2016.2555314] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/06/2023]
Abstract
The emergence of multi-dimensional data offers opportunities for more comprehensive analysis of the molecular characteristics of human diseases and therefore improving diagnosis, treatment, and prevention. In this study, we proposed a heterogeneous network based method by integrating multi-dimensional data (HNMD) to identify GBM-related genes. The novelty of the method lies in that the multi-dimensional data of GBM from TCGA dataset that provide comprehensive information of genes, are combined with protein-protein interactions to construct a weighted heterogeneous network, which reflects both the general and disease-specific relationships between genes. In addition, a propagation algorithm with resistance is introduced to precisely score and rank GBM-related genes. The results of comprehensive performance evaluation show that the proposed method significantly outperforms the network based methods with single-dimensional data and other existing approaches. Subsequent analysis of the top ranked genes suggests they may be functionally implicated in GBM, which further corroborates the superiority of the proposed method. The source code and the results of HNMD can be downloaded from the following URL: http://bioinformatics.ustc.edu.cn/hnmd/ .
Collapse
|
125
|
Chu H, Sun P, Yin J, Liu G, Wang Y, Zhao P, Zhu Y, Yang X, Zheng T, Zhou X, Jin W, Sun C. Integrated network analysis reveals potentially novel molecular mechanisms and therapeutic targets of refractory epilepsies. PLoS One 2017; 12:e0174964. [PMID: 28388656 PMCID: PMC5384674 DOI: 10.1371/journal.pone.0174964] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/17/2016] [Accepted: 03/19/2017] [Indexed: 01/03/2023] Open
Abstract
Epilepsy is a complex neurological disorder and a significant health problem. The pathogenesis of epilepsy remains obscure in a significant number of patients and the current treatment options are not adequate in about a third of individuals which were known as refractory epilepsies (RE). Network medicine provides an effective approach for studying the molecular mechanisms underlying complex diseases. Here we integrated 1876 disease-gene associations of RE and located those genes to human protein-protein interaction (PPI) network to obtain 42 significant RE-associated disease modules. The functional analysis of these disease modules showed novel molecular pathological mechanisms of RE, such as the novel enriched pathways (e.g., "presynaptic nicotinic acetylcholine receptors", "signaling by insulin receptor"). Further analysis on the relationships between current drug targets and the RE-related disease genes showed the rational mechanisms of most antiepileptic drugs. In addition, we detected ten potential novel drug targets (e.g., KCNA1, KCNA4-6, KCNC3, KCND2, KCNMA1, CAMK2G, CACNB4 and GRM1) located in three RE related disease modules, which might provide novel insights into the new drug discovery for RE therapy.
Collapse
Affiliation(s)
- Hongwei Chu
- Department of Biomedical Engineering, Faculty of Electronic Information and Electrical Engineering, Dalian University of Technology, Dalian, China
- Liaoning Provincial Key Laboratory of Cerebral Diseases, Institute for Brain Disorders, Dalian Medical University, Dalian, China
| | - Pin Sun
- Shanghai Medical College, Fudan University, Shanghai, China
| | - Jiahui Yin
- College of Electronics and Information Engineering, Tongji University, Shanghai, China
| | - Guangming Liu
- School of Computer and Information Technology and Beijing Key Lab of Traffic Data Analysis and Mining, Beijing Jiaotong University, Beijing, China
| | - Yiwei Wang
- Liaoning Provincial Key Laboratory of Cerebral Diseases, Institute for Brain Disorders, Dalian Medical University, Dalian, China
| | - Pengyao Zhao
- School of Computer and Information Technology and Beijing Key Lab of Traffic Data Analysis and Mining, Beijing Jiaotong University, Beijing, China
| | - Yizhun Zhu
- Department of Pharmacology, School of Pharmacy, Fudan University, Shanghai, China
| | - Xiaohan Yang
- Liaoning Provincial Key Laboratory of Cerebral Diseases, Institute for Brain Disorders, Dalian Medical University, Dalian, China
| | - Tiezheng Zheng
- Liaoning Provincial Key Laboratory of Cerebral Diseases, Institute for Brain Disorders, Dalian Medical University, Dalian, China
| | - Xuezhong Zhou
- School of Computer and Information Technology and Beijing Key Lab of Traffic Data Analysis and Mining, Beijing Jiaotong University, Beijing, China
| | - Weilin Jin
- Institute of Nano Biomedicine and Engineering, Department of Instrument Science and Engineering, Key Lab. for Thin Film and Microfabrication Technology of Ministry of Education, School of Electronic Information and Electronic Engineering, Shanghai Jiao Tong University, Shanghai, China
| | - Changkai Sun
- Department of Biomedical Engineering, Faculty of Electronic Information and Electrical Engineering, Dalian University of Technology, Dalian, China
- Liaoning Provincial Key Laboratory of Cerebral Diseases, Institute for Brain Disorders, Dalian Medical University, Dalian, China
- Research Center for the Control Engineering of Translational Precision Medicine, Dalian University of Technology, Dalian, China
- State Key Laboratory of Fine Chemicals, Dalian R&D Center for Stem Cell and Tissue Engineering, Dalian University of Technology, Dalian, China
| |
Collapse
|
126
|
Vrijens K, Winckelmans E, Tsamou M, Baeyens W, De Boever P, Jennen D, de Kok TM, Den Hond E, Lefebvre W, Plusquin M, Reynders H, Schoeters G, Van Larebeke N, Vanpoucke C, Kleinjans J, Nawrot TS. Sex-Specific Associations between Particulate Matter Exposure and Gene Expression in Independent Discovery and Validation Cohorts of Middle-Aged Men and Women. ENVIRONMENTAL HEALTH PERSPECTIVES 2017; 125:660-669. [PMID: 27740511 PMCID: PMC5381989 DOI: 10.1289/ehp370] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/21/2015] [Revised: 08/12/2016] [Accepted: 08/22/2016] [Indexed: 05/05/2023]
Abstract
BACKGROUND Particulate matter (PM) exposure leads to premature death, mainly due to respiratory and cardiovascular diseases. OBJECTIVES Identification of transcriptomic biomarkers of air pollution exposure and effect in a healthy adult population. METHODS Microarray analyses were performed in 98 healthy volunteers (48 men, 50 women). The expression of eight sex-specific candidate biomarker genes (significantly associated with PM10 in the discovery cohort and with a reported link to air pollution-related disease) was measured with qPCR in an independent validation cohort (75 men, 94 women). Pathway analysis was performed using Gene Set Enrichment Analysis. Average daily PM2.5 and PM10 exposures over 2-years were estimated for each participant's residential address using spatiotemporal interpolation in combination with a dispersion model. RESULTS Average long-term PM10 was 25.9 (± 5.4) and 23.7 (± 2.3) μg/m3 in the discovery and validation cohorts, respectively. In discovery analysis, associations between PM10 and the expression of individual genes differed by sex. In the validation cohort, long-term PM10 was associated with the expression of DNAJB5 and EAPP in men and ARHGAP4 (p = 0.053) in women. AKAP6 and LIMK1 were significantly associated with PM10 in women, although associations differed in direction between the discovery and validation cohorts. Expression of the eight candidate genes in the discovery cohort differentiated between validation cohort participants with high versus low PM10 exposure (area under the receiver operating curve = 0.92; 95% CI: 0.85, 1.00; p = 0.0002 in men, 0.86; 95% CI: 0.76, 0.96; p = 0.004 in women). CONCLUSIONS Expression of the sex-specific candidate genes identified in the discovery population predicted PM10 exposure in an independent cohort of adults from the same area. Confirmation in other populations may further support this as a new approach for exposure assessment, and may contribute to the discovery of molecular mechanisms for PM-induced health effects.
Collapse
Affiliation(s)
- Karen Vrijens
- Centre for Environmental Sciences, Hasselt University, Diepenbeek, Belgium
| | - Ellen Winckelmans
- Centre for Environmental Sciences, Hasselt University, Diepenbeek, Belgium
| | - Maria Tsamou
- Centre for Environmental Sciences, Hasselt University, Diepenbeek, Belgium
| | - Willy Baeyens
- Department of Analytical and Environmental Chemistry, Free University of Brussels, Brussels, Belgium
| | - Patrick De Boever
- Centre for Environmental Sciences, Hasselt University, Diepenbeek, Belgium
- Environmental Risk and Health Unit, Flemish Institute for Technological Research (VITO), Mol, Belgium
| | - Danyel Jennen
- Department of Toxicogenomics, Maastricht University, Maastricht, Netherlands
| | - Theo M. de Kok
- Department of Toxicogenomics, Maastricht University, Maastricht, Netherlands
| | - Elly Den Hond
- Environmental Risk and Health Unit, Flemish Institute for Technological Research (VITO), Mol, Belgium
- Provincial Institute for Hygiene, Antwerp, Belgium
| | - Wouter Lefebvre
- Environmental Risk and Health Unit, Flemish Institute for Technological Research (VITO), Mol, Belgium
| | - Michelle Plusquin
- Centre for Environmental Sciences, Hasselt University, Diepenbeek, Belgium
| | - Hans Reynders
- Environment, Nature and Energy Department, Flemish Government, Brussels, Belgium
| | - Greet Schoeters
- Environmental Risk and Health Unit, Flemish Institute for Technological Research (VITO), Mol, Belgium
- Department of Biomedical Sciences, University of Antwerp, Antwerp, Belgium
- University of Southern Denmark, Institute of Public Health, Department of Environmental Medicine, Odense, Denmark
| | - Nicolas Van Larebeke
- Department of Radiotherapy and Nuclear Medicine, Ghent University, Ghent, Belgium
| | | | - Jos Kleinjans
- Department of Toxicogenomics, Maastricht University, Maastricht, Netherlands
| | - Tim S. Nawrot
- Centre for Environmental Sciences, Hasselt University, Diepenbeek, Belgium
- Department of Public Health and Primary Care, Leuven University, Leuven, Belgium
- Address correspondence to T.S. Nawrot, Centre for Environmental Sciences, Hasselt University, Agoralaan gebouw D, B-3590 Diepenbeek, Belgium. Telephone: 0032/11-26.83.82. E-mail:
| |
Collapse
|
127
|
Maji P, Shah E, Paul S. RelSim: An integrated method to identify disease genes using gene expression profiles and PPIN based similarity measure. Inf Sci (N Y) 2017. [DOI: 10.1016/j.ins.2016.06.034] [Citation(s) in RCA: 14] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/07/2023]
|
128
|
Rough Hypercuboid and Modified Kulczynski Coefficient for Disease Gene Identification. ACTA ACUST UNITED AC 2017. [DOI: 10.1007/978-3-319-54430-4_45] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 04/04/2023]
|
129
|
Identification of Candidate Genes Related to Inflammatory Bowel Disease Using Minimum Redundancy Maximum Relevance, Incremental Feature Selection, and the Shortest-Path Approach. BIOMED RESEARCH INTERNATIONAL 2017; 2017:5741948. [PMID: 28293637 PMCID: PMC5331171 DOI: 10.1155/2017/5741948] [Citation(s) in RCA: 17] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 08/26/2016] [Accepted: 01/11/2017] [Indexed: 02/08/2023]
Abstract
Identification of disease genes is a hot topic in biomedicine and genomics. However, it is a challenging problem because of the complexity of diseases. Inflammatory bowel disease (IBD) is an idiopathic disease caused by a dysregulated immune response to host intestinal microflora. It has been proven to be associated with the development of intestinal malignancies. Although the specific pathological characteristics and genetic background of IBD have been partially revealed, it is still an overdetermined disease and the blueprint of all genetic variants still needs to be improved. In this study, a novel computational method was built to identify genes related to IBD. Samples from two subtypes of IBD (ulcerative colitis and Crohn's disease) and normal samples were employed. By analyzing the gene expression profiles of these samples using minimum redundancy maximum relevance and incremental feature selection, 21 genes were obtained that could effectively distinguish samples from the two subtypes of IBD and the normal samples. Then, the shortest-path approach was used to search for an additional 20 genes in a large network constructed using protein-protein interactions based on the above-mentioned 21 genes. Analyses of the 41 genes obtained indicate that they are closely associated with this disease.
Collapse
|
130
|
Guo W, Shang DM, Cao JH, Feng K, He YC, Jiang Y, Wang S, Gao YF. Identifying and Analyzing Novel Epilepsy-Related Genes Using Random Walk with Restart Algorithm. BIOMED RESEARCH INTERNATIONAL 2017; 2017:6132436. [PMID: 28255556 PMCID: PMC5309434 DOI: 10.1155/2017/6132436] [Citation(s) in RCA: 14] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 10/23/2016] [Accepted: 01/15/2017] [Indexed: 02/07/2023]
Abstract
As a pathological condition, epilepsy is caused by abnormal neuronal discharge in brain which will temporarily disrupt the cerebral functions. Epilepsy is a chronic disease which occurs in all ages and would seriously affect patients' personal lives. Thus, it is highly required to develop effective medicines or instruments to treat the disease. Identifying epilepsy-related genes is essential in order to understand and treat the disease because the corresponding proteins encoded by the epilepsy-related genes are candidates of the potential drug targets. In this study, a pioneering computational workflow was proposed to predict novel epilepsy-related genes using the random walk with restart (RWR) algorithm. As reported in the literature RWR algorithm often produces a number of false positive genes, and in this study a permutation test and functional association tests were implemented to filter the genes identified by RWR algorithm, which greatly reduce the number of suspected genes and result in only thirty-three novel epilepsy genes. Finally, these novel genes were analyzed based upon some recently published literatures. Our findings implicate that all novel genes were closely related to epilepsy. It is believed that the proposed workflow can also be applied to identify genes related to other diseases and deepen our understanding of the mechanisms of these diseases.
Collapse
Affiliation(s)
- Wei Guo
- Department of Outpatient, China-Japan Union Hospital of Jilin University, Changchun 130033, China
| | - Dong-Mei Shang
- Department of Outpatient, China-Japan Union Hospital of Jilin University, Changchun 130033, China
| | - Jing-Hui Cao
- Department of Neurosurgery, China-Japan Union Hospital of Jilin University, Changchun 130033, China
| | - Kaiyan Feng
- Department of Computer Science, Guangdong AIB Polytechnic, Guangzhou 510507, China
| | - Yi-Chun He
- Department of Neurosurgery, China-Japan Union Hospital of Jilin University, Changchun 130033, China
| | - Yang Jiang
- Department of Surgery, China-Japan Union Hospital of Jilin University, Changchun 130033, China
| | - ShaoPeng Wang
- School of Life Sciences, Shanghai University, Shanghai 200444, China
| | - Yu-Fei Gao
- Department of Neurosurgery, China-Japan Union Hospital of Jilin University, Changchun 130033, China
| |
Collapse
|
131
|
Abstract
Background Identifying the genes associated to human diseases is crucial for disease diagnosis and drug design. Computational approaches, esp. the network-based approaches, have been recently developed to identify disease-related genes effectively from the existing biomedical networks. Meanwhile, the advance in biotechnology enables researchers to produce multi-omics data, enriching our understanding on human diseases, and revealing the complex relationships between genes and diseases. However, none of the existing computational approaches is able to integrate the huge amount of omics data into a weighted integrated network and utilize it to enhance disease related gene discovery. Results We propose a new network-based disease gene prediction method called SLN-SRW (Simplified Laplacian Normalization-Supervised Random Walk) to generate and model the edge weights of a new biomedical network that integrates biomedical data from heterogeneous sources, thus far enhancing the disease related gene discovery. Conclusions The experiment results show that SLN-SRW significantly improves the performance of disease gene prediction on both the real and the synthetic data sets. Electronic supplementary material The online version of this article (doi:10.1186/s12864-016-3263-4) contains supplementary material, which is available to authorized users.
Collapse
|
132
|
Chang NW, Dai HJ, Shih YY, Wu CY, Dela Rosa MAC, Obena RP, Chen YJ, Hsu WL, Oyang YJ. Biomarker identification of hepatocellular carcinoma using a methodical literature mining strategy. Database (Oxford) 2017; 2017:bax082. [PMID: 31725857 PMCID: PMC7243925 DOI: 10.1093/database/bax082] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/01/2016] [Revised: 10/11/2017] [Accepted: 10/11/2017] [Indexed: 12/31/2022]
Abstract
Hepatocellular carcinoma (HCC), one of the most common causes of cancer-related deaths, carries a 5-year survival rate of 18%, underscoring the need for robust biomarkers. In spite of the increased availability of HCC related literatures, many of the promising biomarkers reported have not been validated for clinical use. To narrow down the wide range of possible biomarkers for further clinical validation, bioinformaticians need to sort them out using information provided in published works. Biomedical text mining is an automated way to obtain information of interest within the massive collection of biomedical knowledge, thus enabling extraction of data for biomarkers associated with certain diseases. This method can significantly reduce both the time and effort spent on studying important maladies such as liver diseases. Herein, we report a text mining-aided curation pipeline to identify potential biomarkers for liver cancer. The curation pipeline integrates PubMed E-Utilities to collect abstracts from PubMed and recognize several types of named entities by machine learning-based and pattern-based methods. Genes/proteins from evidential sentences were classified as candidate biomarkers using a convolutional neural network. Lastly, extracted biomarkers were ranked depending on several criteria, such as the frequency of keywords and articles and the journal impact factor, and then integrated into a meaningful list for bioinformaticians. Based on the developed pipeline, we constructed MarkerHub, which contains 2128 candidate biomarkers extracted from PubMed publications from 2008 to 2017. Database URL: http://markerhub.iis.sinica.edu.tw.
Collapse
Affiliation(s)
- Nai-Wen Chang
- Graduate Institute of Biomedical Electronics and Bioinformatics, National Taiwan University, Taipei, Taiwan
- Institute of Information Science, Academia Sinica, Taipei, Taiwan
| | - Hong-Jie Dai
- Department of Computer Science and Information Engineering, National Taitung University, Taitung, Taiwan
- Interdisciplinary Program of Green and Information Technology, National Taitung University, Taitung, Taiwan
| | - Yung-Yu Shih
- Institute of Information Science, Academia Sinica, Taipei, Taiwan
| | - Chi-Yang Wu
- Institute of Information Science, Academia Sinica, Taipei, Taiwan
| | | | | | - Yu-Ju Chen
- Institute of Chemistry, Academia Sinica, Taipei, Taiwan
| | - Wen-Lian Hsu
- Institute of Information Science, Academia Sinica, Taipei, Taiwan
| | - Yen-Jen Oyang
- Graduate Institute of Biomedical Electronics and Bioinformatics, National Taiwan University, Taipei, Taiwan
| |
Collapse
|
133
|
Exploring the molecular basis of age-related disease comorbidities using a multi-omics graphical model. Sci Rep 2016; 6:37646. [PMID: 27886242 PMCID: PMC5122881 DOI: 10.1038/srep37646] [Citation(s) in RCA: 42] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/08/2016] [Accepted: 10/28/2016] [Indexed: 12/28/2022] Open
Abstract
Although association studies have unveiled numerous correlations of biochemical markers with age and age-related diseases, we still lack an understanding of their mutual dependencies. To find molecular pathways that underlie age-related diseases as well as their comorbidities, we integrated aging markers from four different high-throughput omics datasets, namely epigenomics, transcriptomics, glycomics and metabolomics, with a comprehensive set of disease phenotypes from 510 participants of the TwinsUK cohort. We used graphical random forests to assess conditional dependencies between omics markers and phenotypes while eliminating mediated associations. Applying this novel approach for multi-omics data integration yields a model consisting of seven modules that represent distinct aspects of aging. These modules are connected by hubs that potentially trigger comorbidities of age-related diseases. As an example, we identified urate as one of these key players mediating the comorbidity of renal disease with body composition and obesity. Body composition variables are in turn associated with inflammatory IgG markers, mediated by the expression of the hormone oxytocin. Thus, oxytocin potentially contributes to the development of chronic low-grade inflammation, which often accompanies obesity. Our multi-omics graphical model demonstrates the interconnectivity of age-related diseases and highlights molecular markers of the aging process that might drive disease comorbidities.
Collapse
|
134
|
He FQ, Ollert M. Network-Guided Key Gene Discovery for a Given Cellular Process. ADVANCES IN BIOCHEMICAL ENGINEERING/BIOTECHNOLOGY 2016. [PMID: 27783134 DOI: 10.1007/10_2016_39] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/10/2022]
Abstract
Identification of key genes for a given physiological or pathological process is an essential but still very challenging task for the entire biomedical research community. Statistics-based approaches, such as genome-wide association study (GWAS)- or quantitative trait locus (QTL)-related analysis have already made enormous contributions to identifying key genes associated with a given disease or phenotype, the success of which is however very much dependent on a huge number of samples. Recent advances in network biology, especially network inference directly from genome-scale data and the following-up network analysis, opens up new avenues to predict key genes driving a given biological process or cellular function. Here we review and compare the current approaches in predicting key genes, which have no chances to stand out by classic differential expression analysis, from gene-regulatory, protein-protein interaction, or gene expression correlation networks. We elaborate these network-based approaches mainly in the context of immunology and infection, and urge more usage of correlation network-based predictions. Such network-based key gene discovery approaches driven by information-enriched 'omics' data should be very useful for systematic key gene discoveries for any given biochemical process or cellular function, and also valuable for novel drug target discovery and novel diagnostic, prognostic and therapeutic-efficiency marker prediction for a specific disease or disorder.
Collapse
Affiliation(s)
- Feng Q He
- Department of Infection and Immunity, Group of Immune Systems Biology, Luxembourg Institute of Health, 29, rue Henri Koch, 4354, Esch-sur-Alzette, Luxembourg.
| | - Markus Ollert
- Department of Infection and Immunity, Group of Allergy and Clinical Immunology, Luxembourg Institute of Health, 29, rue Henri Koch, 4354, Esch-sur-Alzette, Luxembourg
- Odense Research Center for Anaphylaxis, Department of Dermatology and Allergy Center, Odense University Hospital, University of Southern Denmark, 5000, Odense C, Denmark
| |
Collapse
|
135
|
Ivanov PC, Liu KKL, Bartsch RP. Focus on the emerging new fields of Network Physiology and Network Medicine. NEW JOURNAL OF PHYSICS 2016; 18:100201. [PMID: 30881198 PMCID: PMC6415921 DOI: 10.1088/1367-2630/18/10/100201] [Citation(s) in RCA: 126] [Impact Index Per Article: 15.8] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/15/2023]
Abstract
Despite the vast progress and achievements in systems biology and integrative physiology in the last decades, there is still a significant gap in understanding the mechanisms through which (i) genomic, proteomic and metabolic factors and signaling pathways impact vertical processes across cells, tissues and organs leading to the expression of different disease phenotypes and influence the functional and clinical associations between diseases, and (ii) how diverse physiological systems and organs coordinate their functions over a broad range of space and time scales and horizontally integrate to generate distinct physiologic states at the organism level. Two emerging fields, network medicine and network physiology, aim to address these fundamental questions. Novel concepts and approaches derived from recent advances in network theory, coupled dynamical systems, statistical and computational physics show promise to provide new insights into the complexity of physiological structure and function in health and disease, bridging the genetic and sub-cellular level with inter-cellular interactions and communications among integrated organ systems and sub-systems. These advances form first building blocks in the methodological formalism and theoretical framework necessary to address fundamental problems and challenges in physiology and medicine. This 'focus on' issue contains 26 articles representing state-of-the-art contributions covering diverse systems from the sub-cellular to the organism level where physicists have key role in laying the foundations of these new fields.
Collapse
Affiliation(s)
- Plamen Ch Ivanov
- Keck Laboratory for Network Physiology, Department of Physics, Boston University, Boston, Massachusetts, USA
- Harvard Medical School and Division of Sleep Medicine, Brigham and Women Hospital, Boston, MA 02115, USA
- Institute of Solid State Physics, Bulgarian Academy of Sciences, Sofia 1784, Bulgaria
- (Editor of the ‘focus on’ issue)
| | - Kang K L Liu
- Keck Laboratory for Network Physiology, Department of Physics, Boston University, Boston, Massachusetts, USA
- Department of Neurology, Beth Israel Deaconess Medical Center and Harvard Medical School, Boston, Massachusetts, USA
| | - Ronny P Bartsch
- Department of Physics, Bar-Ilan University, Ramat Gan, 5290002, Israel
| |
Collapse
|
136
|
Tang X, Hu X, Yang X, Fan Y, Li Y, Hu W, Liao Y, Zheng MC, Peng W, Gao L. Predicting diabetes mellitus genes via protein-protein interaction and protein subcellular localization information. BMC Genomics 2016; 17 Suppl 4:433. [PMID: 27535125 PMCID: PMC5001230 DOI: 10.1186/s12864-016-2795-y] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/13/2022] Open
Abstract
BACKGROUND Diabetes mellitus characterized by hyperglycemia as a result of insufficient production of or reduced sensitivity to insulin poses a growing threat to the health of people. It is a heterogeneous disorder with multiple etiologies consisting of type 1 diabetes, type 2 diabetes, gestational diabetes and so on. Diabetes-associated protein/gene prediction is a key step to understand the cellular mechanisms related to diabetes mellitus. Compared with experimental methods, computational predictions of candidate proteins/genes are cheaper and more effortless. Protein-protein interaction (PPI) data produced by the high-throughput technology have been used to prioritize candidate disease genes/proteins. However, the false interactions in the PPI data seriously hurt computational methods performance. In order to address that particular question, new methods are developed to identify candidate disease genes/proteins via integrating biological data from other sources. RESULTS In this study, a new framework called PDMG is proposed to predict candidate disease genes/proteins. First, the weighted networks are building in terms of the combination of the subcellular localization information and PPI data. To form the weighted networks, the importance of each compartment is evaluated based on the number of interacted proteins in this compartment. This is because the very different roles played by different compartments in cell activities. Besides, some compartments are more important than others. Based on the evaluated compartments, the interactions between proteins are scored and the weighted PPI networks are constructed. Second, the known disease genes are extracted from OMIM database as the seed genes to expand disease-specific networks based on the weighted networks. Third, the weighted values between a protein and its neighbors in the disease-related networks are added together and the sum is as the score of the protein. Last but not least, the proteins are ranked based on descending order of their scores. The candidate proteins in the top are considered to be associated with the diseases and are potential disease-related proteins. Various types of data, such as type 2 diabetes-associated genes, subcellular localizations and protein interactions, are used to test PDMG method. CONCLUSIONS The results show that the proteins/genes functionally exerting a direct influence over diabetes are consistently placed at the head of the queue. PDMG expands and ranks 445 candidate proteins from the seed set including original 27 type 2 diabetes proteins. Out of the top 27 proteins, 14 proteins are the real type 2 diabetes proteins. The literature extracted from the PubMed database has proved that, out of 13 novel proteins, 8 proteins are associated with diabetes.
Collapse
Affiliation(s)
- Xiwei Tang
- School of Information Science and Engineering, Hunan First Normal University, Changsha, 410205, China.
- College of Computing and Informatics, Drexel University, Philadelphia, PA 19104, USA.
- College of Computer, National University of Defense Technology, Changsha, 410073, China.
| | - Xiaohua Hu
- College of Computing and Informatics, Drexel University, Philadelphia, PA 19104, USA.
- School of Computer, Central China Normal University, Hubei, 430079, China.
| | - Xuejun Yang
- College of Computer, National University of Defense Technology, Changsha, 410073, China
| | - Yetian Fan
- School of Mathematical Sciences, Dalian University of Technology, Dalian, 116023, China
| | - Yongfan Li
- School of Information Science and Engineering, Hunan First Normal University, Changsha, 410205, China
| | - Wei Hu
- School of Information Science and Engineering, Hunan First Normal University, Changsha, 410205, China
| | - Yongzhong Liao
- School of Information Science and Engineering, Hunan First Normal University, Changsha, 410205, China
| | - Ming Cai Zheng
- School of Information Science and Engineering, Hunan First Normal University, Changsha, 410205, China
| | - Wei Peng
- Computer Center, Kunming University of Science and Technology, Kunming, 650500, China
| | - Li Gao
- School of Computer, Central China Normal University, Hubei, 430079, China
| |
Collapse
|
137
|
Biane C, Delaplace F, Klaudel H. Networks and games for precision medicine. Biosystems 2016; 150:52-60. [PMID: 27543134 DOI: 10.1016/j.biosystems.2016.08.006] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/17/2016] [Revised: 07/20/2016] [Accepted: 08/11/2016] [Indexed: 12/13/2022]
Abstract
Recent advances in omics technologies provide the leverage for the emergence of precision medicine that aims at personalizing therapy to patient. In this undertaking, computational methods play a central role for assisting physicians in their clinical decision-making by combining data analysis and systems biology modelling. Complex diseases such as cancer or diabetes arise from the intricate interplay of various biological molecules. Therefore, assessing drug efficiency requires to study the effects of elementary perturbations caused by diseases on relevant biological networks. In this paper, we propose a computational framework called Network-Action Game applied to best drug selection problem combining Game Theory and discrete models of dynamics (Boolean networks). Decision-making is modelled using Game Theory that defines the process of drug selection among alternative possibilities, while Boolean networks are used to model the effects of the interplay between disease and drugs actions on the patient's molecular system. The actions/strategies of disease and drugs are focused on arc alterations of the interactome. The efficiency of this framework has been evaluated for drug prediction on a model of breast cancer signalling.
Collapse
Affiliation(s)
- Célia Biane
- IBISC Laboratory, Evry Val d'Essonne University, Evry, France.
| | | | - Hanna Klaudel
- IBISC Laboratory, Evry Val d'Essonne University, Evry, France.
| |
Collapse
|
138
|
Li J, Lin X, Teng Y, Qi S, Xiao D, Zhang J, Kang Y. A Comprehensive Evaluation of Disease Phenotype Networks for Gene Prioritization. PLoS One 2016; 11:e0159457. [PMID: 27415759 PMCID: PMC4944959 DOI: 10.1371/journal.pone.0159457] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/03/2016] [Accepted: 07/01/2016] [Indexed: 12/31/2022] Open
Abstract
Identification of disease-causing genes is a fundamental challenge for human health studies. The phenotypic similarity among diseases may reflect the interactions at the molecular level, and phenotype comparison can be used to predict disease candidate genes. Online Mendelian Inheritance in Man (OMIM) is a database of human genetic diseases and related genes that has become an authoritative source of disease phenotypes. However, disease phenotypes have been described by free text; thus, standardization of phenotypic descriptions is needed before diseases can be compared. Several disease phenotype networks have been established in OMIM using different standardization methods. Two of these networks are important for phenotypic similarity analysis: the first and most commonly used network (mimMiner) is standardized by medical subject heading, and the other network (resnikHPO) is the first to be standardized by human phenotype ontology. This paper comprehensively evaluates for the first time the accuracy of these two networks in gene prioritization based on protein–protein interactions using large-scale, leave-one-out cross-validation experiments. The results show that both networks can effectively prioritize disease-causing genes, and the approach that relates two diseases using a logistic function improves prioritization performance. Tanimoto, one of four methods for normalizing resnikHPO, generates a symmetric network and it performs similarly to mimMiner. Furthermore, an integration of these two networks outperforms either network alone in gene prioritization, indicating that these two disease networks are complementary.
Collapse
Affiliation(s)
- Jianhua Li
- Department of Biomedical Informatics, Sino-Dutch Biomedical and Information Engineering School, Northeastern University, Shenyang, Liaoning, China
- Key Laboratory of Medical Image Computing of Northeastern University, Ministry of Education, Shenyang, Liaoning, China
| | - Xiaoyan Lin
- Department of Biomedical Informatics, Sino-Dutch Biomedical and Information Engineering School, Northeastern University, Shenyang, Liaoning, China
| | - Yueyang Teng
- Department of Biomedical Imaging, Sino-Dutch Biomedical and Information Engineering School, Northeastern University, Shenyang, Liaoning, China
| | - Shouliang Qi
- Key Laboratory of Medical Image Computing of Northeastern University, Ministry of Education, Shenyang, Liaoning, China
- Department of Biomedical Imaging, Sino-Dutch Biomedical and Information Engineering School, Northeastern University, Shenyang, Liaoning, China
| | - Dayu Xiao
- Department of Biomedical Imaging, Sino-Dutch Biomedical and Information Engineering School, Northeastern University, Shenyang, Liaoning, China
| | - Jianying Zhang
- Department of Biomedical Informatics, Sino-Dutch Biomedical and Information Engineering School, Northeastern University, Shenyang, Liaoning, China
- Border Biomedical Research Center, Department of Biological Sciences, The University of Texas at El Paso, El Paso, Texas, United States of America
| | - Yan Kang
- Key Laboratory of Medical Image Computing of Northeastern University, Ministry of Education, Shenyang, Liaoning, China
- Department of Biomedical Imaging, Sino-Dutch Biomedical and Information Engineering School, Northeastern University, Shenyang, Liaoning, China
- * E-mail:
| |
Collapse
|
139
|
Ung MH, Liu CC, Cheng C. Integrative analysis of cancer genes in a functional interactome. Sci Rep 2016; 6:29228. [PMID: 27356765 PMCID: PMC4928112 DOI: 10.1038/srep29228] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/23/2016] [Accepted: 06/16/2016] [Indexed: 11/09/2022] Open
Abstract
The post-genomic era has resulted in the accumulation of high-throughput cancer data from a vast array of genomic technologies including next-generation sequencing and microarray. As such, the large amounts of germline variant and somatic mutation data that have been generated from GWAS and sequencing projects, respectively, show great promise in providing a systems-level view of these genetic aberrations. In this study, we analyze publicly available GWAS, somatic mutation, and drug target data derived from large databanks using a network-based approach that incorporates directed edge information under a randomized network hypothesis testing procedure. We show that these three classes of disease-associated nodes exhibit non-random topological characteristics in the context of a functional interactome. Specifically, we show that drug targets tend to lie upstream of somatic mutations and disease susceptibility germline variants. In addition, we introduce a new approach to measuring hierarchy between drug targets, somatic mutants, and disease susceptibility genes by utilizing directionality and path length information. Overall, our results provide new insight into the intrinsic relationships between these node classes that broaden our understanding of cancer. In addition, our results align with current knowledge on the therapeutic actionability of GWAS and somatic mutant nodes, while demonstrating relationships between node classes from a global network perspective.
Collapse
Affiliation(s)
- Matthew H Ung
- Department of Molecular and Systems Biology, Geisel School of Medicine at Dartmouth, Hanover, New Hampshire, 03755 USA.,Program in Quantitative Biomedical Sciences, Geisel School of Medicine at Dartmouth, Lebanon, New Hampshire, 03755 USA
| | - Chun-Chi Liu
- Institute of Genomics and Bioinformatics, National Chung Hsing University, Taiwan
| | - Chao Cheng
- Department of Molecular and Systems Biology, Geisel School of Medicine at Dartmouth, Hanover, New Hampshire, 03755 USA.,Program in Quantitative Biomedical Sciences, Geisel School of Medicine at Dartmouth, Lebanon, New Hampshire, 03755 USA.,Norris Cotton Cancer Center, Geisel School of Medicine at Dartmouth, Lebanon, New Hampshire, 03766 USA
| |
Collapse
|
140
|
Kontou PI, Pavlopoulou A, Dimou NL, Pavlopoulos GA, Bagos PG. Network analysis of genes and their association with diseases. Gene 2016; 590:68-78. [PMID: 27265032 DOI: 10.1016/j.gene.2016.05.044] [Citation(s) in RCA: 18] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/31/2015] [Revised: 05/20/2016] [Accepted: 05/30/2016] [Indexed: 12/21/2022]
Abstract
A plethora of network-based approaches within the Systems Biology universe have been applied, to date, to investigate the underlying molecular mechanisms of various human diseases. In the present study, we perform a bipartite, topological and clustering graph analysis in order to gain a better understanding of the relationships between human genetic diseases and the relationships between the genes that are implicated in them. For this purpose, disease-disease and gene-gene networks were constructed from combined gene-disease association networks. The latter, were created by collecting and integrating data from three diverse resources, each one with different content covering from rare monogenic disorders to common complex diseases. This data pluralism enabled us to uncover important associations between diseases with unrelated phenotypic manifestations but with common genetic origin. For our analysis, the topological attributes and the functional implications of the individual networks were taken into account and are shortly discussed. We believe that some observations of this study could advance our understanding regarding the etiology of a disease with distinct pathological manifestations, and simultaneously provide the springboard for the development of preventive and therapeutic strategies and its underlying genetic mechanisms.
Collapse
Affiliation(s)
- Panagiota I Kontou
- Department of Computer Science and Biomedical Informatics, University of Thessaly, Greece
| | - Athanasia Pavlopoulou
- Department of Computer Science and Biomedical Informatics, University of Thessaly, Greece
| | - Niki L Dimou
- Department of Computer Science and Biomedical Informatics, University of Thessaly, Greece
| | - Georgios A Pavlopoulos
- Lawrence Berkeley Lab, Joint Genome Institute, United States Department of Energy, 2800 Mitchell Drive, Walnut Creek, CA 94598, USA
| | - Pantelis G Bagos
- Department of Computer Science and Biomedical Informatics, University of Thessaly, Greece.
| |
Collapse
|
141
|
UMADEVI S, PREMKUMAR K, VALARMATHI S, AYYASAMY PM, RAJAKUMAR S. IDENTIFICATION OF NOVEL GENES RELATED TO DIABETIC RETINOPATHY USING PROTEIN–PROTEIN INTERACTION NETWORK AND GENE ONTOLOGIES. J BIOL SYST 2016. [DOI: 10.1142/s0218339016500066] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022]
Abstract
Diabetic retinopathy is the most common cause of blindness, associated with many biochemical pathways mediated by several genes and proteins. Disease gene identification can be achieved through several approaches but still it is a challenging task. This study, aimed to find out the novel genes associated with diabetic retinopathy. In this study, all the well-known genes associated with diabetic retinopathy were collected from databases and the protein interaction partners were identified. The interacting candidate genes were chosen by chromosomal locations, sharing with disease genes. The protein–protein interaction network was constructed and the key nodes (genes) were identified by degree, betweenness centrality, closeness centrality and eccentricity centrality. Further, the ontological terms, molecular function, biological process and cellular components were related with that of the disease genes with p-value [Formula: see text]. The genes UBC, FOS, ITGB1, FOXA2, CCND1, FOSL1, RXRA and NCAM1 were identified as potential genes associated with diabetic retinopathy. The molecular functions of these genes include protein binding, receptor activity, receptor binding, oxidoreductase activity, protein kinase activity, serine-type peptidase activity and growth factor. Many of the identified genes were clinically related as evidence by the literature.
Collapse
Affiliation(s)
- S. UMADEVI
- Department of Biomedical Science, Bharathidasan University, Tiruchirappalli 620024, Tamil Nadu, India
| | - K. PREMKUMAR
- Department of Biomedical Science, Bharathidasan University, Tiruchirappalli 620024, Tamil Nadu, India
| | - S. VALARMATHI
- Department of Biochemistry, PSG College of Arts and Science, Coimbatore 641014, Tamil Nadu, India
| | - P. M. AYYASAMY
- Department of Microbiology, Periyar University, Salem 636011, Tamil Nadu, India
| | - S. RAJAKUMAR
- Department of Marine Biotechnology, Bharathidasan University, Tiruchirappalli 620024, Tamil Nadu, India
| |
Collapse
|
142
|
Rahmani H, Blockeel H, Bender A. Using a Human Drug Network for generating novel hypotheses about drugs. INTELL DATA ANAL 2016. [DOI: 10.3233/ida-150800] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/15/2022]
Affiliation(s)
- Hossein Rahmani
- School of Computer Engineering, Iran University of Science and Technology, Tehran, Iran
- Department of Knowledge Engineering, Universiteit Maastricht, Maastricht, The Netherlands
| | - Hendrik Blockeel
- Department of Computer Science, KU Leuven, Leuven, Belgium
- Leiden Institute of Advanced Computer Science, Leiden University, CA Leiden, The Netherlands
| | - Andreas Bender
- Unilever Centre for Molecular Science Informatics, Department of Chemistry, University of Cambridge, Cambridge, UK
| |
Collapse
|
143
|
Al-Harazi O, Al Insaif S, Al-Ajlan MA, Kaya N, Dzimiri N, Colak D. Integrated Genomic and Network-Based Analyses of Complex Diseases and Human Disease Network. J Genet Genomics 2015; 43:349-67. [PMID: 27318646 DOI: 10.1016/j.jgg.2015.11.002] [Citation(s) in RCA: 19] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/11/2015] [Revised: 10/22/2015] [Accepted: 11/20/2015] [Indexed: 12/16/2022]
Abstract
A disease phenotype generally reflects various pathobiological processes that interact in a complex network. The highly interconnected nature of the human protein interaction network (interactome) indicates that, at the molecular level, it is difficult to consider diseases as being independent of one another. Recently, genome-wide molecular measurements, data mining and bioinformatics approaches have provided the means to explore human diseases from a molecular basis. The exploration of diseases and a system of disease relationships based on the integration of genome-wide molecular data with the human interactome could offer a powerful perspective for understanding the molecular architecture of diseases. Recently, subnetwork markers have proven to be more robust and reliable than individual biomarker genes selected based on gene expression profiles alone, and achieve higher accuracy in disease classification. We have applied one of these methodologies to idiopathic dilated cardiomyopathy (IDCM) data that we have generated using a microarray and identified significant subnetworks associated with the disease. In this paper, we review the recent endeavours in this direction, and summarize the existing methodologies and computational tools for network-based analysis of complex diseases and molecular relationships among apparently different disorders and human disease network. We also discuss the future research trends and topics of this promising field.
Collapse
Affiliation(s)
- Olfat Al-Harazi
- Department of Biostatistics, Epidemiology and Scientific Computing, King Faisal Specialist Hospital and Research Centre, Riyadh 11211, Saudi Arabia
| | - Sadiq Al Insaif
- Department of Biostatistics, Epidemiology and Scientific Computing, King Faisal Specialist Hospital and Research Centre, Riyadh 11211, Saudi Arabia
| | - Monirah A Al-Ajlan
- Department of Biostatistics, Epidemiology and Scientific Computing, King Faisal Specialist Hospital and Research Centre, Riyadh 11211, Saudi Arabia; College of Computer and Information Sciences, King Saud University, Riyadh 11451, Saudi Arabia
| | - Namik Kaya
- Department of Genetics, King Faisal Specialist Hospital and Research Centre, Riyadh 11211, Saudi Arabia
| | - Nduna Dzimiri
- Department of Genetics, King Faisal Specialist Hospital and Research Centre, Riyadh 11211, Saudi Arabia
| | - Dilek Colak
- Department of Biostatistics, Epidemiology and Scientific Computing, King Faisal Specialist Hospital and Research Centre, Riyadh 11211, Saudi Arabia.
| |
Collapse
|
144
|
Convergent Lines of Evidence Support LRP8 as a Susceptibility Gene for Psychosis. Mol Neurobiol 2015; 53:6608-6619. [PMID: 26637325 DOI: 10.1007/s12035-015-9559-6] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/26/2015] [Accepted: 11/22/2015] [Indexed: 12/23/2022]
Abstract
Reelin (RELN) is identified as a risk gene for major psychiatric disorders such as schizophrenia (SCZ) and bipolar disorder (BPD). However, the role of its downstream signaling molecule, the low-density lipoprotein receptor-related protein 8 (LRP8) in these illnesses is still unclear. To detect whether LRP8 is a susceptibility gene for SCZ and BPD, we analyzed the associations of single nucleotide polymorphisms (SNPs) in LRP8 in a total of 47,187 subjects (including 9379 SCZ patients; 6990 BPD patients; and 12,556 controls in a screening sample, and 1397 SCZ families, 3947 BPD patients, and 8387 controls in independent replications), and identified a non-synonymous SNP rs5174 in LRP8 significantly associated with SCZ and BPD as well as the combined psychosis phenotype (P meta = 1.99 × 10-5, odds ratio (OR) = 1.066, 95 % confidence interval (CI) = 1.035-1.098). The risk SNP rs5174 was also associated with LRP8 messenger RNA (mRNA) expression in multiple brain tissues across independent samples (lowest P = 0.00005). Further exploratory analysis revealed that LRP8 was preferentially expressed in fetal brain tissues. Protein-protein interaction (PPI) analysis demonstrated that LRP8 significantly participated in a highly interconnected PPI network build by top risk genes for SCZ and BPD (P = 7.0 × 10-4). Collectively, we confirmed that LRP8 is a risk gene for psychosis, and our results provide useful information toward a better understanding of genetic mechanism involving LRP8 underlying risk of complex psychiatric disorders.
Collapse
|
145
|
Szklarczyk D, Santos A, von Mering C, Jensen LJ, Bork P, Kuhn M. STITCH 5: augmenting protein-chemical interaction networks with tissue and affinity data. Nucleic Acids Res 2015; 44:D380-4. [PMID: 26590256 PMCID: PMC4702904 DOI: 10.1093/nar/gkv1277] [Citation(s) in RCA: 883] [Impact Index Per Article: 98.1] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/01/2015] [Accepted: 11/03/2015] [Indexed: 12/21/2022] Open
Abstract
Interactions between proteins and small molecules are an integral part of biological processes in living organisms. Information on these interactions is dispersed over many databases, texts and prediction methods, which makes it difficult to get a comprehensive overview of the available evidence. To address this, we have developed STITCH (‘Search Tool for Interacting Chemicals’) that integrates these disparate data sources for 430 000 chemicals into a single, easy-to-use resource. In addition to the increased scope of the database, we have implemented a new network view that gives the user the ability to view binding affinities of chemicals in the interaction network. This enables the user to get a quick overview of the potential effects of the chemical on its interaction partners. For each organism, STITCH provides a global network; however, not all proteins have the same pattern of spatial expression. Therefore, only a certain subset of interactions can occur simultaneously. In the new, fifth release of STITCH, we have implemented functionality to filter out the proteins and chemicals not associated with a given tissue. The STITCH database can be downloaded in full, accessed programmatically via an extensive API, or searched via a redesigned web interface at http://stitch.embl.de.
Collapse
Affiliation(s)
- Damian Szklarczyk
- Institute of Molecular Life Sciences, University of Zurich and Swiss Institute of Bioinformatics, Winterthurerstrasse 190, 8057 Zurich, Switzerland
| | - Alberto Santos
- Novo Nordisk Foundation Center for Protein Research, Faculty of Health and Medical Sciences, University of Copenhagen, 2200 Copenhagen N, Denmark
| | - Christian von Mering
- Institute of Molecular Life Sciences, University of Zurich and Swiss Institute of Bioinformatics, Winterthurerstrasse 190, 8057 Zurich, Switzerland
| | - Lars Juhl Jensen
- Novo Nordisk Foundation Center for Protein Research, Faculty of Health and Medical Sciences, University of Copenhagen, 2200 Copenhagen N, Denmark
| | - Peer Bork
- European Molecular Biology Laboratory, Structural and Computational Biology Unit, Molecular Medicine Partnership Unit, Meyerhofstrasse 1, 69117 Heidelberg, Germany Max-Delbrück-Centre for Molecular Medicine, Robert-Rössle-Strasse 10, 13092 Berlin, Germany
| | - Michael Kuhn
- Max Planck Institute of Molecular Cell Biology and Genetics, Pfotenhauerstrasse 108, 01307 Dresden
| |
Collapse
|
146
|
Ou-Yang L, Dai DQ, Zhang XF. Detecting Protein Complexes from Signed Protein-Protein Interaction Networks. IEEE/ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS 2015; 12:1333-1344. [PMID: 26671805 DOI: 10.1109/tcbb.2015.2401014] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/05/2023]
Abstract
Identification of protein complexes is fundamental for understanding the cellular functional organization. With the accumulation of physical protein-protein interaction (PPI) data, computational detection of protein complexes from available PPI networks has drawn a lot of attentions. While most of the existing protein complex detection algorithms focus on analyzing the physical protein-protein interaction network, none of them take into account the "signs" (i.e., activation-inhibition relationships) of physical interactions. As the "signs" of interactions reflect the way proteins communicate, considering the "signs" of interactions can not only increase the accuracy of protein complex identification, but also deepen our understanding of the mechanisms of cell functions. In this study, we proposed a novel Signed Graph regularized Nonnegative Matrix Factorization (SGNMF) model to identify protein complexes from signed PPI networks. In our experiments, we compared the results collected by our model on signed PPI networks with those predicted by the state-of-the-art complex detection techniques on the original unsigned PPI networks. We observed that considering the "signs" of interactions significantly benefits the detection of protein complexes. Furthermore, based on the predicted complexes, we predicted a set of signed complex-complex interactions for each dataset, which provides a novel insight of the higher level organization of the cell. All the experimental results and codes can be downloaded from http://mail.sysu.edu.cn/home/stsddq@mail.sysu.edu.cn/dai/others/SGNMF.zip.
Collapse
|
147
|
Liu B, Jin M, Zeng P. Prioritization of candidate disease genes by combining topological similarity and semantic similarity. J Biomed Inform 2015; 57:1-5. [DOI: 10.1016/j.jbi.2015.07.005] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/26/2014] [Revised: 07/01/2015] [Accepted: 07/06/2015] [Indexed: 10/23/2022]
|
148
|
Chen B, Li M, Wang J, Shang X, Wu FX. A fast and high performance multiple data integration algorithm for identifying human disease genes. BMC Med Genomics 2015; 8 Suppl 3:S2. [PMID: 26399620 PMCID: PMC4582601 DOI: 10.1186/1755-8794-8-s3-s2] [Citation(s) in RCA: 39] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/01/2023] Open
Abstract
BACKGROUND Integrating multiple data sources is indispensable in improving disease gene identification. It is not only due to the fact that disease genes associated with similar genetic diseases tend to lie close with each other in various biological networks, but also due to the fact that gene-disease associations are complex. Although various algorithms have been proposed to identify disease genes, their prediction performances and the computational time still should be further improved. RESULTS In this study, we propose a fast and high performance multiple data integration algorithm for identifying human disease genes. A posterior probability of each candidate gene associated with individual diseases is calculated by using a Bayesian analysis method and a binary logistic regression model. Two prior probability estimation strategies and two feature vector construction methods are developed to test the performance of the proposed algorithm. CONCLUSIONS The proposed algorithm is not only generated predictions with high AUC scores, but also runs very fast. When only a single PPI network is employed, the AUC score is 0.769 by using F2 as feature vectors. The average running time for each leave-one-out experiment is only around 1.5 seconds. When three biological networks are integrated, the AUC score using F3 as feature vectors increases to 0.830, and the average running time for each leave-one-out experiment takes only about 12.54 seconds. It is better than many existing algorithms.
Collapse
Affiliation(s)
- Bolin Chen
- School of Computer Science, Northwestern Polytechnical University, 127 Youyi West Road, 710072, Xi'an, P.R. China
| | - Min Li
- School of Information Science and Engineering, Central South University, 410083, Changsha, P.R.China
| | - Jianxin Wang
- School of Information Science and Engineering, Central South University, 410083, Changsha, P.R.China
| | - Xuequn Shang
- School of Computer Science, Northwestern Polytechnical University, 127 Youyi West Road, 710072, Xi'an, P.R. China
| | - Fang-Xiang Wu
- Division of Biomedical Engineering, University of Saskatchewan, 57 Campus Dr., S7N 5A9, Saskatoon, Canada
- Department of Mechanical Engineering, University of Saskatchewan, 57 Campus Dr., S7N 5A9, Saskatoon, Canada
| |
Collapse
|
149
|
ProSim: A Method for Prioritizing Disease Genes Based on Protein Proximity and Disease Similarity. BIOMED RESEARCH INTERNATIONAL 2015; 2015:213750. [PMID: 26339594 PMCID: PMC4538409 DOI: 10.1155/2015/213750] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 12/15/2014] [Accepted: 01/16/2015] [Indexed: 01/19/2023]
Abstract
Predicting disease genes for a particular genetic disease is very challenging in bioinformatics. Based on current research studies, this challenge can be tackled via network-based approaches. Furthermore, it has been highlighted that it is necessary to consider disease similarity along with the protein's proximity to disease genes in a protein-protein interaction (PPI) network in order to improve the accuracy of disease gene prioritization. In this study we propose a new algorithm called proximity disease similarity algorithm (ProSim), which takes both of the aforementioned properties into consideration, to prioritize disease genes. To illustrate the proposed algorithm, we have conducted six case studies, namely, prostate cancer, Alzheimer's disease, diabetes mellitus type 2, breast cancer, colorectal cancer, and lung cancer. We employed leave-one-out cross validation, mean enrichment, tenfold cross validation, and ROC curves to evaluate our proposed method and other existing methods. The results show that our proposed method outperforms existing methods such as PRINCE, RWR, and DADA.
Collapse
|
150
|
Ullah MZ, Aono M, Seddiqui MH. Estimating a ranked list of human hereditary diseases for clinical phenotypes by using weighted bipartite network. ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY. IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY. ANNUAL INTERNATIONAL CONFERENCE 2015; 2013:3475-8. [PMID: 24110477 DOI: 10.1109/embc.2013.6610290] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]
Abstract
With the availability of the huge medical knowledge data on the Internet such as the human disease network, protein-protein interaction (PPI) network, and phenotypegene, gene-disease bipartite networks, it becomes practical to help doctors by suggesting plausible hereditary diseases for a set of clinical phenotypes. However, identifying candidate diseases that best explain a set of clinical phenotypes by considering various heterogeneous networks is still a challenging task. In this paper, we propose a new method for estimating a ranked list of plausible diseases by associating phenotypegene with gene-disease bipartite networks. Our approach is to count the frequency of all the paths from a phenotype to a disease through their associated causative genes, and link the phenotype to the disease with path frequency in a new phenotype-disease bipartite (PDB) network. After that, we generate the candidate weights for the edges of phenotypes with diseases in PDB network. We evaluate our proposed method in terms of Normalized Discounted Cumulative Gain (NDCG), and demonstrate that we outperform the previously known disease ranking method called Phenomizer.
Collapse
|