201
|
Emmert-Streib F, Dehmer M. Enhancing systems medicine beyond genotype data by dynamic patient signatures: having information and using it too. Front Genet 2013; 4:241. [PMID: 24312119 PMCID: PMC3832803 DOI: 10.3389/fgene.2013.00241] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/11/2013] [Accepted: 10/24/2013] [Indexed: 01/08/2023] Open
Abstract
In order to establish systems medicine, based on the results and insights from basic biological research applicable for a medical and a clinical patient care, it is essential to measure patient-based data that represent the molecular and cellular state of the patient's pathology. In this paper, we discuss potential limitations of the sole usage of static genotype data, e.g., from next-generation sequencing, for translational research. The hypothesis advocated in this paper is that dynOmics data, i.e., high-throughput data that are capable of capturing dynamic aspects of the activity of samples from patients, are important for enabling personalized medicine by complementing genotype data.
Collapse
Affiliation(s)
- Frank Emmert-Streib
- Computational Biology and Machine Learning Laboratory, Faculty of Medicine, Health and Life Sciences, Center for Cancer Research and Cell Biology, School of Medicine, Dentistry and Biomedical Sciences, Queen's University BelfastBelfast, UK
| | - Matthias Dehmer
- Institute for Bioinformatics and Translational Research, UMITHall in Tyrol, Austria
| |
Collapse
|
202
|
Vlasblom J, Jin K, Kassir S, Babu M. Exploring mitochondrial system properties of neurodegenerative diseases through interactome mapping. J Proteomics 2013; 100:8-24. [PMID: 24262152 DOI: 10.1016/j.jprot.2013.11.008] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/06/2013] [Revised: 10/08/2013] [Accepted: 11/06/2013] [Indexed: 12/20/2022]
Abstract
UNLABELLED Mitochondria are double membraned, dynamic organelles that are required for a large number of cellular processes, and defects in their function have emerged as causative factors for a growing number of human disorders and are highly associated with cancer, metabolic, and neurodegenerative (ND) diseases. Biochemical and genetic investigations have uncovered small numbers of candidate mitochondrial proteins (MPs) involved in ND disease, but given the diversity of processes affected by MP function and the difficulty of detecting interactions involving these proteins, many more likely remain unknown. However, high-throughput proteomic and genomic approaches developed in genetically tractable model prokaryotes and lower eukaryotes have proven to be effective tools for querying the physical (protein-protein) and functional (gene-gene) relationships between diverse types of proteins, including cytosolic and membrane proteins. In this review, we highlight how experimental and computational approaches developed recently by our group and others can be effectively used towards elucidating the mitochondrial interactome in an unbiased and systematic manner to uncover network-based connections. We discuss how the knowledge from the resulting interaction networks can effectively contribute towards the identification of new mitochondrial disease gene candidates, and thus further clarify the role of mitochondrial biology and the complex etiologies of ND disease. BIOLOGICAL SIGNIFICANCE Biochemical and genetic investigations have uncovered small numbers of candidate mitochondrial proteins (MPs) involved in neurodegenerative (ND) diseases, but given the diversity of processes affected by MP function and the difficulty of detecting interactions involving these proteins, many more likely remain unknown. Large-scale proteomic and genomic approaches developed in model prokaryotes and lower eukaryotes have proven to be effective tools for querying the physical (protein-protein) and functional (gene-gene) relationships between diverse types of proteins. Extension of this new framework to the mitochondrial sub-system in human will likewise provide a universally informative systems-level view of the physical and functional landscape for exploring the evolutionary principles underlying mitochondrial function. In this review, we highlight how experimental and computational approaches developed recently by our group and others can be effectively used towards elucidating the mitochondrial interactome in an unbiased and systematic manner to uncover network-based connections. We anticipate that the knowledge from these resulting interaction networks can effectively contribute towards the identification of new mitochondrial disease gene candidates, and thus foster a deeper molecular understanding of mitochondrial biology as well as the etiology of mitochondrial diseases. This article is part of a Special Issue: Can Proteomics Fill the Gap Between Genomics and Phenotypes?
Collapse
Affiliation(s)
- James Vlasblom
- Department of Biochemistry, Research and Innovation Centre, University of Regina, Regina, Saskatchewan S4S 0A2, Canada
| | - Ke Jin
- Department of Biochemistry, Research and Innovation Centre, University of Regina, Regina, Saskatchewan S4S 0A2, Canada; Banting and Best Department of Medical Research, University of Toronto, Toronto, Ontario M5S 3E1, Canada; Terrence Donnelly Center for Cellular and Biomolecular Research, University of Toronto, Toronto, Ontario M5S 3E1, Canada
| | - Sandy Kassir
- Department of Biochemistry, Research and Innovation Centre, University of Regina, Regina, Saskatchewan S4S 0A2, Canada
| | - Mohan Babu
- Department of Biochemistry, Research and Innovation Centre, University of Regina, Regina, Saskatchewan S4S 0A2, Canada.
| |
Collapse
|
203
|
Systems biology as an integrated platform for bioinformatics, systems synthetic biology, and systems metabolic engineering. Cells 2013; 2:635-88. [PMID: 24709875 PMCID: PMC3972654 DOI: 10.3390/cells2040635] [Citation(s) in RCA: 22] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/16/2013] [Revised: 09/12/2013] [Accepted: 09/19/2013] [Indexed: 01/11/2023] Open
Abstract
Systems biology aims at achieving a system-level understanding of living organisms and applying this knowledge to various fields such as synthetic biology, metabolic engineering, and medicine. System-level understanding of living organisms can be derived from insight into: (i) system structure and the mechanism of biological networks such as gene regulation, protein interactions, signaling, and metabolic pathways; (ii) system dynamics of biological networks, which provides an understanding of stability, robustness, and transduction ability through system identification, and through system analysis methods; (iii) system control methods at different levels of biological networks, which provide an understanding of systematic mechanisms to robustly control system states, minimize malfunctions, and provide potential therapeutic targets in disease treatment; (iv) systematic design methods for the modification and construction of biological networks with desired behaviors, which provide system design principles and system simulations for synthetic biology designs and systems metabolic engineering. This review describes current developments in systems biology, systems synthetic biology, and systems metabolic engineering for engineering and biology researchers. We also discuss challenges and future prospects for systems biology and the concept of systems biology as an integrated platform for bioinformatics, systems synthetic biology, and systems metabolic engineering.
Collapse
|
204
|
Jacunski A, Tatonetti NP. Connecting the dots: applications of network medicine in pharmacology and disease. Clin Pharmacol Ther 2013; 94:659-69. [PMID: 23995266 DOI: 10.1038/clpt.2013.168] [Citation(s) in RCA: 31] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/11/2013] [Accepted: 08/16/2013] [Indexed: 11/09/2022]
Abstract
In 2011, >2.5 million people died from only 15 causes in the United States. Ten of these involved complex or infectious diseases for which there is insufficient knowledge or treatment, such as heart disease, influenza, and Alzheimer's disease.(1) Complex diseases have been difficult to understand due to their multifarious genetic and molecular fingerprints, while certain infectious agents have evolved to elude treatment and prophylaxis. Network medicine provides a macroscopic approach to understanding and treating such illnesses. It integrates experimental data on gene, protein, and metabolic interactions with clinical knowledge of disease and pharmacology in order to extend the understanding of diseases and their treatments. The resulting "big picture" allows for the development of computational and mathematical methods to identify novel disease pathways and predict patient drug response, among others. In this review, we discuss recent advances in network medicine.
Collapse
Affiliation(s)
- A Jacunski
- 1] Integrated Program in Cellular, Molecular and Biomedical Studies, Columbia University Medical Center, New York, New York, USA [2] Department of Biomedical Informatics, Columbia University Medical Center, New York, New York, USA [3] Department of Systems Biology, Columbia University Medical Center, New York, New York, USA [4] Department of Medicine, Columbia University Medical Center, New York, New York, USA
| | | |
Collapse
|
205
|
Ballouz S, Liu JY, George RA, Bains N, Liu A, Oti M, Gaeta B, Fatkin D, Wouters MA. Gentrepid V2.0: a web server for candidate disease gene prediction. BMC Bioinformatics 2013; 14:249. [PMID: 23947436 PMCID: PMC3844418 DOI: 10.1186/1471-2105-14-249] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/22/2012] [Accepted: 08/13/2013] [Indexed: 01/06/2023] Open
Abstract
Background Candidate disease gene prediction is a rapidly developing area of bioinformatics research with the potential to deliver great benefits to human health. As experimental studies detecting associations between genetic intervals and disease proliferate, better bioinformatic techniques that can expand and exploit the data are required. Description Gentrepid is a web resource which predicts and prioritizes candidate disease genes for both Mendelian and complex diseases. The system can take input from linkage analysis of single genetic intervals or multiple marker loci from genome-wide association studies. The underlying database of the Gentrepid tool sources data from numerous gene and protein resources, taking advantage of the wealth of biological information available. Using known disease gene information from OMIM, the system predicts and prioritizes disease gene candidates that participate in the same protein pathways or share similar protein domains. Alternatively, using an ab initio approach, the system can detect enrichment of these protein annotations without prior knowledge of the phenotype. Conclusions The system aims to integrate the wealth of protein information currently available with known and novel phenotype/genotype information to acquire knowledge of biological mechanisms underpinning disease. We have updated the system to facilitate analysis of GWAS data and the study of complex diseases. Application of the system to GWAS data on hypertension using the ICBP data is provided as an example. An interesting prediction is a ZIP transporter additional to the one found by the ICBP analysis. The webserver URL is https://www.gentrepid.org/.
Collapse
Affiliation(s)
- Sara Ballouz
- School of Medicine, Deakin University, Geelong, VIC 3217, Australia.
| | | | | | | | | | | | | | | | | |
Collapse
|
206
|
Unraveling DNA damage response-signaling networks through systems approaches. Arch Toxicol 2013; 87:1635-48. [DOI: 10.1007/s00204-013-1106-5] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/24/2013] [Accepted: 07/15/2013] [Indexed: 10/26/2022]
|
207
|
Ito YA, Walter MA. Genomics and anterior segment dysgenesis: a review. Clin Exp Ophthalmol 2013; 42:13-24. [PMID: 24433355 DOI: 10.1111/ceo.12152] [Citation(s) in RCA: 75] [Impact Index Per Article: 6.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/20/2013] [Accepted: 05/05/2013] [Indexed: 01/16/2023]
Abstract
Anterior segment dysgenesis refers to a spectrum of disorders affecting structures in the anterior segment of the eye including the iris, cornea and trabecular meshwork. Approximately 50% of patients with anterior segment dysgenesis develop glaucoma. Traditional genetic methods using linkage analysis and family-based studies have identified numerous disease-causing genes such as PAX6, FOXC1 and PITX2. Despite these advances, phenotypic and genotypic heterogeneity pose continuing challenges to understand the mechanisms underlying the complexity of anterior segment dysgenesis disorders. Genomic methods, such as genome-wide association studies, are potentially an effective tool to understand anterior segment dysgenesis and the individual susceptibility to the development of glaucoma. In this review, we provide the rationale, as well as the challenges, to utilizing genomic methods to examine anterior segment dysgenesis disorders.
Collapse
Affiliation(s)
- Yoko A Ito
- Department of Medical Genetics, University of Alberta, Edmonton, Canada
| | | |
Collapse
|
208
|
Guo Y, Wei X, Das J, Grimson A, Lipkin S, Clark A, Yu H. Dissecting disease inheritance modes in a three-dimensional protein network challenges the "guilt-by-association" principle. Am J Hum Genet 2013; 93:78-89. [PMID: 23791107 DOI: 10.1016/j.ajhg.2013.05.022] [Citation(s) in RCA: 31] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/05/2012] [Revised: 05/02/2013] [Accepted: 05/23/2013] [Indexed: 10/26/2022] Open
Abstract
To better understand different molecular mechanisms by which mutations lead to various human diseases, we classified 82,833 disease-associated mutations according to their inheritance modes (recessive versus dominant) and molecular types (in-frame [missense point mutations and in-frame indels] versus truncating [nonsense mutations and frameshift indels]) and systematically examined the effects of different classes of disease mutations in a three-dimensional protein interactome network with the atomic-resolution interface resolved for each interaction. We found that although recessive mutations affecting the interaction interface of two interacting proteins tend to cause the same disease, this widely accepted "guilt-by-association" principle does not apply to dominant mutations. Furthermore, recessive truncating mutations in regions encoding the same interface are much more likely to cause the same disease, even for interfaces close to the N terminus of the protein. Conversely, dominant truncating mutations tend to be enriched in regions encoding areas between interfaces. These results suggest that a significant fraction of truncating mutations can generate functional protein products. For example, TRIM27, a known cancer-associated protein, interacts with three proteins (MID2, TRIM42, and SIRPA) through two different interfaces. A dominant truncating mutation (c.1024delT [p.Tyr342Thrfs*30]) associated with ovarian carcinoma is located between the regions encoding the two interfaces; the altered protein retains its interaction with MID2 and TRIM42 through the first interface but loses its interaction with SIRPA through the second interface. Our findings will help clarify the molecular mechanisms of thousands of disease-associated genes and their tens of thousands of mutations, especially for those carrying truncating mutations, often erroneously considered "knockout" alleles.
Collapse
|
209
|
Nie Y, Yu J. Mining breast cancer genes with a network based noise-tolerant approach. BMC SYSTEMS BIOLOGY 2013; 7:49. [PMID: 23799982 PMCID: PMC3702465 DOI: 10.1186/1752-0509-7-49] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 11/28/2012] [Accepted: 06/21/2013] [Indexed: 12/22/2022]
Abstract
BACKGROUND Mining novel breast cancer genes is an important task in breast cancer research. Many approaches prioritize candidate genes based on their similarity to known cancer genes, usually by integrating multiple data sources. However, different types of data often contain varying degrees of noise. For effective data integration, it's important to design methods that work robustly with respect to noise. RESULTS Gene Ontology (GO) annotations were often utilized in cancer gene mining works. However, the vast majority of GO annotations were computationally derived, thus not completely accurate. A set of genes annotated with breast cancer enriched GO terms was adopted here as a set of source data with realistic noise. A novel noise tolerant approach was proposed to rank candidate breast cancer genes using noisy source data within the framework of a comprehensive human Protein-Protein Interaction (PPI) network. Performance of the proposed method was quantitatively evaluated by comparing it with the more established random walk approach. Results showed that the proposed method exhibited better performance in ranking known breast cancer genes and higher robustness against data noise than the random walk approach. When noise started to increase, the proposed method was able to maintained relatively stable performance, while the random walk approach showed drastic performance decline; when noise increased to a large extent, the proposed method was still able to achieve better performance than random walk did. CONCLUSIONS A novel noise tolerant method was proposed to mine breast cancer genes. Compared to the well established random walk approach, it showed better performance in correctly ranking cancer genes and worked robustly with respect to noise within source data. To the best of our knowledge, it's the first such effort to quantitatively analyze noise tolerance between different breast cancer gene mining methods. The sorted gene list can be valuable for breast cancer research. The proposed quantitative noise analysis method may also prove useful for other data integration efforts. It is hoped that the current work can lead to more discussions about influence of data noise on different computational methods for mining disease genes.
Collapse
Affiliation(s)
- Yaling Nie
- National Key Laboratory of Biochemical Engineering, Institute of Process Engineering, Chinese Academy of Sciences, Beijing 100190, China
| | | |
Collapse
|
210
|
Prediction of novel genes associated with negative regulators of toll-like receptors-induced inflammation based on endotoxin tolerance. Inflammation 2013; 35:1889-99. [PMID: 22843012 DOI: 10.1007/s10753-012-9511-0] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/28/2022]
Abstract
Prior exposure of innate immune cells to lipopolysaccharide (LPS) has caused them to be refractory to further endotoxin stimulation, also termed endotoxin tolerance (ET). Bacterial LPS signals through Toll-like receptor (TLR) 4, which was thought to enable the innate immune system to deal with invasive pathogens and to restrain systemic inflammation efficiently. We established a robust model of ET and determined the level of TNF-α and IL-6 in cultured human monocytes. Then, microarray assay was applied to assess gene expression in this model. The results showed that 356 non-tolerizable genes were differentially expressed at a high level in tolerant monocytes. The genes selected were classified into several categories based on gene ontology (GO) and KEGG pathway database. And then literature annotations, protein-protein interaction (PPI) network, and functional consistency were applied to analyze the non-tolerizable genes. Finally, the microarray results were verified by quantitative real-time PCR of seven representative genes, including the two candidate genes, Spry2 and Smurf2, which were supposed to play a critical role in TLRs-induced inflammation based on literature retrieval. Our results would provide useful information for further analysis of regulating TLRs-induced inflammation, and would facilitate the study of associated mechanisms.
Collapse
|
211
|
Gupta VA, Hnia K, Smith LL, Gundry SR, McIntire JE, Shimazu J, Bass JR, Talbot EA, Amoasii L, Goldman NE, Laporte J, Beggs AH. Loss of catalytically inactive lipid phosphatase myotubularin-related protein 12 impairs myotubularin stability and promotes centronuclear myopathy in zebrafish. PLoS Genet 2013; 9:e1003583. [PMID: 23818870 PMCID: PMC3688503 DOI: 10.1371/journal.pgen.1003583] [Citation(s) in RCA: 18] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/20/2012] [Accepted: 05/07/2013] [Indexed: 01/08/2023] Open
Abstract
X-linked myotubular myopathy (XLMTM) is a congenital disorder caused by mutations of the myotubularin gene, MTM1. Myotubularin belongs to a large family of conserved lipid phosphatases that include both catalytically active and inactive myotubularin-related proteins (i.e., "MTMRs"). Biochemically, catalytically inactive MTMRs have been shown to form heteroligomers with active members within the myotubularin family through protein-protein interactions. However, the pathophysiological significance of catalytically inactive MTMRs remains unknown in muscle. By in vitro as well as in vivo studies, we have identified that catalytically inactive myotubularin-related protein 12 (MTMR12) binds to myotubularin in skeletal muscle. Knockdown of the mtmr12 gene in zebrafish resulted in skeletal muscle defects and impaired motor function. Analysis of mtmr12 morphant fish showed pathological changes with central nucleation, disorganized Triads, myofiber hypotrophy and whorled membrane structures similar to those seen in X-linked myotubular myopathy. Biochemical studies showed that deficiency of MTMR12 results in reduced levels of myotubularin protein in zebrafish and mammalian C2C12 cells. Loss of myotubularin also resulted in reduction of MTMR12 protein in C2C12 cells, mice and humans. Moreover, XLMTM mutations within the myotubularin interaction domain disrupted binding to MTMR12 in cell culture. Analysis of human XLMTM patient myotubes showed that mutations that disrupt the interaction between myotubularin and MTMR12 proteins result in reduction of both myotubularin and MTMR12. These studies strongly support the concept that interactions between myotubularin and MTMR12 are required for the stability of their functional protein complex in normal skeletal muscles. This work highlights an important physiological function of catalytically inactive phosphatases in the pathophysiology of myotubular myopathy and suggests a novel therapeutic approach through identification of drugs that could stabilize the myotubularin-MTMR12 complex and hence ameliorate this disorder.
Collapse
Affiliation(s)
- Vandana A. Gupta
- Genomics Program and Division of Genetics, The Manton Center for Orphan Disease Research, Boston Children's Hospital, Harvard Medical School, Boston, Massachusetts, United States of America
| | - Karim Hnia
- Department of Translational Medicine and Neurogenetics, Institut de Génétique et de Biologie Moléculaire et Cellulaire, Inserm U964, CNRS UMR7104, Université de Strasbourg, Collège de France, Chaire de Génétique Humaine, Illkirch, France
| | - Laura L. Smith
- Genomics Program and Division of Genetics, The Manton Center for Orphan Disease Research, Boston Children's Hospital, Harvard Medical School, Boston, Massachusetts, United States of America
| | - Stacey R. Gundry
- Genomics Program and Division of Genetics, The Manton Center for Orphan Disease Research, Boston Children's Hospital, Harvard Medical School, Boston, Massachusetts, United States of America
| | - Jessica E. McIntire
- Genomics Program and Division of Genetics, The Manton Center for Orphan Disease Research, Boston Children's Hospital, Harvard Medical School, Boston, Massachusetts, United States of America
| | - Junko Shimazu
- Genomics Program and Division of Genetics, The Manton Center for Orphan Disease Research, Boston Children's Hospital, Harvard Medical School, Boston, Massachusetts, United States of America
| | - Jessica R. Bass
- Genomics Program and Division of Genetics, The Manton Center for Orphan Disease Research, Boston Children's Hospital, Harvard Medical School, Boston, Massachusetts, United States of America
| | - Ethan A. Talbot
- Genomics Program and Division of Genetics, The Manton Center for Orphan Disease Research, Boston Children's Hospital, Harvard Medical School, Boston, Massachusetts, United States of America
| | - Leonela Amoasii
- Department of Translational Medicine and Neurogenetics, Institut de Génétique et de Biologie Moléculaire et Cellulaire, Inserm U964, CNRS UMR7104, Université de Strasbourg, Collège de France, Chaire de Génétique Humaine, Illkirch, France
| | - Nathaniel E. Goldman
- Genomics Program and Division of Genetics, The Manton Center for Orphan Disease Research, Boston Children's Hospital, Harvard Medical School, Boston, Massachusetts, United States of America
| | - Jocelyn Laporte
- Department of Translational Medicine and Neurogenetics, Institut de Génétique et de Biologie Moléculaire et Cellulaire, Inserm U964, CNRS UMR7104, Université de Strasbourg, Collège de France, Chaire de Génétique Humaine, Illkirch, France
| | - Alan H. Beggs
- Genomics Program and Division of Genetics, The Manton Center for Orphan Disease Research, Boston Children's Hospital, Harvard Medical School, Boston, Massachusetts, United States of America
- * E-mail:
| |
Collapse
|
212
|
Zhang Z, Zhao Z, Liu B, Li D, Zhang D, Chen H, Liu D. Systems biomedicine: It’s your turn—Recent progress in systems biomedicine. QUANTITATIVE BIOLOGY 2013. [DOI: 10.1007/s40484-013-0009-z] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
|
213
|
Baranzini S, Khankhanian P, Patsopoulos N, Li M, Stankovich J, Cotsapas C, Søndergaard H, Ban M, Barizzone N, Bergamaschi L, Booth D, Buck D, Cavalla P, Celius E, Comabella M, Comi G, Compston A, Cournu-Rebeix I, D’alfonso S, Damotte V, Din L, Dubois B, Elovaara I, Esposito F, Fontaine B, Franke A, Goris A, Gourraud PA, Graetz C, Guerini F, Guillot-Noel L, Hafler D, Hakonarson H, Hall P, Hamsten A, Harbo H, Hemmer B, Hillert J, Kemppinen A, Kockum I, Koivisto K, Larsson M, Lathrop M, Leone M, Lill C, Macciardi F, Martin R, Martinelli V, Martinelli-Boneschi F, McCauley J, Myhr KM, Naldi P, Olsson T, Oturai A, Pericak-Vance M, Perla F, Reunanen M, Saarela J, Saker-Delye S, Salvetti M, Sellebjerg F, Sørensen P, Spurkland A, Stewart G, Taylor B, Tienari P, Winkelmann J, Zipp F, Ivinson A, Haines J, Sawcer S, DeJager P, Hauser S, Oksenberg J. Network-based multiple sclerosis pathway analysis with GWAS data from 15,000 cases and 30,000 controls. Am J Hum Genet 2013; 92:854-65. [PMID: 23731539 PMCID: PMC3958952 DOI: 10.1016/j.ajhg.2013.04.019] [Citation(s) in RCA: 121] [Impact Index Per Article: 11.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/09/2013] [Revised: 04/04/2013] [Accepted: 04/23/2013] [Indexed: 02/03/2023] Open
Abstract
Multiple sclerosis (MS) is an inflammatory CNS disease with a substantial genetic component, originally mapped to only the human leukocyte antigen (HLA) region. In the last 5 years, a total of seven genome-wide association studies and one meta-analysis successfully identified 57 non-HLA susceptibility loci. Here, we merged nominal statistical evidence of association and physical evidence of interaction to conduct a protein-interaction-network-based pathway analysis (PINBPA) on two large genetic MS studies comprising a total of 15,317 cases and 29,529 controls. The distribution of nominally significant loci at the gene level matched the patterns of extended linkage disequilibrium in regions of interest. We found that products of genome-wide significantly associated genes are more likely to interact physically and belong to the same or related pathways. We next searched for subnetworks (modules) of genes (and their encoded proteins) enriched with nominally associated loci within each study and identified those modules in common between the two studies. We demonstrate that these modules are more likely to contain genes with bona fide susceptibility variants and, in addition, identify several high-confidence candidates (including BCL10, CD48, REL, TRAF3, and TEC). PINBPA is a powerful approach to gaining further insights into the biology of associated genes and to prioritizing candidates for subsequent genetic studies of complex traits.
Collapse
|
214
|
Csermely P, Korcsmáros T, Kiss HJM, London G, Nussinov R. Structure and dynamics of molecular networks: a novel paradigm of drug discovery: a comprehensive review. Pharmacol Ther 2013; 138:333-408. [PMID: 23384594 PMCID: PMC3647006 DOI: 10.1016/j.pharmthera.2013.01.016] [Citation(s) in RCA: 512] [Impact Index Per Article: 46.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/22/2013] [Accepted: 01/22/2013] [Indexed: 02/02/2023]
Abstract
Despite considerable progress in genome- and proteome-based high-throughput screening methods and in rational drug design, the increase in approved drugs in the past decade did not match the increase of drug development costs. Network description and analysis not only give a systems-level understanding of drug action and disease complexity, but can also help to improve the efficiency of drug design. We give a comprehensive assessment of the analytical tools of network topology and dynamics. The state-of-the-art use of chemical similarity, protein structure, protein-protein interaction, signaling, genetic interaction and metabolic networks in the discovery of drug targets is summarized. We propose that network targeting follows two basic strategies. The "central hit strategy" selectively targets central nodes/edges of the flexible networks of infectious agents or cancer cells to kill them. The "network influence strategy" works against other diseases, where an efficient reconfiguration of rigid networks needs to be achieved by targeting the neighbors of central nodes/edges. It is shown how network techniques can help in the identification of single-target, edgetic, multi-target and allo-network drug target candidates. We review the recent boom in network methods helping hit identification, lead selection optimizing drug efficacy, as well as minimizing side-effects and drug toxicity. Successful network-based drug development strategies are shown through the examples of infections, cancer, metabolic diseases, neurodegenerative diseases and aging. Summarizing >1200 references we suggest an optimized protocol of network-aided drug development, and provide a list of systems-level hallmarks of drug quality. Finally, we highlight network-related drug development trends helping to achieve these hallmarks by a cohesive, global approach.
Collapse
Affiliation(s)
- Peter Csermely
- Department of Medical Chemistry, Semmelweis University, P.O. Box 260, H-1444 Budapest 8, Hungary.
| | | | | | | | | |
Collapse
|
215
|
Marcotte E, Boone C, Babu MM, Gavin AC. Network Biology editorial 2013. MOLECULAR BIOSYSTEMS 2013; 9:1557-8. [PMID: 23712464 DOI: 10.1039/c3mb90018e] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/21/2022]
|
216
|
Lehne B, Schlitt T. Breaking free from the chains of pathway annotation: de novo pathway discovery for the analysis of disease processes. Pharmacogenomics 2013; 13:1967-78. [PMID: 23215889 DOI: 10.2217/pgs.12.170] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/14/2023] Open
Abstract
Interpreting the biological implications of high-throughput experiments such as gene-expression studies, genome-wide association studies and large-scale sequencing studies is not trivial. Gene-set and pathway analyses are useful tools to support the interpretation of such experiments, but rely on curated pathways or gene sets. The recent development of de novo pathway discovery methods aims to overcome this limitation. This article provides an overview of the methods currently available and reviews the advantages and challenges of this approach. In detail, it highlights the particular issues of de novo pathway discovery based on genome-wide association studies data, for which multiple different strategies have been proposed.
Collapse
Affiliation(s)
- Benjamin Lehne
- Bioinformatics Group, Department of Medical & Molecular Genetics, 8th Floor Tower Wing Guy's Hospital, London SE1 9RT, UK
| | | |
Collapse
|
217
|
Kwon HS, Johnson TV, Tomarev SI. Myocilin stimulates osteogenic differentiation of mesenchymal stem cells through mitogen-activated protein kinase signaling. J Biol Chem 2013; 288:16882-16894. [PMID: 23629661 DOI: 10.1074/jbc.m112.422972] [Citation(s) in RCA: 26] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/22/2022] Open
Abstract
Myocilin is a secreted glycoprotein that is expressed in ocular and non-ocular tissues. Mutations in the MYOCILIN gene may lead to juvenile- and adult-onset primary open-angle glaucoma. Here we report that myocilin is expressed in bone marrow-derived mesenchymal stem cells (MSCs) and plays a role in their differentiation into osteoblasts in vitro and in osteogenesis in vivo. Expression of myocilin was detected in MSCs derived from mouse, rat, and human bone marrow, with human MSCs exhibiting the highest level of myocilin expression. Expression of myocilin rose during the course of human MSC differentiation into osteoblasts but not into adipocytes, and treatment with exogenous myocilin further enhanced osteogenesis. MSCs derived from Myoc-null mice had a reduced ability to differentiate into the osteoblastic lineage, which was partially rescued by exogenous extracellular myocilin treatment. Myocilin also stimulated osteogenic differentiation of wild-type MSCs, which was associated with activation of the p38, Erk1/2, and JNK MAP kinase signaling pathways as well as up-regulated expression of the osteogenic transcription factors Runx2 and Dlx5. Finally, cortical bone thickness and trabecular volume, as well as the expression level of osteopontin, a known factor of bone remodeling and osteoblast differentiation, were reduced dramatically in the femurs of Myoc-null mice compared with wild-type mice. These data suggest that myocilin should be considered as a target for improving the bone regenerative potential of MSCs and may identify a new role for myocilin in bone formation and/or maintenance in vivo.
Collapse
Affiliation(s)
- Heung Sun Kwon
- Section of Retinal Ganglion Cell Biology, Laboratory of Retinal Cell and Molecular Biology, National Eye Institute, National Institutes of Health, Bethesda, Maryland 20892
| | - Thomas V Johnson
- Section of Retinal Ganglion Cell Biology, Laboratory of Retinal Cell and Molecular Biology, National Eye Institute, National Institutes of Health, Bethesda, Maryland 20892
| | - Stanislav I Tomarev
- Section of Retinal Ganglion Cell Biology, Laboratory of Retinal Cell and Molecular Biology, National Eye Institute, National Institutes of Health, Bethesda, Maryland 20892.
| |
Collapse
|
218
|
Effective identification of Akt interacting proteins by two-step chemical crosslinking, co-immunoprecipitation and mass spectrometry. PLoS One 2013; 8:e61430. [PMID: 23613850 PMCID: PMC3629208 DOI: 10.1371/journal.pone.0061430] [Citation(s) in RCA: 35] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/03/2013] [Accepted: 03/12/2013] [Indexed: 11/19/2022] Open
Abstract
Akt is a critical protein for cell survival and known to interact with various proteins. However, Akt binding partners that modulate or regulate Akt activation have not been fully elucidated. Identification of Akt-interacting proteins has been customarily achieved by co-immunoprecipitation combined with western blot and/or MS analysis. An intrinsic problem of the method is loss of interacting proteins during procedures to remove non-specific proteins. Moreover, antibody contamination often interferes with the detection of less abundant proteins. Here, we developed a novel two-step chemical crosslinking strategy to overcome these problems which resulted in a dramatic improvement in identifying Akt interacting partners. Akt antibody was first immobilized on protein A/G beads using disuccinimidyl suberate and allowed to bind to cellular Akt along with its interacting proteins. Subsequently, dithiobis[succinimidylpropionate], a cleavable crosslinker, was introduced to produce stable complexes between Akt and binding partners prior to the SDS-PAGE and nanoLC-MS/MS analysis. This approach enabled identification of ten Akt partners from cell lysates containing as low as 1.5 mg proteins, including two new potential Akt interacting partners. None of these but one protein was detectable without crosslinking procedures. The present method provides a sensitive and effective tool to probe Akt-interacting proteins. This strategy should also prove useful for other protein interactions, particularly those involving less abundant or weakly associating partners.
Collapse
|
219
|
Zhu J, Qin Y, Liu T, Wang J, Zheng X. Prioritization of candidate disease genes by topological similarity between disease and protein diffusion profiles. BMC Bioinformatics 2013; 14 Suppl 5:S5. [PMID: 23734762 PMCID: PMC3622672 DOI: 10.1186/1471-2105-14-s5-s5] [Citation(s) in RCA: 26] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND Identification of gene-phenotype relationships is a fundamental challenge in human health clinic. Based on the observation that genes causing the same or similar phenotypes tend to correlate with each other in the protein-protein interaction network, a lot of network-based approaches were proposed based on different underlying models. A recent comparative study showed that diffusion-based methods achieve the state-of-the-art predictive performance. RESULTS In this paper, a new diffusion-based method was proposed to prioritize candidate disease genes. Diffusion profile of a disease was defined as the stationary distribution of candidate genes given a random walk with restart where similarities between phenotypes are incorporated. Then, candidate disease genes are prioritized by comparing their diffusion profiles with that of the disease. Finally, the effectiveness of our method was demonstrated through the leave-one-out cross-validation against control genes from artificial linkage intervals and randomly chosen genes. Comparative study showed that our method achieves improved performance compared to some classical diffusion-based methods. To further illustrate our method, we used our algorithm to predict new causing genes of 16 multifactorial diseases including Prostate cancer and Alzheimer's disease, and the top predictions were in good consistent with literature reports. CONCLUSIONS Our study indicates that integration of multiple information sources, especially the phenotype similarity profile data, and introduction of global similarity measure between disease and gene diffusion profiles are helpful for prioritizing candidate disease genes. AVAILABILITY Programs and data are available upon request.
Collapse
Affiliation(s)
- Jie Zhu
- Department of Mathematics, Shanghai Normal University, Shanghai, China
| | | | | | | | | |
Collapse
|
220
|
Emig D, Ivliev A, Pustovalova O, Lancashire L, Bureeva S, Nikolsky Y, Bessarabova M. Drug target prediction and repositioning using an integrated network-based approach. PLoS One 2013; 8:e60618. [PMID: 23593264 PMCID: PMC3617101 DOI: 10.1371/journal.pone.0060618] [Citation(s) in RCA: 143] [Impact Index Per Article: 13.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/11/2013] [Accepted: 02/28/2013] [Indexed: 11/18/2022] Open
Abstract
The discovery of novel drug targets is a significant challenge in drug development. Although the human genome comprises approximately 30,000 genes, proteins encoded by fewer than 400 are used as drug targets in the treatment of diseases. Therefore, novel drug targets are extremely valuable as the source for first in class drugs. On the other hand, many of the currently known drug targets are functionally pleiotropic and involved in multiple pathologies. Several of them are exploited for treating multiple diseases, which highlights the need for methods to reliably reposition drug targets to new indications. Network-based methods have been successfully applied to prioritize novel disease-associated genes. In recent years, several such algorithms have been developed, some focusing on local network properties only, and others taking the complete network topology into account. Common to all approaches is the understanding that novel disease-associated candidates are in close overall proximity to known disease genes. However, the relevance of these methods to the prediction of novel drug targets has not yet been assessed. Here, we present a network-based approach for the prediction of drug targets for a given disease. The method allows both repositioning drug targets known for other diseases to the given disease and the prediction of unexploited drug targets which are not used for treatment of any disease. Our approach takes as input a disease gene expression signature and a high-quality interaction network and outputs a prioritized list of drug targets. We demonstrate the high performance of our method and highlight the usefulness of the predictions in three case studies. We present novel drug targets for scleroderma and different types of cancer with their underlying biological processes. Furthermore, we demonstrate the ability of our method to identify non-suspected repositioning candidates using diabetes type 1 as an example.
Collapse
Affiliation(s)
- Dorothea Emig
- IP & Science, Thomson Reuters, Carlsbad, California, United States of America
| | - Alexander Ivliev
- IP & Science, Thomson Reuters, Carlsbad, California, United States of America
| | - Olga Pustovalova
- IP & Science, Thomson Reuters, Carlsbad, California, United States of America
| | - Lee Lancashire
- IP & Science, Thomson Reuters, Carlsbad, California, United States of America
| | - Svetlana Bureeva
- IP & Science, Thomson Reuters, Carlsbad, California, United States of America
| | - Yuri Nikolsky
- IP & Science, Thomson Reuters, Carlsbad, California, United States of America
| | - Marina Bessarabova
- IP & Science, Thomson Reuters, Carlsbad, California, United States of America
- * E-mail:
| |
Collapse
|
221
|
Lee Y, Li H, Li J, Rebman E, Achour I, Regan KE, Gamazon ER, Chen JL, Yang XH, Cox NJ, Lussier YA. Network models of genome-wide association studies uncover the topological centrality of protein interactions in complex diseases. J Am Med Inform Assoc 2013; 20:619-29. [PMID: 23355459 PMCID: PMC3721168 DOI: 10.1136/amiajnl-2012-001519] [Citation(s) in RCA: 33] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/22/2023] Open
Abstract
BACKGROUND While genome-wide association studies (GWAS) of complex traits have revealed thousands of reproducible genetic associations to date, these loci collectively confer very little of the heritability of their respective diseases and, in general, have contributed little to our understanding the underlying disease biology. Physical protein interactions have been utilized to increase our understanding of human Mendelian disease loci but have yet to be fully exploited for complex traits. METHODS We hypothesized that protein interaction modeling of GWAS findings could highlight important disease-associated loci and unveil the role of their network topology in the genetic architecture of diseases with complex inheritance. RESULTS Network modeling of proteins associated with the intragenic single nucleotide polymorphisms of the National Human Genome Research Institute catalog of complex trait GWAS revealed that complex trait associated loci are more likely to be hub and bottleneck genes in available, albeit incomplete, networks (OR=1.59, Fisher's exact test p < 2.24 × 10(-12)). Network modeling also prioritized novel type 2 diabetes (T2D) genetic variations from the Finland-USA Investigation of Non-Insulin-Dependent Diabetes Mellitus Genetics and the Wellcome Trust GWAS data, and demonstrated the enrichment of hubs and bottlenecks in prioritized T2D GWAS genes. The potential biological relevance of the T2D hub and bottleneck genes was revealed by their increased number of first degree protein interactions with known T2D genes according to several independent sources (p<0.01, probability of being first interactors of known T2D genes). CONCLUSION Virtually all common diseases are complex human traits, and thus the topological centrality in protein networks of complex trait genes has implications in genetics, personal genomics, and therapy.
Collapse
Affiliation(s)
- Younghee Lee
- Center for Biomedical Informatics, The University of Chicago, Chicago, Illinois, USA
| | | | | | | | | | | | | | | | | | | | | |
Collapse
|
222
|
Linghu B, Franzosa EA, Xia Y. Construction of functional linkage gene networks by data integration. Methods Mol Biol 2013. [PMID: 23192549 DOI: 10.1007/978-1-62703-107-3_14] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/31/2022]
Abstract
Networks of functional associations between genes have recently been successfully used for gene function and disease-related research. A typical approach for constructing such functional linkage gene networks (FLNs) is based on the integration of diverse high-throughput functional genomics datasets. Data integration is a nontrivial task due to the heterogeneous nature of the different data sources and their variable accuracy and completeness. The presence of correlations between data sources also adds another layer of complexity to the integration process. In this chapter we discuss an approach for constructing a human FLN from data integration and a subsequent application of the FLN to novel disease gene discovery. Similar approaches can be applied to nonhuman species and other discovery tasks.
Collapse
Affiliation(s)
- Bolan Linghu
- Translational Sciences Department, Novartis Institutes for BioMedical Research, Cambridge, MA, USA.
| | | | | |
Collapse
|
223
|
Kim KJ, Hwang D, Kim WU. Systems Approach to Rheumatoid Arthritis. JOURNAL OF RHEUMATIC DISEASES 2013. [DOI: 10.4078/jrd.2013.20.6.348] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022]
Affiliation(s)
- Ki-Jo Kim
- Division of Rheumatology, Department of Internal Medicine, St. Vincent's Hospital, The Catholic University of Korea, Suwon, Korea
| | - Daehee Hwang
- Center for Systems Biology of Plant Senescence and Life History, Daegu Gyeongbuk Institute of Science & Technology, Daegu, Korea
| | - Wan-Uk Kim
- Division of Rheumatology, Department of Internal Medicine, St. Vincent's Hospital, The Catholic University of Korea, Suwon, Korea
| |
Collapse
|
224
|
Abstract
Proteins do not function in isolation; it is their interactions with one another and also with other molecules (e.g. DNA, RNA) that mediate metabolic and signaling pathways, cellular processes, and organismal systems. Due to their central role in biological function, protein interactions also control the mechanisms leading to healthy and diseased states in organisms. Diseases are often caused by mutations affecting the binding interface or leading to biochemically dysfunctional allosteric changes in proteins. Therefore, protein interaction networks can elucidate the molecular basis of disease, which in turn can inform methods for prevention, diagnosis, and treatment. In this chapter, we will describe the computational approaches to predict and map networks of protein interactions and briefly review the experimental methods to detect protein interactions. We will describe the application of protein interaction networks as a translational approach to the study of human disease and evaluate the challenges faced by these approaches.
Collapse
Affiliation(s)
- Mileidy W. Gonzalez
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, Maryland, United States of America
| | - Maricel G. Kann
- Biological Sciences, University of Maryland, Baltimore County, Baltimore, Maryland, United States of America
- * E-mail:
| |
Collapse
|
225
|
Affiliation(s)
- Xiaoke Ma
- School of Computer Science and Technology, Xidian University, No. 2 South TaiBai Road, Xi'an, Shaanxi 710071, P.R. China
| | | |
Collapse
|
226
|
Cheng TMK, Gulati S, Agius R, Bates PA. Understanding cancer mechanisms through network dynamics. Brief Funct Genomics 2012; 11:543-60. [PMID: 22811516 DOI: 10.1093/bfgp/els025] [Citation(s) in RCA: 27] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/11/2024] Open
Abstract
Cancer is a complex, multifaceted disease. Cellular systems are perturbed both during the onset and development of cancer, and the behavioural change of tumour cells usually involves a broad range of dynamic variations. To an extent, the difficulty of monitoring the systemic change has been alleviated by recent developments in the high-throughput technologies. At both the genomic as well as proteomic levels, the technological advances in microarray and mass spectrometry, in conjunction with computational simulations and the construction of human interactome maps have facilitated the progress of identifying disease-associated genes. On a systems level, computational approaches developed for network analysis are becoming especially useful for providing insights into the mechanism behind tumour development and metastasis. This review emphasizes network approaches that have been developed to study cancer and provides an overview of our current knowledge of protein-protein interaction networks, and how their systemic perturbation can be analysed by two popular network simulation methods: Boolean network and ordinary differential equations.
Collapse
Affiliation(s)
- Tammy M K Cheng
- Biomolecular Modelling Laboratory, Cancer Research UK London Research Institute, Lincoln's Inn Fields, London WC2A 3LY, UK
| | | | | | | |
Collapse
|
227
|
Abstract
Molecular network data are increasingly becoming available, necessitating the development of well performing computational tools for their analyses. Such tools enabled conceptually different approaches for exploring human diseases to be undertaken, in particular, those that study the relationship between a multitude of biomolecules within a cell. Hence, a new field of network biology has emerged as part of systems biology, aiming to untangle the complexity of cellular network organization. We survey current network analysis methods that aim to give insight into human disease.
Collapse
Affiliation(s)
- Vuk Janjić
- Department of Computing, Imperial College London, 180 Queen's Gate, SW7 2AZ London, UK
| | | |
Collapse
|
228
|
Magger O, Waldman YY, Ruppin E, Sharan R. Enhancing the prioritization of disease-causing genes through tissue specific protein interaction networks. PLoS Comput Biol 2012; 8:e1002690. [PMID: 23028288 PMCID: PMC3459874 DOI: 10.1371/journal.pcbi.1002690] [Citation(s) in RCA: 115] [Impact Index Per Article: 9.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/15/2011] [Accepted: 07/28/2012] [Indexed: 01/07/2023] Open
Abstract
The prioritization of candidate disease-causing genes is a fundamental challenge in the post-genomic era. Current state of the art methods exploit a protein-protein interaction (PPI) network for this task. They are based on the observation that genes causing phenotypically-similar diseases tend to lie close to one another in a PPI network. However, to date, these methods have used a static picture of human PPIs, while diseases impact specific tissues in which the PPI networks may be dramatically different. Here, for the first time, we perform a large-scale assessment of the contribution of tissue-specific information to gene prioritization. By integrating tissue-specific gene expression data with PPI information, we construct tissue-specific PPI networks for 60 tissues and investigate their prioritization power. We find that tissue-specific PPI networks considerably improve the prioritization results compared to those obtained using a generic PPI network. Furthermore, they allow predicting novel disease-tissue associations, pointing to sub-clinical tissue effects that may escape early detection.
Collapse
Affiliation(s)
- Oded Magger
- Blavatnik School of Computer Science, Tel Aviv University, Tel Aviv, Israel.
| | | | | | | |
Collapse
|
229
|
Guney E, Oliva B. Exploiting protein-protein interaction networks for genome-wide disease-gene prioritization. PLoS One 2012; 7:e43557. [PMID: 23028459 PMCID: PMC3448640 DOI: 10.1371/journal.pone.0043557] [Citation(s) in RCA: 78] [Impact Index Per Article: 6.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/19/2012] [Accepted: 07/23/2012] [Indexed: 11/23/2022] Open
Abstract
Complex genetic disorders often involve products of multiple genes acting cooperatively. Hence, the pathophenotype is the outcome of the perturbations in the underlying pathways, where gene products cooperate through various mechanisms such as protein-protein interactions. Pinpointing the decisive elements of such disease pathways is still challenging. Over the last years, computational approaches exploiting interaction network topology have been successfully applied to prioritize individual genes involved in diseases. Although linkage intervals provide a list of disease-gene candidates, recent genome-wide studies demonstrate that genes not associated with any known linkage interval may also contribute to the disease phenotype. Network based prioritization methods help highlighting such associations. Still, there is a need for robust methods that capture the interplay among disease-associated genes mediated by the topology of the network. Here, we propose a genome-wide network-based prioritization framework named GUILD. This framework implements four network-based disease-gene prioritization algorithms. We analyze the performance of these algorithms in dozens of disease phenotypes. The algorithms in GUILD are compared to state-of-the-art network topology based algorithms for prioritization of genes. As a proof of principle, we investigate top-ranking genes in Alzheimer's disease (AD), diabetes and AIDS using disease-gene associations from various sources. We show that GUILD is able to significantly highlight disease-gene associations that are not used a priori. Our findings suggest that GUILD helps to identify genes implicated in the pathology of human disorders independent of the loci associated with the disorders.
Collapse
Affiliation(s)
- Emre Guney
- Structural Bioinformatics Group (GRIB), Universitat Pompeu Fabra, Barcelona Research Park of Biomedicine (PRBB), Barcelona, Catalonia, Spain
| | - Baldo Oliva
- Structural Bioinformatics Group (GRIB), Universitat Pompeu Fabra, Barcelona Research Park of Biomedicine (PRBB), Barcelona, Catalonia, Spain
- * E-mail:
| |
Collapse
|
230
|
Jordán F, Nguyen TP, Liu WC. Studying protein-protein interaction networks: a systems view on diseases. Brief Funct Genomics 2012; 11:497-504. [PMID: 22908210 DOI: 10.1093/bfgp/els035] [Citation(s) in RCA: 47] [Impact Index Per Article: 3.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/10/2023] Open
Abstract
In order to better understand several cellular processes, it is helpful to study how various components make up the system. This systems perspective is supported by several modelling tools including network analysis. Networks of protein-protein interactions (PPI networks) offer a way to depict, visualize and quantify the functioning and relative importance of particular proteins in cell function. The toolkit of network analysis ranges from the local indices describing individual proteins (as network nodes) to global indicators of system architecture, describing the total interaction system (as the whole network). We briefly introduce some of these network indices and present a case study where the connectedness and potential functional relationships between certain disease proteins are inferred. We argue that network analysis can be used, in general, to improve databases, to infer novel functions, to quantify positional importance and to support predictions in pathogenesis studies. The systems perspective and network analysis can be of particular importance in studying diseases with complex molecular processes.
Collapse
Affiliation(s)
- Ferenc Jordán
- The Microsoft Research-University of Trento Center for Computational and Systems Biology, Piazza Manifattura 1, 38068, Rovereto, TN, Italy.
| | | | | |
Collapse
|
231
|
Masoudi-Nejad A, Meshkin A, Haji-Eghrari B, Bidkhori G. RETRACTED ARTICLE: Candidate gene prioritization. Mol Genet Genomics 2012; 287:679-98. [DOI: 10.1007/s00438-012-0710-z] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/12/2012] [Accepted: 07/12/2012] [Indexed: 01/16/2023]
|
232
|
Pradhan MP, Prasad NKA, Palakal MJ. A systems biology approach to the global analysis of transcription factors in colorectal cancer. BMC Cancer 2012; 12:331. [PMID: 22852817 PMCID: PMC3539921 DOI: 10.1186/1471-2407-12-331] [Citation(s) in RCA: 22] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/14/2011] [Accepted: 06/21/2012] [Indexed: 02/08/2023] Open
Abstract
Background Biological entities do not perform in isolation, and often, it is the nature and degree of interactions among numerous biological entities which ultimately determines any final outcome. Hence, experimental data on any single biological entity can be of limited value when considered only in isolation. To address this, we propose that augmenting individual entity data with the literature will not only better define the entity’s own significance but also uncover relationships with novel biological entities. To test this notion, we developed a comprehensive text mining and computational methodology that focused on discovering new targets of one class of molecular entities, transcription factors (TF), within one particular disease, colorectal cancer (CRC). Methods We used 39 molecular entities known to be associated with CRC along with six colorectal cancer terms as the bait list, or list of search terms, for mining the biomedical literature to identify CRC-specific genes and proteins. Using the literature-mined data, we constructed a global TF interaction network for CRC. We then developed a multi-level, multi-parametric methodology to identify TFs to CRC. Results The small bait list, when augmented with literature-mined data, identified a large number of biological entities associated with CRC. The relative importance of these TF and their associated modules was identified using functional and topological features. Additional validation of these highly-ranked TF using the literature strengthened our findings. Some of the novel TF that we identified were: SLUG, RUNX1, IRF1, HIF1A, ATF-2, ABL1, ELK-1 and GATA-1. Some of these TFs are associated with functional modules in known pathways of CRC, including the Beta-catenin/development, immune response, transcription, and DNA damage pathways. Conclusions Our methodology of using text mining data and a multi-level, multi-parameter scoring technique was able to identify both known and novel TF that have roles in CRC. Starting with just one TF (SMAD3) in the bait list, the literature mining process identified an additional 116 CRC-associated TFs. Our network-based analysis showed that these TFs all belonged to any of 13 major functional groups that are known to play important roles in CRC. Among these identified TFs, we obtained a novel six-node module consisting of ATF2-P53-JNK1-ELK1-EPHB2-HIF1A, from which the novel JNK1-ELK1 association could potentially be a significant marker for CRC.
Collapse
Affiliation(s)
- Meeta P Pradhan
- School of Informatics, Indiana University Purdue University Indianapolis, Indianapolis, IN 46202, USA
| | | | | |
Collapse
|
233
|
García-Alonso L, Alonso R, Vidal E, Amadoz A, de María A, Minguez P, Medina I, Dopazo J. Discovering the hidden sub-network component in a ranked list of genes or proteins derived from genomic experiments. Nucleic Acids Res 2012; 40:e158. [PMID: 22844098 PMCID: PMC3488210 DOI: 10.1093/nar/gks699] [Citation(s) in RCA: 22] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/21/2022] Open
Abstract
Genomic experiments (e.g. differential gene expression, single-nucleotide polymorphism association) typically produce ranked list of genes. We present a simple but powerful approach which uses protein–protein interaction data to detect sub-networks within such ranked lists of genes or proteins. We performed an exhaustive study of network parameters that allowed us concluding that the average number of components and the average number of nodes per component are the parameters that best discriminate between real and random networks. A novel aspect that increases the efficiency of this strategy in finding sub-networks is that, in addition to direct connections, also connections mediated by intermediate nodes are considered to build up the sub-networks. The possibility of using of such intermediate nodes makes this approach more robust to noise. It also overcomes some limitations intrinsic to experimental designs based on differential expression, in which some nodes are invariant across conditions. The proposed approach can also be used for candidate disease-gene prioritization. Here, we demonstrate the usefulness of the approach by means of several case examples that include a differential expression analysis in Fanconi Anemia, a genome-wide association study of bipolar disorder and a genome-scale study of essentiality in cancer genes. An efficient and easy-to-use web interface (available at http://www.babelomics.org) based on HTML5 technologies is also provided to run the algorithm and represent the network.
Collapse
Affiliation(s)
- Luz García-Alonso
- Department of Bioinformatics, Centro de Investigación Príncipe Felipe (CIPF), Valencia, Spain
| | | | | | | | | | | | | | | |
Collapse
|
234
|
|
235
|
Shi M, Beauchamp RD, Zhang B. A network-based gene expression signature informs prognosis and treatment for colorectal cancer patients. PLoS One 2012; 7:e41292. [PMID: 22844451 PMCID: PMC3402487 DOI: 10.1371/journal.pone.0041292] [Citation(s) in RCA: 30] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/03/2012] [Accepted: 06/19/2012] [Indexed: 01/08/2023] Open
Abstract
Background Several studies have reported gene expression signatures that predict recurrence risk in stage II and III colorectal cancer (CRC) patients with minimal gene membership overlap and undefined biological relevance. The goal of this study was to investigate biological themes underlying these signatures, to infer genes of potential mechanistic importance to the CRC recurrence phenotype and to test whether accurate prognostic models can be developed using mechanistically important genes. Methods and Findings We investigated eight published CRC gene expression signatures and found no functional convergence in Gene Ontology enrichment analysis. Using a random walk-based approach, we integrated these signatures and publicly available somatic mutation data on a protein-protein interaction network and inferred 487 genes that were plausible candidate molecular underpinnings for the CRC recurrence phenotype. We named the list of 487 genes a NEM signature because it integrated information from Network, Expression, and Mutation. The signature showed significant enrichment in four biological processes closely related to cancer pathophysiology and provided good coverage of known oncogenes, tumor suppressors, and CRC-related signaling pathways. A NEM signature-based Survival Support Vector Machine prognostic model was trained using a microarray gene expression dataset and tested on an independent dataset. The model-based scores showed a 75.7% concordance with the real survival data and separated patients into two groups with significantly different relapse-free survival (p = 0.002). Similar results were obtained with reversed training and testing datasets (p = 0.007). Furthermore, adjuvant chemotherapy was significantly associated with prolonged survival of the high-risk patients (p = 0.006), but not beneficial to the low-risk patients (p = 0.491). Conclusions The NEM signature not only reflects CRC biology but also informs patient prognosis and treatment response. Thus, the network-based data integration method provides a convergence between biological relevance and clinical usefulness in gene signature development.
Collapse
Affiliation(s)
- Mingguang Shi
- Department of Biomedical Informatics, Vanderbilt University School of Medicine, Nashville, Tennessee, United States of America
| | - R. Daniel Beauchamp
- Department of Cancer Biology, Vanderbilt University School of Medicine, Nashville, Tennessee, United States of America
- Department of Surgery, Vanderbilt University School of Medicine, Nashville, Tennessee, United States of America
- Department of Cell and Developmental Biology, Vanderbilt University School of Medicine, Nashville, Tennessee, United States of America
- Vanderbilt-Ingram Cancer Center, Vanderbilt University School of Medicine, Nashville, Tennessee, United States of America
| | - Bing Zhang
- Department of Biomedical Informatics, Vanderbilt University School of Medicine, Nashville, Tennessee, United States of America
- Department of Cancer Biology, Vanderbilt University School of Medicine, Nashville, Tennessee, United States of America
- Vanderbilt-Ingram Cancer Center, Vanderbilt University School of Medicine, Nashville, Tennessee, United States of America
- * E-mail:
| |
Collapse
|
236
|
Zhao J, Chen J, Yang TH, Holme P. Insights into the pathogenesis of axial spondyloarthropathy from network and pathway analysis. BMC SYSTEMS BIOLOGY 2012; 6 Suppl 1:S4. [PMID: 23046677 PMCID: PMC3403611 DOI: 10.1186/1752-0509-6-s1-s4] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 12/25/2022]
Abstract
Background Complex chronic diseases are usually not caused by changes in a single causal gene but by an unbalanced regulating network resulting from the dysfunctions of multiple genes or their products. Therefore, network based systems approach can be helpful for the identification of candidate genes related to complex diseases and their relationships. Axial spondyloarthropathy (SpA) is a group of chronic inflammatory joint diseases that mainly affect the spine and the sacroiliac joints. The pathogenesis of SpA remains largely unknown. Results In this paper, we conducted a network study of the pathogenesis of SpA. We integrated data related to SpA, from the OMIM database, proteomics and microarray experiments of SpA, to prioritize SpA candidate disease genes in the context of human protein interactome. Based on the top ranked SpA related genes, we constructed a SpA specific PPI network, identified potential pathways associated with SpA, and finally sketched an overview of biological processes involved in the development of SpA. Conclusions The protein-protein interaction (PPI) network and pathways reflect the link between the two pathological processes of SpA, i.e., immune mediated inflammation, as well as imbalanced bone modelling caused new boneformation and bone loss. We found that some known disease causative genes, such as TNFand ILs, play pivotal roles in this interaction.
Collapse
Affiliation(s)
- Jing Zhao
- Department of Mathematics, Logistical Engineering University, Chongqing, China.
| | | | | | | |
Collapse
|
237
|
Zaki N, Berengueres J, Efimov D. Detection of protein complexes using a protein ranking algorithm. Proteins 2012; 80:2459-68. [PMID: 22685080 DOI: 10.1002/prot.24130] [Citation(s) in RCA: 42] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/29/2012] [Revised: 05/31/2012] [Accepted: 06/01/2012] [Indexed: 12/24/2022]
Abstract
Detecting protein complexes from protein-protein interaction (PPI) network is becoming a difficult challenge in computational biology. There is ample evidence that many disease mechanisms involve protein complexes, and being able to predict these complexes is important to the characterization of the relevant disease for diagnostic and treatment purposes. This article introduces a novel method for detecting protein complexes from PPI by using a protein ranking algorithm (ProRank). ProRank quantifies the importance of each protein based on the interaction structure and the evolutionarily relationships between proteins in the network. A novel way of identifying essential proteins which are known for their critical role in mediating cellular processes and constructing protein complexes is proposed and analyzed. We evaluate the performance of ProRank using two PPI networks on two reference sets of protein complexes created from Munich Information Center for Protein Sequence, containing 81 and 162 known complexes, respectively. We compare the performance of ProRank to some of the well known protein complex prediction methods (ClusterONE, CMC, CFinder, MCL, MCode and Core) in terms of precision and recall. We show that ProRank predicts more complexes correctly at a competitive level of precision and recall. The level of the accuracy achieved using ProRank in comparison to other recent methods for detecting protein complexes is a strong argument in favor of the proposed method.
Collapse
Affiliation(s)
- Nazar Zaki
- Faculty of Information Technology, UAEU, Al Ain, UAE.
| | | | | |
Collapse
|
238
|
Dannenfelser R, Clark NR, Ma'ayan A. Genes2FANs: connecting genes through functional association networks. BMC Bioinformatics 2012; 13:156. [PMID: 22748121 PMCID: PMC3472228 DOI: 10.1186/1471-2105-13-156] [Citation(s) in RCA: 25] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/28/2011] [Accepted: 05/25/2012] [Indexed: 01/04/2023] Open
Abstract
Background Protein-protein, cell signaling, metabolic, and transcriptional interaction networks are useful for identifying connections between lists of experimentally identified genes/proteins. However, besides physical or co-expression interactions there are many ways in which pairs of genes, or their protein products, can be associated. By systematically incorporating knowledge on shared properties of genes from diverse sources to build functional association networks (FANs), researchers may be able to identify additional functional interactions between groups of genes that are not readily apparent. Results Genes2FANs is a web based tool and a database that utilizes 14 carefully constructed FANs and a large-scale protein-protein interaction (PPI) network to build subnetworks that connect lists of human and mouse genes. The FANs are created from mammalian gene set libraries where mouse genes are converted to their human orthologs. The tool takes as input a list of human or mouse Entrez gene symbols to produce a subnetwork and a ranked list of intermediate genes that are used to connect the query input list. In addition, users can enter any PubMed search term and then the system automatically converts the returned results to gene lists using GeneRIF. This gene list is then used as input to generate a subnetwork from the user’s PubMed query. As a case study, we applied Genes2FANs to connect disease genes from 90 well-studied disorders. We find an inverse correlation between the counts of links connecting disease genes through PPI and links connecting diseases genes through FANs, separating diseases into two categories. Conclusions Genes2FANs is a useful tool for interpreting the relationships between gene/protein lists in the context of their various functions and networks. Combining functional association interactions with physical PPIs can be useful for revealing new biology and help form hypotheses for further experimentation. Our finding that disease genes in many cancers are mostly connected through PPIs whereas other complex diseases, such as autism and type-2 diabetes, are mostly connected through FANs without PPIs, can guide better strategies for disease gene discovery. Genes2FANs is available at:
http://actin.pharm.mssm.edu/genes2FANs.
Collapse
Affiliation(s)
- Ruth Dannenfelser
- Department of Pharmacology and Systems Therapeutics, Systems Biology Center of New York, Mount Sinai School of Medicine, New York, NY 10029, USA
| | | | | |
Collapse
|
239
|
Kotlyar M, Fortney K, Jurisica I. Network-based characterization of drug-regulated genes, drug targets, and toxicity. Methods 2012; 57:499-507. [PMID: 22749929 DOI: 10.1016/j.ymeth.2012.06.003] [Citation(s) in RCA: 60] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/22/2012] [Revised: 05/30/2012] [Accepted: 06/08/2012] [Indexed: 12/25/2022] Open
Abstract
Proteins do not exert their effects in isolation of one another, but interact together in complex networks. In recent years, sophisticated methods have been developed to leverage protein-protein interaction (PPI) network structure to improve several stages of the drug discovery process. Network-based methods have been applied to predict drug targets, drug side effects, and new therapeutic indications. In this paper we have two aims. First, we review the past contributions of network approaches and methods to drug discovery, and discuss their limitations and possible future directions. Second, we show how past work can be generalized to gain a more complete understanding of how drugs perturb networks. Previous network-based characterizations of drug effects focused on the small number of known drug targets, i.e., direct binding partners of drugs. However, drugs affect many more genes than their targets - they can profoundly affect the cell's transcriptome. For the first time, we use networks to characterize genes that are differentially regulated by drugs. We found that drug-regulated genes differed from drug targets in terms of functional annotations, cellular localizations, and topological properties. Drug targets mainly included receptors on the plasma membrane, down-regulated genes were largely in the nucleus and were enriched for DNA binding, and genes lacking drug relationships were enriched in the extracellular region. Network topology analysis indicated several significant graph properties, including high degree and betweenness for the drug targets and drug-regulated genes, though possibly due to network biases. Topological analysis also showed that proteins of down-regulated genes appear to be frequently involved in complexes. Analyzing network distances between regulated genes, we found that genes regulated by structurally similar drugs were significantly closer than genes regulated by dissimilar drugs. Finally, network centrality of a drug's differentially regulated genes correlated significantly with drug toxicity.
Collapse
Affiliation(s)
- Max Kotlyar
- The Campbell Family Institute for Cancer Research, Ontario Cancer Institute, University Health Network, IBM Life Sciences Discovery Centre, Toronto Medical Discovery Tower, 9-305, 101 College Street, Toronto, Ontario, M5G 1L7, Canada.
| | | | | |
Collapse
|
240
|
Zhang L, Li X, Tai J, Li W, Chen L. Predicting candidate genes based on combined network topological features: a case study in coronary artery disease. PLoS One 2012; 7:e39542. [PMID: 22761820 PMCID: PMC3382204 DOI: 10.1371/journal.pone.0039542] [Citation(s) in RCA: 30] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/28/2012] [Accepted: 05/22/2012] [Indexed: 11/26/2022] Open
Abstract
Predicting candidate genes using gene expression profiles and unbiased protein-protein interactions (PPI) contributes a lot in deciphering the pathogenesis of complex diseases. Recent studies showed that there are significant disparities in network topological features between non-disease and disease genes in protein-protein interaction settings. Integrated methods could consider their characteristics comprehensively in a biological network. In this study, we introduce a novel computational method, based on combined network topological features, to construct a combined classifier and then use it to predict candidate genes for coronary artery diseases (CAD). As a result, 276 novel candidate genes were predicted and were found to share similar functions to known disease genes. The majority of the candidate genes were cross-validated by other three methods. Our method will be useful in the search for candidate genes of other diseases.
Collapse
Affiliation(s)
- Liangcai Zhang
- College of Bioinformatics Science and Technology, Harbin Medical University, Harbin, Heilongjiang Province, China
- * E-mail: (LCZ); (LC)
| | - Xu Li
- College of Bioinformatics Science and Technology, Harbin Medical University, Harbin, Heilongjiang Province, China
| | - Jingxie Tai
- College of Bioinformatics Science and Technology, Harbin Medical University, Harbin, Heilongjiang Province, China
| | - Wan Li
- College of Bioinformatics Science and Technology, Harbin Medical University, Harbin, Heilongjiang Province, China
| | - Lina Chen
- College of Bioinformatics Science and Technology, Harbin Medical University, Harbin, Heilongjiang Province, China
- * E-mail: (LCZ); (LC)
| |
Collapse
|
241
|
Affiliation(s)
- Ian W. Taylor
- Samuel Lunenfeld Research Institute; Mount Sinai Hospital; Toronto Ontario Canada
- Department of Molecular Genetics; University of Toronto; Toronto Ontario Canada
| | - Jeffrey L. Wrana
- Samuel Lunenfeld Research Institute; Mount Sinai Hospital; Toronto Ontario Canada
- Department of Molecular Genetics; University of Toronto; Toronto Ontario Canada
| |
Collapse
|
242
|
Barrenäs F, Chavali S, Alves AC, Coin L, Jarvelin MR, Jörnsten R, Langston MA, Ramasamy A, Rogers G, Wang H, Benson M. Highly interconnected genes in disease-specific networks are enriched for disease-associated polymorphisms. Genome Biol 2012; 13:R46. [PMID: 22703998 PMCID: PMC3446318 DOI: 10.1186/gb-2012-13-6-r46] [Citation(s) in RCA: 52] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/27/2012] [Revised: 05/25/2012] [Accepted: 06/15/2012] [Indexed: 02/07/2023] Open
Abstract
Background Complex diseases are associated with altered interactions between thousands of genes. We developed a novel method to identify and prioritize disease genes, which was generally applicable to complex diseases. Results We identified modules of highly interconnected genes in disease-specific networks derived from integrating gene-expression and protein interaction data. We examined if those modules were enriched for disease-associated SNPs, and could be used to find novel genes for functional studies. First, we analyzed publicly available gene expression microarray and genome-wide association study (GWAS) data from 13, highly diverse, complex diseases. In each disease, highly interconnected genes formed modules, which were significantly enriched for genes harboring disease-associated SNPs. To test if such modules could be used to find novel genes for functional studies, we repeated the analyses using our own gene expression microarray and GWAS data from seasonal allergic rhinitis. We identified a novel gene, FGF2, whose relevance was supported by functional studies using combined small interfering RNA-mediated knock-down and gene expression microarrays. The modules in the 13 complex diseases analyzed here tended to overlap and were enriched for pathways related to oncological, metabolic and inflammatory diseases. This suggested that this union of the modules would be associated with a general increase in susceptibility for complex diseases. Indeed, we found that this union was enriched with GWAS genes for 145 other complex diseases. Conclusions Modules of highly interconnected complex disease genes were enriched for disease-associated SNPs, and could be used to find novel genes for functional studies.
Collapse
Affiliation(s)
- Fredrik Barrenäs
- The Centre for Individualized Medication, Linköping University Hospital, Linköping University, Linköping, Sweden
| | | | | | | | | | | | | | | | | | | | | |
Collapse
|
243
|
Doncheva NT, Kacprowski T, Albrecht M. Recent approaches to the prioritization of candidate disease genes. WILEY INTERDISCIPLINARY REVIEWS-SYSTEMS BIOLOGY AND MEDICINE 2012; 4:429-42. [PMID: 22689539 DOI: 10.1002/wsbm.1177] [Citation(s) in RCA: 45] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/10/2022]
Abstract
Many efforts are still devoted to the discovery of genes involved with specific phenotypes, in particular, diseases. High-throughput techniques are thus applied frequently to detect dozens or even hundreds of candidate genes. However, the experimental validation of many candidates is often an expensive and time-consuming task. Therefore, a great variety of computational approaches has been developed to support the identification of the most promising candidates for follow-up studies. The biomedical knowledge already available about the disease of interest and related genes is commonly exploited to find new gene-disease associations and to prioritize candidates. In this review, we highlight recent methodological advances in this research field of candidate gene prioritization. We focus on approaches that use network information and integrate heterogeneous data sources. Furthermore, we discuss current benchmarking procedures for evaluating and comparing different prioritization methods.
Collapse
|
244
|
Eronen L, Toivonen H. Biomine: predicting links between biological entities using network models of heterogeneous databases. BMC Bioinformatics 2012; 13:119. [PMID: 22672646 PMCID: PMC3505483 DOI: 10.1186/1471-2105-13-119] [Citation(s) in RCA: 45] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/31/2011] [Accepted: 04/17/2012] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND Biological databases contain large amounts of data concerning the functions and associations of genes and proteins. Integration of data from several such databases into a single repository can aid the discovery of previously unknown connections spanning multiple types of relationships and databases. RESULTS Biomine is a system that integrates cross-references from several biological databases into a graph model with multiple types of edges, such as protein interactions, gene-disease associations and gene ontology annotations. Edges are weighted based on their type, reliability, and informativeness. We present Biomine and evaluate its performance in link prediction, where the goal is to predict pairs of nodes that will be connected in the future, based on current data. In particular, we formulate protein interaction prediction and disease gene prioritization tasks as instances of link prediction. The predictions are based on a proximity measure computed on the integrated graph. We consider and experiment with several such measures, and perform a parameter optimization procedure where different edge types are weighted to optimize link prediction accuracy. We also propose a novel method for disease-gene prioritization, defined as finding a subset of candidate genes that cluster together in the graph. We experimentally evaluate Biomine by predicting future annotations in the source databases and prioritizing lists of putative disease genes. CONCLUSIONS The experimental results show that Biomine has strong potential for predicting links when a set of selected candidate links is available. The predictions obtained using the entire Biomine dataset are shown to clearly outperform ones obtained using any single source of data alone, when different types of links are suitably weighted. In the gene prioritization task, an established reference set of disease-associated genes is useful, but the results show that under favorable conditions, Biomine can also perform well when no such information is available.The Biomine system is a proof of concept. Its current version contains 1.1 million entities and 8.1 million relations between them, with focus on human genetics. Some of its functionalities are available in a public query interface at http://biomine.cs.helsinki.fi, allowing searching for and visualizing connections between given biological entities.
Collapse
Affiliation(s)
- Lauri Eronen
- Biocomputing Platforms Ltd, Innopoli 2, Tekniikantie 14, , FI-02150 Espoo, Finland.
| | | |
Collapse
|
245
|
Jaeger S, Aloy P. From protein interaction networks to novel therapeutic strategies. IUBMB Life 2012; 64:529-37. [DOI: 10.1002/iub.1040] [Citation(s) in RCA: 32] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/13/2012] [Accepted: 03/14/2012] [Indexed: 01/18/2023]
|
246
|
Garcia-Garcia J, Bonet J, Guney E, Fornes O, Planas J, Oliva B. Networks of ProteinProtein Interactions: From Uncertainty to Molecular Details. Mol Inform 2012; 31:342-62. [PMID: 27477264 DOI: 10.1002/minf.201200005] [Citation(s) in RCA: 22] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/16/2012] [Accepted: 03/09/2012] [Indexed: 11/08/2022]
Abstract
Proteins are the bricks and mortar of cells. The work of proteins is structural and functional, as they are the principal element of the organization of the cell architecture, but they also play a relevant role in its metabolism and regulation. To perform all these functions, proteins need to interact with each other and with other bio-molecules, either to form complexes or to recognize precise targets of their action. For instance, a particular transcription factor may activate one gene or another depending on its interactions with other proteins and not only with DNA. Hence, the ability of a protein to interact with other bio-molecules, and the partners they have at each particular time and location can be crucial to characterize the role of a protein. Proteins rarely act alone; they rather constitute a mingled network of physical interactions or other types of relationships (such as metabolic and regulatory) or signaling cascades. In this context, understanding the function of a protein implies to recognize the members of its neighborhood and to grasp how they associate, both at the systemic and atomic level. The network of physical interactions between the proteins of a system, cell or organism, is defined as the interactome. The purpose of this review is to deepen the description of interactomes at different levels of detail: from the molecular structure of complexes to the global topology of the network of interactions. The approaches and techniques applied experimentally and computationally to attain each level are depicted. The limits of each technique and its integration into a model network, the challenges and actual problems of completeness of an interactome, and the reliability of the interactions are reviewed and summarized. Finally, the application of the current knowledge of protein-protein interactions on modern network medicine and protein function annotation is also explored.
Collapse
Affiliation(s)
- Javier Garcia-Garcia
- Structural Bioinformatics Group, GRIB-IMIM, Universitat Pompeu Fabra, Barcelona Research Park of Biomedicine (PRBB), Catalonia, Spain
| | - Jaume Bonet
- Structural Bioinformatics Group, GRIB-IMIM, Universitat Pompeu Fabra, Barcelona Research Park of Biomedicine (PRBB), Catalonia, Spain
| | - Emre Guney
- Structural Bioinformatics Group, GRIB-IMIM, Universitat Pompeu Fabra, Barcelona Research Park of Biomedicine (PRBB), Catalonia, Spain
| | - Oriol Fornes
- Structural Bioinformatics Group, GRIB-IMIM, Universitat Pompeu Fabra, Barcelona Research Park of Biomedicine (PRBB), Catalonia, Spain
| | - Joan Planas
- Structural Bioinformatics Group, GRIB-IMIM, Universitat Pompeu Fabra, Barcelona Research Park of Biomedicine (PRBB), Catalonia, Spain
| | - Baldo Oliva
- Structural Bioinformatics Group, GRIB-IMIM, Universitat Pompeu Fabra, Barcelona Research Park of Biomedicine (PRBB), Catalonia, Spain.
| |
Collapse
|
247
|
Kou Y, Betancur C, Xu H, Buxbaum JD, Ma'ayan A. Network- and attribute-based classifiers can prioritize genes and pathways for autism spectrum disorders and intellectual disability. AMERICAN JOURNAL OF MEDICAL GENETICS PART C-SEMINARS IN MEDICAL GENETICS 2012; 160C:130-42. [PMID: 22499558 DOI: 10.1002/ajmg.c.31330] [Citation(s) in RCA: 28] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/10/2022]
Abstract
Autism spectrum disorders (ASD) are a group of related neurodevelopmental disorders with significant combined prevalence (∼1%) and high heritability. Dozens of individually rare genes and loci associated with high-risk for ASD have been identified, which overlap extensively with genes for intellectual disability (ID). However, studies indicate that there may be hundreds of genes that remain to be identified. The advent of inexpensive massively parallel nucleotide sequencing can reveal the genetic underpinnings of heritable complex diseases, including ASD and ID. However, whole exome sequencing (WES) and whole genome sequencing (WGS) provides an embarrassment of riches, where many candidate variants emerge. It has been argued that genetic variation for ASD and ID will cluster in genes involved in distinct pathways and protein complexes. For this reason, computational methods that prioritize candidate genes based on additional functional information such as protein-protein interactions or association with specific canonical or empirical pathways, or other attributes, can be useful. In this study we applied several supervised learning approaches to prioritize ASD or ID disease gene candidates based on curated lists of known ASD and ID disease genes. We implemented two network-based classifiers and one attribute-based classifier to show that we can rank and classify known, and predict new, genes for these neurodevelopmental disorders. We also show that ID and ASD share common pathways that perturb an overlapping synaptic regulatory subnetwork. We also show that features relating to neuronal phenotypes in mouse knockouts can help in classifying neurodevelopmental genes. Our methods can be applied broadly to other diseases helping in prioritizing newly identified genetic variation that emerge from disease gene discovery based on WES and WGS.
Collapse
Affiliation(s)
- Yan Kou
- Department of Psychiatry, Mount Sinai School of Medicine, One Gustave L Levy Place, Box 1668, New York, NY 10029, USA
| | | | | | | | | |
Collapse
|
248
|
Le DH, Kwon YK. GPEC: a Cytoscape plug-in for random walk-based gene prioritization and biomedical evidence collection. Comput Biol Chem 2012; 37:17-23. [PMID: 22430954 DOI: 10.1016/j.compbiolchem.2012.02.004] [Citation(s) in RCA: 47] [Impact Index Per Article: 3.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/25/2011] [Revised: 01/10/2012] [Accepted: 02/20/2012] [Indexed: 11/18/2022]
Abstract
Finding genes associated with a disease is an important issue in the biomedical area and many gene prioritization methods have been proposed for this goal. Among these, network-based approaches are recently proposed and outperformed functional annotation-based ones. Here, we introduce a novel Cytoscape plug-in, GPEC, to help identify putative genes likely to be associated with specific diseases or pathways. In the plug-in, gene prioritization is performed through a random walk with restart algorithm, a state-of-the art network-based method, along with a gene/protein relationship network. The plug-in also allows users efficiently collect biomedical evidence for highly ranked candidate genes. A set of known genes, candidate genes and a gene/protein relationship network can be provided in a flexible way.
Collapse
Affiliation(s)
- Duc-Hau Le
- School of Computer Science and Engineering, Water Resources University, 175 Tay Son, Dong Da, Hanoi, Vietnam.
| | | |
Collapse
|
249
|
Joe MK, Kee C, Tomarev SI. Myocilin interacts with syntrophins and is member of dystrophin-associated protein complex. J Biol Chem 2012; 287:13216-27. [PMID: 22371502 DOI: 10.1074/jbc.m111.224063] [Citation(s) in RCA: 31] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/31/2022] Open
Abstract
Genetic studies have linked myocilin to open angle glaucoma, but the functions of the protein in the eye and other tissues have remained elusive. The purpose of this investigation was to elucidate myocilin function(s). We identified α1-syntrophin, a component of the dystrophin-associated protein complex (DAPC), as a myocilin-binding candidate. Myocilin interacted with α1-syntrophin via its N-terminal domain and co-immunoprecipitated with α1-syntrophin from C2C12 myotubes and mouse skeletal muscle. Expression of 15-fold higher levels of myocilin in the muscles of transgenic mice led to the elevated association of α1-syntrophin, neuronal nitric-oxide synthase, and α-dystroglycan with DAPC, which increased the binding of laminin to α-dystroglycan and Akt signaling. Phosphorylation of Akt and Forkhead box O-class 3, key regulators of muscle size, was increased more than 3-fold, whereas the expression of muscle-specific RING finger protein-1 and atrogin-1, muscle atrophy markers, was decreased by 79 and 88%, respectively, in the muscles of transgenic mice. Consequently, the average size of muscle fibers of the transgenic mice was increased by 36% relative to controls. We suggest that intracellular myocilin plays a role as a regulator of muscle hypertrophy pathways, acting through the components of DAPC.
Collapse
Affiliation(s)
- Myung Kuk Joe
- Retinal Ganglion Cell Biology Section, Laboratory of Retinal Cell and Molecular Biology, NEI, National Institutes of Health, Bethesda, Maryland 20892, USA
| | | | | |
Collapse
|
250
|
Ciriello G, Cerami E, Sander C, Schultz N. Mutual exclusivity analysis identifies oncogenic network modules. Genome Res 2012; 22:398-406. [PMID: 21908773 PMCID: PMC3266046 DOI: 10.1101/gr.125567.111] [Citation(s) in RCA: 475] [Impact Index Per Article: 39.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/29/2011] [Accepted: 08/18/2011] [Indexed: 12/26/2022]
Abstract
Although individual tumors of the same clinical type have surprisingly diverse genomic alterations, these events tend to occur in a limited number of pathways, and alterations that affect the same pathway tend to not co-occur in the same patient. While pathway analysis has been a powerful tool in cancer genomics, our knowledge of oncogenic pathway modules is incomplete. To systematically identify such modules, we have developed a novel method, Mutual Exclusivity Modules in cancer (MEMo). The method uses correlation analysis and statistical tests to identify network modules by three criteria: (1) Member genes are recurrently altered across a set of tumor samples; (2) member genes are known to or are likely to participate in the same biological process; and (3) alteration events within the modules are mutually exclusive. Applied to data from the Cancer Genome Atlas (TCGA), the method identifies the principal known altered modules in glioblastoma (GBM) and highlights the striking mutual exclusivity of genomic alterations in the PI(3)K, p53, and Rb pathways. In serous ovarian cancer, we make the novel observation that inactivation of BRCA1 and BRCA2 is mutually exclusive of amplification of CCNE1 and inactivation of RB1, suggesting distinct alternative causes of genomic instability in this cancer type; and, we identify RBBP8 as a candidate oncogene involved in Rb-mediated cell cycle control. When applied to any cancer genomics data set, the algorithm can nominate oncogenic alterations that have a particularly strong selective effect and may also be useful in the design of therapeutic combinations in cases where mutual exclusivity reflects synthetic lethality.
Collapse
Affiliation(s)
- Giovanni Ciriello
- Computational Biology Center, Memorial Sloan-Kettering Cancer Center, New York, New York 10065, USA
| | - Ethan Cerami
- Computational Biology Center, Memorial Sloan-Kettering Cancer Center, New York, New York 10065, USA
- Tri-Institutional Training Program in Computational Biology and Medicine, New York, New York 10065, USA
| | - Chris Sander
- Computational Biology Center, Memorial Sloan-Kettering Cancer Center, New York, New York 10065, USA
| | - Nikolaus Schultz
- Computational Biology Center, Memorial Sloan-Kettering Cancer Center, New York, New York 10065, USA
| |
Collapse
|