1
|
Roux J, Bez N, Rochet P, Joo R, Mahévas S. Graphlet correlation distance to compare small graphs. PLoS One 2023; 18:e0281646. [PMID: 36791120 PMCID: PMC9931116 DOI: 10.1371/journal.pone.0281646] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/14/2022] [Accepted: 01/28/2023] [Indexed: 02/16/2023] Open
Abstract
Graph models are standard for representing mutual relationships between sets of entities. Often, graphs deal with a large number of entities with a small number of connections (e.g. social media relationships, infectious disease spread). The distances or similarities between such large graphs are known to be well established by the Graphlet Correlation Distance (GCD). This paper deals with small graphs (with potentially high densities of connections) that have been somewhat neglected in the literature but that concern important fora like sociology, ecology and fisheries, to mention some examples. First, based on numerical experiments, we study the conditions under which Erdős-Rényi, Fitness Scale-Free, Watts-Strogatz small-world and geometric graphs can be distinguished by a specific GCD measure based on 11 orbits, the GCD11. This is done with respect to the density and the order (i.e. the number of nodes) of the graphs when comparing graphs with the same and different orders. Second, we develop a randomization statistical test based on the GCD11 to compare empirical graphs to the four possible null models used in this analysis and apply it to a fishing case study where graphs represent pairwise proximity between fishing vessels. The statistical test rules out independent pairing within the fleet studied which is a standard assumption in fisheries. It also illustrates the difficulty to identify similarities between real-world small graphs and graph models.
Collapse
Affiliation(s)
- Jérôme Roux
- UMR DECOD, IFREMER, BP 21105, Nantes Cedex, France
- * E-mail:
| | - Nicolas Bez
- MARBEC, IRD, Univ Montpellier, Ifremer, CNRS, INRAE, Sète, France
| | | | - Rocío Joo
- Global Fishing Watch, Washington, DC, United States of America
| | | |
Collapse
|
2
|
Bašić M, Arsić B, Obradović Z. Another estimation of Laplacian spectrum of the Kronecker product of graphs. Inf Sci (N Y) 2022. [DOI: 10.1016/j.ins.2022.07.082] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/05/2022]
|
3
|
Lotz-Havla AS, Woidy M, Guder P, Schmiesing J, Erdmann R, Waterham HR, Muntau AC, Gersting SW. Edgetic Perturbations Contribute to Phenotypic Variability in PEX26 Deficiency. Front Genet 2021; 12:726174. [PMID: 34804114 PMCID: PMC8600046 DOI: 10.3389/fgene.2021.726174] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/16/2021] [Accepted: 10/18/2021] [Indexed: 12/11/2022] Open
Abstract
Peroxisomes share metabolic pathways with other organelles and peroxisomes are embedded into key cellular processes. However, the specific function of many peroxisomal proteins remains unclear and restricted knowledge of the peroxisomal protein interaction network limits a precise mapping of this network into the cellular metabolism. Inborn peroxisomal disorders are autosomal or X-linked recessive diseases that affect peroxisomal biogenesis (PBD) and/or peroxisomal metabolism. Pathogenic variants in the PEX26 gene lead to peroxisomal disorders of the full Zellweger spectrum continuum. To investigate the phenotypic complexity of PEX26 deficiency, we performed a combined organelle protein interaction screen and network medicine approach and 1) analyzed whether PEX26 establishes interactions with other peroxisomal proteins, 2) deciphered the PEX26 interaction network, 3) determined how PEX26 is involved in further processes of peroxisomal biogenesis and metabolism, and 4) showed how variant-specific disruption of protein-protein interactions (edgetic perturbations) may contribute to phenotypic variability in PEX26 deficient patients. The discovery of 14 novel protein-protein interactions for PEX26 revealed a hub position of PEX26 inside the peroxisomal interactome. Analysis of edgetic perturbations of PEX26 variants revealed a strong correlation between the number of affected protein-protein interactions and the molecular phenotype of matrix protein import. The role of PEX26 in peroxisomal biogenesis was expanded encompassing matrix protein import, division and proliferation, and membrane assembly. Moreover, the PEX26 interaction network intersects with cellular lipid metabolism at different steps. The results of this study expand the knowledge about the function of PEX26 and refine genotype-phenotype correlations, which may contribute to our understanding of the underlying disease mechanism of PEX26 deficiency.
Collapse
Affiliation(s)
- Amelie S Lotz-Havla
- Dr. von Hauner Children's Hospital, Ludwig-Maximilians-University, Munich, Germany
| | - Mathias Woidy
- University Children's Research, University Medical Center Hamburg-Eppendorf, Hamburg, Germany
| | - Philipp Guder
- University Children's Research, University Medical Center Hamburg-Eppendorf, Hamburg, Germany
| | - Jessica Schmiesing
- University Children's Research, University Medical Center Hamburg-Eppendorf, Hamburg, Germany
| | - Ralf Erdmann
- Institut für Physiologische Chemie, Medizinische Fakultät der Ruhr-Universität Bochum, Bochum, Germany
| | - Hans R Waterham
- Laboratory Genetic Metabolic Diseases, Academic Medical Center, University of Amsterdam, Amsterdam, Netherlands
| | - Ania C Muntau
- Children's Hospital, University Medical Center Hamburg-Eppendorf, Hamburg, Germany
| | - Søren W Gersting
- University Children's Research, University Medical Center Hamburg-Eppendorf, Hamburg, Germany
| |
Collapse
|
4
|
Advances in protein-protein interaction network analysis for Parkinson's disease. Neurobiol Dis 2021; 155:105395. [PMID: 34022367 DOI: 10.1016/j.nbd.2021.105395] [Citation(s) in RCA: 23] [Impact Index Per Article: 7.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/31/2021] [Revised: 05/12/2021] [Accepted: 05/14/2021] [Indexed: 02/08/2023] Open
Abstract
Protein-protein interactions (PPIs) are a key component of the subcellular molecular networks which enable cells to function. Due to their importance in homeostasis, alterations to the networks can be detrimental, leading to cellular dysfunction and ultimately disease states. Parkinson's disease (PD) is a progressive neurodegenerative condition with multifactorial aetiology, spanning genetic variation and environmental modifiers. At a molecular and systems level, the characterisation of PD is the focus of extensive research, largely due to an unmet need for disease modifying therapies. PPI network analysis approaches are a valuable strategy to accelerate our understanding of the molecular crosstalk and biological processes underlying PD pathogenesis, especially due to the complex nature of this disease. In this review, we describe the utility of PPI network approaches in modelling complex systems, focusing on previous work in PD research. We discuss four principal strategies for using PPI network approaches: to infer PD related cellular functions, pathways and novel genes; to support genomics studies; to study the interactome of single PD related genes; and to compare the molecular basis of PD to other neurodegenerative disorders. This is an evolving area of research which is likely to further expand as omics data generation and availability increase. These approaches complement and bridge-the-gap between genetics and functional research to inform future investigations. In this review we outline several limitations that require consideration, acknowledging that ongoing challenges in this field continue to be addressed and the refinement of these approaches will facilitate further advances using PPI network analysis for understanding complex diseases.
Collapse
|
5
|
Zanin M, Aitya NA, Basilio J, Baumbach J, Benis A, Behera CK, Bucholc M, Castiglione F, Chouvarda I, Comte B, Dao TT, Ding X, Pujos-Guillot E, Filipovic N, Finn DP, Glass DH, Harel N, Iesmantas T, Ivanoska I, Joshi A, Boudjeltia KZ, Kaoui B, Kaur D, Maguire LP, McClean PL, McCombe N, de Miranda JL, Moisescu MA, Pappalardo F, Polster A, Prasad G, Rozman D, Sacala I, Sanchez-Bornot JM, Schmid JA, Sharp T, Solé-Casals J, Spiwok V, Spyrou GM, Stalidzans E, Stres B, Sustersic T, Symeonidis I, Tieri P, Todd S, Van Steen K, Veneva M, Wang DH, Wang H, Wang H, Watterson S, Wong-Lin K, Yang S, Zou X, Schmidt HH. An Early Stage Researcher's Primer on Systems Medicine Terminology. NETWORK AND SYSTEMS MEDICINE 2021; 4:2-50. [PMID: 33659919 PMCID: PMC7919422 DOI: 10.1089/nsm.2020.0003] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 10/27/2020] [Indexed: 12/19/2022] Open
Abstract
Background: Systems Medicine is a novel approach to medicine, that is, an interdisciplinary field that considers the human body as a system, composed of multiple parts and of complex relationships at multiple levels, and further integrated into an environment. Exploring Systems Medicine implies understanding and combining concepts coming from diametral different fields, including medicine, biology, statistics, modeling and simulation, and data science. Such heterogeneity leads to semantic issues, which may slow down implementation and fruitful interaction between these highly diverse fields. Methods: In this review, we collect and explain more than100 terms related to Systems Medicine. These include both modeling and data science terms and basic systems medicine terms, along with some synthetic definitions, examples of applications, and lists of relevant references. Results: This glossary aims at being a first aid kit for the Systems Medicine researcher facing an unfamiliar term, where he/she can get a first understanding of them, and, more importantly, examples and references for digging into the topic.
Collapse
Affiliation(s)
- Massimiliano Zanin
- Centro de Tecnología Biomédica, Universidad Politécnica de Madrid, Madrid, Spain
| | - Nadim A.A. Aitya
- Intelligent Systems Research Centre, School of Computing, Engineering and Intelligent Systems, Ulster University, Ulster, United Kingdom
| | - José Basilio
- Center for Physiology and Pharmacology, Institute of Vascular Biology and Thrombosis Research, Medical University of Vienna, Vienna, Austria
| | - Jan Baumbach
- TUM School of Life Sciences Weihenstephan, Technical University of Munich, Freising, Germany
| | - Arriel Benis
- Faculty of Technology Management, Holon Institute of Technology (HIT), Holon, Israel
| | - Chandan K. Behera
- Intelligent Systems Research Centre, School of Computing, Engineering and Intelligent Systems, Ulster University, Ulster, United Kingdom
| | - Magda Bucholc
- Intelligent Systems Research Centre, School of Computing, Engineering and Intelligent Systems, Ulster University, Ulster, United Kingdom
| | - Filippo Castiglione
- CNR National Research Council, IAC Institute for Applied Computing, Rome, Italy
| | - Ioanna Chouvarda
- Lab of Computing, Medical Informatics, and Biomedical Imaging Technologies, School of Medicine, Aristotle University of Thessaloniki, Thessaloniki, Greece
| | - Blandine Comte
- Université Clermont Auvergne, INRAE, UNH, Plateforme d'Exploration du Métabolisme, MetaboHUB Clermont, Clermont-Ferrand, France
| | - Tien-Tuan Dao
- Biomechanics and Bioengineering Laboratory (UMR CNRS 7338), Université de Technologie de Compiègne, Compiègne, France
- Labex MS2T “Control of Technological Systems-of-Systems,” CNRS and Université de Technologie de Compiègne, Compiègne, France
| | - Xuemei Ding
- Intelligent Systems Research Centre, School of Computing, Engineering and Intelligent Systems, Ulster University, Ulster, United Kingdom
| | - Estelle Pujos-Guillot
- Université Clermont Auvergne, INRAE, UNH, Plateforme d'Exploration du Métabolisme, MetaboHUB Clermont, Clermont-Ferrand, France
| | - Nenad Filipovic
- Faculty of Engineering, University of Kragujevac, Kragujevac, Serbia
- Bioengineering Research and Development Center (BioIRC), Kragujevac, Serbia
- Steinbeis Advanced Risk Technologies Institute doo Kragujevac, Kragujevac, Serbia
| | - David P. Finn
- Pharmacology and Therapeutics, School of Medicine, Galway Neuroscience Centre, National University of Ireland, Galway, Republic of Ireland
| | - David H. Glass
- School of Computing, Ulster University, Ulster, United Kingdom
| | - Nissim Harel
- Faculty of Sciences, Holon Institute of Technology (HIT), Holon, Israel
| | - Tomas Iesmantas
- Department of Mathematics and Natural Sciences, Kaunas University of Technology, Kaunas, Lithuania
| | - Ilinka Ivanoska
- Faculty of Computer Science and Engineering, Ss. Cyril and Methodius University, Skopje, Macedonia
| | - Alok Joshi
- Intelligent Systems Research Centre, School of Computing, Engineering and Intelligent Systems, Ulster University, Ulster, United Kingdom
| | - Karim Zouaoui Boudjeltia
- Laboratory of Experimental Medicine (ULB 222), Medicine Faculty, Université libre de Bruxelles, CHU de Charleroi, Charleroi, Belgium
| | - Badr Kaoui
- Biomechanics and Bioengineering Laboratory (UMR CNRS 7338), Université de Technologie de Compiègne, Compiègne, France
- Labex MS2T “Control of Technological Systems-of-Systems,” CNRS and Université de Technologie de Compiègne, Compiègne, France
| | - Daman Kaur
- Northern Ireland Centre for Stratified Medicine, Biomedical Sciences Research Institute, Ulster University, Ulster, United Kingdom
| | - Liam P. Maguire
- Intelligent Systems Research Centre, School of Computing, Engineering and Intelligent Systems, Ulster University, Ulster, United Kingdom
| | - Paula L. McClean
- Northern Ireland Centre for Stratified Medicine, Biomedical Sciences Research Institute, Ulster University, Ulster, United Kingdom
| | - Niamh McCombe
- Intelligent Systems Research Centre, School of Computing, Engineering and Intelligent Systems, Ulster University, Ulster, United Kingdom
| | - João Luís de Miranda
- Escola Superior de Tecnologia e Gestão, Instituto Politécnico de Portalegre, Portalegre, Portugal
- Centro de Recursos Naturais e Ambiente (CERENA), Instituto Superior Técnico, Universidade de Lisboa, Lisboa, Portugal
| | | | | | - Annikka Polster
- Centre for Molecular Medicine Norway (NCMM), Forskningparken, Oslo, Norway
| | - Girijesh Prasad
- Intelligent Systems Research Centre, School of Computing, Engineering and Intelligent Systems, Ulster University, Ulster, United Kingdom
| | - Damjana Rozman
- Centre for Functional Genomics and Bio-Chips, Institute of Biochemistry, Faculty of Medicine, University of Ljubljana, Ljubljana, Slovenia
| | - Ioan Sacala
- Faculty of Automatic Control and Computers, University Politehnica of Bucharest, Bucharest, Romania
| | - Jose M. Sanchez-Bornot
- Intelligent Systems Research Centre, School of Computing, Engineering and Intelligent Systems, Ulster University, Ulster, United Kingdom
| | - Johannes A. Schmid
- Center for Physiology and Pharmacology, Institute of Vascular Biology and Thrombosis Research, Medical University of Vienna, Vienna, Austria
| | - Trevor Sharp
- Department of Pharmacology, University of Oxford, Oxford, United Kingdom
| | - Jordi Solé-Casals
- Data and Signal Processing Research Group, University of Vic–Central University of Catalonia, Vic, Spain
- Department of Psychiatry, University of Cambridge, Cambridge, United Kingdom
- College of Artificial Intelligence, Nankai University, Tianjin, China
| | - Vojtěch Spiwok
- Department of Biochemistry and Microbiology, University of Chemistry and Technology, Prague, Czech Republic
| | - George M. Spyrou
- The Cyprus School of Molecular Medicine, The Cyprus Institute of Neurology and Genetics, Nicosia, Cyprus
| | - Egils Stalidzans
- Computational Systems Biology Group, Institute of Microbiology and Biotechnology, University of Latvia, Riga, Latvia
| | - Blaž Stres
- Department of Animal Science, Biotechnical Faculty, University of Ljubljana, Ljubljana, Slovenia
- Faculty of Civil and Geodetic Engineering, University of Ljubljana, Ljubljana, Slovenia
- Department of Automation, Biocybernetics and Robotics, Jozef Stefan Institute, Ljubljana, Slovenia
| | - Tijana Sustersic
- Faculty of Engineering, University of Kragujevac, Kragujevac, Serbia
- Bioengineering Research and Development Center (BioIRC), Kragujevac, Serbia
- Steinbeis Advanced Risk Technologies Institute doo Kragujevac, Kragujevac, Serbia
| | - Ioannis Symeonidis
- Center for Research and Technology Hellas, Hellenic Institute of Transport, Thessaloniki, Greece
| | - Paolo Tieri
- CNR National Research Council, IAC Institute for Applied Computing, Rome, Italy
| | - Stephen Todd
- Altnagelvin Area Hospital, Western Health and Social Care Trust, Altnagelvin, United Kingdom
| | - Kristel Van Steen
- BIO3-Systems Genetics, GIGA-R, University of Liege, Liege, Belgium
- BIO3-Systems Medicine, Department of Human Genetics, KU Leuven, Leuven, Belgium
| | | | - Da-Hui Wang
- State Key Laboratory of Cognitive Neuroscience and Learning, and School of Systems Science, Beijing Normal University, Beijing, China
| | - Haiying Wang
- School of Computing, Ulster University, Ulster, United Kingdom
| | - Hui Wang
- School of Computing, Ulster University, Ulster, United Kingdom
| | - Steven Watterson
- Northern Ireland Centre for Stratified Medicine, Ulster University, Londonderry, United Kingdom
| | - KongFatt Wong-Lin
- Intelligent Systems Research Centre, School of Computing, Engineering and Intelligent Systems, Ulster University, Ulster, United Kingdom
| | - Su Yang
- Intelligent Systems Research Centre, School of Computing, Engineering and Intelligent Systems, Ulster University, Ulster, United Kingdom
| | - Xin Zou
- Shanghai Centre for Systems Biomedicine, Key Laboratory of Systems Biomedicine (Ministry of Education), Shanghai Jiao Tong University, Shanghai, China
| | - Harald H.H.W. Schmidt
- Faculty of Health, Medicine & Life Science, Maastricht University, Maastricht, The Netherlands
| |
Collapse
|
6
|
Fischer MT, Arya D, Streeb D, Seebacher D, Keim DA, Worring M. Visual Analytics for Temporal Hypergraph Model Exploration. IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS 2021; 27:550-560. [PMID: 33048721 DOI: 10.1109/tvcg.2020.3030408] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/11/2023]
Abstract
Many processes, from gene interaction in biology to computer networks to social media, can be modeled more precisely as temporal hypergraphs than by regular graphs. This is because hypergraphs generalize graphs by extending edges to connect any number of vertices, allowing complex relationships to be described more accurately and predict their behavior over time. However, the interactive exploration and seamless refinement of such hypergraph-based prediction models still pose a major challenge. We contribute Hyper-Matrix, a novel visual analytics technique that addresses this challenge through a tight coupling between machine-learning and interactive visualizations. In particular, the technique incorporates a geometric deep learning model as a blueprint for problem-specific models while integrating visualizations for graph-based and category-based data with a novel combination of interactions for an effective user-driven exploration of hypergraph models. To eliminate demanding context switches and ensure scalability, our matrix-based visualization provides drill-down capabilities across multiple levels of semantic zoom, from an overview of model predictions down to the content. We facilitate a focused analysis of relevant connections and groups based on interactive user-steering for filtering and search tasks, a dynamically modifiable partition hierarchy, various matrix reordering techniques, and interactive model feedback. We evaluate our technique in a case study and through formative evaluation with law enforcement experts using real-world internet forum communication data. The results show that our approach surpasses existing solutions in terms of scalability and applicability, enables the incorporation of domain knowledge, and allows for fast search-space traversal. With the proposed technique, we pave the way for the visual analytics of temporal hypergraphs in a wide variety of domains.
Collapse
|
7
|
Chanda P, Costa E, Hu J, Sukumar S, Van Hemert J, Walia R. Information Theory in Computational Biology: Where We Stand Today. ENTROPY (BASEL, SWITZERLAND) 2020; 22:E627. [PMID: 33286399 PMCID: PMC7517167 DOI: 10.3390/e22060627] [Citation(s) in RCA: 22] [Impact Index Per Article: 5.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 04/30/2020] [Revised: 05/31/2020] [Accepted: 06/03/2020] [Indexed: 12/30/2022]
Abstract
"A Mathematical Theory of Communication" was published in 1948 by Claude Shannon to address the problems in the field of data compression and communication over (noisy) communication channels. Since then, the concepts and ideas developed in Shannon's work have formed the basis of information theory, a cornerstone of statistical learning and inference, and has been playing a key role in disciplines such as physics and thermodynamics, probability and statistics, computational sciences and biological sciences. In this article we review the basic information theory based concepts and describe their key applications in multiple major areas of research in computational biology-gene expression and transcriptomics, alignment-free sequence comparison, sequencing and error correction, genome-wide disease-gene association mapping, metabolic networks and metabolomics, and protein sequence, structure and interaction analysis.
Collapse
Affiliation(s)
- Pritam Chanda
- Corteva Agriscience™, Indianapolis, IN 46268, USA
- Computer and Information Science, Indiana University-Purdue University, Indianapolis, IN 46202, USA
| | - Eduardo Costa
- Corteva Agriscience™, Mogi Mirim, Sao Paulo 13801-540, Brazil
| | - Jie Hu
- Corteva Agriscience™, Indianapolis, IN 46268, USA
| | | | | | - Rasna Walia
- Corteva Agriscience™, Johnston, IA 50131, USA
| |
Collapse
|
8
|
Taoka M, Nobe Y, Yamaki Y, Sato K, Ishikawa H, Izumikawa K, Yamauchi Y, Hirota K, Nakayama H, Takahashi N, Isobe T. Landscape of the complete RNA chemical modifications in the human 80S ribosome. Nucleic Acids Res 2019; 46:9289-9298. [PMID: 30202881 PMCID: PMC6182160 DOI: 10.1093/nar/gky811] [Citation(s) in RCA: 214] [Impact Index Per Article: 42.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/27/2018] [Accepted: 09/06/2018] [Indexed: 01/08/2023] Open
Abstract
During ribosome biogenesis, ribosomal RNAs acquire various chemical modifications that ensure the fidelity of translation, and dysregulation of the modification processes can cause proteome changes as observed in cancer and inherited human disorders. Here, we report the complete chemical modifications of all RNAs of the human 80S ribosome as determined with quantitative mass spectrometry. We assigned 228 sites with 14 different post-transcriptional modifications, most of which are located in functional regions of the ribosome. All modifications detected are typical of eukaryotic ribosomal RNAs, and no human-specific modifications were observed, in contrast to a recently reported cryo-electron microscopy analysis. While human ribosomal RNAs appeared to have little polymorphism regarding the post-transcriptional modifications, we found that pseudouridylation at two specific sites in 28S ribosomal RNA are significantly reduced in ribosomes of patients with familial dyskeratosis congenita, a genetic disease caused by a point mutation in the pseudouridine synthase gene DKC1. The landscape of the entire epitranscriptomic ribosomal RNA modifications provides a firm basis for understanding ribosome function and dysfunction associated with human disease.
Collapse
Affiliation(s)
- Masato Taoka
- Department of Chemistry, Graduate School of Science, Tokyo Metropolitan University, Minami-osawa 1-1, Hachioji-shi, Tokyo 192-0397, Japan
| | - Yuko Nobe
- Department of Chemistry, Graduate School of Science, Tokyo Metropolitan University, Minami-osawa 1-1, Hachioji-shi, Tokyo 192-0397, Japan
| | - Yuka Yamaki
- Department of Chemistry, Graduate School of Science, Tokyo Metropolitan University, Minami-osawa 1-1, Hachioji-shi, Tokyo 192-0397, Japan
| | - Ko Sato
- Department of Chemistry, Graduate School of Science, Tokyo Metropolitan University, Minami-osawa 1-1, Hachioji-shi, Tokyo 192-0397, Japan
| | - Hideaki Ishikawa
- Department of Applied Biological Science, Graduate School of Agriculture, Tokyo University of Agriculture and Technology, Saiwai-cho 3-5-8, Fuchu-shi, Tokyo 183-8509, Japan
| | - Keiichi Izumikawa
- Department of Applied Biological Science, Graduate School of Agriculture, Tokyo University of Agriculture and Technology, Saiwai-cho 3-5-8, Fuchu-shi, Tokyo 183-8509, Japan
| | - Yoshio Yamauchi
- Department of Chemistry, Graduate School of Science, Tokyo Metropolitan University, Minami-osawa 1-1, Hachioji-shi, Tokyo 192-0397, Japan
| | - Kouji Hirota
- Department of Chemistry, Graduate School of Science, Tokyo Metropolitan University, Minami-osawa 1-1, Hachioji-shi, Tokyo 192-0397, Japan
| | - Hiroshi Nakayama
- Biomolecular Characterization Unit, RIKEN Center for Sustainable Resource Science, 2-1 Hirosawa, Wako, Saitama 351-0198, Japan
| | - Nobuhiro Takahashi
- Department of Applied Biological Science, Graduate School of Agriculture, Tokyo University of Agriculture and Technology, Saiwai-cho 3-5-8, Fuchu-shi, Tokyo 183-8509, Japan
| | - Toshiaki Isobe
- Department of Chemistry, Graduate School of Science, Tokyo Metropolitan University, Minami-osawa 1-1, Hachioji-shi, Tokyo 192-0397, Japan
| |
Collapse
|
9
|
Stöcker BK, Köster J, Zamir E, Rahmann S. Modeling and simulating networks of interdependent protein interactions. Integr Biol (Camb) 2018; 10:290-305. [PMID: 29676773 DOI: 10.1039/c8ib00012c] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/01/2023]
Abstract
Protein interactions are fundamental building blocks of biochemical reaction systems underlying cellular functions. The complexity and functionality of these systems emerge not only from the protein interactions themselves but also from the dependencies between these interactions, as generated by allosteric effects or mutual exclusion due to steric hindrance. Therefore, formal models for integrating and utilizing information about interaction dependencies are of high interest. Here, we describe an approach for endowing protein networks with interaction dependencies using propositional logic, thereby obtaining constrained protein interaction networks ("constrained networks"). The construction of these networks is based on public interaction databases as well as text-mined information about interaction dependencies. We present an efficient data structure and algorithm to simulate protein complex formation in constrained networks. The efficiency of the model allows fast simulation and facilitates the analysis of many proteins in large networks. In addition, this approach enables the simulation of perturbation effects, such as knockout of single or multiple proteins and changes of protein concentrations. We illustrate how our model can be used to analyze a constrained human adhesome protein network, which is responsible for the formation of diverse and dynamic cell-matrix adhesion sites. By comparing protein complex formation under known interaction dependencies versus without dependencies, we investigate how these dependencies shape the resulting repertoire of protein complexes. Furthermore, our model enables investigating how the interplay of network topology with interaction dependencies influences the propagation of perturbation effects across a large biochemical system. Our simulation software CPINSim (for Constrained Protein Interaction Network Simulator) is available under the MIT license at http://github.com/BiancaStoecker/cpinsim and as a Bioconda package (https://bioconda.github.io).
Collapse
Affiliation(s)
- Bianca K Stöcker
- Genome Informatics, Institute of Human Genetics, University Hospital Essen, University of Duisburg-Essen, 45147 Essen, Germany.
| | | | | | | |
Collapse
|
10
|
Huang L, Liao L, Wu CH. Completing sparse and disconnected protein-protein network by deep learning. BMC Bioinformatics 2018; 19:103. [PMID: 29566671 PMCID: PMC5863833 DOI: 10.1186/s12859-018-2112-7] [Citation(s) in RCA: 12] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/16/2017] [Accepted: 03/12/2018] [Indexed: 12/01/2022] Open
Abstract
Background Protein-protein interaction (PPI) prediction remains a central task in systems biology to achieve a better and holistic understanding of cellular and intracellular processes. Recently, an increasing number of computational methods have shifted from pair-wise prediction to network level prediction. Many of the existing network level methods predict PPIs under the assumption that the training network should be connected. However, this assumption greatly affects the prediction power and limits the application area because the current golden standard PPI networks are usually very sparse and disconnected. Therefore, how to effectively predict PPIs based on a training network that is sparse and disconnected remains a challenge. Results In this work, we developed a novel PPI prediction method based on deep learning neural network and regularized Laplacian kernel. We use a neural network with an autoencoder-like architecture to implicitly simulate the evolutionary processes of a PPI network. Neurons of the output layer correspond to proteins and are labeled with values (1 for interaction and 0 for otherwise) from the adjacency matrix of a sparse disconnected training PPI network. Unlike autoencoder, neurons at the input layer are given all zero input, reflecting an assumption of no a priori knowledge about PPIs, and hidden layers of smaller sizes mimic ancient interactome at different times during evolution. After the training step, an evolved PPI network whose rows are outputs of the neural network can be obtained. We then predict PPIs by applying the regularized Laplacian kernel to the transition matrix that is built upon the evolved PPI network. The results from cross-validation experiments show that the PPI prediction accuracies for yeast data and human data measured as AUC are increased by up to 8.4 and 14.9% respectively, as compared to the baseline. Moreover, the evolved PPI network can also help us leverage complementary information from the disconnected training network and multiple heterogeneous data sources. Tested by the yeast data with six heterogeneous feature kernels, the results show our method can further improve the prediction performance by up to 2%, which is very close to an upper bound that is obtained by an Approximate Bayesian Computation based sampling method. Conclusions The proposed evolution deep neural network, coupled with regularized Laplacian kernel, is an effective tool in completing sparse and disconnected PPI networks and in facilitating integration of heterogeneous data sources.
Collapse
Affiliation(s)
- Lei Huang
- Department of Computer and Information Sciences, University of Delaware, 18 Amstel Avenue, Newark, 19716, Delaware, USA
| | - Li Liao
- Department of Computer and Information Sciences, University of Delaware, 18 Amstel Avenue, Newark, 19716, Delaware, USA.
| | - Cathy H Wu
- Department of Computer and Information Sciences, University of Delaware, 18 Amstel Avenue, Newark, 19716, Delaware, USA.,Center for Bioinformatics and Computational Biology, University of Delaware, 15 Innovation Way, Newark, 19711, Delaware, USA
| |
Collapse
|
11
|
Vyas R, Bapat S, Goel P, Karthikeyan M, Tambe SS, Kulkarni BD. Application of Genetic Programming (GP) Formalism for Building Disease Predictive Models from Protein-Protein Interactions (PPI) Data. IEEE/ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS 2018; 15:27-37. [PMID: 28113781 DOI: 10.1109/tcbb.2016.2621042] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/06/2023]
Abstract
Protein-protein interactions (PPIs) play a vital role in the biological processes involved in the cell functions and disease pathways. The experimental methods known to predict PPIs require tremendous efforts and the results are often hindered by the presence of a large number of false positives. Herein, we demonstrate the use of a new Genetic Programming (GP) based Symbolic Regression (SR) approach for predicting PPIs related to a disease. In a case study, a dataset consisting of one hundred and thirty five PPI complexes related to cancer was used to construct a generic PPI predicting model with good PPI prediction accuracy and generalization ability. A high correlation coefficient(CC) of 0.893, low root mean square error (RMSE) and mean absolute percentage error (MAPE) values of 478.221 and 0.239, respectively were achieved for both the training and test set outputs. To validate the discriminatory nature of the model, it was applied on a dataset of diabetes complexes where it yielded significantly low CC values. Thus, the GP model developed here serves a dual purpose: (a)a predictor of the binding energy of cancer related PPI complexes, and (b)a classifier for discriminating PPI complexes related to cancer from those of other diseases.
Collapse
|
12
|
Elmsallati A, Msalati A, Kalita J. Index-Based Network Aligner of Protein-Protein Interaction Networks. IEEE/ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS 2018; 15:330-336. [PMID: 28113986 DOI: 10.1109/tcbb.2016.2613098] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/06/2023]
Abstract
Network Alignment over graph-structured data has received considerable attention in many recent applications. Global network alignment tries to uniquely find the best mapping for a node in one network to only one node in another network. The mapping is performed according to some matching criteria that depend on the nature of data. In molecular biology, functional orthologs, protein complexes, and evolutionary conserved pathways are some examples of information uncovered by global network alignment. Current techniques for global network alignment suffer from several drawbacks, e.g., poor performance and high memory requirements. We address these problems by proposing IBNAL, Indexes-Based Network ALigner, for better alignment quality and faster results. To accelerate the alignment step, IBNAL makes use of a novel clique-based index and is able to align large networks in seconds. IBNAL produces a higher topological quality alignment and comparable biological match in alignment relative to other state-of-the-art aligners even though topological fit is primarily used to match nodes. IBNAL's results confirm and give another evidence that homology information is more likely to be encoded in network topology than sequence information.
Collapse
|
13
|
Gundogdu D, Finnerty AN, Staiano J, Teso S, Passerini A, Pianesi F, Lepri B. Investigating the association between social interactions and personality states dynamics. ROYAL SOCIETY OPEN SCIENCE 2017; 4:170194. [PMID: 28989732 PMCID: PMC5627072 DOI: 10.1098/rsos.170194] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 03/01/2017] [Accepted: 08/21/2017] [Indexed: 06/07/2023]
Abstract
The recent personality psychology literature has coined the name of personality states to refer to states having the same behavioural, affective and cognitive content (described by adjectives) as the corresponding trait, but for a shorter duration. The variability in personality states may be the reaction to specific characteristics of situations. The aim of our study is to investigate whether specific situational factors, that is, different configurations of face-to-face interactions, are predictors of variability of personality states in a work environment. The obtained results provide evidence that within-person variability in personality is associated with variation in face-to-face interactions. Interestingly, the effects differ by type and level of the personality states: adaptation effects for Agreeableness and Emotional Stability, whereby the personality states of an individual trigger similar states in other people interacting with them and complementarity effects for Openness to Experience, whereby the personality states of an individual trigger opposite states in other people interacting with them. Overall, these findings encourage further research to characterize face-to-face and social interactions in terms of their relevance to personality states.
Collapse
Affiliation(s)
- Didem Gundogdu
- Fondazione Bruno Kessler, Trento, Italy
- Department of Information Engineering and Computer Science, University of Trento, Trento, Italy
- EIT Digital, Trento, Italy
| | - Ailbhe N. Finnerty
- Department of Clinical, Educational and Health Psychology, University College London, London, UK
| | | | - Stefano Teso
- Department of Information Engineering and Computer Science, University of Trento, Trento, Italy
| | - Andrea Passerini
- Department of Information Engineering and Computer Science, University of Trento, Trento, Italy
| | - Fabio Pianesi
- Fondazione Bruno Kessler, Trento, Italy
- EIT Digital, Trento, Italy
| | | |
Collapse
|
14
|
Zhang W, Xu J, Li Y, Zou X. A new two-stage method for revealing missing parts of edges in protein-protein interaction networks. PLoS One 2017; 12:e0177029. [PMID: 28493910 PMCID: PMC5426645 DOI: 10.1371/journal.pone.0177029] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/04/2016] [Accepted: 04/20/2017] [Indexed: 12/24/2022] Open
Abstract
With the increasing availability of high-throughput data, various computational methods have recently been developed for understanding the cell through protein-protein interaction (PPI) networks at a systems level. However, due to the incompleteness of the original PPI networks those efforts have been significantly hindered. In this paper, we propose a two stage method to predict underlying links between two originally unlinked protein pairs. First, we measure gene expression and gene functional similarly between unlinked protein pairs on Saccharomyces cerevisiae benchmark network and obtain new constructed networks. Then, we select the significant part of the new predicted links by analyzing the difference between essential proteins that have been identified based on the new constructed networks and the original network. Furthermore, we validate the performance of the new method by using the reliable and comprehensive PPI dataset obtained from the STRING database and compare the new proposed method with four other random walk-based methods. Comparing the results indicates that the new proposed strategy performs well in predicting underlying links. This study provides a general paradigm for predicting new interactions between protein pairs and offers new insights into identifying essential proteins.
Collapse
Affiliation(s)
- Wei Zhang
- School of Science, East China Jiaotong University, Nanchang 330013, China
- * E-mail: (WZ); (XFZ)
| | - Jia Xu
- School of Mechatronic Engineering, East China Jiaotong University, Nanchang 330013, China
| | - Yuanyuan Li
- School of Mathematics and Statistics, Wuhan Institute of Technology in Wuhan, Wuhan, 430072, China
| | - Xiufen Zou
- School of Mathematics and Statistics, Wuhan University, Wuhan 430072, China
- * E-mail: (WZ); (XFZ)
| |
Collapse
|
15
|
Abstract
Paralleling the increasing availability of protein-protein interaction (PPI) network data, several network alignment methods have been proposed. Network alignments have been used to uncover functionally conserved network parts and to transfer annotations. However, due to the computational intractability of the network alignment problem, aligners are heuristics providing divergent solutions and no consensus exists on a gold standard, or which scoring scheme should be used to evaluate them. We comprehensively evaluate the alignment scoring schemes and global network aligners on large scale PPI data and observe that three methods, HUBALIGN, L-GRAAL and NATALIE, regularly produce the most topologically and biologically coherent alignments. We study the collective behaviour of network aligners and observe that PPI networks are almost entirely aligned with a handful of aligners that we unify into a new tool, Ulign. Ulign enables complete alignment of two networks, which traditional global and local aligners fail to do. Also, multiple mappings of Ulign define biologically relevant soft clusterings of proteins in PPI networks, which may be used for refining the transfer of annotations across networks. Hence, PPI networks are already well investigated by current aligners, so to gain additional biological insights, a paradigm shift is needed. We propose such a shift come from aligning all available data types collectively rather than any particular data type in isolation from others.
Collapse
|
16
|
Huang L, Liao L, Wu CH. Protein-protein interaction prediction based on multiple kernels and partial network with linear programming. BMC SYSTEMS BIOLOGY 2016. [PMCID: PMC4977483 DOI: 10.1186/s12918-016-0296-x] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 08/30/2023]
Abstract
Background Prediction of de novo protein-protein interaction is a critical step toward reconstructing PPI networks, which is a central task in systems biology. Recent computational approaches have shifted from making PPI prediction based on individual pairs and single data source to leveraging complementary information from multiple heterogeneous data sources and partial network structure. However, how to quickly learn weights for heterogeneous data sources remains a challenge. In this work, we developed a method to infer de novo PPIs by combining multiple data sources represented in kernel format and obtaining optimal weights based on random walk over the existing partial networks. Results Our proposed method utilizes Barker algorithm and the training data to construct a transition matrix which constrains how a random walk would traverse the partial network. Multiple heterogeneous features for the proteins in the network are then combined into the form of weighted kernel fusion, which provides a new "adjacency matrix" for the whole network that may consist of disconnected components but is required to comply with the transition matrix on the training subnetwork. This requirement is met by adjusting the weights to minimize the element-wise difference between the transition matrix and the weighted kernels. The minimization problem is solved by linear programming. The weighted kernel fusion is then transformed to regularized Laplacian (RL) kernel to infer missing or new edges in the PPI network, which can potentially connect the previously disconnected components. Conclusions The results on synthetic data demonstrated the soundness and robustness of the proposed algorithms under various conditions. And the results on real data show that the accuracies of PPI prediction for yeast data and human data measured as AUC are increased by up to 19 % and 11 % respectively, as compared to a control method without using optimal weights. Moreover, the weights learned by our method Weight Optimization by Linear Programming (WOLP) are very consistent with that learned by sampling, and can provide insights into the relations between PPIs and various feature kernel, thereby improving PPI prediction even for disconnected PPI networks.
Collapse
|
17
|
Zeigler AC, Richardson WJ, Holmes JW, Saucerman JJ. A computational model of cardiac fibroblast signaling predicts context-dependent drivers of myofibroblast differentiation. J Mol Cell Cardiol 2016; 94:72-81. [PMID: 27017945 DOI: 10.1016/j.yjmcc.2016.03.008] [Citation(s) in RCA: 61] [Impact Index Per Article: 7.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 10/29/2015] [Revised: 02/26/2016] [Accepted: 03/17/2016] [Indexed: 12/21/2022]
Abstract
Cardiac fibroblasts support heart function, and aberrant fibroblast signaling can lead to fibrosis and cardiac dysfunction. Yet how signaling molecules drive myofibroblast differentiation and fibrosis in the complex signaling environment of cardiac injury remains unclear. We developed a large-scale computational model of cardiac fibroblast signaling in order to identify regulators of fibrosis under diverse signaling contexts. The model network integrates 10 signaling pathways, including 91 nodes and 134 reactions, and it correctly predicted 80% of independent previous experiments. The model predicted key fibrotic signaling regulators (e.g. reactive oxygen species, tissue growth factor β (TGFβ) receptor), whose function varied depending on the extracellular environment. We characterized how network structure relates to function, identified functional modules, and predicted cross-talk between TGFβ and mechanical signaling, which was validated experimentally in adult cardiac fibroblasts. This study provides a systems framework for predicting key regulators of fibroblast signaling across diverse signaling contexts.
Collapse
Affiliation(s)
- A C Zeigler
- Department of Biomedical Engineering, University of Virginia, Charlottesville, VA 22908, USA
| | - W J Richardson
- Department of Biomedical Engineering, University of Virginia, Charlottesville, VA 22908, USA
| | - J W Holmes
- Department of Biomedical Engineering, University of Virginia, Charlottesville, VA 22908, USA
| | - J J Saucerman
- Department of Biomedical Engineering, University of Virginia, Charlottesville, VA 22908, USA.
| |
Collapse
|
18
|
Huang L, Liao L, Wu CH. Inference of protein-protein interaction networks from multiple heterogeneous data. EURASIP JOURNAL ON BIOINFORMATICS & SYSTEMS BIOLOGY 2016; 2016:8. [PMID: 26941784 PMCID: PMC4761017 DOI: 10.1186/s13637-016-0040-2] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 08/06/2015] [Accepted: 02/09/2016] [Indexed: 11/29/2022]
Abstract
Protein-protein interaction (PPI) prediction is a central task in achieving a better understanding of cellular and intracellular processes. Because high-throughput experimental methods are both expensive and time-consuming, and are also known of suffering from the problems of incompleteness and noise, many computational methods have been developed, with varied degrees of success. However, the inference of PPI network from multiple heterogeneous data sources remains a great challenge. In this work, we developed a novel method based on approximate Bayesian computation and modified differential evolution sampling (ABC-DEP) and regularized laplacian (RL) kernel. The method enables inference of PPI networks from topological properties and multiple heterogeneous features including gene expression and Pfam domain profiles, in forms of weighted kernels. The optimal weights are obtained by ABC-DEP, and the kernel fusion built based on optimal weights serves as input to RL to infer missing or new edges in the PPI network. Detailed comparisons with control methods have been made, and the results show that the accuracy of PPI prediction measured by AUC is increased by up to 23 %, as compared to a baseline without using optimal weights. The method can provide insights into the relations between PPIs and various feature kernels and demonstrates strong capability of predicting faraway interactions that cannot be well detected by traditional RL method.
Collapse
Affiliation(s)
- Lei Huang
- Department of Computer and Information Sciences, University of Delaware, 18 Amstel Avenue, Newark, 19716 DE USA
| | - Li Liao
- Department of Computer and Information Sciences, University of Delaware, 18 Amstel Avenue, Newark, 19716 DE USA
| | - Cathy H Wu
- Department of Computer and Information Sciences, University of Delaware, 18 Amstel Avenue, Newark, 19716 DE USA ; Center for Bioinformatics and Computational Biology, University of Delaware, 15 Innovation Way, Newark, 19711 DE USA
| |
Collapse
|
19
|
Computational Methods for Integration of Biological Data. Per Med 2016. [DOI: 10.1007/978-3-319-39349-0_8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/20/2022]
|
20
|
Gligorijević V, Malod-Dognin N, Pržulj N. Fuse: multiple network alignment via data fusion. Bioinformatics 2015; 32:1195-203. [DOI: 10.1093/bioinformatics/btv731] [Citation(s) in RCA: 36] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/22/2014] [Accepted: 10/09/2015] [Indexed: 02/07/2023] Open
|
21
|
Gligorijević V, Pržulj N. Methods for biological data integration: perspectives and challenges. J R Soc Interface 2015; 12:20150571. [PMID: 26490630 PMCID: PMC4685837 DOI: 10.1098/rsif.2015.0571] [Citation(s) in RCA: 157] [Impact Index Per Article: 17.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/26/2015] [Accepted: 09/25/2015] [Indexed: 12/17/2022] Open
Abstract
Rapid technological advances have led to the production of different types of biological data and enabled construction of complex networks with various types of interactions between diverse biological entities. Standard network data analysis methods were shown to be limited in dealing with such heterogeneous networked data and consequently, new methods for integrative data analyses have been proposed. The integrative methods can collectively mine multiple types of biological data and produce more holistic, systems-level biological insights. We survey recent methods for collective mining (integration) of various types of networked biological data. We compare different state-of-the-art methods for data integration and highlight their advantages and disadvantages in addressing important biological problems. We identify the important computational challenges of these methods and provide a general guideline for which methods are suited for specific biological problems, or specific data types. Moreover, we propose that recent non-negative matrix factorization-based approaches may become the integration methodology of choice, as they are well suited and accurate in dealing with heterogeneous data and have many opportunities for further development.
Collapse
Affiliation(s)
| | - Nataša Pržulj
- Department of Computing, Imperial College London, London SW7 2AZ, UK
| |
Collapse
|
22
|
Abstract
The acquisition of mutations that activate oncogenes or inactivate tumor suppressors is a primary feature of most cancers. Mutations that directly alter protein sequence and structure drive the development of tumors through aberrant expression and modification of proteins, in many cases directly impacting components of signal transduction pathways and cellular architecture. Cancer-associated mutations may have direct or indirect effects on proteins and their interactions and while the effects of mutations on signaling pathways have been widely studied, how mutations alter underlying protein-protein interaction networks is much less well understood. Systematic mapping of oncoprotein protein interactions using proteomics techniques as well as computational network analyses is revealing how oncoprotein mutations perturb protein-protein interaction networks and drive the cancer phenotype.
Collapse
Affiliation(s)
- Emily Bowler
- Centre for Biological Sciences, University of Southampton, Southampton SO17 1BJ, UK
| | - Zhenghe Wang
- Department of Genetics and Genome Science, Case Western Reserve University, Cleveland, Ohio 44106, USA
| | - Rob M. Ewing
- Centre for Biological Sciences, University of Southampton, Southampton SO17 1BJ, UK
| |
Collapse
|
23
|
Hulovatyy Y, Chen H, Milenković T. Exploring the structure and function of temporal networks with dynamic graphlets. Bioinformatics 2015; 31:i171-80. [PMID: 26072480 PMCID: PMC4765862 DOI: 10.1093/bioinformatics/btv227] [Citation(s) in RCA: 62] [Impact Index Per Article: 6.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/18/2023] Open
Abstract
MOTIVATION With increasing availability of temporal real-world networks, how to efficiently study these data? One can model a temporal network as a single aggregate static network, or as a series of time-specific snapshots, each being an aggregate static network over the corresponding time window. Then, one can use established methods for static analysis on the resulting aggregate network(s), but losing in the process valuable temporal information either completely, or at the interface between different snapshots, respectively. Here, we develop a novel approach for studying a temporal network more explicitly, by capturing inter-snapshot relationships. RESULTS We base our methodology on well-established graphlets (subgraphs), which have been proven in numerous contexts in static network research. We develop new theory to allow for graphlet-based analyses of temporal networks. Our new notion of dynamic graphlets is different from existing dynamic network approaches that are based on temporal motifs (statistically significant subgraphs). The latter have limitations: their results depend on the choice of a null network model that is required to evaluate the significance of a subgraph, and choosing a good null model is non-trivial. Our dynamic graphlets overcome the limitations of the temporal motifs. Also, when we aim to characterize the structure and function of an entire temporal network or of individual nodes, our dynamic graphlets outperform the static graphlets. Clearly, accounting for temporal information helps. We apply dynamic graphlets to temporal age-specific molecular network data to deepen our limited knowledge about human aging. AVAILABILITY AND IMPLEMENTATION http://www.nd.edu/∼cone/DG.
Collapse
Affiliation(s)
- Y Hulovatyy
- Department of Computer Science and Engineering, Interdisciplinary Center for Network Science and Applications, and ECK Institute for Global Health, University of Notre Dame, Notre Dame, IN 46556, USA
| | - H Chen
- Department of Computer Science and Engineering, Interdisciplinary Center for Network Science and Applications, and ECK Institute for Global Health, University of Notre Dame, Notre Dame, IN 46556, USA
| | - T Milenković
- Department of Computer Science and Engineering, Interdisciplinary Center for Network Science and Applications, and ECK Institute for Global Health, University of Notre Dame, Notre Dame, IN 46556, USA
| |
Collapse
|
24
|
Malod-Dognin N, Pržulj N. L-GRAAL: Lagrangian graphlet-based network aligner. ACTA ACUST UNITED AC 2015; 31:2182-9. [PMID: 25725498 PMCID: PMC4481854 DOI: 10.1093/bioinformatics/btv130] [Citation(s) in RCA: 97] [Impact Index Per Article: 10.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/29/2014] [Accepted: 02/25/2015] [Indexed: 12/31/2022]
Abstract
Motivation: Discovering and understanding patterns in networks of protein–protein interactions (PPIs) is a central problem in systems biology. Alignments between these networks aid functional understanding as they uncover important information, such as evolutionary conserved pathways, protein complexes and functional orthologs. A few methods have been proposed for global PPI network alignments, but because of NP-completeness of underlying sub-graph isomorphism problem, producing topologically and biologically accurate alignments remains a challenge. Results: We introduce a novel global network alignment tool, Lagrangian GRAphlet-based ALigner (L-GRAAL), which directly optimizes both the protein and the interaction functional conservations, using a novel alignment search heuristic based on integer programming and Lagrangian relaxation. We compare L-GRAAL with the state-of-the-art network aligners on the largest available PPI networks from BioGRID and observe that L-GRAAL uncovers the largest common sub-graphs between the networks, as measured by edge-correctness and symmetric sub-structures scores, which allow transferring more functional information across networks. We assess the biological quality of the protein mappings using the semantic similarity of their Gene Ontology annotations and observe that L-GRAAL best uncovers functionally conserved proteins. Furthermore, we introduce for the first time a measure of the semantic similarity of the mapped interactions and show that L-GRAAL also uncovers best functionally conserved interactions. In addition, we illustrate on the PPI networks of baker's yeast and human the ability of L-GRAAL to predict new PPIs. Finally, L-GRAAL's results are the first to show that topological information is more important than sequence information for uncovering functionally conserved interactions. Availability and implementation: L-GRAAL is coded in C++. Software is available at: http://bio-nets.doc.ic.ac.uk/L-GRAAL/. Contact:n.malod-dognin@imperial.ac.uk Supplementary information:Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
| | - Nataša Pržulj
- Department of Computing, Imperial College London, London, UK
| |
Collapse
|
25
|
Bosque G, Folch-Fortuny A, Picó J, Ferrer A, Elena SF. Topology analysis and visualization of Potyvirus protein-protein interaction network. BMC SYSTEMS BIOLOGY 2014; 8:129. [PMID: 25409737 PMCID: PMC4251984 DOI: 10.1186/s12918-014-0129-8] [Citation(s) in RCA: 23] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 05/28/2014] [Accepted: 11/05/2014] [Indexed: 11/25/2022]
Abstract
Background One of the central interests of Virology is the identification of host factors that contribute to virus infection. Despite tremendous efforts, the list of factors identified remains limited. With omics techniques, the focus has changed from identifying and thoroughly characterizing individual host factors to the simultaneous analysis of thousands of interactions, framing them on the context of protein-protein interaction networks and of transcriptional regulatory networks. This new perspective is allowing the identification of direct and indirect viral targets. Such information is available for several members of the Potyviridae family, one of the largest and more important families of plant viruses. Results After collecting information on virus protein-protein interactions from different potyviruses, we have processed it and used it for inferring a protein-protein interaction network. All proteins are connected into a single network component. Some proteins show a high degree and are highly connected while others are much less connected, with the network showing a significant degree of dissortativeness. We have attempted to integrate this virus protein-protein interaction network into the largest protein-protein interaction network of Arabidopsis thaliana, a susceptible laboratory host. To make the interpretation of data and results easier, we have developed a new approach for visualizing and analyzing the dynamic spread on the host network of the local perturbations induced by viral proteins. We found that local perturbations can reach the entire host protein-protein interaction network, although the efficiency of this spread depends on the particular viral proteins. By comparing the spread dynamics among viral proteins, we found that some proteins spread their effects fast and efficiently by attacking hubs in the host network while other proteins exert more local effects. Conclusions Our findings confirm that potyvirus protein-protein interaction networks are highly connected, with some proteins playing the role of hubs. Several topological parameters depend linearly on the protein degree. Some viral proteins focus their effect in only host hubs while others diversify its effect among several proteins at the first step. Future new data will help to refine our model and to improve our predictions. Electronic supplementary material The online version of this article (doi:10.1186/s12918-014-0129-8) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Gabriel Bosque
- Institut Universitari d'Automàtica i Informàtica Industrial, Universitat Politècnica de València, Camí de Vera s/n, 46022, València, Spain.
| | - Abel Folch-Fortuny
- Departamento de Estadística e Investigación Operativa Aplicadas y Calidad, Universitat Politècnica de València, Camí de Vera, s/n, Edificio 7A, 46022, València, Spain.
| | - Jesús Picó
- Institut Universitari d'Automàtica i Informàtica Industrial, Universitat Politècnica de València, Camí de Vera s/n, 46022, València, Spain.
| | - Alberto Ferrer
- Departamento de Estadística e Investigación Operativa Aplicadas y Calidad, Universitat Politècnica de València, Camí de Vera, s/n, Edificio 7A, 46022, València, Spain.
| | - Santiago F Elena
- Instituto de Biología Molecular y Celular de Plantas, Consejo Superior de Investigaciones Científicas-Universitat Politècnica de València, Campus UPV CPI 8E, Ingeniero Fausto Elio s/n, 46022, València, Spain. .,The Santa Fe Institute, Santa Fe, NM, 87501, USA.
| |
Collapse
|
26
|
Survey of network-based approaches to research of cardiovascular diseases. BIOMED RESEARCH INTERNATIONAL 2014; 2014:527029. [PMID: 24772427 PMCID: PMC3977459 DOI: 10.1155/2014/527029] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 11/26/2013] [Accepted: 02/07/2014] [Indexed: 01/08/2023]
Abstract
Cardiovascular diseases (CVDs) are the leading health problem worldwide. Investigating causes and mechanisms of CVDs calls for an integrative approach that would take into account its complex etiology. Biological networks generated from available data on biomolecular interactions are an excellent platform for understanding interconnectedness of all processes within a living cell, including processes that underlie diseases. Consequently, topology of biological networks has successfully been used for identifying genes, pathways, and modules that govern molecular actions underlying various complex diseases. Here, we review approaches that explore and use relationships between topological properties of biological networks and mechanisms underlying CVDs.
Collapse
|
27
|
Abstract
MOTIVATION Because susceptibility to diseases increases with age, studying aging gains importance. Analyses of gene expression or sequence data, which have been indispensable for investigating aging, have been limited to studying genes and their protein products in isolation, ignoring their connectivities. However, proteins function by interacting with other proteins, and this is exactly what biological networks (BNs) model. Thus, analyzing the proteins' BN topologies could contribute to the understanding of aging. Current methods for analyzing systems-level BNs deal with their static representations, even though cells are dynamic. For this reason, and because different data types can give complementary biological insights, we integrate current static BNs with aging-related gene expression data to construct dynamic age-specific BNs. Then, we apply sensitive measures of topology to the dynamic BNs to study cellular changes with age. RESULTS While global BN topologies do not significantly change with age, local topologies of a number of genes do. We predict such genes to be aging-related. We demonstrate credibility of our predictions by (i) observing significant overlap between our predicted aging-related genes and 'ground truth' aging-related genes; (ii) observing significant overlap between functions and diseases that are enriched in our aging-related predictions and those that are enriched in 'ground truth' aging-related data; (iii) providing evidence that diseases which are enriched in our aging-related predictions are linked to human aging; and (iv) validating our high-scoring novel predictions in the literature. AVAILABILITY AND IMPLEMENTATION Software executables are available upon request.
Collapse
Affiliation(s)
- Fazle E Faisal
- Department of Computer Science and Engineering, ECK Institute for Global Health and Interdisciplinary Center for Network Science and Applications, University of Notre Dame, Notre Dame, IN 46556, USA
| | - Tijana Milenković
- Department of Computer Science and Engineering, ECK Institute for Global Health and Interdisciplinary Center for Network Science and Applications, University of Notre Dame, Notre Dame, IN 46556, USA
| |
Collapse
|
28
|
Lei C, Tamim S, Bishop AJ, Ruan J. Fully automated protein complex prediction based on topological similarity and community structure. Proteome Sci 2013; 11:S9. [PMID: 24564887 PMCID: PMC3908383 DOI: 10.1186/1477-5956-11-s1-s9] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/24/2022] Open
Abstract
To understand the function of protein complexes and their association with biological processes, a lot of studies have been done towards analyzing the protein-protein interaction (PPI) networks. However, the advancement in high-throughput technology has resulted in a humongous amount of data for analysis. Moreover, high level of noise, sparseness, and skewness in degree distribution of PPI networks limits the performance of many clustering algorithms and further analysis of their interactions. In addressing and solving these problems we present a novel random walk based algorithm that converts the incomplete and binary PPI network into a protein-protein topological similarity matrix (PP-TS matrix). We believe that if two proteins share some high-order topological similarities they are likely to be interacting with each other. Using the obtained PP-TS matrix, we constructed and used weighted networks to further study and analyze the interaction among proteins. Specifically, we applied a fully automated community structure finding algorithm (Auto-HQcut) on the obtained weighted network to cluster protein complexes. We then analyzed the protein complexes for significance in biological processes. To help visualize and analyze these protein complexes we also developed an interface that displays the resulting complexes as well as the characteristics associated with each complex. Applying our approach to a yeast protein-protein interaction network, we found that the predicted protein-protein interaction pairs with high topological similarities have more significant biological relevance than the original protein-protein interactions pairs. When we compared our PPI network reconstruction algorithm with other existing algorithms using gene ontology and gene co-expression, our algorithm produced the highest similarity scores. Also, our predicted protein complexes showed higher accuracy measure compared to the other protein complex predictions.
Collapse
|
29
|
Winterbach W, Mieghem PV, Reinders M, Wang H, Ridder DD. Topology of molecular interaction networks. BMC SYSTEMS BIOLOGY 2013; 7:90. [PMID: 24041013 PMCID: PMC4231395 DOI: 10.1186/1752-0509-7-90] [Citation(s) in RCA: 75] [Impact Index Per Article: 6.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 02/19/2013] [Accepted: 08/01/2013] [Indexed: 12/23/2022]
Abstract
Molecular interactions are often represented as network models which have become the common language of many areas of biology. Graphs serve as convenient mathematical representations of network models and have themselves become objects of study. Their topology has been intensively researched over the last decade after evidence was found that they share underlying design principles with many other types of networks.Initial studies suggested that molecular interaction network topology is related to biological function and evolution. However, further whole-network analyses did not lead to a unified view on what this relation may look like, with conclusions highly dependent on the type of molecular interactions considered and the metrics used to study them. It is unclear whether global network topology drives function, as suggested by some researchers, or whether it is simply a byproduct of evolution or even an artefact of representing complex molecular interaction networks as graphs.Nevertheless, network biology has progressed significantly over the last years. We review the literature, focusing on two major developments. First, realizing that molecular interaction networks can be naturally decomposed into subsystems (such as modules and pathways), topology is increasingly studied locally rather than globally. Second, there is a move from a descriptive approach to a predictive one: rather than correlating biological network topology to generic properties such as robustness, it is used to predict specific functions or phenotypes.Taken together, this change in focus from globally descriptive to locally predictive points to new avenues of research. In particular, multi-scale approaches are developments promising to drive the study of molecular interaction networks further.
Collapse
Affiliation(s)
- Wynand Winterbach
- Network Architectures and Services, Department of Intelligent Systems, Faculty of
Electrical Engineering, Mathematics and Computer Science, Delft University of
Technology, P.O. Box 5031, 2600 GA Delft, The Netherlands
- Delft Bioinformatics Lab, Department of Intelligent Systems, Faculty of Electrical
Engineering, Mathematics and Computer Science, Delft University of Technology,
P.O. Box 5031, 2600 GA Delft, The Netherlands
| | - Piet Van Mieghem
- Network Architectures and Services, Department of Intelligent Systems, Faculty of
Electrical Engineering, Mathematics and Computer Science, Delft University of
Technology, P.O. Box 5031, 2600 GA Delft, The Netherlands
| | - Marcel Reinders
- Delft Bioinformatics Lab, Department of Intelligent Systems, Faculty of Electrical
Engineering, Mathematics and Computer Science, Delft University of Technology,
P.O. Box 5031, 2600 GA Delft, The Netherlands
- Netherlands Bioinformatics Center, 6500 HB Nijmegen, The Netherlands
- Kluyver Centre for Genomics of Industrial Fermentation, 2600 GA Delft, The
Netherlands
| | - Huijuan Wang
- Network Architectures and Services, Department of Intelligent Systems, Faculty of
Electrical Engineering, Mathematics and Computer Science, Delft University of
Technology, P.O. Box 5031, 2600 GA Delft, The Netherlands
| | - Dick de Ridder
- Delft Bioinformatics Lab, Department of Intelligent Systems, Faculty of Electrical
Engineering, Mathematics and Computer Science, Delft University of Technology,
P.O. Box 5031, 2600 GA Delft, The Netherlands
- Netherlands Bioinformatics Center, 6500 HB Nijmegen, The Netherlands
- Kluyver Centre for Genomics of Industrial Fermentation, 2600 GA Delft, The
Netherlands
| |
Collapse
|
30
|
Abstract
Large amounts of protein-protein interaction (PPI) data are available. The human PPI network currently contains over 56 000 interactions between 11 100 proteins. It has been demonstrated that the structure of this network is not random and that the same wiring patterns in it underlie the same biological processes and diseases. In this paper, we ask if there exists a subnetwork of the human PPI network such that its topology is the key to disease formation and hence should be the primary object of therapeutic intervention. We demonstrate that such a subnetwork exists and can be obtained purely computationally. In particular, by successively pruning the entire human PPI network, we are left with a "core" subnetwork that is not only topologically and functionally homogeneous, but is also enriched in disease genes, drug targets, and it contains genes that are known to drive disease formation. We call this subnetwork the Core Diseasome. Furthermore, we show that the topology of the Core Diseasome is unique in the human PPI network suggesting that it may be the wiring of this network that governs the mutagenesis that leads to disease. Explaining the mechanisms behind this phenomenon and exploiting them remains a challenge.
Collapse
Affiliation(s)
- Vuk Janjić
- Department of Computing, Imperial College London, London, SW7 2AZ, UK.
| | | |
Collapse
|
31
|
Cohen D, Kuperstein I, Barillot E, Zinovyev A, Calzone L. From a biological hypothesis to the construction of a mathematical model. Methods Mol Biol 2013; 1021:107-125. [PMID: 23715982 DOI: 10.1007/978-1-62703-450-0_6] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/02/2023]
Abstract
Mathematical models serve to explain complex biological phenomena and provide predictions that can be tested experimentally. They can provide plausible scenarios of a complex biological behavior when intuition is not sufficient anymore. The process from a biological hypothesis to a mathematical model might be challenging for biologists that are not familiar with mathematical modeling. In this chapter we discuss a possible workflow that describes the steps to be taken starting from a biological hypothesis on a biochemical cellular mechanism to the construction of a mathematical model using the appropriate formalism. An important part of this workflow is formalization of biological knowledge, which can be facilitated by existing tools and standards developed by the systems biology community. This chapter aims at introducing modeling to experts in molecular biology that would like to convert their hypotheses into mathematical models.
Collapse
|
32
|
Levnajić Z. Evolutionary design of non-frustrated networks of phase-repulsive oscillators. Sci Rep 2012; 2:967. [PMID: 23243494 PMCID: PMC3522069 DOI: 10.1038/srep00967] [Citation(s) in RCA: 16] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/09/2012] [Accepted: 11/21/2012] [Indexed: 11/17/2022] Open
Abstract
Evolutionary optimisation algorithm is employed to design networks of phase-repulsive oscillators that achieve an anti-phase synchronised state. By introducing the link frustration, the evolutionary process is implemented by rewiring the links with probability proportional to their frustration, until the final network displaying a unique non-frustrated dynamical state is reached. Resulting networks are bipartite and with zero clustering. In addition, the designed non-frustrated anti-phase synchronised networks display a clear topological scale. This contrasts usually studied cases of networks with phase-attractive dynamics, whose performance towards full synchronisation is typically enhanced by the presence of a topological hierarchy.
Collapse
Affiliation(s)
- Zoran Levnajić
- Faculty of information studies in Novo mesto, Novo mesto, Slovenia.
| |
Collapse
|
33
|
Lei C, Ruan J. A novel link prediction algorithm for reconstructing protein-protein interaction networks by topological similarity. ACTA ACUST UNITED AC 2012; 29:355-64. [PMID: 23235927 DOI: 10.1093/bioinformatics/bts688] [Citation(s) in RCA: 75] [Impact Index Per Article: 6.3] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/24/2022]
Abstract
MOTIVATION Recent advances in technology have dramatically increased the availability of protein-protein interaction (PPI) data and stimulated the development of many methods for improving the systems level understanding the cell. However, those efforts have been significantly hindered by the high level of noise, sparseness and highly skewed degree distribution of PPI networks. Here, we present a novel algorithm to reduce the noise present in PPI networks. The key idea of our algorithm is that two proteins sharing some higher-order topological similarities, measured by a novel random walk-based procedure, are likely interacting with each other and may belong to the same protein complex. RESULTS Applying our algorithm to a yeast PPI network, we found that the edges in the reconstructed network have higher biological relevance than in the original network, assessed by multiple types of information, including gene ontology, gene expression, essentiality, conservation between species and known protein complexes. Comparison with existing methods shows that the network reconstructed by our method has the highest quality. Using two independent graph clustering algorithms, we found that the reconstructed network has resulted in significantly improved prediction accuracy of protein complexes. Furthermore, our method is applicable to PPI networks obtained with different experimental systems, such as affinity purification, yeast two-hybrid (Y2H) and protein-fragment complementation assay (PCA), and evidence shows that the predicted edges are likely bona fide physical interactions. Finally, an application to a human PPI network increased the coverage of the network by at least 100%. AVAILABILITY www.cs.utsa.edu/∼jruan/RWS/.
Collapse
Affiliation(s)
- Chengwei Lei
- Department of Computer Science, The University of Texas at San Antonio, San Antonio, TX 78249, USA
| | | |
Collapse
|
34
|
Abstract
Molecular network data are increasingly becoming available, necessitating the development of well performing computational tools for their analyses. Such tools enabled conceptually different approaches for exploring human diseases to be undertaken, in particular, those that study the relationship between a multitude of biomolecules within a cell. Hence, a new field of network biology has emerged as part of systems biology, aiming to untangle the complexity of cellular network organization. We survey current network analysis methods that aim to give insight into human disease.
Collapse
Affiliation(s)
- Vuk Janjić
- Department of Computing, Imperial College London, 180 Queen's Gate, SW7 2AZ London, UK
| | | |
Collapse
|
35
|
Braun P. Interactome mapping for analysis of complex phenotypes: insights from benchmarking binary interaction assays. Proteomics 2012; 12:1499-518. [PMID: 22589225 DOI: 10.1002/pmic.201100598] [Citation(s) in RCA: 46] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/18/2023]
Abstract
Protein interactions mediate essentially all biological processes and analysis of protein-protein interactions using both large-scale and small-scale approaches has contributed fundamental insights to the understanding of biological systems. In recent years, interactome network maps have emerged as an important tool for analyzing and interpreting genetic data of complex phenotypes. Complementary experimental approaches to test for binary, direct interactions, and for membership in protein complexes are used to explore the interactome. The two approaches are not redundant but yield orthogonal perspectives onto the complex network of physical interactions by which proteins mediate biological processes. In recent years, several publications have demonstrated that interactions from high-throughput experiments can be equally reliable as the high quality subset of interactions identified in small-scale studies. Critical for this insight was the introduction of standardized experimental benchmarking of interaction and validation assays using reference sets. The data obtained in these benchmarking experiments have resulted in greater appreciation of the limitations and the complementary strengths of different assays. Moreover, benchmarking is a central element of a conceptual framework to estimate interactome sizes and thereby measure progress toward near complete network maps. These estimates have revealed that current large-scale data sets, although often of high quality, cover only a small fraction of a given interactome. Here, I review the findings of assay benchmarking and discuss implications for quality control, and for strategies toward obtaining a near-complete map of the interactome of an organism.
Collapse
Affiliation(s)
- Pascal Braun
- Department of Plant Systems Biology, Center of Life and Food Sciences, Technische Universität München, Freising, Germany.
| |
Collapse
|
36
|
De Las Rivas J, Fontanillo C. Protein-protein interaction networks: unraveling the wiring of molecular machines within the cell. Brief Funct Genomics 2012; 11:489-96. [PMID: 22908212 DOI: 10.1093/bfgp/els036] [Citation(s) in RCA: 57] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/23/2023] Open
Abstract
Mapping and understanding of the protein interaction networks with their key modules and hubs can provide deeper insights into the molecular machinery underlying complex phenotypes. In this article, we present the basic characteristics and definitions of protein networks, starting with a distinction of the different types of associations between proteins. We focus the review on protein-protein interactions (PPIs), a subset of associations defined as physical contacts between proteins that occur by selective molecular docking in a particular biological context. We present such definition as opposed to other types of protein associations derived from regulatory, genetic, structural or functional relations. To determine PPIs, a variety of binary and co-complex methods exist; however, not all the technologies provide the same information and data quality. A way of increasing confidence in a given protein interaction is to integrate orthogonal experimental evidences. The use of several complementary methods testing each single interaction assesses the accuracy of PPI data and tries to minimize the occurrence of false interactions. Following this approach there have been important efforts to unify primary databases of experimentally proven PPIs into integrated databases. These meta-databases provide a measure of the confidence of interactions based on the number of experimental proofs that report them. As a conclusion, we can state that integrated information allows the building of more reliable interaction networks. Identification of communities, cliques, modules and hubs by analysing the topological parameters and graph properties of the protein networks allows the discovery of central/critical nodes, which are candidates to regulate cellular flux and dynamics.
Collapse
Affiliation(s)
- Javier De Las Rivas
- Bioinformatics and Functional Genomics Research Group, Cancer Research Center (IBMCC, CSIC/USAL), Salamanca, Spain.
| | | |
Collapse
|
37
|
Saraç OS, Pancaldi V, Bähler J, Beyer A. Topology of functional networks predicts physical binding of proteins. ACTA ACUST UNITED AC 2012; 28:2137-45. [PMID: 22718785 DOI: 10.1093/bioinformatics/bts351] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/11/2023]
Abstract
MOTIVATION It has been recognized that the topology of molecular networks provides information about the certainty and nature of individual interactions. Thus, network motifs have been used for predicting missing links in biological networks and for removing false positives. However, various different measures can be inferred from the structure of a given network and their predictive power varies depending on the task at hand. RESULTS Herein, we present a systematic assessment of seven different network features extracted from the topology of functional genetic networks and we quantify their ability to classify interactions into different types of physical protein associations. Using machine learning, we combine features based on network topology with non-network features and compare their importance of the classification of interactions. We demonstrate the utility of network features based on human and budding yeast networks; we show that network features can distinguish different sub-types of physical protein associations and we apply the framework to fission yeast, which has a much sparser known physical interactome than the other two species. Our analysis shows that network features are at least as predictive for the tasks we tested as non-network features. However, feature importance varies between species owing to different topological characteristics of the networks. The application to fission yeast shows that small maps of physical interactomes can be extended based on functional networks, which are often more readily available. AVAILABILITY AND IMPLEMENTATION The R-code for computing the network features is available from www.cellularnetworks.org
Collapse
Affiliation(s)
- Omer Sinan Saraç
- Biotechnology Center, Technische Universitt Dresden, D-01062 Dresden, Germany
| | | | | | | |
Collapse
|
38
|
Fung DCY, Li SS, Goel A, Hong SH, Wilkins MR. Visualization of the interactome: What are we looking at? Proteomics 2012; 12:1669-86. [DOI: 10.1002/pmic.201100454] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/30/2022]
Affiliation(s)
- David C. Y. Fung
- New South Wales Systems Biology Initiative; and School of Biotechnology and Biomolecular Sciences; The University of New South Wales; New South Wales Australia
| | - Simone S. Li
- New South Wales Systems Biology Initiative; and School of Biotechnology and Biomolecular Sciences; The University of New South Wales; New South Wales Australia
| | - Apurv Goel
- New South Wales Systems Biology Initiative; and School of Biotechnology and Biomolecular Sciences; The University of New South Wales; New South Wales Australia
| | - Seok-Hee Hong
- School of Information Technologies; Faculty of Engineering and Information Technologies; The University of Sydney; New South Wales Australia
| | - Marc R. Wilkins
- New South Wales Systems Biology Initiative; and School of Biotechnology and Biomolecular Sciences; The University of New South Wales; New South Wales Australia
| |
Collapse
|
39
|
Topological analysis and interactive visualization of biological networks and protein structures. Nat Protoc 2012; 7:670-85. [PMID: 22422314 DOI: 10.1038/nprot.2012.004] [Citation(s) in RCA: 332] [Impact Index Per Article: 27.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/18/2022]
Abstract
Computational analysis and interactive visualization of biological networks and protein structures are common tasks for gaining insight into biological processes. This protocol describes three workflows based on the NetworkAnalyzer and RINalyzer plug-ins for Cytoscape, a popular software platform for networks. NetworkAnalyzer has become a standard Cytoscape tool for comprehensive network topology analysis. In addition, RINalyzer provides methods for exploring residue interaction networks derived from protein structures. The first workflow uses NetworkAnalyzer to perform a topological analysis of biological networks. The second workflow applies RINalyzer to study protein structure and function and to compute network centrality measures. The third workflow combines NetworkAnalyzer and RINalyzer to compare residue networks. The full protocol can be completed in ∼2 h.
Collapse
|