1
|
Gil-Gomez A, Rest JS. Wiring Between Close Nodes in Molecular Networks Evolves More Quickly Than Between Distant Nodes. Mol Biol Evol 2024; 41:msae098. [PMID: 38768245 PMCID: PMC11136681 DOI: 10.1093/molbev/msae098] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/21/2023] [Revised: 04/14/2024] [Accepted: 05/15/2024] [Indexed: 05/22/2024] Open
Abstract
As species diverge, a wide range of evolutionary processes lead to changes in protein-protein interaction (PPI) networks and metabolic networks. The rate at which molecular networks evolve is an important question in evolutionary biology. Previous empirical work has focused on interactomes from model organisms to calculate rewiring rates, but this is limited by the relatively small number of species and sparse nature of network data across species. We present a proxy for variation in network topology: variation in drug-drug interactions (DDIs), obtained by studying drug combinations (DCs) across taxa. Here, we propose the rate at which DDIs change across species as an estimate of the rate at which the underlying molecular network changes as species diverge. We computed the evolutionary rates of DDIs using previously published data from a high-throughput study in gram-negative bacteria. Using phylogenetic comparative methods, we found that DDIs diverge rapidly over short evolutionary time periods, but that divergence saturates over longer time periods. In parallel, we mapped drugs with known targets in PPI and cofunctional networks. We found that the targets of synergistic DDIs are closer in these networks than other types of DCs and that synergistic interactions have a higher evolutionary rate, meaning that nodes that are closer evolve at a faster rate. Future studies of network evolution may use DC data to gain larger-scale perspectives on the details of network evolution within and between species.
Collapse
Affiliation(s)
- Alejandro Gil-Gomez
- Department of Ecology and Evolution, Laufer Center for Physical and Quantitative Biology, Stony Brook University, 650 Life Sciences, Stony Brook, NY 11794-4254, USA
| | - Joshua S Rest
- Department of Ecology and Evolution, Laufer Center for Physical and Quantitative Biology, Stony Brook University, 650 Life Sciences, Stony Brook, NY 11794-4254, USA
| |
Collapse
|
2
|
Mansueto G, Fusco G, Colonna G. A Tiny Viral Protein, SARS-CoV-2-ORF7b: Functional Molecular Mechanisms. Biomolecules 2024; 14:541. [PMID: 38785948 PMCID: PMC11118181 DOI: 10.3390/biom14050541] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/26/2024] [Revised: 04/01/2024] [Accepted: 04/17/2024] [Indexed: 05/25/2024] Open
Abstract
This study presents the interaction with the human host metabolism of SARS-CoV-2 ORF7b protein (43 aa), using a protein-protein interaction network analysis. After pruning, we selected from BioGRID the 51 most significant proteins among 2753 proven interactions and 1708 interactors specific to ORF7b. We used these proteins as functional seeds, and we obtained a significant network of 551 nodes via STRING. We performed topological analysis and calculated topological distributions by Cytoscape. By following a hub-and-spoke network architectural model, we were able to identify seven proteins that ranked high as hubs and an additional seven as bottlenecks. Through this interaction model, we identified significant GO-processes (5057 terms in 15 categories) induced in human metabolism by ORF7b. We discovered high statistical significance processes of dysregulated molecular cell mechanisms caused by acting ORF7b. We detected disease-related human proteins and their involvement in metabolic roles, how they relate in a distorted way to signaling and/or functional systems, in particular intra- and inter-cellular signaling systems, and the molecular mechanisms that supervise programmed cell death, with mechanisms similar to that of cancer metastasis diffusion. A cluster analysis showed 10 compact and significant functional clusters, where two of them overlap in a Giant Connected Component core of 206 total nodes. These two clusters contain most of the high-rank nodes. ORF7b acts through these two clusters, inducing most of the metabolic dysregulation. We conducted a co-regulation and transcriptional analysis by hub and bottleneck proteins. This analysis allowed us to define the transcription factors and miRNAs that control the high-ranking proteins and the dysregulated processes within the limits of the poor knowledge that these sectors still impose.
Collapse
Affiliation(s)
- Gelsomina Mansueto
- Dipartimento di Scienze Mediche e Chirurgiche Avanzate, Università della Campania, L. Vanvitelli, 80138 Naples, Italy;
| | - Giovanna Fusco
- Istituto Zooprofilattico Sperimentale del Mezzogiorno, 80055 Portici, Italy;
| | - Giovanni Colonna
- Medical Informatics AOU, Università della Campania, L. Vanvitelli, 80138 Naples, Italy
| |
Collapse
|
3
|
Caetano-Anollés G. Agency in evolution of biomolecular communication. Ann N Y Acad Sci 2023; 1525:88-103. [PMID: 37219369 DOI: 10.1111/nyas.15005] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 05/24/2023]
Abstract
Biomolecular communication demands that interactions between parts of a molecular system act as scaffolds for message transmission. It also requires an organized system of signs-a communicative agency-for creating and transmitting meaning. The emergence of agency, the capacity to act in a given context and generate end-directed behaviors, has baffled evolutionary biologists for centuries. Here, I explore its emergence with knowledge grounded in over two decades of evolutionary genomic and bioinformatic exploration. Biphasic processes of growth and diversification exist that generate hierarchy and modularity in biological systems at widely ranging time scales. Similarly, a biphasic process exists in communication that constructs a message before it can be transmitted for interpretation. Transmission dissipates matter-energy and information and involves computation. Agency emerges when molecular machinery generates hierarchical layers of vocabularies in an entangled communication network clustered around the universal Turing machine of the ribosome. Computations canalize biological systems to perform biological functions in a dissipative quest to structure long-lived occurrents. This occurs within the confines of a "triangle of persistence" that maximizes invariance with trade-offs between economy, flexibility, and robustness. Thus, learning from previous historical and circumstantial experiences unifies modules in a hierarchy that expands the agency of systems.
Collapse
Affiliation(s)
- Gustavo Caetano-Anollés
- Evolutionary Bioinformatics Laboratory, Department of Crop Sciences and C. R. Woese Institute for Genomic Biology, University of Illinois, Urbana, Illinois, USA
| |
Collapse
|
4
|
Crawford-Kahrl P, Nerem RR, Cummins B, Gedeon T. Genetic Networks Encode Secrets of Their Past. J Theor Biol 2022; 541:111092. [DOI: 10.1016/j.jtbi.2022.111092] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/07/2021] [Revised: 03/04/2022] [Accepted: 03/12/2022] [Indexed: 11/25/2022]
|
5
|
OUP accepted manuscript. Brief Funct Genomics 2022; 21:243-269. [DOI: 10.1093/bfgp/elac007] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/02/2021] [Revised: 03/17/2022] [Accepted: 03/18/2022] [Indexed: 11/14/2022] Open
|
6
|
Chefaoui RM. Seasonal variations of waterbird ecological networks under different saltpans management. ECOL INFORM 2021. [DOI: 10.1016/j.ecoinf.2021.101364] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/20/2022]
|
7
|
Manrubia S, Cuesta JA, Aguirre J, Ahnert SE, Altenberg L, Cano AV, Catalán P, Diaz-Uriarte R, Elena SF, García-Martín JA, Hogeweg P, Khatri BS, Krug J, Louis AA, Martin NS, Payne JL, Tarnowski MJ, Weiß M. From genotypes to organisms: State-of-the-art and perspectives of a cornerstone in evolutionary dynamics. Phys Life Rev 2021; 38:55-106. [PMID: 34088608 DOI: 10.1016/j.plrev.2021.03.004] [Citation(s) in RCA: 36] [Impact Index Per Article: 12.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/24/2020] [Accepted: 03/01/2021] [Indexed: 12/21/2022]
Abstract
Understanding how genotypes map onto phenotypes, fitness, and eventually organisms is arguably the next major missing piece in a fully predictive theory of evolution. We refer to this generally as the problem of the genotype-phenotype map. Though we are still far from achieving a complete picture of these relationships, our current understanding of simpler questions, such as the structure induced in the space of genotypes by sequences mapped to molecular structures, has revealed important facts that deeply affect the dynamical description of evolutionary processes. Empirical evidence supporting the fundamental relevance of features such as phenotypic bias is mounting as well, while the synthesis of conceptual and experimental progress leads to questioning current assumptions on the nature of evolutionary dynamics-cancer progression models or synthetic biology approaches being notable examples. This work delves with a critical and constructive attitude into our current knowledge of how genotypes map onto molecular phenotypes and organismal functions, and discusses theoretical and empirical avenues to broaden and improve this comprehension. As a final goal, this community should aim at deriving an updated picture of evolutionary processes soundly relying on the structural properties of genotype spaces, as revealed by modern techniques of molecular and functional analysis.
Collapse
Affiliation(s)
- Susanna Manrubia
- Department of Systems Biology, Centro Nacional de Biotecnología (CSIC), Madrid, Spain; Grupo Interdisciplinar de Sistemas Complejos (GISC), Madrid, Spain.
| | - José A Cuesta
- Grupo Interdisciplinar de Sistemas Complejos (GISC), Madrid, Spain; Departamento de Matemáticas, Universidad Carlos III de Madrid, Leganés, Spain; Instituto de Biocomputación y Física de Sistemas Complejos (BiFi), Universidad de Zaragoza, Spain; UC3M-Santander Big Data Institute (IBiDat), Getafe, Madrid, Spain
| | - Jacobo Aguirre
- Grupo Interdisciplinar de Sistemas Complejos (GISC), Madrid, Spain; Centro de Astrobiología, CSIC-INTA, ctra. de Ajalvir km 4, 28850 Torrejón de Ardoz, Madrid, Spain
| | - Sebastian E Ahnert
- Department of Chemical Engineering and Biotechnology, University of Cambridge, Philippa Fawcett Drive, Cambridge CB3 0AS, UK; The Alan Turing Institute, British Library, 96 Euston Road, London NW1 2DB, UK
| | | | - Alejandro V Cano
- Institute of Integrative Biology, ETH Zurich, Zurich, Switzerland; Swiss Institute of Bioinformatics, Lausanne, Switzerland
| | - Pablo Catalán
- Grupo Interdisciplinar de Sistemas Complejos (GISC), Madrid, Spain; Departamento de Matemáticas, Universidad Carlos III de Madrid, Leganés, Spain
| | - Ramon Diaz-Uriarte
- Department of Biochemistry, Universidad Autónoma de Madrid, Madrid, Spain; Instituto de Investigaciones Biomédicas "Alberto Sols" (UAM-CSIC), Madrid, Spain
| | - Santiago F Elena
- Instituto de Biología Integrativa de Sistemas, I(2)SysBio (CSIC-UV), València, Spain; The Santa Fe Institute, Santa Fe, NM, USA
| | | | - Paulien Hogeweg
- Theoretical Biology and Bioinformatics Group, Utrecht University, the Netherlands
| | - Bhavin S Khatri
- The Francis Crick Institute, London, UK; Department of Life Sciences, Imperial College London, London, UK
| | - Joachim Krug
- Institute for Biological Physics, University of Cologne, Köln, Germany
| | - Ard A Louis
- Rudolf Peierls Centre for Theoretical Physics, University of Oxford, Oxford, UK
| | - Nora S Martin
- Theory of Condensed Matter Group, Cavendish Laboratory, University of Cambridge, Cambridge, UK; Sainsbury Laboratory, University of Cambridge, Cambridge, UK
| | - Joshua L Payne
- Institute of Integrative Biology, ETH Zurich, Zurich, Switzerland; Swiss Institute of Bioinformatics, Lausanne, Switzerland
| | | | - Marcel Weiß
- Theory of Condensed Matter Group, Cavendish Laboratory, University of Cambridge, Cambridge, UK; Sainsbury Laboratory, University of Cambridge, Cambridge, UK
| |
Collapse
|
8
|
Wéber R, Huzsvár T, Hős C. Vulnerability analysis of water distribution networks to accidental pipe burst. WATER RESEARCH 2020; 184:116178. [PMID: 32707306 DOI: 10.1016/j.watres.2020.116178] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/02/2020] [Revised: 06/18/2020] [Accepted: 07/12/2020] [Indexed: 06/11/2023]
Abstract
Even the best-maintained water distribution network (WDN) might suffer pipe bursts occasionally, and the utility company must reconstruct the damaged sections of the system. The affected area must be segregated by closing the corresponding isolation valves; as a result, the required amount of drinking water might not be available. This paper explores the behaviour and topology of segments, especially their criticality from the viewpoint of the whole system. A novel, objective, dimensionless, segment-based quantity is proposed to evaluate the vulnerability of both the segments and the whole WDN against a single, incidental pipe break, computed as the product of the probability of failure within the segment and the amount of unserved consumption. 27 comprehensive real-life WDNs have been examined by means of the new metric and with the help of complex network theory, exploiting the concept of the degree distribution and topology-based structural properties (e.g. network diameter, clustering coefficient). It was found that metrics based purely on topology suggest different network behaviour as vulnerability analysis, which also includes the hydraulics. The investigation of the global network vulnerabilities has revealed several critically exposed systems, and the local distributions unveiled new properties of WDNs in the case of a random pipe break.
Collapse
Affiliation(s)
- Richárd Wéber
- Budapest University of Technology and Economics, Faculty of Mechanical Engineering,Department of Hydrodynamic Systems, Hungary.
| | - Tamás Huzsvár
- Budapest University of Technology and Economics, Faculty of Mechanical Engineering,Department of Hydrodynamic Systems, Hungary.
| | - Csaba Hős
- Budapest University of Technology and Economics, Faculty of Mechanical Engineering,Department of Hydrodynamic Systems, Hungary.
| |
Collapse
|
9
|
Correa Marrero M, Immink RGH, de Ridder D, van Dijk ADJ. Improved inference of intermolecular contacts through protein-protein interaction prediction using coevolutionary analysis. Bioinformatics 2020; 35:2036-2042. [PMID: 30398547 DOI: 10.1093/bioinformatics/bty924] [Citation(s) in RCA: 9] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/04/2018] [Revised: 10/11/2018] [Accepted: 11/05/2018] [Indexed: 01/09/2023] Open
Abstract
MOTIVATION Predicting residue-residue contacts between interacting proteins is an important problem in bioinformatics. The growing wealth of sequence data can be used to infer these contacts through correlated mutation analysis on multiple sequence alignments of interacting homologs of the proteins of interest. This requires correct identification of pairs of interacting proteins for many species, in order to avoid introducing noise (i.e. non-interacting sequences) in the analysis that will decrease predictive performance. RESULTS We have designed Ouroboros, a novel algorithm to reduce such noise in intermolecular contact prediction. Our method iterates between weighting proteins according to how likely they are to interact based on the correlated mutations signal, and predicting correlated mutations based on the weighted sequence alignment. We show that this approach accurately discriminates between protein interaction versus non-interaction and simultaneously improves the prediction of intermolecular contact residues compared to a naive application of correlated mutation analysis. This requires no training labels concerning interactions or contacts. Furthermore, the method relaxes the assumption of one-to-one interaction of previous approaches, allowing for the study of many-to-many interactions. AVAILABILITY AND IMPLEMENTATION Source code and test data are available at www.bif.wur.nl/. SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
| | - Richard G H Immink
- Laboratory of Molecular Biology, Department of Plant Sciences.,Bioscience, Wageningen Plant Research
| | | | - Aalt D J van Dijk
- Bioinformatics Group, Department of Plant Sciences.,Bioscience, Wageningen Plant Research.,Biometris, Department of Plant Sciences, Wageningen University & Research, Wageningen PB, The Netherlands
| |
Collapse
|
10
|
Dandage R, Landry CR. Paralog dependency indirectly affects the robustness of human cells. Mol Syst Biol 2019; 15:e8871. [PMID: 31556487 PMCID: PMC6757259 DOI: 10.15252/msb.20198871] [Citation(s) in RCA: 30] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/18/2019] [Revised: 08/26/2019] [Accepted: 08/28/2019] [Indexed: 12/19/2022] Open
Abstract
The protective redundancy of paralogous genes partly relies on the fact that they carry their functions independently. However, a significant fraction of paralogous proteins may form functionally dependent pairs, for instance, through heteromerization. As a consequence, one could expect these heteromeric paralogs to be less protective against deleterious mutations. To test this hypothesis, we examined the robustness landscape of gene loss-of-function by CRISPR-Cas9 in more than 450 human cell lines. This landscape shows regions of greater deleteriousness to gene inactivation as a function of key paralog properties. Heteromeric paralogs are more likely to occupy such regions owing to their high expression and large number of protein-protein interaction partners. Further investigation revealed that heteromers may also be under stricter dosage balance, which may also contribute to the higher deleteriousness upon gene inactivation. Finally, we suggest that physical dependency may contribute to the deleteriousness upon loss-of-function as revealed by the correlation between the strength of interactions between paralogs and their higher deleteriousness upon loss of function.
Collapse
Affiliation(s)
- Rohan Dandage
- Département de BiologieUniversité LavalQuébecQCCanada
- Département de Biochimie, Microbiologie et Bio‐InformatiqueUniversité LavalQuébecQCCanada
- Institut de Biologie Intégrative et des Systèmes (IBIS)Université LavalQuébecQCCanada
- The Québec Network for Research on Protein Function, Engineering, and Applications (PROTEO)Université LavalQuébecQCCanada
- Centre de Recherche en Données Massives (CRDM)Université LavalQuébecQCCanada
| | - Christian R Landry
- Département de BiologieUniversité LavalQuébecQCCanada
- Département de Biochimie, Microbiologie et Bio‐InformatiqueUniversité LavalQuébecQCCanada
- Institut de Biologie Intégrative et des Systèmes (IBIS)Université LavalQuébecQCCanada
- The Québec Network for Research on Protein Function, Engineering, and Applications (PROTEO)Université LavalQuébecQCCanada
- Centre de Recherche en Données Massives (CRDM)Université LavalQuébecQCCanada
| |
Collapse
|
11
|
Marchant A, Cisneros AF, Dubé AK, Gagnon-Arsenault I, Ascencio D, Jain H, Aubé S, Eberlein C, Evans-Yamamoto D, Yachie N, Landry CR. The role of structural pleiotropy and regulatory evolution in the retention of heteromers of paralogs. eLife 2019; 8:46754. [PMID: 31454312 PMCID: PMC6711710 DOI: 10.7554/elife.46754] [Citation(s) in RCA: 24] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/11/2019] [Accepted: 08/11/2019] [Indexed: 01/07/2023] Open
Abstract
Gene duplication is a driver of the evolution of new functions. The duplication of genes encoding homomeric proteins leads to the formation of homomers and heteromers of paralogs, creating new complexes after a single duplication event. The loss of these heteromers may be required for the two paralogs to evolve independent functions. Using yeast as a model, we find that heteromerization is frequent among duplicated homomers and correlates with functional similarity between paralogs. Using in silico evolution, we show that for homomers and heteromers sharing binding interfaces, mutations in one paralog can have structural pleiotropic effects on both interactions, resulting in highly correlated responses of the complexes to selection. Therefore, heteromerization could be preserved indirectly due to selection for the maintenance of homomers, thus slowing down functional divergence between paralogs. We suggest that paralogs can overcome the obstacle of structural pleiotropy by regulatory evolution at the transcriptional and post-translational levels.
Collapse
Affiliation(s)
- Axelle Marchant
- Département de biochimie, de microbiologie et de bio-informatique, Université Laval, Québec, Canada.,PROTEO, le réseau québécois de recherche sur la fonction, la structure et l'ingénierie des protéines, Université Laval, Québec, Canada.,Centre de Recherche en Données Massives (CRDM), Université Laval, Québec, Canada.,Département de biologie, Université Laval, Québec, Canada
| | - Angel F Cisneros
- Département de biochimie, de microbiologie et de bio-informatique, Université Laval, Québec, Canada.,PROTEO, le réseau québécois de recherche sur la fonction, la structure et l'ingénierie des protéines, Université Laval, Québec, Canada.,Centre de Recherche en Données Massives (CRDM), Université Laval, Québec, Canada
| | - Alexandre K Dubé
- Département de biochimie, de microbiologie et de bio-informatique, Université Laval, Québec, Canada.,PROTEO, le réseau québécois de recherche sur la fonction, la structure et l'ingénierie des protéines, Université Laval, Québec, Canada.,Centre de Recherche en Données Massives (CRDM), Université Laval, Québec, Canada.,Département de biologie, Université Laval, Québec, Canada
| | - Isabelle Gagnon-Arsenault
- Département de biochimie, de microbiologie et de bio-informatique, Université Laval, Québec, Canada.,PROTEO, le réseau québécois de recherche sur la fonction, la structure et l'ingénierie des protéines, Université Laval, Québec, Canada.,Centre de Recherche en Données Massives (CRDM), Université Laval, Québec, Canada.,Département de biologie, Université Laval, Québec, Canada
| | - Diana Ascencio
- Département de biochimie, de microbiologie et de bio-informatique, Université Laval, Québec, Canada.,PROTEO, le réseau québécois de recherche sur la fonction, la structure et l'ingénierie des protéines, Université Laval, Québec, Canada.,Centre de Recherche en Données Massives (CRDM), Université Laval, Québec, Canada.,Département de biologie, Université Laval, Québec, Canada
| | - Honey Jain
- Département de biochimie, de microbiologie et de bio-informatique, Université Laval, Québec, Canada.,PROTEO, le réseau québécois de recherche sur la fonction, la structure et l'ingénierie des protéines, Université Laval, Québec, Canada.,Centre de Recherche en Données Massives (CRDM), Université Laval, Québec, Canada.,Department of Biological Sciences, Birla Institute of Technology and Sciences, Pilani, India
| | - Simon Aubé
- Département de biochimie, de microbiologie et de bio-informatique, Université Laval, Québec, Canada.,PROTEO, le réseau québécois de recherche sur la fonction, la structure et l'ingénierie des protéines, Université Laval, Québec, Canada.,Centre de Recherche en Données Massives (CRDM), Université Laval, Québec, Canada
| | - Chris Eberlein
- PROTEO, le réseau québécois de recherche sur la fonction, la structure et l'ingénierie des protéines, Université Laval, Québec, Canada.,Centre de Recherche en Données Massives (CRDM), Université Laval, Québec, Canada.,Département de biologie, Université Laval, Québec, Canada
| | - Daniel Evans-Yamamoto
- Research Center for Advanced Science and Technology, University of Tokyo, Tokyo, Japan.,Institute for Advanced Biosciences, Keio University, Tsuruoka, Japan.,Graduate School of Media and Governance, Keio University, Fujisawa, Japan
| | - Nozomu Yachie
- Research Center for Advanced Science and Technology, University of Tokyo, Tokyo, Japan.,Institute for Advanced Biosciences, Keio University, Tsuruoka, Japan.,Graduate School of Media and Governance, Keio University, Fujisawa, Japan.,Department of Biological Sciences, Graduate School of Science, University of Tokyo, Tokyo, Japan
| | - Christian R Landry
- Département de biochimie, de microbiologie et de bio-informatique, Université Laval, Québec, Canada.,PROTEO, le réseau québécois de recherche sur la fonction, la structure et l'ingénierie des protéines, Université Laval, Québec, Canada.,Centre de Recherche en Données Massives (CRDM), Université Laval, Québec, Canada.,Département de biologie, Université Laval, Québec, Canada
| |
Collapse
|
12
|
Kwon D, Lee D, Kim J, Lee J, Sim M, Kim J. INTERSPIA: a web application for exploring the dynamics of protein-protein interactions among multiple species. Nucleic Acids Res 2019; 46:W89-W94. [PMID: 29746660 PMCID: PMC6031021 DOI: 10.1093/nar/gky378] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/31/2018] [Accepted: 04/27/2018] [Indexed: 02/06/2023] Open
Abstract
Proteins perform biological functions through cascading interactions with each other by forming protein complexes. As a result, interactions among proteins, called protein-protein interactions (PPIs) are not completely free from selection constraint during evolution. Therefore, the identification and analysis of PPI changes during evolution can give us new insight into the evolution of functions. Although many algorithms, databases and websites have been developed to help the study of PPIs, most of them are limited to visualize the structure and features of PPIs in a chosen single species with limited functions in the visualization perspective. This leads to difficulties in the identification of different patterns of PPIs in different species and their functional consequences. To resolve these issues, we developed a web application, called INTER-Species Protein Interaction Analysis (INTERSPIA). Given a set of proteins of user's interest, INTERSPIA first discovers additional proteins that are functionally associated with the input proteins and searches for different patterns of PPIs in multiple species through a server-side pipeline, and second visualizes the dynamics of PPIs in multiple species using an easy-to-use web interface. INTERSPIA is freely available at http://bioinfo.konkuk.ac.kr/INTERSPIA/.
Collapse
Affiliation(s)
- Daehong Kwon
- Department of Biomedical Science and Engineering, Konkuk University, Seoul 05029, Korea
| | - Daehwan Lee
- Department of Biomedical Science and Engineering, Konkuk University, Seoul 05029, Korea
| | - Juyeon Kim
- Department of Biomedical Science and Engineering, Konkuk University, Seoul 05029, Korea
| | - Jongin Lee
- Department of Biomedical Science and Engineering, Konkuk University, Seoul 05029, Korea
| | - Mikang Sim
- Department of Biomedical Science and Engineering, Konkuk University, Seoul 05029, Korea
| | - Jaebum Kim
- Department of Biomedical Science and Engineering, Konkuk University, Seoul 05029, Korea
| |
Collapse
|
13
|
Identification of influential invaders in evolutionary populations. Sci Rep 2019; 9:7305. [PMID: 31086258 PMCID: PMC6514010 DOI: 10.1038/s41598-019-43853-9] [Citation(s) in RCA: 19] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/30/2019] [Accepted: 05/01/2019] [Indexed: 11/08/2022] Open
Abstract
The identification of the most influential nodes has been a vibrant subject of research across the whole of network science. Here we map this problem to structured evolutionary populations, where strategies and the interaction network are both subject to change over time based on social inheritance. We study cooperative communities, which cheaters can invade because they avoid the cost of contributions that are associated with cooperation. The question that we seek to answer is at which nodes cheaters invade most successfully. We propose the weighted degree decomposition to identify and rank the most influential invaders. More specifically, we distinguish two kinds of ranking based on the weighted degree decomposition. We show that a ranking strategy based on negative-weighted degree allows to successfully identify the most influential invaders in the case of weak selection, while a ranking strategy based on positive-weighted degree performs better when the selection is strong. Our research thus reveals how to identify the most influential invaders based on statistical measures in dynamically evolving cooperative communities.
Collapse
|
14
|
Topological evolution of coexpression networks by new gene integration maintains the hierarchical and modular structures in human ancestors. SCIENCE CHINA-LIFE SCIENCES 2019; 62:594-608. [PMID: 30919280 DOI: 10.1007/s11427-019-9483-6] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/15/2018] [Accepted: 11/05/2018] [Indexed: 12/23/2022]
Abstract
We analyze the global structure and evolution of human gene coexpression networks driven by new gene integration. When the Pearson correlation coefficient is greater than or equal to 0.5, we find that the coexpression network consists of 334 small components and one "giant" connected subnet comprising of 6317 interacting genes. This network shows the properties of power-law degree distribution and small-world. The average clustering coefficient of younger genes is larger than that of the elderly genes (0.6685 vs. 0.5762). Particularly, we find that the younger genes with a larger degree also show a property of hierarchical architecture. The younger genes play an important role in the overall pivotability of the network and this network contains few redundant duplicate genes. Moreover, we find that gene duplication and orphan genes are two dominant evolutionary forces in shaping this network. Both the duplicate genes and orphan genes develop new links through a "rich-gets-richer" mechanism. With the gradual integration of new genes into the ancestral network, most of the topological structure features of the network would gradually increase. However, the exponent of degree distribution and modularity coefficient of the whole network do not change significantly, which implies that the evolution of coexpression networks maintains the hierarchical and modular structures in human ancestors.
Collapse
|
15
|
Jabbari K, Heger P, Sharma R, Wiehe T. The Diverging Routes of BORIS and CTCF: An Interactomic and Phylogenomic Analysis. Life (Basel) 2018; 8:life8010004. [PMID: 29385718 PMCID: PMC5871936 DOI: 10.3390/life8010004] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/23/2017] [Revised: 01/25/2018] [Accepted: 01/25/2018] [Indexed: 12/11/2022] Open
Abstract
The CCCTC-binding factor (CTCF) is multi-functional, ubiquitously expressed, and highly conserved from Drosophila to human. It has important roles in transcriptional insulation and the formation of a high-dimensional chromatin structure. CTCF has a paralog called “Brother of Regulator of Imprinted Sites” (BORIS) or “CTCF-like” (CTCFL). It binds DNA at sites similar to those of CTCF. However, the expression profiles of the two proteins are quite different. We investigated the evolutionary trajectories of the two proteins after the duplication event using a phylogenomic and interactomic approach. We find that CTCF has 52 direct interaction partners while CTCFL only has 19. Almost all interactors already existed before the emergence of CTCF and CTCFL. The unique secondary loss of CTCF from several nematodes is paralleled by a loss of two of its interactors, the polycomb repressive complex subunit SuZ12 and the multifunctional transcription factor TYY1. In contrast to earlier studies reporting the absence of BORIS from birds, we present evidence for a multigene synteny block containing CTCFL that is conserved in mammals, reptiles, and several species of birds, indicating that not the entire lineage of birds experienced a loss of CTCFL. Within this synteny block, BORIS and its genomic neighbors seem to be partitioned into two nested chromatin loops. The high expression of SPO11, RAE1, RBM38, and PMEPA1 in male tissues suggests a possible link between CTCFL, meiotic recombination, and fertility-associated phenotypes. Using the 65,700 exomes and the 1000 genomes data, we observed a higher number of intergenic, non-synonymous, and loss-of-function mutations in CTCFL than in CTCF, suggesting a reduced strength of purifying selection, perhaps due to less functional constraint.
Collapse
Affiliation(s)
- Kamel Jabbari
- Cologne Biocenter, Institute for Genetics, University of Cologne, Zülpicher Straße 47a, 50674 Köln, Germany.
| | - Peter Heger
- Cologne Biocenter, Institute for Genetics, University of Cologne, Zülpicher Straße 47a, 50674 Köln, Germany.
| | - Ranu Sharma
- Cologne Biocenter, Institute for Genetics, University of Cologne, Zülpicher Straße 47a, 50674 Köln, Germany.
| | - Thomas Wiehe
- Cologne Biocenter, Institute for Genetics, University of Cologne, Zülpicher Straße 47a, 50674 Köln, Germany.
| |
Collapse
|
16
|
Abstract
Sequence and structure space are nowadays sufficiently large that we can use computational methods to model the structure of proteins based on sequence similarity alone. Not only useful as a standalone tool, homology modelling has also had a transformative effect on the ease with which we can solve crystal structures and electron density maps. Another technique-molecular dynamics-aims to model protein structures from first principles and, thanks to increases in computational power, is slowly becoming a viable tool for studying protein complexes. Finally, the prediction of protein assembly pathways from three-dimensional structures of complexes is also now becoming possible.
Collapse
Affiliation(s)
- Jonathan N Wells
- MRC Human Genetics Unit, Institute of Genetics and Molecular Medicine, University of Edinburgh, Edinburgh, UK.
| | - L Therese Bergendahl
- MRC Human Genetics Unit, Institute of Genetics and Molecular Medicine, University of Edinburgh, Edinburgh, UK
| | - Joseph A Marsh
- MRC Human Genetics Unit, Institute of Genetics and Molecular Medicine, University of Edinburgh, Edinburgh, UK
| |
Collapse
|
17
|
Konini S, Janse van Rensburg EJ. Mean field analysis of algorithms for scale-free networks in molecular biology. PLoS One 2017; 12:e0189866. [PMID: 29272285 PMCID: PMC5741260 DOI: 10.1371/journal.pone.0189866] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/09/2017] [Accepted: 12/04/2017] [Indexed: 11/25/2022] Open
Abstract
The sampling of scale-free networks in Molecular Biology is usually achieved by growing networks from a seed using recursive algorithms with elementary moves which include the addition and deletion of nodes and bonds. These algorithms include the Barabási-Albert algorithm. Later algorithms, such as the Duplication-Divergence algorithm, the Solé algorithm and the iSite algorithm, were inspired by biological processes underlying the evolution of protein networks, and the networks they produce differ essentially from networks grown by the Barabási-Albert algorithm. In this paper the mean field analysis of these algorithms is reconsidered, and extended to variant and modified implementations of the algorithms. The degree sequences of scale-free networks decay according to a powerlaw distribution, namely P(k) ∼ k−γ, where γ is a scaling exponent. We derive mean field expressions for γ, and test these by numerical simulations. Generally, good agreement is obtained. We also found that some algorithms do not produce scale-free networks (for example some variant Barabási-Albert and Solé networks).
Collapse
Affiliation(s)
- S. Konini
- Mathematics & Statistics, York University, Toronto, Ontario, M3J 1P3, Canada
| | | |
Collapse
|
18
|
Diss G, Gagnon-Arsenault I, Dion-Coté AM, Vignaud H, Ascencio DI, Berger CM, Landry CR. Gene duplication can impart fragility, not robustness, in the yeast protein interaction network. Science 2017; 355:630-634. [PMID: 28183979 DOI: 10.1126/science.aai7685] [Citation(s) in RCA: 80] [Impact Index Per Article: 11.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/13/2016] [Accepted: 01/13/2017] [Indexed: 12/18/2022]
Abstract
The maintenance of duplicated genes is thought to protect cells from genetic perturbations, but the molecular basis of this robustness is largely unknown. By measuring the interaction of yeast proteins with their partners in wild-type cells and in cells lacking a paralog, we found that 22 out of 56 paralog pairs compensate for the lost interactions. An equivalent number of pairs exhibit the opposite behavior and require each other's presence for maintaining their interactions. These dependent paralogs generally interact physically, regulate each other's abundance, and derive from ancestral self-interacting proteins. This reveals that gene duplication may actually increase mutational fragility instead of robustness in a large number of cases.
Collapse
Affiliation(s)
- Guillaume Diss
- Département de Biologie, Université Laval, Québec, QC, Canada.,The Quebec Network for Research on Protein Function, Engineering, and Applications, Université Laval, Québec, QC, Canada.,Institut de Biologie Intégrative et des Systèmes, Université Laval, Québec, QC, Canada.,EMBL/CRG Systems Biology Research Unit, Centre for Genomic Regulation (CRG), The Barcelona Institute of Science and Technology (BIST), Doctor Aiguader 88, 08003 Barcelona, Spain.,Universitat Pompeu Fabra (UPF), 08003 Barcelona, Spain
| | - Isabelle Gagnon-Arsenault
- Département de Biologie, Université Laval, Québec, QC, Canada.,The Quebec Network for Research on Protein Function, Engineering, and Applications, Université Laval, Québec, QC, Canada.,Institut de Biologie Intégrative et des Systèmes, Université Laval, Québec, QC, Canada
| | - Anne-Marie Dion-Coté
- Département de Biologie, Université Laval, Québec, QC, Canada.,The Quebec Network for Research on Protein Function, Engineering, and Applications, Université Laval, Québec, QC, Canada.,Institut de Biologie Intégrative et des Systèmes, Université Laval, Québec, QC, Canada
| | - Hélène Vignaud
- Département de Biologie, Université Laval, Québec, QC, Canada.,The Quebec Network for Research on Protein Function, Engineering, and Applications, Université Laval, Québec, QC, Canada.,Institut de Biologie Intégrative et des Systèmes, Université Laval, Québec, QC, Canada
| | - Diana I Ascencio
- Département de Biologie, Université Laval, Québec, QC, Canada.,The Quebec Network for Research on Protein Function, Engineering, and Applications, Université Laval, Québec, QC, Canada.,Institut de Biologie Intégrative et des Systèmes, Université Laval, Québec, QC, Canada.,Laboratorio Nacional de Genómica para la Biodiversidad (LANGEBIO), Centro de Investigación y de Estudios Avanzados del Instituto Politécnico Nacional, Irapuato, Guanajuato, Mexico
| | - Caroline M Berger
- Département de Biologie, Université Laval, Québec, QC, Canada.,The Quebec Network for Research on Protein Function, Engineering, and Applications, Université Laval, Québec, QC, Canada.,Institut de Biologie Intégrative et des Systèmes, Université Laval, Québec, QC, Canada
| | - Christian R Landry
- Département de Biologie, Université Laval, Québec, QC, Canada. .,The Quebec Network for Research on Protein Function, Engineering, and Applications, Université Laval, Québec, QC, Canada.,Institut de Biologie Intégrative et des Systèmes, Université Laval, Québec, QC, Canada
| |
Collapse
|
19
|
Holland DO, Shapiro BH, Xue P, Johnson ME. Protein-protein binding selectivity and network topology constrain global and local properties of interface binding networks. Sci Rep 2017; 7:5631. [PMID: 28717235 PMCID: PMC5514078 DOI: 10.1038/s41598-017-05686-2] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/26/2017] [Accepted: 06/01/2017] [Indexed: 01/30/2023] Open
Abstract
Protein-protein interactions networks (PPINs) are known to share a highly conserved structure across all organisms. What is poorly understood, however, is the structure of the child interface interaction networks (IINs), which map the binding sites proteins use for each interaction. In this study we analyze four independently constructed IINs from yeast and humans and find a conserved structure of these networks with a unique topology distinct from the parent PPIN. Using an IIN sampling algorithm and a fitness function trained on the manually curated PPINs, we show that IIN topology can be mostly explained as a balance between limits on interface diversity and a need for physico-chemical binding complementarity. This complementarity must be optimized both for functional interactions and against mis-interactions, and this selectivity is encoded in the IIN motifs. To test whether the parent PPIN shapes IINs, we compared optimal IINs in biological PPINs versus random PPINs. We found that the hubs in biological networks allow for selective binding with minimal interfaces, suggesting that binding specificity is an additional pressure for a scale-free-like PPIN. We confirm through phylogenetic analysis that hub interfaces are strongly conserved and rewiring of interactions between proteins involved in endocytosis preserves interface binding selectivity.
Collapse
Affiliation(s)
- David O Holland
- Department of Biomedical Engineering, Johns Hopkins University, Baltimore, Maryland, USA
| | - Benjamin H Shapiro
- Department of Biophysics, Johns Hopkins University, Baltimore, Maryland, USA
| | - Pei Xue
- Department of Biophysics, Johns Hopkins University, Baltimore, Maryland, USA
| | - Margaret E Johnson
- Department of Biophysics, Johns Hopkins University, Baltimore, Maryland, USA.
| |
Collapse
|
20
|
Zhou H, Liu J, Li J, Duan W. A density-based approach for detecting complexes in weighted PPI networks by semantic similarity. PLoS One 2017; 12:e0180570. [PMID: 28704455 PMCID: PMC5507511 DOI: 10.1371/journal.pone.0180570] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/03/2017] [Accepted: 06/16/2017] [Indexed: 11/23/2022] Open
Abstract
Protein complex detection in PPI networks plays an important role in analyzing biological processes. A new algorithm-DBGPWN-is proposed for predicting complexes in PPI networks. Firstly, a method based on gene ontology is used to measure semantic similarities between interacted proteins, and the similarity values are used as their weights. Then, a density-based graph partitioning algorithm is developed to find clusters in the weighted PPI networks, and the identified ones are considered to be dense and similar. Experimental results demonstrate that our approach achieves good performance as compared with such algorithms as MCL, CMC, MCODE, RNSC, CORE, ClusterOne and FGN.
Collapse
Affiliation(s)
- HongFang Zhou
- School of Computer Science and Engineering, Xi'an University of Technology, Xi’an, China
| | - Jie Liu
- School of Computer Science and Engineering, Xi'an University of Technology, Xi’an, China
| | - JunHuai Li
- School of Computer Science and Engineering, Xi'an University of Technology, Xi’an, China
| | - WenCong Duan
- School of Computer Science and Engineering, Xi'an University of Technology, Xi’an, China
| |
Collapse
|
21
|
Morrison ES, Badyaev AV. The Landscape of Evolution: Reconciling Structural and Dynamic Properties of Metabolic Networks in Adaptive Diversifications. Integr Comp Biol 2016; 56:235-46. [PMID: 27252203 DOI: 10.1093/icb/icw026] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/31/2022] Open
Abstract
The network of the interactions among genes, proteins, and metabolites delineates a range of potential phenotypic diversifications in a lineage, and realized phenotypic changes are the result of differences in the dynamics of the expression of the elements and interactions in this deterministic network. Regulatory mechanisms, such as hormones, mediate the relationship between the structural and dynamic properties of networks by determining how and when the elements are expressed and form a functional unit or state. Changes in regulatory mechanisms lead to variable expression of functional states of a network within and among generations. Functional properties of network elements, and the magnitude and direction of evolutionary change they determine, depend on their location within a network. Here, we examine the relationship between network structure and the dynamic mechanisms that regulate flux through a metabolic network. We review the mechanisms that control metabolic flux in enzymatic reactions and examine structural properties of the network locations that are targets of flux control. We aim to establish a predictive framework to test the contributions of structural and dynamic properties of deterministic networks to evolutionary diversifications.
Collapse
Affiliation(s)
- Erin S Morrison
- Department of Ecology and Evolutionary Biology, University of Arizona, Tucson, AZ 85721-0001, USA
| | - Alexander V Badyaev
- Department of Ecology and Evolutionary Biology, University of Arizona, Tucson, AZ 85721-0001, USA
| |
Collapse
|
22
|
Ames RM, Talavera D, Williams SG, Robertson DL, Lovell SC. Binding interface change and cryptic variation in the evolution of protein-protein interactions. BMC Evol Biol 2016; 16:40. [PMID: 26892785 PMCID: PMC4758157 DOI: 10.1186/s12862-016-0608-1] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/25/2015] [Accepted: 02/02/2016] [Indexed: 12/03/2022] Open
Abstract
Background Physical interactions between proteins are essential for almost all biological functions and systems. To understand the evolution of function it is therefore important to understand the evolution of molecular interactions. Of key importance is the evolution of binding specificity, the set of interactions made by a protein, since change in specificity can lead to “rewiring” of interaction networks. Unfortunately, the interfaces through which proteins interact are complex, typically containing many amino-acid residues that collectively must contribute to binding specificity as well as binding affinity, structural integrity of the interface and solubility in the unbound state. Results In order to study the relationship between interface composition and binding specificity, we make use of paralogous pairs of yeast proteins. Immediately after duplication these paralogues will have identical sequences and protein products that make an identical set of interactions. As the sequences diverge, we can correlate amino-acid change in the interface with any change in the specificity of binding. We show that change in interface regions correlates only weakly with change in specificity, and many variants in interfaces are functionally equivalent. We show that many of the residue replacements within interfaces are silent with respect to their contribution to binding specificity. Conclusions We conclude that such functionally-equivalent change has the potential to contribute to evolutionary plasticity in interfaces by creating cryptic variation, which in turn may provide the raw material for functional innovation and coevolution. Electronic supplementary material The online version of this article (doi:10.1186/s12862-016-0608-1) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Ryan M Ames
- Computational and Evolutionary Biology, Faculty of Life Sciences, University of Manchester, Oxford Road, Manchester, M13 9PT, UK. .,Current address: Wellcome Trust Centre for Biomedical Modelling and Analysis, University of Exeter, RILD Level 3, Exeter, EX2 5DW, UK.
| | - David Talavera
- Computational and Evolutionary Biology, Faculty of Life Sciences, University of Manchester, Oxford Road, Manchester, M13 9PT, UK. .,Current address: Institute of Cardiovascular Sciences, Faculty of Medical and Human Sciences, University of Manchester, Oxford Road, Manchester, M13 9PT, UK.
| | - Simon G Williams
- Computational and Evolutionary Biology, Faculty of Life Sciences, University of Manchester, Oxford Road, Manchester, M13 9PT, UK. .,Current address: Institute of Cardiovascular Sciences, Faculty of Medical and Human Sciences, University of Manchester, Oxford Road, Manchester, M13 9PT, UK.
| | - David L Robertson
- Computational and Evolutionary Biology, Faculty of Life Sciences, University of Manchester, Oxford Road, Manchester, M13 9PT, UK.
| | - Simon C Lovell
- Computational and Evolutionary Biology, Faculty of Life Sciences, University of Manchester, Oxford Road, Manchester, M13 9PT, UK.
| |
Collapse
|
23
|
Woods KN, Pfeffer J. Using THz Spectroscopy, Evolutionary Network Analysis Methods, and MD Simulation to Map the Evolution of Allosteric Communication Pathways in c-Type Lysozymes. Mol Biol Evol 2016; 33:40-61. [PMID: 26337549 PMCID: PMC4693973 DOI: 10.1093/molbev/msv178] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/15/2022] Open
Abstract
It is now widely accepted that protein function is intimately tied with the navigation of energy landscapes. In this framework, a protein sequence is not described by a distinct structure but rather by an ensemble of conformations. And it is through this ensemble that evolution is able to modify a protein's function by altering its landscape. Hence, the evolution of protein functions involves selective pressures that adjust the sampling of the conformational states. In this work, we focus on elucidating the evolutionary pathway that shaped the function of individual proteins that make-up the mammalian c-type lysozyme subfamily. Using both experimental and computational methods, we map out specific intermolecular interactions that direct the sampling of conformational states and accordingly, also underlie shifts in the landscape that are directly connected with the formation of novel protein functions. By contrasting three representative proteins in the family we identify molecular mechanisms that are associated with the selectivity of enhanced antimicrobial properties and consequently, divergent protein function. Namely, we link the extent of localized fluctuations involving the loop separating helices A and B with shifts in the equilibrium of the ensemble of conformational states that mediate interdomain coupling and concurrently moderate substrate binding affinity. This work reveals unique insights into the molecular level mechanisms that promote the progression of interactions that connect the immune response to infection with the nutritional properties of lactation, while also providing a deeper understanding about how evolving energy landscapes may define present-day protein function.
Collapse
|
24
|
Scienski K, Fay JC, Conant GC. Patterns of Gene Conversion in Duplicated Yeast Histones Suggest Strong Selection on a Coadapted Macromolecular Complex. Genome Biol Evol 2015; 7:3249-58. [PMID: 26560339 PMCID: PMC4700949 DOI: 10.1093/gbe/evv216] [Citation(s) in RCA: 19] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/10/2023] Open
Abstract
We find evidence for interlocus gene conversion in five duplicated histone genes from six yeast species. The sequences of these duplicated genes, surviving from the ancient genome duplication, show phylogenetic patterns inconsistent with the well-resolved orthology relationships inferred from a likelihood model of gene loss after the genome duplication. Instead, these paralogous genes are more closely related to each other than any is to its nearest ortholog. In addition to simulations supporting gene conversion, we also present evidence for elevated rates of radical amino acid substitutions along the branches implicated in the conversion events. As these patterns are similar to those seen in ribosomal proteins that have undergone gene conversion, we speculate that in cases where duplicated genes code for proteins that are a part of tightly interacting complexes, selection may favor the fixation of gene conversion events in order to maintain high protein identities between duplicated copies.
Collapse
Affiliation(s)
- Kathy Scienski
- Division of Animal Sciences, University of Missouri, Columbia Present address: Genetics Graduate Program, Texas A&M University, College Station, TX
| | - Justin C Fay
- Department of Genetics, Washington University Center for Genome Sciences and Systems Biology, Washington University
| | - Gavin C Conant
- Division of Animal Sciences, University of Missouri, Columbia Informatics Institute, University of Missouri, Columbia
| |
Collapse
|
25
|
Delewski W, Paterkiewicz B, Manicki M, Schilke B, Tomiczek B, Ciesielski SJ, Nierzwicki L, Czub J, Dutkiewicz R, Craig EA, Marszalek J. Iron-Sulfur Cluster Biogenesis Chaperones: Evidence for Emergence of Mutational Robustness of a Highly Specific Protein-Protein Interaction. Mol Biol Evol 2015; 33:643-56. [PMID: 26545917 DOI: 10.1093/molbev/msv254] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/31/2022] Open
Abstract
Biogenesis of iron-sulfur clusters (FeS) is a highly conserved process involving Hsp70 and J-protein chaperones. However, Hsp70 specialization differs among species. In most eukaryotes, including Schizosaccharomyces pombe, FeS biogenesis involves interaction between the J-protein Jac1 and the multifunctional Hsp70 Ssc1. But, in Saccharomyces cerevisiae and closely related species, Jac1 interacts with the specialized Hsp70 Ssq1, which emerged through duplication of SSC1. As little is known about how gene duplicates affect the robustness of their protein interaction partners, we analyzed the functional and evolutionary consequences of Ssq1 specialization on the ubiquitous J-protein cochaperone Jac1, by comparing S. cerevisiae and S. pombe. Although deletion of JAC1 is lethal in both species, alanine substitutions within the conserved His-Pro-Asp (HPD) motif, which is critical for Jac1:Hsp70 interaction, have species-specific effects. They are lethal in S. pombe, but not in S. cerevisiae. These in vivo differences correlated with in vitro biochemical measurements. Charged residues present in the J-domain of S. cerevisiae Jac1, but absent in S. pombe Jac1, are important for tolerance of S. cerevisiae Jac1 to HPD alterations. Moreover, Jac1 orthologs from species that encode Ssq1 have a higher sequence divergence. The simplest interpretation of our results is that Ssq1's coevolution with Jac1 resulted in expansion of their binding interface, thus increasing the efficiency of their interaction. Such an expansion could in turn compensate for negative effects of HPD substitutions. Thus, our results support the idea that the robustness of Jac1 emerged as consequence of its highly efficient and specific interaction with Ssq1.
Collapse
Affiliation(s)
- Wojciech Delewski
- Laboratory of Evolutionary Biochemistry, Intercollegiate Faculty of Biotechnology, University of Gdansk and Medical University of Gdansk, Gdansk, Poland
| | - Bogumiła Paterkiewicz
- Laboratory of Evolutionary Biochemistry, Intercollegiate Faculty of Biotechnology, University of Gdansk and Medical University of Gdansk, Gdansk, Poland
| | - Mateusz Manicki
- Laboratory of Evolutionary Biochemistry, Intercollegiate Faculty of Biotechnology, University of Gdansk and Medical University of Gdansk, Gdansk, Poland
| | - Brenda Schilke
- Department of Biochemistry, University of Wisconsin-Madison
| | - Bartłomiej Tomiczek
- Laboratory of Evolutionary Biochemistry, Intercollegiate Faculty of Biotechnology, University of Gdansk and Medical University of Gdansk, Gdansk, Poland
| | | | - Lukasz Nierzwicki
- Department of Physical Chemistry, Gdansk University of Technology, Gdansk, Poland
| | - Jacek Czub
- Department of Physical Chemistry, Gdansk University of Technology, Gdansk, Poland
| | - Rafal Dutkiewicz
- Laboratory of Evolutionary Biochemistry, Intercollegiate Faculty of Biotechnology, University of Gdansk and Medical University of Gdansk, Gdansk, Poland
| | | | - Jaroslaw Marszalek
- Laboratory of Evolutionary Biochemistry, Intercollegiate Faculty of Biotechnology, University of Gdansk and Medical University of Gdansk, Gdansk, Poland Department of Biochemistry, University of Wisconsin-Madison
| |
Collapse
|
26
|
Cai S, Liu Z, Lee HC. Mean field theory for biology inspired duplication-divergence network model. CHAOS (WOODBURY, N.Y.) 2015; 25:083106. [PMID: 26328557 DOI: 10.1063/1.4928212] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/05/2023]
Abstract
The duplication-divergence network model is generally thought to incorporate key ingredients underlying the growth and evolution of protein-protein interaction networks. Properties of the model have been elucidated through numerous simulation studies. However, a comprehensive theoretical study of the model is lacking. Here, we derived analytic expressions for quantities describing key characteristics of the network-the average degree, the degree distribution, the clustering coefficient, and the neighbor connectivity-in the mean-field, large-N limit of an extended version of the model, duplication-divergence complemented with heterodimerization and addition. We carried out extensive simulations and verified excellent agreement between simulation and theory except for one partial case. All four quantities obeyed power-laws even at moderate network size ( N∼10(4)), except the degree distribution, which had an additional exponential factor observed to obey power-law. It is shown that our network model can lead to the emergence of scale-free property and hierarchical modularity simultaneously, reproducing the important topological properties of real protein-protein interaction networks.
Collapse
Affiliation(s)
- Shuiming Cai
- Faculty of Science, Jiangsu University, Zhenjiang 212013, China
| | - Zengrong Liu
- Institute of Systems Biology, Shanghai University, Shanghai 200444, China
| | - H C Lee
- Institute of Systems Biology and Bioinformatics, National Central University, Zhongli, 32001 Taiwan
| |
Collapse
|
27
|
Sudha G, Naveenkumar N, Srinivasan N. Evolutionary and structural analyses of heterodimeric proteins composed of subunits with same fold. Proteins 2015; 83:1766-86. [PMID: 26148218 DOI: 10.1002/prot.24849] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/04/2015] [Revised: 05/30/2015] [Accepted: 06/21/2015] [Indexed: 11/10/2022]
Abstract
Heterodimeric proteins with homologous subunits of same fold are involved in various biological processes. The objective of this study is to understand the evolution of structural and functional features of such heterodimers. Using a non-redundant dataset of 70 such heterodimers of known 3D structure and an independent dataset of 173 heterodimers from yeast, we note that the mean sequence identity between interacting homologous subunits is only 23-24% suggesting that, generally, highly diverged paralogues assemble to form such a heterodimer. We also note that the functional roles of interacting subunits/domains are generally quite different. This suggests that, though the interacting subunits/domains are homologous, the high evolutionary divergence characterize their high functional divergence which contributes to a gross function for the heterodimer considered as a whole. The inverse relationship between sequence identity and RMSD of interacting homologues in heterodimers is not followed. We also addressed the question of formation of homodimers of the subunits of heterodimers by generating models of fictitious homodimers on the basis of the 3D structures of the heterodimers. Interaction energies associated with these homodimers suggests that, in overwhelming majority of the cases, such homodimers are unlikely to be stable. Majority of the homologues of heterodimers of known structures form heterodimers (51.8%) and a small proportion (14.6%) form homodimers. Comparison of 3D structures of heterodimers with homologous homodimers suggests that interfacial nature of residues is not well conserved. In over 90% of the cases we note that the interacting subunits of heterodimers are co-localized in the cell.
Collapse
Affiliation(s)
- Govindarajan Sudha
- Molecular Biophysics Unit, Indian Institute of Science, Bangalore, Karnataka, 560012, India
| | - Nagarajan Naveenkumar
- National Centre for Biological Sciences, Tata Institute of Fundamental Research, Bangalore, Karnataka, 560065, India.,Bharathidasan University, Tiruchirappalli, Tamil Nadu, 620024, India
| | | |
Collapse
|
28
|
Massey SE. Genetic code evolution reveals the neutral emergence of mutational robustness, and information as an evolutionary constraint. Life (Basel) 2015; 5:1301-32. [PMID: 25919033 PMCID: PMC4500140 DOI: 10.3390/life5021301] [Citation(s) in RCA: 26] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/02/2015] [Revised: 04/02/2015] [Accepted: 04/03/2015] [Indexed: 01/09/2023] Open
Abstract
The standard genetic code (SGC) is central to molecular biology and its origin and evolution is a fundamental problem in evolutionary biology, the elucidation of which promises to reveal much about the origins of life. In addition, we propose that study of its origin can also reveal some fundamental and generalizable insights into mechanisms of molecular evolution, utilizing concepts from complexity theory. The first is that beneficial traits may arise by non-adaptive processes, via a process of "neutral emergence". The structure of the SGC is optimized for the property of error minimization, which reduces the deleterious impact of point mutations. Via simulation, it can be shown that genetic codes with error minimization superior to the SGC can emerge in a neutral fashion simply by a process of genetic code expansion via tRNA and aminoacyl-tRNA synthetase duplication, whereby similar amino acids are added to codons related to that of the parent amino acid. This process of neutral emergence has implications beyond that of the genetic code, as it suggests that not all beneficial traits have arisen by the direct action of natural selection; we term these "pseudaptations", and discuss a range of potential examples. Secondly, consideration of genetic code deviations (codon reassignments) reveals that these are mostly associated with a reduction in proteome size. This code malleability implies the existence of a proteomic constraint on the genetic code, proportional to the size of the proteome (P), and that its reduction in size leads to an "unfreezing" of the codon - amino acid mapping that defines the genetic code, consistent with Crick's Frozen Accident theory. The concept of a proteomic constraint may be extended to propose a general informational constraint on genetic fidelity, which may be used to explain variously, differences in mutation rates in genomes with differing proteome sizes, differences in DNA repair capacity and genome GC content between organisms, a selective pressure in the evolution of sexual reproduction, and differences in translational fidelity. Lastly, the utility of the concept of an informational constraint to other diverse fields of research is explored.
Collapse
Affiliation(s)
- Steven E Massey
- Biology Department, PO Box 23360, University of Puerto Rico-Rio Piedras, San Juan, PR 00931, USA.
| |
Collapse
|
29
|
Networks and Hierarchies: Approaching Complexity in Evolutionary Theory. INTERDISCIPLINARY EVOLUTION RESEARCH 2015. [DOI: 10.1007/978-3-319-15045-1_6] [Citation(s) in RCA: 16] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/13/2022]
|
30
|
Stolzer M, Wasserman L, Durand D. Robustness of birth-death and gain models for inferring evolutionary events. BMC Genomics 2014; 15 Suppl 6:S9. [PMID: 25572914 PMCID: PMC4239551 DOI: 10.1186/1471-2164-15-s6-s9] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/10/2022] Open
Abstract
Background Phylogenetic birth-death models are opening a new window on the processes of genome evolution in studies of the evolution of gene and protein families, protein-protein interaction networks, microRNAs, and copy number variation. Given a species tree and a set of genomic characters in present-day species, the birth-death approach estimates the most likely rates required to explain the observed data and returns the expected ancestral character states and the history of character state changes. Achieving a balance between model complexity and generalizability is a fundamental challenge in the application of birth-death models. While more parameters promise greater accuracy and more biologically realistic models, increasing model complexity can lead to overfitting and a heavy computational cost. Results Here we present a systematic, empirical investigation of these tradeoffs, using protein domain families in six metazoan genomes as a case study. We compared models of increasing complexity, implemented in the Count program, with respect to model fit, robustness, and stability. In addition, we used a bootstrapping procedure to assess estimator variability. The results show that the most complex model, which allows for both branch-specific and family-specific rate variation, achieves the best fit, without overfitting. Variance remains low with increasing complexity, except for family-specific loss rates. This variance is reduced when the number of discrete rate categories is increased. Model choice is of greatest concern when different models lead to fundamentally different outcomes. To investigate the extent to which model choice influences biological interpretation, ancestral states and expected events were inferred under each model. Disturbingly, the different models not only resulted in quantitatively different histories, but predicted qualitatively different patterns of domain family turnover and genome expansion and reduction. Conclusions The work presented here evaluates model choice for genomic birth-death models in a systematic way and presents the first use of bootstrapping to assess estimator variance in birth-death models. We find that a model incorporating both lineage and family rate variation yields more accurate estimators without sacrificing generality. Our results indicate that model choice can lead to fundamentally different evolutionary conclusions, emphasizing the importance of more biologically realistic and complex models.
Collapse
|
31
|
Hu J, Reinert K. LocalAli: an evolutionary-based local alignment approach to identify functionally conserved modules in multiple networks. ACTA ACUST UNITED AC 2014; 31:363-72. [PMID: 25282642 DOI: 10.1093/bioinformatics/btu652] [Citation(s) in RCA: 29] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022]
Abstract
MOTIVATION Sequences and protein interaction data are of significance to understand the underlying molecular mechanism of organisms. Local network alignment is one of key systematic ways for predicting protein functions, identifying functional modules and understanding the phylogeny from these data. Most of currently existing tools, however, encounter their limitations, which are mainly concerned with scoring scheme, speed and scalability. Therefore, there are growing demands for sophisticated network evolution models and efficient local alignment algorithms. RESULTS We developed a fast and scalable local network alignment tool called LocalAli for the identification of functionally conserved modules in multiple networks. In this algorithm, we firstly proposed a new framework to reconstruct the evolution history of conserved modules based on a maximum-parsimony evolutionary model. By relying on this model, LocalAli facilitates interpretation of resulting local alignments in terms of conserved modules, which have been evolved from a common ancestral module through a series of evolutionary events. A meta-heuristic method simulated annealing was used to search for the optimal or near-optimal inner nodes (i.e. ancestral modules) of the evolutionary tree. To evaluate the performance and the statistical significance, LocalAli were tested on 26 real datasets and 1040 randomly generated datasets. The results suggest that LocalAli outperforms all existing algorithms in terms of coverage, consistency and scalability, meanwhile retains a high precision in the identification of functionally coherent subnetworks. AVAILABILITY The source code and test datasets are freely available for download under the GNU GPL v3 license at https://code.google.com/p/localali/. CONTACT jialu.hu@fu-berlin.de or knut.reinert@fu-berlin.de. SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Jialu Hu
- Department of Mathematics and Computer Science, Freie Universität Berlin, Takustrasse 9, 14195 Berlin, Germany
| | - Knut Reinert
- Department of Mathematics and Computer Science, Freie Universität Berlin, Takustrasse 9, 14195 Berlin, Germany
| |
Collapse
|
32
|
Zhang N, Huang T, Cai YD. Discriminating between deleterious and neutral non-frameshifting indels based on protein interaction networks and hybrid properties. Mol Genet Genomics 2014; 290:343-52. [PMID: 25248637 DOI: 10.1007/s00438-014-0922-5] [Citation(s) in RCA: 20] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/20/2014] [Accepted: 09/12/2014] [Indexed: 02/06/2023]
Abstract
More than ten thousand coding variants are contained in each human genome; however, our knowledge of the way genetic variants underlie phenotypic differences is far from complete. Small insertions and deletions (indels) are one of the most common types of human genetic variants, and indels play a significant role in human inherited disease. To date, we still lack a comprehensive understanding of how indels cause diseases. Therefore, identification and analysis of such deleterious variants is a key challenge and has been of great interest in the current research in genome biology. Increasing numbers of computational methods have been developed for discriminating between deleterious indels and neutral indels. However, most of the existing methods are based on traditional sequential or structural features, which cannot completely explain the association between indels and the resulting induced inherited disease. In this study, we establish a novel method to predict deleterious non-frameshifting indels based on features extracted from both protein interaction networks and traditional hybrid properties. Each indel was coded by 1,246 features. Using the maximum relevance minimum redundancy method and the incremental feature selection method, we obtained an optimal feature set containing 42 features, of which 21 features were derived from protein interaction networks. Based on the optimal feature set, an 88 % accuracy and a 0.76 MCC value were achieved by a Random Forest as evaluated by the Jackknife cross-validation test. This method outperformed existing methods of predicting deleterious indels, and can be applied in practice for deleterious non-frameshifting indel predictions in genome research. The analysis of the optimal features selected in the model revealed that network interactions play more important roles and could be informative for better illustrating an indel's function and disease associations than traditional sequential or structural features. These results could shed some light on the genetic basis of human genetic variations and human inherited diseases.
Collapse
Affiliation(s)
- Ning Zhang
- Department of Biomedical Engineering, Tianjin Key Lab of BME Measurement, Tianjin University, Tianjin, 300072, People's Republic of China
| | | | | |
Collapse
|
33
|
Sun JT, Ao B, Zhang S, Bing Z, Yang L. Evolving protein-protein interaction networks: A model based on duplication and mutation at different rates. J Theor Biol 2014; 350:32-6. [PMID: 24491255 DOI: 10.1016/j.jtbi.2014.01.034] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/20/2013] [Revised: 01/23/2014] [Accepted: 01/24/2014] [Indexed: 11/24/2022]
Abstract
We present a model describing the evolution of protein-protein interaction networks. The model is based on gene duplication and mutation. Considering rates of gene duplication and mutation, the average node degree and cluster coefficient are calculated for different parameters. The predicted degree distribution and cluster coefficient are in good agreement with the observed statistical properties obtained from the analysis of the yeast proteome.
Collapse
Affiliation(s)
- Jin-Tu Sun
- Institute of Modern Physics, Chinese Academy of Sciences, Lanzhou 730000, China
| | - Bin Ao
- Institute of Modern Physics, Chinese Academy of Sciences, Lanzhou 730000, China
| | - Sheng Zhang
- Institute of Modern Physics, Chinese Academy of Sciences, Lanzhou 730000, China
| | - Zhitong Bing
- Institute of Modern Physics, Chinese Academy of Sciences, Lanzhou 730000, China
| | - Lei Yang
- Institute of Modern Physics, Chinese Academy of Sciences, Lanzhou 730000, China.
| |
Collapse
|
34
|
Choi YS, Yoon S, Kim KL, Yoo J, Song P, Kim M, Shin YE, Yang WJ, Noh JE, Cho HS, Kim S, Chung J, Ryu SH. Computational design of binding proteins to EGFR domain II. PLoS One 2014; 9:e92513. [PMID: 24710267 PMCID: PMC3977815 DOI: 10.1371/journal.pone.0092513] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/29/2013] [Accepted: 02/24/2014] [Indexed: 12/03/2022] Open
Abstract
We developed a process to produce novel interactions between two previously unrelated proteins. This process selects protein scaffolds and designs protein interfaces that bind to a surface patch of interest on a target protein. Scaffolds with shapes complementary to the target surface patch were screened using an exhaustive computational search of the human proteome and optimized by directed evolution using phage display. This method was applied to successfully design scaffolds that bind to epidermal growth factor receptor (EGFR) domain II, the interface of EGFR dimerization, with high reactivity toward the target surface patch of EGFR domain II. One potential application of these tailor-made protein interactions is the development of therapeutic agents against specific protein targets.
Collapse
Affiliation(s)
- Yoon Sup Choi
- Cancer Research Institute, Seoul National University School of Medicine, Seoul, Republic of Korea
- School of Interdisciplinary Bioscience and Bioengineering (I-Bio), Pohang University of Science and Technology (POSTECH), Pohang, Republic of Korea
- KT Institute of Convergence Technology, Seocho-gu, Seoul, Korea
| | - Soomin Yoon
- Cancer Research Institute, Seoul National University School of Medicine, Seoul, Republic of Korea
- Department of Biochemistry and Molecular Biology, Seoul National University School of Medicine, Seoul, Republic of Korea
| | - Kyung-Lock Kim
- Division of Integrative Bioscience and Biotechnology, Pohang University of Science and Technology (POSTECH), Pohang, Republic of Korea
| | - Jiho Yoo
- Department of Systems Biology, College of Life Science and Biotechnology, Yonsei University, Seoul, Republic of Korea
| | - Parkyong Song
- Division of Integrative Bioscience and Biotechnology, Pohang University of Science and Technology (POSTECH), Pohang, Republic of Korea
| | - Minsoo Kim
- Cancer Research Institute, Seoul National University School of Medicine, Seoul, Republic of Korea
- Scripps Korea Antibody Institute, Chuncheon, Republic of Korea
| | - Young-Eun Shin
- Division of Integrative Bioscience and Biotechnology, Pohang University of Science and Technology (POSTECH), Pohang, Republic of Korea
| | - Won Jun Yang
- Cancer Research Institute, Seoul National University School of Medicine, Seoul, Republic of Korea
- Department of Biochemistry and Molecular Biology, Seoul National University School of Medicine, Seoul, Republic of Korea
| | - Jung-eun Noh
- Division of Integrative Bioscience and Biotechnology, Pohang University of Science and Technology (POSTECH), Pohang, Republic of Korea
| | - Hyun-soo Cho
- Department of Systems Biology, College of Life Science and Biotechnology, Yonsei University, Seoul, Republic of Korea
| | - Sanguk Kim
- School of Interdisciplinary Bioscience and Bioengineering (I-Bio), Pohang University of Science and Technology (POSTECH), Pohang, Republic of Korea
- Division of IT Convergence Engineering, Pohang University of Science and Technology (POSTECH), Pohang, Republic of Korea
- * E-mail: (SK); (JC); (SHR)
| | - Junho Chung
- Cancer Research Institute, Seoul National University School of Medicine, Seoul, Republic of Korea
- Department of Biochemistry and Molecular Biology, Seoul National University School of Medicine, Seoul, Republic of Korea
- * E-mail: (SK); (JC); (SHR)
| | - Sung Ho Ryu
- School of Interdisciplinary Bioscience and Bioengineering (I-Bio), Pohang University of Science and Technology (POSTECH), Pohang, Republic of Korea
- Division of Integrative Bioscience and Biotechnology, Pohang University of Science and Technology (POSTECH), Pohang, Republic of Korea
- * E-mail: (SK); (JC); (SHR)
| |
Collapse
|
35
|
Protein-protein interaction detection: methods and analysis. INTERNATIONAL JOURNAL OF PROTEOMICS 2014; 2014:147648. [PMID: 24693427 PMCID: PMC3947875 DOI: 10.1155/2014/147648] [Citation(s) in RCA: 384] [Impact Index Per Article: 38.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Subscribe] [Scholar Register] [Received: 10/26/2013] [Revised: 12/05/2013] [Accepted: 12/20/2013] [Indexed: 12/24/2022]
Abstract
Protein-protein interaction plays key role in predicting the protein function of target protein and drug ability of molecules. The majority of genes and proteins realize resulting phenotype functions as a set of interactions. The in vitro and in vivo methods like affinity purification, Y2H (yeast 2 hybrid), TAP (tandem affinity purification), and so forth have their own limitations like cost, time, and so forth, and the resultant data sets are noisy and have more false positives to annotate the function of drug molecules. Thus, in silico methods which include sequence-based approaches, structure-based approaches, chromosome proximity, gene fusion, in silico 2 hybrid, phylogenetic tree, phylogenetic profile, and gene expression-based approaches were developed. Elucidation of protein interaction networks also contributes greatly to the analysis of signal transduction pathways. Recent developments have also led to the construction of networks having all the protein-protein interactions using computational methods for signaling pathways and protein complex identification in specific diseases.
Collapse
|
36
|
Jafari M, Sadeghi M, Mirzaie M, Marashi SA, Rezaei-Tavirani M. Evolutionarily conserved motifs and modules in mitochondrial protein–protein interaction networks. Mitochondrion 2013; 13:668-75. [DOI: 10.1016/j.mito.2013.09.006] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/20/2013] [Revised: 08/18/2013] [Accepted: 09/23/2013] [Indexed: 10/26/2022]
|
37
|
Baker CR, Hanson-Smith V, Johnson AD. Following gene duplication, paralog interference constrains transcriptional circuit evolution. Science 2013; 342:104-8. [PMID: 24092741 PMCID: PMC3911913 DOI: 10.1126/science.1240810] [Citation(s) in RCA: 82] [Impact Index Per Article: 7.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/02/2022]
Abstract
Most models of gene duplication assume that the ancestral functions of the preduplication gene are independent and can therefore be neatly partitioned between descendant paralogs. However, many gene products, such as transcriptional regulators, are components within cooperative assemblies; here, we show that a natural consequence of duplication and divergence of such proteins can be competitive interference between the paralogs. Our example is based on the duplication of the essential MADS-box transcriptional regulator Mcm1, which is found in all fungi and regulates a large set of genes. We show that a set of historical amino acid sequence substitutions minimized paralog interference in contemporary species and, in doing so, increased the molecular complexity of this gene regulatory network. We propose that paralog interference is a common constraint on gene duplicate evolution, and its resolution, which can generate additional regulatory complexity, is needed to stabilize duplicated genes in the genome.
Collapse
Affiliation(s)
- Christopher R. Baker
- Department of Immunology and Microbiology, University of California, San Francisco, CA 94143, USA
- Department of Biochemistry and Biophysics, University of California, San Francisco, CA 94158, USA
| | - Victor Hanson-Smith
- Department of Immunology and Microbiology, University of California, San Francisco, CA 94143, USA
- Department of Biochemistry and Biophysics, University of California, San Francisco, CA 94158, USA
| | - Alexander D. Johnson
- Department of Immunology and Microbiology, University of California, San Francisco, CA 94143, USA
- Department of Biochemistry and Biophysics, University of California, San Francisco, CA 94158, USA
| |
Collapse
|
38
|
Abstract
The modern synthesis of evolutionary theory and genetics has enabled us to discover underlying molecular mechanisms of organismal evolution. We know that in order to maximize an organism's fitness in a particular environment, individual interactions among components of protein and nucleic acid networks need to be optimized by natural selection, or sometimes through random processes, as the organism responds to changes and/or challenges in the environment. Despite the significant role of molecular networks in determining an organism's adaptation to its environment, we still do not know how such inter- and intra-molecular interactions within networks change over time and contribute to an organism's evolvability while maintaining overall network functions. One way to address this challenge is to identify connections between molecular networks and their host organisms, to manipulate these connections, and then attempt to understand how such perturbations influence molecular dynamics of the network and thus influence evolutionary paths and organismal fitness. In the present review, we discuss how integrating evolutionary history with experimental systems that combine tools drawn from molecular evolution, synthetic biology and biochemistry allow us to identify the underlying mechanisms of organismal evolution, particularly from the perspective of protein interaction networks.
Collapse
|
39
|
Jost J, Scherrer K. Information theory, gene expression, and combinatorial regulation: a quantitative analysis. Theory Biosci 2013; 133:1-21. [PMID: 23674094 DOI: 10.1007/s12064-013-0182-7] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/02/2011] [Accepted: 04/19/2013] [Indexed: 02/04/2023]
Abstract
According to a functional definition of the term "gene", a protein-coding gene corresponds to a polypeptide and, hence, a coding sequence. It is therefore as such not yet present at the DNA level, but assembled from possibly heterogeneous pieces in the course of RNA processing. Assembly and regulation of genes require, thus, information about when and in which quantity specific polypeptides are to be produced. To assess this, we draw upon precise biochemical data. On the basis of our conceptual framework, we also develop formal models for the coordinated expression of specific sets of genes through the interaction of transcripts and mRNAs and with proteins via a precise putative regulatory code. Thus, the nucleotides in transcripts and mRNA are not only arranged into amino acid-coding triplets, but at the same time may participate in regulatory oligomotifs that provide binding sites for specific proteins. We can then quantify and compare product and regulatory information involved in gene expression and regulation.
Collapse
|
40
|
Kaltenegger E, Eich E, Ober D. Evolution of homospermidine synthase in the convolvulaceae: a story of gene duplication, gene loss, and periods of various selection pressures. THE PLANT CELL 2013; 25:1213-27. [PMID: 23572540 PMCID: PMC3663263 DOI: 10.1105/tpc.113.109744] [Citation(s) in RCA: 31] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/18/2013] [Revised: 03/11/2013] [Accepted: 03/19/2013] [Indexed: 05/03/2023]
Abstract
Homospermidine synthase (HSS), the first pathway-specific enzyme of pyrrolizidine alkaloid biosynthesis, is known to have its origin in the duplication of a gene encoding deoxyhypusine synthase. To study the processes that followed this gene duplication event and gave rise to HSS, we identified sequences encoding HSS and deoxyhypusine synthase from various species of the Convolvulaceae. We show that HSS evolved only once in this lineage. This duplication event was followed by several losses of a functional gene copy attributable to gene loss or pseudogenization. Statistical analyses of sequence data suggest that, in those lineages in which the gene copy was successfully recruited as HSS, the gene duplication event was followed by phases of various selection pressures, including purifying selection, relaxed functional constraints, and possibly positive Darwinian selection. Site-specific mutagenesis experiments have confirmed that the substitution of sites predicted to be under positive Darwinian selection is sufficient to convert a deoxyhypusine synthase into a HSS. In addition, analyses of transcript levels have shown that HSS and deoxyhypusine synthase have also diverged with respect to their regulation. The impact of protein-protein interaction on the evolution of HSS is discussed with respect to current models of enzyme evolution.
Collapse
Affiliation(s)
- Elisabeth Kaltenegger
- Botanisches Institut und Botanischer Garten, Universität Kiel, D-24098 Kiel, Germany
| | - Eckart Eich
- Institut für Pharmazie II, Pharmazeutische Biologie, Freie Universität Berlin, D-14195 Berlin, Germany
| | - Dietrich Ober
- Botanisches Institut und Botanischer Garten, Universität Kiel, D-24098 Kiel, Germany
| |
Collapse
|
41
|
Jin Y, Turaev D, Weinmaier T, Rattei T, Makse HA. The evolutionary dynamics of protein-protein interaction networks inferred from the reconstruction of ancient networks. PLoS One 2013; 8:e58134. [PMID: 23526967 PMCID: PMC3603955 DOI: 10.1371/journal.pone.0058134] [Citation(s) in RCA: 42] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/11/2012] [Accepted: 01/30/2013] [Indexed: 11/18/2022] Open
Abstract
Cellular functions are based on the complex interplay of proteins, therefore the structure and dynamics of these protein-protein interaction (PPI) networks are the key to the functional understanding of cells. In the last years, large-scale PPI networks of several model organisms were investigated. A number of theoretical models have been developed to explain both the network formation and the current structure. Favored are models based on duplication and divergence of genes, as they most closely represent the biological foundation of network evolution. However, studies are often based on simulated instead of empirical data or they cover only single organisms. Methodological improvements now allow the analysis of PPI networks of multiple organisms simultaneously as well as the direct modeling of ancestral networks. This provides the opportunity to challenge existing assumptions on network evolution. We utilized present-day PPI networks from integrated datasets of seven model organisms and developed a theoretical and bioinformatic framework for studying the evolutionary dynamics of PPI networks. A novel filtering approach using percolation analysis was developed to remove low confidence interactions based on topological constraints. We then reconstructed the ancient PPI networks of different ancestors, for which the ancestral proteomes, as well as the ancestral interactions, were inferred. Ancestral proteins were reconstructed using orthologous groups on different evolutionary levels. A stochastic approach, using the duplication-divergence model, was developed for estimating the probabilities of ancient interactions from today's PPI networks. The growth rates for nodes, edges, sizes and modularities of the networks indicate multiplicative growth and are consistent with the results from independent static analysis. Our results support the duplication-divergence model of evolution and indicate fractality and multiplicative growth as general properties of the PPI network structure and dynamics.
Collapse
Affiliation(s)
- Yuliang Jin
- Levich Institute and Physics Department, City College of New York, New York, New York, United States of America
| | - Dmitrij Turaev
- Department of Computational Systems Biology, University of Vienna, Vienna, Austria
| | - Thomas Weinmaier
- Department of Computational Systems Biology, University of Vienna, Vienna, Austria
| | - Thomas Rattei
- Department of Computational Systems Biology, University of Vienna, Vienna, Austria
| | - Hernán A. Makse
- Levich Institute and Physics Department, City College of New York, New York, New York, United States of America
| |
Collapse
|
42
|
Xin X, Gfeller D, Cheng J, Tonikian R, Sun L, Guo A, Lopez L, Pavlenco A, Akintobi A, Zhang Y, Rual JF, Currell B, Seshagiri S, Hao T, Yang X, Shen YA, Salehi-Ashtiani K, Li J, Cheng AT, Bouamalay D, Lugari A, Hill DE, Grimes ML, Drubin DG, Grant BD, Vidal M, Boone C, Sidhu SS, Bader GD. SH3 interactome conserves general function over specific form. Mol Syst Biol 2013; 9:652. [PMID: 23549480 PMCID: PMC3658277 DOI: 10.1038/msb.2013.9] [Citation(s) in RCA: 56] [Impact Index Per Article: 5.1] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/27/2012] [Accepted: 02/20/2013] [Indexed: 12/20/2022] Open
Abstract
Src homology 3 (SH3) domains bind peptides to mediate protein-protein interactions that assemble and regulate dynamic biological processes. We surveyed the repertoire of SH3 binding specificity using peptide phage display in a metazoan, the worm Caenorhabditis elegans, and discovered that it structurally mirrors that of the budding yeast Saccharomyces cerevisiae. We then mapped the worm SH3 interactome using stringent yeast two-hybrid and compared it with the equivalent map for yeast. We found that the worm SH3 interactome resembles the analogous yeast network because it is significantly enriched for proteins with roles in endocytosis. Nevertheless, orthologous SH3 domain-mediated interactions are highly rewired. Our results suggest a model of network evolution where general function of the SH3 domain network is conserved over its specific form.
Collapse
Affiliation(s)
- Xiaofeng Xin
- The Donnelly Centre, University of Toronto, Toronto, Ontario, Canada
- Department of Molecular Genetics, University of Toronto, Toronto, Ontario, Canada
| | - David Gfeller
- The Donnelly Centre, University of Toronto, Toronto, Ontario, Canada
| | - Jackie Cheng
- Department of Molecular and Cell Biology, University of California Berkeley, Berkeley, CA, USA
| | - Raffi Tonikian
- The Donnelly Centre, University of Toronto, Toronto, Ontario, Canada
- Department of Molecular Genetics, University of Toronto, Toronto, Ontario, Canada
| | - Lin Sun
- Department of Molecular Biology and Biochemistry, Rutgers University, Piscataway, NJ, USA
| | - Ailan Guo
- Cell Signaling Technology, Danvers, MA, USA
| | - Lianet Lopez
- The Donnelly Centre, University of Toronto, Toronto, Ontario, Canada
| | - Alevtina Pavlenco
- The Donnelly Centre, University of Toronto, Toronto, Ontario, Canada
| | - Adenrele Akintobi
- Department of Molecular Biology and Biochemistry, Rutgers University, Piscataway, NJ, USA
| | - Yingnan Zhang
- Department of Early Discovery Biochemistry, Genentech, South San Francisco, CA, USA
| | - Jean-François Rual
- Center for Cancer Systems Biology (CCSB) and Department of Cancer Biology, Dana-Farber Cancer Institute, Boston, MA, USA
- Department of Genetics, Harvard Medical School, Boston, MA, USA
| | - Bridget Currell
- Department of Molecular Biology, Genentech, South San Francisco, CA, USA
| | | | - Tong Hao
- Center for Cancer Systems Biology (CCSB) and Department of Cancer Biology, Dana-Farber Cancer Institute, Boston, MA, USA
- Department of Genetics, Harvard Medical School, Boston, MA, USA
| | - Xinping Yang
- Center for Cancer Systems Biology (CCSB) and Department of Cancer Biology, Dana-Farber Cancer Institute, Boston, MA, USA
- Department of Genetics, Harvard Medical School, Boston, MA, USA
| | - Yun A Shen
- Center for Cancer Systems Biology (CCSB) and Department of Cancer Biology, Dana-Farber Cancer Institute, Boston, MA, USA
- Department of Genetics, Harvard Medical School, Boston, MA, USA
| | - Kourosh Salehi-Ashtiani
- Center for Cancer Systems Biology (CCSB) and Department of Cancer Biology, Dana-Farber Cancer Institute, Boston, MA, USA
- Department of Genetics, Harvard Medical School, Boston, MA, USA
| | - Jingjing Li
- The Donnelly Centre, University of Toronto, Toronto, Ontario, Canada
- Department of Molecular Genetics, University of Toronto, Toronto, Ontario, Canada
| | - Aaron T Cheng
- Department of Molecular and Cell Biology, University of California Berkeley, Berkeley, CA, USA
| | - Dryden Bouamalay
- Department of Molecular and Cell Biology, University of California Berkeley, Berkeley, CA, USA
| | - Adrien Lugari
- IMR Laboratory, UPR 3243, Institut de Microbiologie de la Méditérannée, CNRS and Aix-Marseille Université, Marseille Cedex 20, France
| | - David E Hill
- Center for Cancer Systems Biology (CCSB) and Department of Cancer Biology, Dana-Farber Cancer Institute, Boston, MA, USA
- Department of Genetics, Harvard Medical School, Boston, MA, USA
| | - Mark L Grimes
- Division of Biological Sciences, Center for Structural and Functional Neuroscience, The University of Montana, Missoula, MT, USA
| | - David G Drubin
- Department of Molecular and Cell Biology, University of California Berkeley, Berkeley, CA, USA
| | - Barth D Grant
- Department of Molecular Biology and Biochemistry, Rutgers University, Piscataway, NJ, USA
| | - Marc Vidal
- Center for Cancer Systems Biology (CCSB) and Department of Cancer Biology, Dana-Farber Cancer Institute, Boston, MA, USA
- Department of Genetics, Harvard Medical School, Boston, MA, USA
| | - Charles Boone
- The Donnelly Centre, University of Toronto, Toronto, Ontario, Canada
- Department of Molecular Genetics, University of Toronto, Toronto, Ontario, Canada
| | - Sachdev S Sidhu
- The Donnelly Centre, University of Toronto, Toronto, Ontario, Canada
- Department of Molecular Genetics, University of Toronto, Toronto, Ontario, Canada
| | - Gary D Bader
- The Donnelly Centre, University of Toronto, Toronto, Ontario, Canada
- Department of Molecular Genetics, University of Toronto, Toronto, Ontario, Canada
- Department of Computer Science, University of Toronto, Toronto, Ontario, Canada
| |
Collapse
|
43
|
|
44
|
Halu A, Bianconi G. Monochromaticity in neutral evolutionary network models. PHYSICAL REVIEW. E, STATISTICAL, NONLINEAR, AND SOFT MATTER PHYSICS 2012; 86:066101. [PMID: 23367998 DOI: 10.1103/physreve.86.066101] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/18/2012] [Revised: 11/07/2012] [Indexed: 06/01/2023]
Abstract
Recent studies on epistatic networks of model organisms have unveiled a certain type of modular property called monochromaticity in which the networks are clustered into functional modules that interact with each other through the same type of epistasis. Here, we propose and study three epistatic network models that are inspired by the duplication-divergence mechanism to gain insight into the evolutionary basis of monochromaticity and to test if it can be explained as the outcome of a neutral evolutionary hypothesis. We show that the epistatic networks formed by these stochastic evolutionary models have monochromaticity conflict distributions that are centered close to zero and are statistically significantly different from their randomized counterparts. In particular, the last model we propose yields a strictly monochromatic solution. Our results agree with the monochromaticity findings in real organisms and point toward the possible role of a neutral mechanism in the evolution of this phenomenon.
Collapse
Affiliation(s)
- Arda Halu
- Department of Physics, Northeastern University, Boston, Massachusetts 02115, USA
| | | |
Collapse
|
45
|
Kolář M, Meier J, Mustonen V, Lässig M, Berg J. GraphAlignment: Bayesian pairwise alignment of biological networks. BMC SYSTEMS BIOLOGY 2012; 6:144. [PMID: 23171476 PMCID: PMC3573967 DOI: 10.1186/1752-0509-6-144] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 05/10/2012] [Accepted: 11/07/2012] [Indexed: 11/10/2022]
Abstract
BACKGROUND With increased experimental availability and accuracy of bio-molecular networks, tools for their comparative and evolutionary analysis are needed. A key component for such studies is the alignment of networks. RESULTS We introduce the Bioconductor package GraphAlignment for pairwise alignment of bio-molecular networks. The alignment incorporates information both from network vertices and network edges and is based on an explicit evolutionary model, allowing inference of all scoring parameters directly from empirical data. We compare the performance of our algorithm to an alternative algorithm, Græmlin 2.0.On simulated data, GraphAlignment outperforms Græmlin 2.0 in several benchmarks except for computational complexity. When there is little or no noise in the data, GraphAlignment is slower than Græmlin 2.0. It is faster than Græmlin 2.0 when processing noisy data containing spurious vertex associations. Its typical case complexity grows approximately as O(N2.6).On empirical bacterial protein-protein interaction networks (PIN) and gene co-expression networks, GraphAlignment outperforms Græmlin 2.0 with respect to coverage and specificity, albeit by a small margin. On large eukaryotic PIN, Græmlin 2.0 outperforms GraphAlignment. CONCLUSIONS The GraphAlignment algorithm is robust to spurious vertex associations, correctly resolves paralogs, and shows very good performance in identification of homologous vertices defined by high vertex and/or interaction similarity. The simplicity and generality of GraphAlignment edge scoring makes the algorithm an appropriate choice for global alignment of networks.
Collapse
Affiliation(s)
- Michal Kolář
- Institut für Theoretische Physik, Universität zu Köln, Zülpicher Straße 77, D-50937 Köln, Germany
| | | | | | | | | |
Collapse
|
46
|
Mayo M, Abdelzaher AF, Perkins EJ, Ghosh P. Motif Participation by Genes in E. coli Transcriptional Networks. Front Physiol 2012; 3:357. [PMID: 23055976 PMCID: PMC3457071 DOI: 10.3389/fphys.2012.00357] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/03/2012] [Accepted: 08/20/2012] [Indexed: 11/13/2022] Open
Abstract
Motifs are patterns of recurring connections among the genes of genetic networks that occur more frequently than would be expected from randomized networks with the same degree sequence. Although the abundance of certain three-node motifs, such as the feed-forward loop, is positively correlated with a networks’ ability to tolerate moderate disruptions to gene expression, little is known regarding the connectivity of individual genes participating in multiple motifs. Using the transcriptional network of the bacterium Escherichia coli, we investigate this feature by reconstructing the distribution of genes participating in feed-forward loop motifs from its largest connected network component. We contrast these motif participation distributions with those obtained from model networks built using the preferential attachment mechanism employed by many biological and man-made networks. We report that, although some of these model networks support a motif participation distribution that appears qualitatively similar to that obtained from the bacterium E. coli, the probability for a node to support a feed-forward loop motif may instead be strongly influenced by only a few master transcriptional regulators within the network. From these analyses we conclude that such master regulators may be a crucial ingredient to describe coupling among feed-forward loop motifs in transcriptional regulatory networks.
Collapse
Affiliation(s)
- Michael Mayo
- Environmental Laboratory, US Army Engineer Research and Development Center Vicksburg, MS, USA
| | | | | | | |
Collapse
|
47
|
Abstract
Background The development of high-throughput Microarray technologies has provided various opportunities to systematically characterize diverse types of computational biological networks. Co-expression network have become popular in the analysis of microarray data, such as for detecting functional gene modules. Results This paper presents a method to build a co-expression network (CEN) and to detect network modules from the built network. We use an effective gene expression similarity measure called NMRS (Normalized mean residue similarity) to construct the CEN. We have tested our method on five publicly available benchmark microarray datasets. The network modules extracted by our algorithm have been biologically validated in terms of Q value and p value. Conclusions Our results show that the technique is capable of detecting biologically significant network modules from the co-expression network. Biologist can use this technique to find groups of genes with similar functionality based on their expression information.
Collapse
|
48
|
A network synthesis model for generating protein interaction network families. PLoS One 2012; 7:e41474. [PMID: 22912671 PMCID: PMC3418285 DOI: 10.1371/journal.pone.0041474] [Citation(s) in RCA: 34] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/20/2011] [Accepted: 06/27/2012] [Indexed: 11/19/2022] Open
Abstract
In this work, we introduce a novel network synthesis model that can generate families of evolutionarily related synthetic protein-protein interaction (PPI) networks. Given an ancestral network, the proposed model generates the network family according to a hypothetical phylogenetic tree, where the descendant networks are obtained through duplication and divergence of their ancestors, followed by network growth using network evolution models. We demonstrate that this network synthesis model can effectively create synthetic networks whose internal and cross-network properties closely resemble those of real PPI networks. The proposed model can serve as an effective framework for generating comprehensive benchmark datasets that can be used for reliable performance assessment of comparative network analysis algorithms. Using this model, we constructed a large-scale network alignment benchmark, called NAPAbench, and evaluated the performance of several representative network alignment algorithms. Our analysis clearly shows the relative performance of the leading network algorithms, with their respective advantages and disadvantages. The algorithm and source code of the network synthesis model and the network alignment benchmark NAPAbench are publicly available at http://www.ece.tamu.edu/bjyoon/NAPAbench/.
Collapse
|
49
|
Peterson GJ, Pressé S, Peterson KS, Dill KA. Simulated evolution of protein-protein interaction networks with realistic topology. PLoS One 2012; 7:e39052. [PMID: 22768057 PMCID: PMC3387198 DOI: 10.1371/journal.pone.0039052] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/16/2012] [Accepted: 05/15/2012] [Indexed: 02/02/2023] Open
Abstract
We model the evolution of eukaryotic protein-protein interaction (PPI) networks. In our model, PPI networks evolve by two known biological mechanisms: (1) Gene duplication, which is followed by rapid diversification of duplicate interactions. (2) Neofunctionalization, in which a mutation leads to a new interaction with some other protein. Since many interactions are due to simple surface compatibility, we hypothesize there is an increased likelihood of interacting with other proteins in the target protein’s neighborhood. We find good agreement of the model on 10 different network properties compared to high-confidence experimental PPI networks in yeast, fruit flies, and humans. Key findings are: (1) PPI networks evolve modular structures, with no need to invoke particular selection pressures. (2) Proteins in cells have on average about 6 degrees of separation, similar to some social networks, such as human-communication and actor networks. (3) Unlike social networks, which have a shrinking diameter (degree of maximum separation) over time, PPI networks are predicted to grow in diameter. (4) The model indicates that evolutionarily old proteins should have higher connectivities and be more centrally embedded in their networks. This suggests a way in which present-day proteomics data could provide insights into biological evolution.
Collapse
Affiliation(s)
- G Jack Peterson
- Biophysics Graduate Group, University of California San Francisco, San Francisco, California, United States of America.
| | | | | | | |
Collapse
|
50
|
Fokkens L, Hogeweg P, Snel B. Gene duplications contribute to the overrepresentation of interactions between proteins of a similar age. BMC Evol Biol 2012; 12:99. [PMID: 22732003 PMCID: PMC3457867 DOI: 10.1186/1471-2148-12-99] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/11/2012] [Accepted: 06/07/2012] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND The study of biological networks and how they have evolved is fundamental to our understanding of the cell. By investigating how proteins of different ages are connected in the protein interaction network, one can infer how that network has expanded in evolution, without the need for explicit reconstruction of ancestral networks. Studies that implement this approach show that proteins are often connected to proteins of a similar age, suggesting a simultaneous emergence of interacting proteins. There are several theories explaining this phenomenon, but despite the importance of gene duplication in genome evolution, none consider protein family dynamics as a contributing factor. RESULTS In an S. cerevisiae protein interaction network we investigate to what extent edges that arise from duplication events contribute to the observed tendency to interact with proteins of a similar age. We find that part of this tendency is explained by interactions between paralogs. Age is usually defined on the level of protein families, rather than individual proteins, hence paralogs have the same age. The major contribution however, is from interaction partners that are shared between paralogs. These interactions have most likely been conserved after a duplication event. To investigate to what extent a nearly neutral process of network growth can explain these results, we adjust a well-studied network growth model to incorporate protein families. Our model shows that the number of edges between paralogs can be amplified by subsequent duplication events, thus explaining the overrepresentation of interparalog edges in the data. The fact that interaction partners shared by paralogs are often of the same age as the paralogs does not arise naturally from our model and needs further investigation. CONCLUSION We amend previous theories that explain why proteins of a similar age prefer to interact by demonstrating that this observation can be partially explained by gene duplication events. There is an ongoing debate on whether the protein interaction network is predominantly shaped by duplication and subfunctionalization or whether network rewiring is most important. Our analyses of S. cerevisiae protein interaction networks demonstrate that duplications have influenced at least one property of the protein interaction network: how proteins of different ages are connected.
Collapse
Affiliation(s)
- Like Fokkens
- Theoretical Biology and Bioinformatics, Department of Biology, Faculty of Science, Utrecht University, Padualaan 8, 3584CH, Utrecht, The Netherlands
| | - Paulien Hogeweg
- Theoretical Biology and Bioinformatics, Department of Biology, Faculty of Science, Utrecht University, Padualaan 8, 3584CH, Utrecht, The Netherlands
| | - Berend Snel
- Theoretical Biology and Bioinformatics, Department of Biology, Faculty of Science, Utrecht University, Padualaan 8, 3584CH, Utrecht, The Netherlands
- Netherlands Consortium for Systems Biology (NCSB), c/o NISB Bureau, University of Amsterdam, Science Park 904, 1098XH, Amsterdam, The Netherlands
| |
Collapse
|