1
|
Kosoglu K, Aydin Z, Tuncbag N, Gursoy A, Keskin O. Structural coverage of the human interactome. Brief Bioinform 2023; 25:bbad496. [PMID: 38180828 PMCID: PMC10768791 DOI: 10.1093/bib/bbad496] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/04/2023] [Revised: 11/16/2023] [Accepted: 11/30/2023] [Indexed: 01/07/2024] Open
Abstract
Complex biological processes in cells are embedded in the interactome, representing the complete set of protein-protein interactions. Mapping and analyzing the protein structures are essential to fully comprehending these processes' molecular details. Therefore, knowing the structural coverage of the interactome is important to show the current limitations. Structural modeling of protein-protein interactions requires accurate protein structures. In this study, we mapped all experimental structures to the reference human proteome. Later, we found the enrichment in structural coverage when complementary methods such as homology modeling and deep learning (AlphaFold) were included. We then collected the interactions from the literature and databases to form the reference human interactome, resulting in 117 897 non-redundant interactions. When we analyzed the structural coverage of the interactome, we found that the number of experimentally determined protein complex structures is scarce, corresponding to 3.95% of all binary interactions. We also analyzed known and modeled structures to potentially construct the structural interactome with a docking method. Our analysis showed that 12.97% of the interactions from HuRI and 73.62% and 32.94% from the filtered versions of STRING and HIPPIE could potentially be modeled with high structural coverage or accuracy, respectively. Overall, this paper provides an overview of the current state of structural coverage of the human proteome and interactome.
Collapse
Affiliation(s)
- Kayra Kosoglu
- Computational Sciences and Engineering, College of Engineering, Koc University, 34450 Istanbul, Turkey
| | - Zeynep Aydin
- Computational Sciences and Engineering, College of Engineering, Koc University, 34450 Istanbul, Turkey
| | - Nurcan Tuncbag
- School of Medicine, Koc University, 34450 Istanbul, Turkey
- Department of Chemical and Biological Engineering, College of Engineering, Koc University, 34450 Istanbul, Turkey
| | - Attila Gursoy
- Department of Computer Engineering, College of Engineering, Koc University, 34450 Istanbul, Turkey
| | - Ozlem Keskin
- Department of Chemical and Biological Engineering, College of Engineering, Koc University, 34450 Istanbul, Turkey
| |
Collapse
|
2
|
Yang M, Harrison BR, Promislow DEL. In search of a Drosophila core cellular network with single-cell transcriptome data. G3 GENES|GENOMES|GENETICS 2022; 12:6670625. [PMID: 35976114 PMCID: PMC9526075 DOI: 10.1093/g3journal/jkac212] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 06/26/2022] [Accepted: 08/03/2022] [Indexed: 11/29/2022]
Abstract
Along with specialized functions, cells of multicellular organisms also perform essential functions common to most if not all cells. Whether diverse cells do this by using the same set of genes, interacting in a fixed coordinated fashion to execute essential functions, or a subset of genes specific to certain cells, remains a central question in biology. Here, we focus on gene coexpression to search for a core cellular network across a whole organism. Single-cell RNA-sequencing measures gene expression of individual cells, enabling researchers to discover gene expression patterns that contribute to the diversity of cell functions. Current efforts to study cellular functions focus primarily on identifying differentially expressed genes across cells. However, patterns of coexpression between genes are probably more indicative of biological processes than are the expression of individual genes. We constructed cell-type-specific gene coexpression networks using single-cell transcriptome datasets covering diverse cell types from the fruit fly, Drosophila melanogaster. We detected a set of highly coordinated genes preserved across cell types and present this as the best estimate of a core cellular network. This core is very small compared with cell-type-specific gene coexpression networks and shows dense connectivity. Gene members of this core tend to be ancient genes and are enriched for those encoding ribosomal proteins. Overall, we find evidence for a core cellular network in diverse cell types of the fruit fly. The topological, structural, functional, and evolutionary properties of this core indicate that it accounts for only a minority of essential functions.
Collapse
Affiliation(s)
- Ming Yang
- Department of Laboratory Medicine and Pathology, University of Washington School of Medicine , Seattle, WA 98195, USA
| | - Benjamin R Harrison
- Department of Laboratory Medicine and Pathology, University of Washington School of Medicine , Seattle, WA 98195, USA
| | - Daniel E L Promislow
- Department of Laboratory Medicine and Pathology, University of Washington School of Medicine , Seattle, WA 98195, USA
- Department of Biology, University of Washington , Seattle, WA 98195, USA
| |
Collapse
|
3
|
Amacher JF, Brooks L, Hampton TH, Madden DR. Specificity in PDZ-peptide interaction networks: Computational analysis and review. JOURNAL OF STRUCTURAL BIOLOGY-X 2020; 4:100022. [PMID: 32289118 PMCID: PMC7138185 DOI: 10.1016/j.yjsbx.2020.100022] [Citation(s) in RCA: 25] [Impact Index Per Article: 6.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 01/15/2020] [Revised: 02/26/2020] [Accepted: 02/29/2020] [Indexed: 01/03/2023]
Abstract
Globular PDZ domains typically serve as protein-protein interaction modules that regulate a wide variety of cellular functions via recognition of short linear motifs (SLiMs). Often, PDZ mediated-interactions are essential components of macromolecular complexes, and disruption affects the entire scaffold. Due to their roles as linchpins in trafficking and signaling pathways, PDZ domains are attractive targets: both for controlling viral pathogens, which bind PDZ domains and hijack cellular machinery, as well as for developing therapies to combat human disease. However, successful therapeutic interventions that avoid off-target effects are a challenge, because each PDZ domain interacts with a number of cellular targets, and specific binding preferences can be difficult to decipher. Over twenty-five years of research has produced a wealth of data on the stereochemical preferences of individual PDZ proteins and their binding partners. Currently the field lacks a central repository for this information. Here, we provide this important resource and provide a manually curated, comprehensive list of the 271 human PDZ domains. We use individual domain, as well as recent genomic and proteomic, data in order to gain a holistic view of PDZ domains and interaction networks, arguing this knowledge is critical to optimize targeting selectivity and to benefit human health.
Collapse
Affiliation(s)
- Jeanine F Amacher
- Department of Biochemistry and Cell Biology, Geisel School of Medicine at Dartmouth, Hanover, NH 03755, USA.,Department of Chemistry, Western Washington University, Bellingham, WA 98225, USA
| | - Lionel Brooks
- Department of Biology, Western Washington University, Bellingham, WA 98225, USA
| | - Thomas H Hampton
- Department of Microbiology and Immunology, Geisel School of Medicine at Dartmouth, Hanover, NH 03755, USA
| | - Dean R Madden
- Department of Biochemistry and Cell Biology, Geisel School of Medicine at Dartmouth, Hanover, NH 03755, USA
| |
Collapse
|
4
|
A Quantitative Genetic Interaction Map of HIV Infection. Mol Cell 2020; 78:197-209.e7. [PMID: 32084337 DOI: 10.1016/j.molcel.2020.02.004] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/23/2019] [Revised: 01/10/2020] [Accepted: 02/02/2020] [Indexed: 12/16/2022]
Abstract
We have developed a platform for quantitative genetic interaction mapping using viral infectivity as a functional readout and constructed a viral host-dependency epistasis map (vE-MAP) of 356 human genes linked to HIV function, comprising >63,000 pairwise genetic perturbations. The vE-MAP provides an expansive view of the genetic dependencies underlying HIV infection and can be used to identify drug targets and study viral mutations. We found that the RNA deadenylase complex, CNOT, is a central player in the vE-MAP and show that knockout of CNOT1, 10, and 11 suppressed HIV infection in primary T cells by upregulating innate immunity pathways. This phenotype was rescued by deletion of IRF7, a transcription factor regulating interferon-stimulated genes, revealing a previously unrecognized host signaling pathway involved in HIV infection. The vE-MAP represents a generic platform that can be used to study the global effects of how different pathogens hijack and rewire the host during infection.
Collapse
|
5
|
Abstract
Macromolecular complexes play a key role in cellular function. Predicting the structure and dynamics of these complexes is one of the key challenges in structural biology. Docking applications have traditionally been used to predict pairwise interactions between proteins. However, few methods exist for modeling multi-protein assemblies. Here we present two methods, CombDock and DockStar, that can predict multi-protein assemblies starting from subunit structural models. CombDock can assemble subunits without any assumptions about the pairwise interactions between subunits, while DockStar relies on the interaction graph or, alternatively, a homology model or a cryo-electron microscopy (EM) density map of the entire complex. We demonstrate the two methods using RNA polymerase II with 12 subunits and TRiC/CCT chaperonin with 16 subunits.
Collapse
Affiliation(s)
- Dina Schneidman-Duhovny
- School of Computer Science and Engineering and the Institute of Life Sciences, The Hebrew University of Jerusalem, Jerusalem, Israel
| | - Haim J Wolfson
- Blavatnik School of Computer Science, Tel Aviv University, Tel Aviv, Israel.
| |
Collapse
|
6
|
Tutuncuoglu B, Krogan NJ. Mapping genetic interactions in cancer: a road to rational combination therapies. Genome Med 2019; 11:62. [PMID: 31640753 PMCID: PMC6805649 DOI: 10.1186/s13073-019-0680-4] [Citation(s) in RCA: 15] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/29/2019] [Accepted: 10/16/2019] [Indexed: 01/08/2023] Open
Abstract
The discovery of synthetic lethal interactions between poly (ADP-ribose) polymerase (PARP) inhibitors and BRCA genes, which are involved in homologous recombination, led to the approval of PARP inhibition as a monotherapy for patients with BRCA1/2-mutated breast or ovarian cancer. Studies following the initial observation of synthetic lethality demonstrated that the reach of PARP inhibitors is well beyond just BRCA1/2 mutants. Insights into the mechanisms of action of anticancer drugs are fundamental for the development of targeted monotherapies or rational combination treatments that will synergize to promote cancer cell death and overcome mechanisms of resistance. The development of targeted therapeutic agents is premised on mapping the physical and functional dependencies of mutated genes in cancer. An important part of this effort is the systematic screening of genetic interactions in a variety of cancer types. Until recently, genetic-interaction screens have relied either on the pairwise perturbations of two genes or on the perturbation of genes of interest combined with inhibition by commonly used anticancer drugs. Here, we summarize recent advances in mapping genetic interactions using targeted, genome-wide, and high-throughput genetic screens, and we discuss the therapeutic insights obtained through such screens. We further focus on factors that should be considered in order to develop a robust analysis pipeline. Finally, we discuss the integration of functional interaction data with orthogonal methods and suggest that such approaches will increase the reach of genetic-interaction screens for the development of rational combination therapies.
Collapse
Affiliation(s)
- Beril Tutuncuoglu
- Department of Cellular and Molecular Pharmacology, University of California, San Francisco, 16th Street, Mission Bay Campus, San Francisco, CA, 94158-2140, USA.,The J. David Gladstone Institutes, Owens Street, San Francisco, CA, 94158, USA.,Quantitative Biosciences Institute, University of California, San Francisco, 4th Street, San Francisco, CA, 94158, USA.,Cancer Cell Map Initiative (CCMI), La Jolla and San Francisco, CA, USA
| | - Nevan J Krogan
- Department of Cellular and Molecular Pharmacology, University of California, San Francisco, 16th Street, Mission Bay Campus, San Francisco, CA, 94158-2140, USA. .,The J. David Gladstone Institutes, Owens Street, San Francisco, CA, 94158, USA. .,Quantitative Biosciences Institute, University of California, San Francisco, 4th Street, San Francisco, CA, 94158, USA. .,Cancer Cell Map Initiative (CCMI), La Jolla and San Francisco, CA, USA.
| |
Collapse
|
7
|
Fragata I, Blanckaert A, Dias Louro MA, Liberles DA, Bank C. Evolution in the light of fitness landscape theory. Trends Ecol Evol 2019; 34:69-82. [DOI: 10.1016/j.tree.2018.10.009] [Citation(s) in RCA: 84] [Impact Index Per Article: 16.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/07/2018] [Revised: 10/16/2018] [Accepted: 10/17/2018] [Indexed: 01/28/2023]
|
8
|
Rosell M, Fernández-Recio J. Hot-spot analysis for drug discovery targeting protein-protein interactions. Expert Opin Drug Discov 2018; 13:327-338. [PMID: 29376444 DOI: 10.1080/17460441.2018.1430763] [Citation(s) in RCA: 54] [Impact Index Per Article: 9.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/18/2022]
Abstract
INTRODUCTION Protein-protein interactions are important for biological processes and pathological situations, and are attractive targets for drug discovery. However, rational drug design targeting protein-protein interactions is still highly challenging. Hot-spot residues are seen as the best option to target such interactions, but their identification requires detailed structural and energetic characterization, which is only available for a tiny fraction of protein interactions. Areas covered: In this review, the authors cover a variety of computational methods that have been reported for the energetic analysis of protein-protein interfaces in search of hot-spots, and the structural modeling of protein-protein complexes by docking. This can help to rationalize the discovery of small-molecule inhibitors of protein-protein interfaces of therapeutic interest. Computational analysis and docking can help to locate the interface, molecular dynamics can be used to find suitable cavities, and hot-spot predictions can focus the search for inhibitors of protein-protein interactions. Expert opinion: A major difficulty for applying rational drug design methods to protein-protein interactions is that in the majority of cases the complex structure is not available. Fortunately, computational docking can complement experimental data. An interesting aspect to explore in the future is the integration of these strategies for targeting PPIs with large-scale mutational analysis.
Collapse
Affiliation(s)
- Mireia Rosell
- a Department of Life Sciences , Barcelona Supercomputing Center (BSC) , Barcelona , Spain
| | - Juan Fernández-Recio
- a Department of Life Sciences , Barcelona Supercomputing Center (BSC) , Barcelona , Spain.,b Structural Biology Unit , Institut de Biologia Molecular de Barcelona (IBMB), CSIC , Barcelona , Spain
| |
Collapse
|
9
|
Barradas-Bautista D, Rosell M, Pallara C, Fernández-Recio J. Structural Prediction of Protein–Protein Interactions by Docking: Application to Biomedical Problems. PROTEIN-PROTEIN INTERACTIONS IN HUMAN DISEASE, PART A 2018; 110:203-249. [DOI: 10.1016/bs.apcsb.2017.06.003] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/31/2022]
|
10
|
DEEPN as an Approach for Batch Processing of Yeast 2-Hybrid Interactions. Cell Rep 2017; 17:303-315. [PMID: 27681439 DOI: 10.1016/j.celrep.2016.08.095] [Citation(s) in RCA: 14] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/04/2016] [Revised: 05/06/2016] [Accepted: 08/29/2016] [Indexed: 01/06/2023] Open
Abstract
We adapted the yeast 2-hybrid assay to simultaneously uncover multiple transient protein interactions within a single screen by using a strategy termed DEEPN (dynamic enrichment for evaluation of protein networks). This approach incorporates high-throughput DNA sequencing and computation to follow competition among a plasmid population encoding interacting partners. To demonstrate the capacity of DEEPN, we identify a wide range of ubiquitin-binding proteins, including interactors that we verify biochemically. To demonstrate the specificity of DEEPN, we show that DEEPN allows simultaneous comparison of candidate interactors across multiple bait proteins, allowing differential interactions to be identified. This feature was used to identify interactors that distinguish between GTP- and GDP-bound conformations of Rab5.
Collapse
|
11
|
Kazantsev F, Akberdin I, Lashin S, Ree N, Timonov V, Ratushny A, Khlebodarova T, Likhoshvai V. MAMMOTh: A new database for curated mathematical models of biomolecular systems. J Bioinform Comput Biol 2017; 16:1740010. [PMID: 29172865 DOI: 10.1142/s0219720017400108] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022]
Abstract
MOTIVATION Living systems have a complex hierarchical organization that can be viewed as a set of dynamically interacting subsystems. Thus, to simulate the internal nature and dynamics of the entire biological system, we should use the iterative way for a model reconstruction, which is a consistent composition and combination of its elementary subsystems. In accordance with this bottom-up approach, we have developed the MAthematical Models of bioMOlecular sysTems (MAMMOTh) tool that consists of the database containing manually curated MAMMOTh fitted to the experimental data and a software tool that provides their further integration. RESULTS The MAMMOTh database entries are organized as building blocks in a way that the model parts can be used in different combinations to describe systems with higher organizational level (metabolic pathways and/or transcription regulatory networks). The tool supports export of a single model or their combinations in SBML or Mathematica standards. The database currently contains 110 mathematical sub-models for Escherichia coli elementary subsystems (enzymatic reactions and gene expression regulatory processes) that can be combined in at least 5100 complex/sophisticated models concerning more complex biological processes as de novo nucleotide biosynthesis, aerobic/anaerobic respiration and nitrate/nitrite utilization in E. coli. All models are functionally interconnected and sufficiently complement public model resources. AVAILABILITY http://mammoth.biomodelsgroup.ru.
Collapse
Affiliation(s)
- Fedor Kazantsev
- * Institute of Cytology and Genetics SB RAS, Lavrentyev Avenue., 10, Novosibirsk 630090, Russia.,† Novosibirsk State University, Pirogova str. 2, Novosibirsk 630090, Russia
| | - Ilya Akberdin
- * Institute of Cytology and Genetics SB RAS, Lavrentyev Avenue., 10, Novosibirsk 630090, Russia.,† Novosibirsk State University, Pirogova str. 2, Novosibirsk 630090, Russia.,¶ Biology Department, San Diego State University, San Diego, CA 92182-4614, USA
| | - Sergey Lashin
- * Institute of Cytology and Genetics SB RAS, Lavrentyev Avenue., 10, Novosibirsk 630090, Russia.,† Novosibirsk State University, Pirogova str. 2, Novosibirsk 630090, Russia
| | - Natalia Ree
- * Institute of Cytology and Genetics SB RAS, Lavrentyev Avenue., 10, Novosibirsk 630090, Russia
| | - Vladimir Timonov
- † Novosibirsk State University, Pirogova str. 2, Novosibirsk 630090, Russia
| | - Alexander Ratushny
- ‡ Center for Infectious Disease Research (Formerly Seattle, Biomedical Research Institute), Seattle, WA 98109, USA.,§ Institute for Systems Biology, Seattle, WA 98109-5234, USA
| | - Tamara Khlebodarova
- * Institute of Cytology and Genetics SB RAS, Lavrentyev Avenue., 10, Novosibirsk 630090, Russia
| | - Vitaly Likhoshvai
- * Institute of Cytology and Genetics SB RAS, Lavrentyev Avenue., 10, Novosibirsk 630090, Russia.,† Novosibirsk State University, Pirogova str. 2, Novosibirsk 630090, Russia
| |
Collapse
|
12
|
Barradas-Bautista D, Fernández-Recio J. Docking-based modeling of protein-protein interfaces for extensive structural and functional characterization of missense mutations. PLoS One 2017; 12:e0183643. [PMID: 28841721 PMCID: PMC5571915 DOI: 10.1371/journal.pone.0183643] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/03/2017] [Accepted: 08/08/2017] [Indexed: 01/23/2023] Open
Abstract
Next-generation sequencing (NGS) technologies are providing genomic information for an increasing number of healthy individuals and patient populations. In the context of the large amount of generated genomic data that is being generated, understanding the effect of disease-related mutations at molecular level can contribute to close the gap between genotype and phenotype and thus improve prevention, diagnosis or treatment of a pathological condition. In order to fully characterize the effect of a pathological mutation and have useful information for prediction purposes, it is important first to identify whether the mutation is located at a protein-binding interface, and second to understand the effect on the binding affinity of the affected interaction/s. Computational methods, such as protein docking are currently used to complement experimental efforts and could help to build the human structural interactome. Here we have extended the original pyDockNIP method to predict the location of disease-associated nsSNPs at protein-protein interfaces, when there is no available structure for the protein-protein complex. We have applied this approach to the pathological interaction networks of six diseases with low structural data on PPIs. This approach can almost double the number of nsSNPs that can be characterized and identify edgetic effects in many nsSNPs that were previously unknown. This can help to annotate and interpret genomic data from large-scale population studies, and to achieve a better understanding of disease at molecular level.
Collapse
Affiliation(s)
| | - Juan Fernández-Recio
- Life Sciences Department, Barcelona Supercomputing Center (BSC), Barcelona, Spain
- * E-mail:
| |
Collapse
|
13
|
Kumar A, Pathak RK, Gupta SM, Gaur VS, Pandey D. Systems Biology for Smart Crops and Agricultural Innovation: Filling the Gaps between Genotype and Phenotype for Complex Traits Linked with Robust Agricultural Productivity and Sustainability. OMICS-A JOURNAL OF INTEGRATIVE BIOLOGY 2016; 19:581-601. [PMID: 26484978 DOI: 10.1089/omi.2015.0106] [Citation(s) in RCA: 63] [Impact Index Per Article: 7.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/21/2022]
Abstract
In recent years, rapid developments in several omics platforms and next generation sequencing technology have generated a huge amount of biological data about plants. Systems biology aims to develop and use well-organized and efficient algorithms, data structure, visualization, and communication tools for the integration of these biological data with the goal of computational modeling and simulation. It studies crop plant systems by systematically perturbing them, checking the gene, protein, and informational pathway responses; integrating these data; and finally, formulating mathematical models that describe the structure of system and its response to individual perturbations. Consequently, systems biology approaches, such as integrative and predictive ones, hold immense potential in understanding of molecular mechanism of agriculturally important complex traits linked to agricultural productivity. This has led to identification of some key genes and proteins involved in networks of pathways involved in input use efficiency, biotic and abiotic stress resistance, photosynthesis efficiency, root, stem and leaf architecture, and nutrient mobilization. The developments in the above fields have made it possible to design smart crops with superior agronomic traits through genetic manipulation of key candidate genes.
Collapse
Affiliation(s)
| | - Rajesh Kumar Pathak
- 2 Department of Biotechnology, G. B. Pant Engineering College , Pauri Garhwal-246194, Uttarakhand, India
| | - Sanjay Mohan Gupta
- 3 Molecular Biology and Genetic Engineering Laboratory, Defence Institute of Bio-Energy Research , DRDO, Haldwani, Uttarakhand, India
| | - Vikram Singh Gaur
- 4 College of Agriculture , Waraseoni, Balaghat, Madhya Pradesh, India
| | | |
Collapse
|
14
|
Scott NE, Brown LM, Kristensen AR, Foster LJ. Development of a computational framework for the analysis of protein correlation profiling and spatial proteomics experiments. J Proteomics 2014; 118:112-29. [PMID: 25464368 DOI: 10.1016/j.jprot.2014.10.024] [Citation(s) in RCA: 19] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/02/2014] [Revised: 10/18/2014] [Accepted: 10/27/2014] [Indexed: 01/12/2023]
Abstract
UNLABELLED Standard approaches to studying an interactome do not easily allow conditional experiments but in recent years numerous groups have demonstrated the potential for co-fractionation/co-migration based approaches to assess an interactome at a similar sensitivity and specificity yet significantly lower cost and higher speed than traditional approaches. Unfortunately, there is as yet no implementation of the bioinformatics tools required to robustly analyze co-fractionation data in a way that can also integrate the valuable information contained in biological replicates. Here we have developed a freely available, integrated bioinformatics solution for the analysis of protein correlation profiling SILAC data. This modular solution allows the deconvolution of protein chromatograms into individual Gaussian curves enabling the use of these chromatography features to align replicates and assemble a consensus map of features observed across replicates; the chromatograms and individual curves are then used to quantify changes in protein interactions and construct the interactome. We have applied this workflow to the analysis of HeLa cells infected with a Salmonella enterica serovar Typhimurium infection model where we can identify specific interactions that are affected by the infection. These bioinformatics tools simplify the analysis of co-fractionation/co-migration data to the point where there is no specialized knowledge required to measure an interactome in this way. BIOLOGICAL SIGNIFICANCE We describe a set of software tools for the bioinformatics analysis of co-migration/co-fractionation data that integrates the results from multiple replicates to generate an interactome, including the impact on individual interactions of any external perturbation. This article is part of a Special Issue entitled: Protein dynamics in health and disease. Guest Editors: Pierre Thibault and Anne-Claude Gingras.
Collapse
Affiliation(s)
- Nichollas E Scott
- Centre for High-throughput Biology, University of British Columbia, Vancouver V6T 1Z4, British Columbia, Canada.
| | - Lyda M Brown
- Centre for High-throughput Biology, University of British Columbia, Vancouver V6T 1Z4, British Columbia, Canada
| | - Anders R Kristensen
- Michael Smith Genome Sciences Centre, British Columbia Cancer Agency, Vancouver V5Z 4S6, British Columbia, Canada
| | - Leonard J Foster
- Centre for High-throughput Biology, University of British Columbia, Vancouver V6T 1Z4, British Columbia, Canada.
| |
Collapse
|
15
|
Abstract
Thrombin has multiple functions in blood coagulation and its regulation is central to maintaining the balance between hemorrhage and thrombosis. Empirical and computational methods that capture thrombin generation can provide advancements to current clinical screening of the hemostatic balance at the level of the individual. In any individual, procoagulant and anticoagulant factor levels together act to generate a unique coagulation phenotype (net balance) that is reflective of the sum of its developmental, environmental, genetic, nutritional and pharmacological influences. Defining such thrombin phenotypes may provide a means to track disease progression pre-crisis. In this review we briefly describe thrombin function, methods for assessing thrombin dynamics as a phenotypic marker, computationally derived thrombin phenotypes versus determined clinical phenotypes, the boundaries of normal range thrombin generation using plasma composition based approaches and the feasibility of these approaches for predicting risk.
Collapse
|
16
|
Clancy T, Hovig E. From proteomes to complexomes in the era of systems biology. Proteomics 2014; 14:24-41. [PMID: 24243660 DOI: 10.1002/pmic.201300230] [Citation(s) in RCA: 21] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/12/2013] [Revised: 10/22/2013] [Accepted: 11/06/2013] [Indexed: 01/16/2023]
Abstract
Protein complexes carry out almost the entire signaling and functional processes in the cell. The protein complex complement of a cell, and its network of complex-complex interactions, is referred to here as the complexome. Computational methods to predict protein complexes from proteomics data, resulting in network representations of complexomes, have recently being developed. In addition, key advances have been made toward understanding the network and structural organization of complexomes. We review these bioinformatics advances, and their discovery-potential, as well as the merits of integrating proteomics data with emerging methods in systems biology to study protein complex signaling. It is envisioned that improved integration of proteomics and systems biology, incorporating the dynamics of protein complexes in space and time, may lead to more predictive models of cell signaling networks for effective modulation.
Collapse
Affiliation(s)
- Trevor Clancy
- Department of Tumor Biology, Institute for Cancer Research, The Norwegian Radium Hospital, Oslo University Hospital, Oslo, Norway
| | | |
Collapse
|
17
|
Abstract
The past decade has seen a dramatic expansion in the number and range of techniques available to obtain genome-wide information and to analyze this information so as to infer both the functions of individual molecules and how they interact to modulate the behavior of biological systems. Here, we review these techniques, focusing on the construction of physical protein-protein interaction networks, and highlighting approaches that incorporate protein structure, which is becoming an increasingly important component of systems-level computational techniques. We also discuss how network analyses are being applied to enhance our basic understanding of biological systems and their disregulation, as well as how these networks are being used in drug development.
Collapse
Affiliation(s)
- Donald Petrey
- Center for Computational Biology and Bioinformatics, Department of Systems Biology
| | | |
Collapse
|
18
|
Systematic exploration of ubiquitin sequence, E1 activation efficiency, and experimental fitness in yeast. J Mol Biol 2014; 426:2854-70. [PMID: 24862281 DOI: 10.1016/j.jmb.2014.05.019] [Citation(s) in RCA: 34] [Impact Index Per Article: 3.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/10/2014] [Revised: 05/13/2014] [Accepted: 05/18/2014] [Indexed: 01/26/2023]
Abstract
The complexity of biological interaction networks poses a challenge to understanding the function of individual connections in the overall network. To address this challenge, we developed a high-throughput reverse engineering strategy to analyze how thousands of specific perturbations (encompassing all point mutations in a central gene) impact both a specific edge (interaction to a directly connected node) and an overall network function. We analyzed the effects of ubiquitin mutations on activation by the E1 enzyme and compared these to effects on yeast growth rate. Using this approach, we delineated ubiquitin mutations that selectively impacted the ubiquitin-E1 edge. We find that the elasticity function relating the efficiency of ubiquitin-E1 interaction to growth rate is non-linear and that a greater than 50-fold decrease in E1 activation efficiency is required to reduce growth rate by 2-fold. Despite the robustness of fitness to decreases in E1 activation efficiency, the effects of most ubiquitin mutations on E1 activation paralleled the effects on growth rate. Our observations indicate that most ubiquitin mutations that disrupt E1 activation also disrupt other functions. The structurally characterized ubiquitin-E1 interface encompasses the interfaces of ubiquitin with most other known binding partners, and we propose that this enables E1 in wild-type cells to selectively activate ubiquitin protein molecules capable of binding to other partners from the cytoplasmic pool of ubiquitin protein that will include molecules with chemical damage and/or errors from transcription and translation.
Collapse
|
19
|
Hingorani KS, Gierasch LM. Comparing protein folding in vitro and in vivo: foldability meets the fitness challenge. Curr Opin Struct Biol 2014; 24:81-90. [PMID: 24434632 DOI: 10.1016/j.sbi.2013.11.007] [Citation(s) in RCA: 62] [Impact Index Per Article: 6.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/26/2013] [Revised: 10/21/2013] [Accepted: 11/21/2013] [Indexed: 01/09/2023]
Abstract
In this review, we compare and contrast current knowledge about in vitro and in vivo protein folding. Major advances in understanding fundamental principles underlying protein folding in optimized in vitro conditions have yielded detailed physicochemical principles of folding landscapes for small, single domain proteins. In addition, there has been increased research focusing on the key features of protein folding in the cell that differentiate it from in vitro folding, such as co-translational folding, chaperone-facilitated folding, and folding in crowded conditions with many weak interactions. Yet these two research areas have not been bridged effectively in research carried out to date. This review points to gaps between the two that are ripe for future research. Moreover, we emphasize the biological selection pressures that impact protein folding in vivo and how fitness drives the evolution of protein sequences in ways that may place foldability in tension with other requirements on a given protein. We suggest that viewing the physicochemical process of protein folding through the lens of evolution will unveil new insights and pose novel challenges about in-cell folding landscapes.
Collapse
Affiliation(s)
- Karan S Hingorani
- Program in Molecular and Cellular Biology, University of Massachusetts, Amherst, Amherst, MA 01003, United States; Department of Biochemistry & Molecular Biology, University of Massachusetts, Amherst, Amherst, MA 01003, United States
| | - Lila M Gierasch
- Program in Molecular and Cellular Biology, University of Massachusetts, Amherst, Amherst, MA 01003, United States; Department of Biochemistry & Molecular Biology, University of Massachusetts, Amherst, Amherst, MA 01003, United States; Department of Chemistry, University of Massachusetts, Amherst, Amherst, MA 01003, United States.
| |
Collapse
|
20
|
Carter H, Hofree M, Ideker T. Genotype to phenotype via network analysis. Curr Opin Genet Dev 2013; 23:611-21. [PMID: 24238873 PMCID: PMC3866044 DOI: 10.1016/j.gde.2013.10.003] [Citation(s) in RCA: 85] [Impact Index Per Article: 7.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/07/2013] [Revised: 10/04/2013] [Accepted: 10/09/2013] [Indexed: 02/06/2023]
Abstract
A prime objective of genomic medicine is the identification of disease-causing mutations and the mechanisms by which such events result in disease. As most disease phenotypes arise not from single genes and proteins but from a complex network of molecular interactions, a priori knowledge about the molecular network serves as a framework for biological inference and data mining. Here we review recent developments at the interface of biological networks and mutation analysis. We examine how mutations may be treated as a perturbation of the molecular interaction network and what insights may be gained from taking this perspective. We review work that aims to transform static networks into rich context-dependent networks and recent attempts to integrate non-coding RNAs into such analysis. Finally, we conclude with an overview of the many challenges and opportunities that lie ahead.
Collapse
Affiliation(s)
- Hannah Carter
- Institute for Genomic Medicine and Department of Medicine, University of California, San Diego, 9500 Gillman Drive, La Jolla, CA 92093, United States
| | | | | |
Collapse
|
21
|
Abstract
Proteins are not monolithic entities; rather, they can contain multiple domains that mediate distinct interactions, and their functionality can be regulated through post-translational modifications at multiple distinct sites. Traditionally, network biology has ignored such properties of proteins and has instead examined either the physical interactions of whole proteins or the consequences of removing entire genes. In this Review, we discuss experimental and computational methods to increase the resolution of protein-protein, genetic and drug-gene interaction studies to the domain and residue levels. Such work will be crucial for using interaction networks to connect sequence and structural information, and to understand the biological consequences of disease-associated mutations, which will hopefully lead to more effective therapeutic strategies.
Collapse
|
22
|
Hleap JS, Susko E, Blouin C. Defining structural and evolutionary modules in proteins: a community detection approach to explore sub-domain architecture. BMC STRUCTURAL BIOLOGY 2013; 13:20. [PMID: 24131821 PMCID: PMC4016585 DOI: 10.1186/1472-6807-13-20] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 03/20/2013] [Accepted: 10/11/2013] [Indexed: 12/23/2022]
Abstract
Background Assessing protein modularity is important to understand protein evolution. Still the question of the existence of a sub-domain modular architecture remains. We propose a graph-theory approach with significance and power testing to identify modules in protein structures. In the first step, clusters are determined by optimizing the partition that maximizes the modularity score. Second, each cluster is tested for significance. Significant clusters are referred to as modules. Evolutionary modules are identified by analyzing homologous structures. Dynamic modules are inferred from sets of snapshots of molecular simulations. We present here a methodology to identify sub-domain architecture robustly, biologically meaningful, and statistically supported. Results The robustness of this new method is tested using simulated data with known modularity. Modules are correctly identified even when there is a low correlation between landmarks within a module. We also analyzed the evolutionary modularity of a data set of α-amylase catalytic domain homologs, and the dynamic modularity of the Niemann-Pick C1 (NPC1) protein N-terminal domain. The α-amylase contains an (α/β)8 barrel (TIM barrel) with the polysaccharides cleavage site and a calcium-binding domain. In this data set we identified four robust evolutionary modules, one of which forms the minimal functional TIM barrel topology. The NPC1 protein is involved in the intracellular lipid metabolism coordinating sterol trafficking. NPC1 N-terminus is the first luminal domain which binds to cholesterol and its oxygenated derivatives. Our inferred dynamic modules in the protein NPC1 are also shown to match functional components of the protein related to the NPC1 disease. Conclusions A domain compartmentalization can be found and described in correlation space. To our knowledge, there is no other method attempting to identify sub-domain architecture from the correlation among residues. Most attempts made focus on sequence motifs of protein-protein interactions, binding sites, or sequence conservancy. We were able to describe functional/structural sub-domain architecture related to key residues for starch cleavage, calcium, and chloride binding sites in the α-amylase, and sterol opening-defining modules and disease-related residues in the NPC1. We also described the evolutionary sub-domain architecture of the α-amylase catalytic domain, identifying the already reported minimum functional TIM barrel.
Collapse
Affiliation(s)
- Jose Sergio Hleap
- Department of Biochemistry and Molecular Biology, Dalhousie University, Halifax, NS, B3H 4R2, Canada.
| | | | | |
Collapse
|
23
|
Kurata H, Maeda K, Onaka T, Takata T. BioFNet: biological functional network database for analysis and synthesis of biological systems. Brief Bioinform 2013; 15:699-709. [PMID: 23894104 DOI: 10.1093/bib/bbt048] [Citation(s) in RCA: 16] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022] Open
Abstract
In synthetic biology and systems biology, a bottom-up approach can be used to construct a complex, modular, hierarchical structure of biological networks. To analyze or design such networks, it is critical to understand the relationship between network structure and function, the mechanism through which biological parts or biomolecules are assembled into building blocks or functional networks. A functional network is defined as a subnetwork of biomolecules that performs a particular function. Understanding the mechanism of building functional networks would help develop a methodology for analyzing the structure of large-scale networks and design a robust biological circuit to perform a target function. We propose a biological functional network database, named BioFNet, which can cover the whole cell at the level of molecular interactions. The BioFNet takes an advantage in implementing the simulation program for the mathematical models of the functional networks, visualizing the simulated results. It presents a sound basis for rational design of biochemical networks and for understanding how functional networks are assembled to create complex high-level functions, which would reveal design principles underlying molecular architectures.
Collapse
|
24
|
Almo SC, Garforth SJ, Hillerich BS, Love JD, Seidel RD, Burley SK. Protein production from the structural genomics perspective: achievements and future needs. Curr Opin Struct Biol 2013; 23:335-44. [PMID: 23642905 PMCID: PMC4163025 DOI: 10.1016/j.sbi.2013.02.014] [Citation(s) in RCA: 33] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/19/2012] [Revised: 02/18/2013] [Accepted: 02/18/2013] [Indexed: 01/31/2023]
Abstract
Despite a multitude of recent technical breakthroughs speeding high-resolution structural analysis of biological macromolecules, production of sufficient quantities of well-behaved, active protein continues to represent the rate-limiting step in many structure determination efforts. These challenges are only amplified when considered in the context of ongoing structural genomics efforts, which are now contending with multi-domain eukaryotic proteins, secreted proteins, and ever-larger macromolecular assemblies. Exciting new developments in eukaryotic expression platforms, including insect and mammalian-based systems, promise enhanced opportunities for structural approaches to some of the most important biological problems. Development and implementation of automated eukaryotic expression techniques promises to significantly improve production of materials for structural, functional, and biomedical research applications.
Collapse
Affiliation(s)
- Steven C Almo
- Department of Biochemistry, Albert Einstein College of Medicine, 1300 Morris Park Avenue, Bronx, NY 10461, United States.
| | | | | | | | | | | |
Collapse
|
25
|
Lim WA, Lee CM, Tang C. Design principles of regulatory networks: searching for the molecular algorithms of the cell. Mol Cell 2013; 49:202-12. [PMID: 23352241 DOI: 10.1016/j.molcel.2012.12.020] [Citation(s) in RCA: 101] [Impact Index Per Article: 9.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/31/2012] [Accepted: 12/30/2012] [Indexed: 12/28/2022]
Abstract
A challenge in biology is to understand how complex molecular networks in the cell execute sophisticated regulatory functions. Here we explore the idea that there are common and general principles that link network structures to biological functions, principles that constrain the design solutions that evolution can converge upon for accomplishing a given cellular task. We describe approaches for classifying networks based on abstract architectures and functions, rather than on the specific molecular components of the networks. For any common regulatory task, can we define the space of all possible molecular solutions? Such inverse approaches might ultimately allow the assembly of a design table of core molecular algorithms that could serve as a guide for building synthetic networks and modulating disease networks.
Collapse
Affiliation(s)
- Wendell A Lim
- Center for Systems and Synthetic Biology, University of California, San Francisco, CA 94158, USA.
| | | | | |
Collapse
|
26
|
Gajendrarao P, Krishnamoorthy N, Kassem HS, Moharem-Elgamal S, Cecchi F, Olivotto I, Yacoub MH. Molecular modeling of disease causing mutations in domain C1 of cMyBP-C. PLoS One 2013; 8:e59206. [PMID: 23527136 PMCID: PMC3602012 DOI: 10.1371/journal.pone.0059206] [Citation(s) in RCA: 19] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/26/2012] [Accepted: 02/12/2013] [Indexed: 11/20/2022] Open
Abstract
Cardiac myosin binding protein-C (cMyBP-C) is a multi-domain (C0-C10) protein that regulates heart muscle contraction through interaction with myosin, actin and other sarcomeric proteins. Several mutations of this protein cause familial hypertrophic cardiomyopathy (HCM). Domain C1 of cMyBP-C plays a central role in protein interactions with actin and myosin. Here, we studied structure-function relationship of three disease causing mutations, Arg177His, Ala216Thr and Glu258Lys of the domain C1 using computational biology techniques with its available X-ray crystal structure. The results suggest that each mutation could affect structural properties of the domain C1, and hence it's structural integrity through modifying intra-molecular arrangements in a distinct mode. The mutations also change surface charge distributions, which could impact the binding of C1 with other sarcomeric proteins thereby affecting contractile function. These structural consequences of the C1 mutants could be valuable to understand the molecular mechanisms for the disease.
Collapse
Affiliation(s)
| | | | - Heba Sh Kassem
- BA-HCM National Programme at Aswan Heart Centre, Egypt
- Pathology Department and Clinical Genomics Centre, Alexandria Faculty of Medicine, Alexandria, Egypt
| | - Sarah Moharem-Elgamal
- BA-HCM National Programme at Aswan Heart Centre, Egypt
- National Heart Institute, Giza, Egypt
| | - Franco Cecchi
- Referral Center for Myocardial Diseases, Careggi University Hospital, Florence, Italy
| | - Iacopo Olivotto
- Referral Center for Myocardial Diseases, Careggi University Hospital, Florence, Italy
| | - Magdi H. Yacoub
- Qatar Cardiovascular Research Center, Qatar Foundation, Doha, Qatar
- BA-HCM National Programme at Aswan Heart Centre, Egypt
- National Heart and Lung Institute, Imperial College London, United Kingdom
| |
Collapse
|