1
|
Gupta SK, Ponte-Sucre A, Bencurova E, Dandekar T. An Ebola, Neisseria and Trypanosoma human protein interaction census reveals a conserved human protein cluster targeted by various human pathogens. Comput Struct Biotechnol J 2021; 19:5292-5308. [PMID: 34745452 PMCID: PMC8531761 DOI: 10.1016/j.csbj.2021.09.017] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/19/2021] [Revised: 09/14/2021] [Accepted: 09/15/2021] [Indexed: 12/28/2022] Open
Abstract
Filovirus ebolavirus (ZE; Zaire ebolavirus, Bundibugyo ebolavirus), Neisseria meningitidis (NM), and Trypanosoma brucei (Tb) are serious infectious pathogens, spanning viruses, bacteria and protists and all may target the blood and central nervous system during their life cycle. NM and Tb are extracellular pathogens while ZE is obligatory intracellular, targetting immune privileged sites. By using interactomics and comparative evolutionary analysis we studied whether conserved human proteins are targeted by these pathogens. We examined 2797 unique pathogen-targeted human proteins. The information derived from orthology searches of experimentally validated protein-protein interactions (PPIs) resulted both in unique and shared PPIs for each pathogen. Comparing and analyzing conserved and pathogen-specific infection pathways for NM, TB and ZE, we identified human proteins predicted to be targeted in at least two of the compared host-pathogen networks. However, four proteins were common to all three host-pathogen interactomes: the elongation factor 1-alpha 1 (EEF1A1), the SWI/SNF complex subunit SMARCC2 (matrix-associated actin-dependent regulator of chromatin subfamily C), the dolichyl-diphosphooligosaccharide--protein glycosyltransferase subunit 1 (RPN1), and the tubulin beta-5 chain (TUBB). These four human proteins all are also involved in cytoskeleton and its regulation and are often addressed by various human pathogens. Specifically, we found (i) 56 human pathogenic bacteria and viruses that target these four proteins, (ii) the well researched new pandemic pathogen SARS-CoV-2 targets two of these four human proteins and (iii) nine human pathogenic fungi (yet another evolutionary distant organism group) target three of the conserved proteins by 130 high confidence interactions.
Collapse
Affiliation(s)
- Shishir K Gupta
- Functional Genomics & Systems Biology Group, Department of Bioinformatics, Biocenter, Am Hubland, University of Würzburg, 97074 Würzburg, Germany
- Evolutionary Genomics Group, Center for Computational and Theoretical Biology, University of Würzburg, 97078 Würzburg, Germany
| | - Alicia Ponte-Sucre
- Laboratorio de Fisiología Molecular, Instituto de Medicina Experimental, Escuela Luis Razetti, Universidad Central de Venezuela, Caracas, Venezuela
- Medical Mission Institute, Hermann-Schell-Str. 7, 97074 Würzburg, Germany
| | - Elena Bencurova
- Functional Genomics & Systems Biology Group, Department of Bioinformatics, Biocenter, Am Hubland, University of Würzburg, 97074 Würzburg, Germany
| | - Thomas Dandekar
- Functional Genomics & Systems Biology Group, Department of Bioinformatics, Biocenter, Am Hubland, University of Würzburg, 97074 Würzburg, Germany
- EMBL Heidelberg, BioComputing Unit, Meyerhofstraße 1, 69117 Heidelberg, Germany
| |
Collapse
|
2
|
Alborzi SZ, Ahmed Nacer A, Najjar H, Ritchie DW, Devignes MD. PPIDomainMiner: Inferring domain-domain interactions from multiple sources of protein-protein interactions. PLoS Comput Biol 2021; 17:e1008844. [PMID: 34370723 PMCID: PMC8376228 DOI: 10.1371/journal.pcbi.1008844] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/25/2021] [Revised: 08/19/2021] [Accepted: 07/12/2021] [Indexed: 12/26/2022] Open
Abstract
Many biological processes are mediated by protein-protein interactions (PPIs). Because protein domains are the building blocks of proteins, PPIs likely rely on domain-domain interactions (DDIs). Several attempts exist to infer DDIs from PPI networks but the produced datasets are heterogeneous and sometimes not accessible, while the PPI interactome data keeps growing. We describe a new computational approach called “PPIDM” (Protein-Protein Interactions Domain Miner) for inferring DDIs using multiple sources of PPIs. The approach is an extension of our previously described “CODAC” (Computational Discovery of Direct Associations using Common neighbors) method for inferring new edges in a tripartite graph. The PPIDM method has been applied to seven widely used PPI resources, using as “Gold-Standard” a set of DDIs extracted from 3D structural databases. Overall, PPIDM has produced a dataset of 84,552 non-redundant DDIs. Statistical significance (p-value) is calculated for each source of PPI and used to classify the PPIDM DDIs in Gold (9,175 DDIs), Silver (24,934 DDIs) and Bronze (50,443 DDIs) categories. Dataset comparison reveals that PPIDM has inferred from the 2017 releases of PPI sources about 46% of the DDIs present in the 2020 release of the 3did database, not counting the DDIs present in the Gold-Standard. The PPIDM dataset contains 10,229 DDIs that are consistent with more than 13,300 PPIs extracted from the IMEx database, and nearly 23,300 DDIs (27.5%) that are consistent with more than 214,000 human PPIs extracted from the STRING database. Examples of newly inferred DDIs covering more than 10 PPIs in the IMEx database are provided. Further exploitation of the PPIDM DDI reservoir includes the inventory of possible partners of a protein of interest and characterization of protein interactions at the domain level in combination with other methods. The result is publicly available at http://ppidm.loria.fr/. We revisit at a large scale the question of inferring DDIs from PPIs. Compared to previous studies, we take a unified approach accross multiple sources of PPIs. This approach is a method for inferring new edges in a tripartite graph setting and can be compared to link prediction approaches in knowledge graphs. Aggregation of several sources is performed using an optimized weighted average of the individual scores calculated in each source. A huge dataset of over 84K DDIs is produced which far exceeds the previous datasets. We show that a significant portion of the PPIDM dataset covers a large number of PPIs from curated (IMEx) or non curated (STRING) databases. Such a reservoir of DDIs deserves further exploration and can be combined with high-throughput methods such as cross-linking mass spectrometry to identify plausible protein partners of proteins of interest.
Collapse
|
3
|
Armijos-Jaramillo V, Espinosa N, Vizcaíno K, Santander-Gordón D. A Novel In Silico Method for Molecular Mimicry Detection Finds a Formin with the Potential to Manipulate the Maize Cell Cytoskeleton. MOLECULAR PLANT-MICROBE INTERACTIONS : MPMI 2021; 34:815-825. [PMID: 33755496 DOI: 10.1094/mpmi-11-20-0332-r] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/12/2023]
Abstract
Molecular mimicry is one of the evolutionary strategies that parasites use to manipulate the host metabolism and perform an effective infection. This phenomenon has been observed in several animal and plant pathosystems. Despite the relevance of this mechanism in pathogenesis, little is known about it in fungus-plant interactions. For that reason, we performed an in silico method to select plausible mimicry candidates for the Ustilago maydis-maize interaction. Our methodology used a tripartite sequence comparison between the parasite, the host, and nonparasitic organisms' genomes. Furthermore, we used RNA sequencing information to identify gene coexpression, and we determined subcellular localization to detect potential cases of colocalization in the imitator-imitated pairs. With these approximations, we found a putative extracellular formin in U. maydis with the potential to rearrange the host cell cytoskeleton. In parallel, we detected at least two maize genes involved in the cytoskeleton rearrangement differentially expressed under U. maydis infection; thus, this find increases the expectation for the potential mimicry role of the fungal protein. The use of several sources of data led us to develop a strict and replicable in silico methodology to detect molecular mimicry in pathosystems with enough information available. Furthermore, this is the first time that a genomewide search has been performed to detect molecular mimicry in a U. maydis-maize system. Additionally, to allow the reproducibility of this experiment and the use of this pipeline, we created a Web server called Molecular Mimicry Finder.[Formula: see text] Copyright © 2021 The Author(s). This is an open access article distributed under the CC BY-NC-ND 4.0 International license.
Collapse
Affiliation(s)
- Vinicio Armijos-Jaramillo
- Carrera de Ingeniería en Biotecnología, Facultad de Ingeniería y Ciencias Aplicadas, Universidad de Las Américas, Quito, Ecuador
- Grupo de Bio-Quimioinformática, Universidad de Las Américas, Quito, Ecuador
| | - Nicole Espinosa
- Carrera de Ingeniería en Biotecnología, Facultad de Ingeniería y Ciencias Aplicadas, Universidad de Las Américas, Quito, Ecuador
| | - Karla Vizcaíno
- Carrera de Ingeniería en Biotecnología, Facultad de Ingeniería y Ciencias Aplicadas, Universidad de Las Américas, Quito, Ecuador
| | - Daniela Santander-Gordón
- Carrera de Ingeniería en Biotecnología, Facultad de Ingeniería y Ciencias Aplicadas, Universidad de Las Américas, Quito, Ecuador
| |
Collapse
|
4
|
Zhou Y, Chen H, Li S, Chen M. mPPI: a database extension to visualize structural interactome in a one-to-many manner. DATABASE-THE JOURNAL OF BIOLOGICAL DATABASES AND CURATION 2021; 2021:6307707. [PMID: 34156447 DOI: 10.1093/database/baab036] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/26/2021] [Revised: 05/10/2021] [Accepted: 05/28/2021] [Indexed: 01/02/2023]
Abstract
Protein-protein interaction (PPI) databases with structural information are useful to investigate biological functions at both systematic and atomic levels. However, most existing PPI databases only curate binary interactome. From the perspective of the display and function of PPI, as well as the structural binding interface, the related database and resources are summarized. We developed a database extension, named mPPI, for PPI structural visualization. Comparing with the existing structural interactomes that curate resolved PPI conformation in pairs, mPPI can visualize target protein and its multiple interactors simultaneously, which facilitates multi-target drug discovery and structure prediction of protein macro-complexes. By employing a protein-protein docking algorithm, mPPI largely extends the coverage of structural interactome from experimentally resolved complexes. mPPI is designed to be a customizable and convenient plugin for PPI databases. It possesses wide potential applications for various PPI databases, and it has been used for a neurodegenerative disease-related PPI database as demonstration. Scripts and implementation guidelines of mPPI are documented at the database tool website. Database URL http://bis.zju.edu.cn/mppi/.
Collapse
Affiliation(s)
- Yekai Zhou
- Department of Bioinformatics, College of Life Sciences, Zhejiang University, Hangzhou 310058, China.,Department of Computer Science, The University of Hong Kong, Hong Kong 999077, China
| | - Hongjun Chen
- Department of Bioinformatics, College of Life Sciences, Zhejiang University, Hangzhou 310058, China
| | - Sida Li
- Department of Bioinformatics, College of Life Sciences, Zhejiang University, Hangzhou 310058, China
| | - Ming Chen
- Department of Bioinformatics, College of Life Sciences, Zhejiang University, Hangzhou 310058, China.,Bioinformatics Center, The First Affiliated Hospital, School of Medicine, Zhejiang University, Hangzhou 310058, China
| |
Collapse
|
5
|
Pathogen and Host-Pathogen Protein Interactions Provide a Key to Identify Novel Drug Targets. SYSTEMS MEDICINE 2021. [DOI: 10.1016/b978-0-12-801238-3.11607-1] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/14/2023] Open
|
6
|
Kaushik AC, Mehmood A, Dai X, Wei DQ. WeiBI (web-based platform): Enriching integrated interaction network with increased coverage and functional proteins from genome-wide experimental OMICS data. Sci Rep 2020; 10:5618. [PMID: 32221380 PMCID: PMC7101429 DOI: 10.1038/s41598-020-62508-8] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/17/2019] [Accepted: 03/10/2020] [Indexed: 12/27/2022] Open
Abstract
Many molecular system biology approaches recognize various interactions and functional associations of proteins that occur in cellular processing. Further understanding of the characterization technique reveals noteworthy information. These types of known and predicted interactions, gained through multiple resources, are thought to be important for experimental data to satisfy comprehensive and quality needs. The current work proposes the “WeiBI (WeiBiologicalInteractions)” database that clarifies direct and indirect partnerships associated with biological interactions. This database contains information concerning protein’s functional partnerships and interactions along with their integration into a statistical model that can be computationally predicted for humans. This novel approach in WeiBI version 1.0 collects information using an improved algorithm by transferring interactions between more than 115570 entries, allowing statistical analysis with the automated background for the given inputs for functional enrichment. This approach also allows the input of an entity’s list from a database along with the visualization of subsets as an interaction network and successful performance of the enrichment analysis for a gene set. This wisely improved algorithm is user-friendly, and its accessibility and higher accuracy make it the best database for exploring interactions among genomes’ network and reflects the importance of this study. The proposed server “WeiBI” is accessible at http://weislab.com/WeiDOCK/?page=PKPD.
Collapse
Affiliation(s)
- Aman Chandra Kaushik
- Wuxi School of Medicine, Jiangnan University, Wuxi, China. .,State Key Laboratory of Microbial Metabolism and School of Life Sciences and Biotechnology, Shanghai Jiao Tong University, Shanghai, 200240, China.
| | - Aamir Mehmood
- State Key Laboratory of Microbial Metabolism and School of Life Sciences and Biotechnology, Shanghai Jiao Tong University, Shanghai, 200240, China
| | - Xiaofeng Dai
- Wuxi School of Medicine, Jiangnan University, Wuxi, China
| | - Dong-Qing Wei
- State Key Laboratory of Microbial Metabolism and School of Life Sciences and Biotechnology, Shanghai Jiao Tong University, Shanghai, 200240, China.
| |
Collapse
|
7
|
Gupta SK, Srivastava M, Osmanoglu Ö, Dandekar T. Genome-wide inference of the Camponotus floridanus protein-protein interaction network using homologous mapping and interacting domain profile pairs. Sci Rep 2020; 10:2334. [PMID: 32047225 PMCID: PMC7012867 DOI: 10.1038/s41598-020-59344-1] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/09/2019] [Accepted: 01/22/2020] [Indexed: 12/18/2022] Open
Abstract
Apart from some model organisms, the interactome of most organisms is largely unidentified. High-throughput experimental techniques to determine protein-protein interactions (PPIs) are resource intensive and highly susceptible to noise. Computational methods of PPI determination can accelerate biological discovery by identifying the most promising interacting pairs of proteins and by assessing the reliability of identified PPIs. Here we present a first in-depth study describing a global view of the ant Camponotus floridanus interactome. Although several ant genomes have been sequenced in the last eight years, studies exploring and investigating PPIs in ants are lacking. Our study attempts to fill this gap and the presented interactome will also serve as a template for determining PPIs in other ants in future. Our C. floridanus interactome covers 51,866 non-redundant PPIs among 6,274 proteins, including 20,544 interactions supported by domain-domain interactions (DDIs), 13,640 interactions supported by DDIs and subcellular localization, and 10,834 high confidence interactions mediated by 3,289 proteins. These interactions involve and cover 30.6% of the entire C. floridanus proteome.
Collapse
Affiliation(s)
- Shishir K Gupta
- Functional Genomics and Systems Biology Group, Department of Bioinformatics, Biocenter, Am Hubland, D-97074, Würzburg, Germany.,Department of Microbiology, Biocenter, Am Hubland, D-97074, Würzburg, Germany
| | - Mugdha Srivastava
- Functional Genomics and Systems Biology Group, Department of Bioinformatics, Biocenter, Am Hubland, D-97074, Würzburg, Germany
| | - Özge Osmanoglu
- Functional Genomics and Systems Biology Group, Department of Bioinformatics, Biocenter, Am Hubland, D-97074, Würzburg, Germany
| | - Thomas Dandekar
- Functional Genomics and Systems Biology Group, Department of Bioinformatics, Biocenter, Am Hubland, D-97074, Würzburg, Germany. .,EMBL Heidelberg, BioComputing Unit, Meyerhofstraße 1, 69117, Heidelberg, Germany.
| |
Collapse
|
8
|
Reyna MA, Haan D, Paczkowska M, Verbeke LPC, Vazquez M, Kahraman A, Pulido-Tamayo S, Barenboim J, Wadi L, Dhingra P, Shrestha R, Getz G, Lawrence MS, Pedersen JS, Rubin MA, Wheeler DA, Brunak S, Izarzugaza JMG, Khurana E, Marchal K, von Mering C, Sahinalp SC, Valencia A, Reimand J, Stuart JM, Raphael BJ. Pathway and network analysis of more than 2500 whole cancer genomes. Nat Commun 2020; 11:729. [PMID: 32024854 PMCID: PMC7002574 DOI: 10.1038/s41467-020-14367-0] [Citation(s) in RCA: 60] [Impact Index Per Article: 15.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/12/2018] [Accepted: 12/18/2019] [Indexed: 12/14/2022] Open
Abstract
The catalog of cancer driver mutations in protein-coding genes has greatly expanded in the past decade. However, non-coding cancer driver mutations are less well-characterized and only a handful of recurrent non-coding mutations, most notably TERT promoter mutations, have been reported. Here, as part of the ICGC/TCGA Pan-Cancer Analysis of Whole Genomes (PCAWG) Consortium, which aggregated whole genome sequencing data from 2658 cancer across 38 tumor types, we perform multi-faceted pathway and network analyses of non-coding mutations across 2583 whole cancer genomes from 27 tumor types compiled by the ICGC/TCGA PCAWG project that was motivated by the success of pathway and network analyses in prioritizing rare mutations in protein-coding genes. While few non-coding genomic elements are recurrently mutated in this cohort, we identify 93 genes harboring non-coding mutations that cluster into several modules of interacting proteins. Among these are promoter mutations associated with reduced mRNA expression in TP53, TLE4, and TCF4. We find that biological processes had variable proportions of coding and non-coding mutations, with chromatin remodeling and proliferation pathways altered primarily by coding mutations, while developmental pathways, including Wnt and Notch, altered by both coding and non-coding mutations. RNA splicing is primarily altered by non-coding mutations in this cohort, and samples containing non-coding mutations in well-known RNA splicing factors exhibit similar gene expression signatures as samples with coding mutations in these genes. These analyses contribute a new repertoire of possible cancer genes and mechanisms that are altered by non-coding mutations and offer insights into additional cancer vulnerabilities that can be investigated for potential therapeutic treatments.
Collapse
Affiliation(s)
- Matthew A Reyna
- Department of Computer Science, Princeton University, Princeton, NJ, 08540, USA
- Department of Biomedical Informatics, Emory University, Atlanta, GA, 30322, USA
| | - David Haan
- Department of Biomolecular Engineering and UC Santa Cruz Genomics Institute, University of California, Santa Cruz, Santa Cruz, CA, 95060, USA
| | - Marta Paczkowska
- Computational Biology Program, Ontario Institute for Cancer Research, Toronto, Ontario, Canada
| | - Lieven P C Verbeke
- Department of Information Technology, IDLab, Ghent University, IMEC, Ghent, the Netherlands
- Department of Plant Biotechnology and Bioinformatics, Ghent University, Ghent, the Netherlands
| | - Miguel Vazquez
- Barcelona Supercomputing Center (BSC), Barcelona, 08034, Spain
- Department of Clinical and Molecular Medicine, Faculty of Medicine and Health Sciences, Norwegian University of Science and Technology, Trondheim, Norway
| | - Abdullah Kahraman
- Institute of Molecular Life Sciences and Swiss Institute of Bioinformatics, University of Zurich, CH-8057, Zurich, Switzerland
- Department of Pathology and Molecular Pathology, University Hospital Zurich, CH-8091, Zurich, Switzerland
| | - Sergio Pulido-Tamayo
- Department of Information Technology, IDLab, Ghent University, IMEC, Ghent, the Netherlands
- Department of Plant Biotechnology and Bioinformatics, Ghent University, Ghent, the Netherlands
| | - Jonathan Barenboim
- Computational Biology Program, Ontario Institute for Cancer Research, Toronto, Ontario, Canada
| | - Lina Wadi
- Computational Biology Program, Ontario Institute for Cancer Research, Toronto, Ontario, Canada
| | - Priyanka Dhingra
- Department of Physiology and Biophysics, Weill Cornell Medicine, New York, NY, 10065, USA
| | - Raunak Shrestha
- Vancouver Prostate Centre, 2660 Oak Street, Vancouver, BC, V6H 3Z6, Canada
| | - Gad Getz
- The Broad Institute of MIT and Harvard, Cambridge, MA, 02124, USA
- Massachusetts General Hospital Center for Cancer Research, Charlestown, MA, 02129, USA
- Harvard Medical School, 250 Longwood Avenue, Boston, MA, 02115, USA
- Massachusetts General Hospital, Department of Pathology, Boston, MA, 02114, USA
| | - Michael S Lawrence
- The Broad Institute of MIT and Harvard, Cambridge, MA, 02124, USA
- Massachusetts General Hospital Center for Cancer Research, Charlestown, MA, 02129, USA
| | - Jakob Skou Pedersen
- Department of Molecular Medicine (MOMA), Aarhus University Hospital, Aarhus, Denmark
- Bioinformatics Research Centre (BiRC), Aarhus University, Aarhus, Denmark
| | - Mark A Rubin
- Department of Physiology and Biophysics, Weill Cornell Medicine, New York, NY, 10065, USA
| | - David A Wheeler
- Human Genome Sequencing Center, Baylor College of Medicine, Houston, TX, 77030, USA
| | - Søren Brunak
- DTU Bioinformatics, Department of Bio and Health Informatics, Technical University of Denmark, Kemitorvet, 2800, Kongens Lyngby, Denmark
- Novo Nordisk Foundation Center for Protein Research, Faculty of Health and Medical Sciences, University of Copenhagen, 2200, Copenhagen, Denmark
| | - Jose M G Izarzugaza
- DTU Bioinformatics, Department of Bio and Health Informatics, Technical University of Denmark, Kemitorvet, 2800, Kongens Lyngby, Denmark
- Novo Nordisk Foundation Center for Protein Research, Faculty of Health and Medical Sciences, University of Copenhagen, 2200, Copenhagen, Denmark
| | - Ekta Khurana
- Department of Physiology and Biophysics, Weill Cornell Medicine, New York, NY, 10065, USA
| | - Kathleen Marchal
- Department of Information Technology, IDLab, Ghent University, IMEC, Ghent, the Netherlands
- Department of Plant Biotechnology and Bioinformatics, Ghent University, Ghent, the Netherlands
| | - Christian von Mering
- Institute of Molecular Life Sciences and Swiss Institute of Bioinformatics, University of Zurich, CH-8057, Zurich, Switzerland
| | - S Cenk Sahinalp
- Vancouver Prostate Centre, 2660 Oak Street, Vancouver, BC, V6H 3Z6, Canada
- Department of Computer Science, Indiana University, Bloomington, IN, 47405, USA
| | - Alfonso Valencia
- Barcelona Supercomputing Center (BSC), Barcelona, 08034, Spain
- ICREA, Barcelona, 08010, Spain
| | - Jüri Reimand
- Computational Biology Program, Ontario Institute for Cancer Research, Toronto, Ontario, Canada.
- Department of Medical Biophysics, University of Toronto, Toronto, Ontario, Canada.
| | - Joshua M Stuart
- Department of Biomolecular Engineering and UC Santa Cruz Genomics Institute, University of California, Santa Cruz, Santa Cruz, CA, 95060, USA.
| | - Benjamin J Raphael
- Department of Computer Science, Princeton University, Princeton, NJ, 08540, USA.
| |
Collapse
|
9
|
Abstract
Drugs modulate disease states through their actions on targets in the body. Determining these targets aids the focused development of new treatments, and helps to better characterize those already employed. One means of accomplishing this is through the deployment of in silico methodologies, harnessing computational analytical and predictive power to produce educated hypotheses for experimental verification. Here, we provide an overview of the current state of the art, describe some of the well-established methods in detail, and reflect on how they, and emerging technologies promoting the incorporation of complex and heterogeneous data-sets, can be employed to improve our understanding of (poly)pharmacology.
Collapse
Affiliation(s)
- Ryan Byrne
- Department of Chemistry and Applied Biosciences, Swiss Federal Institute of Technology (ETH), Zurich, Switzerland
| | - Gisbert Schneider
- Department of Chemistry and Applied Biosciences, Swiss Federal Institute of Technology (ETH), Zurich, Switzerland.
| |
Collapse
|
10
|
Bencurova E, Gupta SK, Oskoueian E, Bhide M, Dandekar T. Omics and bioinformatics applied to vaccine development against Borrelia. Mol Omics 2018; 14:330-340. [PMID: 30113617 DOI: 10.1039/c8mo00130h] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/19/2022]
Abstract
Borrelia burgdorferi is an extracellular spirochete that causes Lyme disease. Currently, no effective vaccine is available for humans and animals except for dogs. In the present study, an extensive bioinformatics pipeline was established to predict new candidates that can be used for vaccine development including building the protein-protein interaction network based on orthologues of experimentally verified protein-protein interaction networks, elucidation of the proteins involved in the immune response, selection of the topologically-interesting proteins and their prioritization based on their antigenicity. Proteomic network analysis yielded an interactome network with 120 nodes with 97 interactions. Proteins were selected to obtain a subnet containing only the borrelial membrane proteins and immune-related host proteins. This strategy resulted in the selection of 15 borrelial targets, which were subjected to extensive bioinformatics analysis to predict their antigenic properties. Based on the strategy applied in this study the proteins encoded by erpX (ErpX proteins, UniProt ID: H7C7L6), erpL (ErpL protein, UniProt ID: H7C7M3) and erpY (ErpY protein, UniProt ID: Q9S0D9) are suggested as a novel set of vaccine targets to control Lyme disease. Moreover, five different tools were used to validate their antigenicity regarding B-cells. The combination of all these proteins in a vaccine should allow improved protection against Borrelia infection.
Collapse
Affiliation(s)
- Elena Bencurova
- Department of Bioinformatics, Biocenter, Am Hubland, D-97074 Wuerzburg, Germany.
| | | | | | | | | |
Collapse
|
11
|
Identification of Antifungal Targets Based on Computer Modeling. J Fungi (Basel) 2018; 4:jof4030081. [PMID: 29973534 PMCID: PMC6162656 DOI: 10.3390/jof4030081] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/15/2018] [Revised: 06/24/2018] [Accepted: 06/29/2018] [Indexed: 01/07/2023] Open
Abstract
Aspergillus fumigatus is a saprophytic, cosmopolitan fungus that attacks patients with a weak immune system. A rational solution against fungal infection aims to manipulate fungal metabolism or to block enzymes essential for Aspergillus survival. Here we discuss and compare different bioinformatics approaches to analyze possible targeting strategies on fungal-unique pathways. For instance, phylogenetic analysis reveals fungal targets, while domain analysis allows us to spot minor differences in protein composition between the host and fungi. Moreover, protein networks between host and fungi can be systematically compared by looking at orthologs and exploiting information from host⁻pathogen interaction databases. Further data—such as knowledge of a three-dimensional structure, gene expression data, or information from calculated metabolic fluxes—refine the search and rapidly put a focus on the best targets for antimycotics. We analyzed several of the best targets for application to structure-based drug design. Finally, we discuss general advantages and limitations in identification of unique fungal pathways and protein targets when applying bioinformatics tools.
Collapse
|
12
|
Computational and Experimental Approaches to Predict Host-Parasite Protein-Protein Interactions. Methods Mol Biol 2018; 1819:153-173. [PMID: 30421403 DOI: 10.1007/978-1-4939-8618-7_7] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/20/2022]
Abstract
In host-parasite systems, protein-protein interactions are key to allow the pathogen to enter the host and persist within the host. The study of host-parasite molecular communication improves the understanding the mechanisms of infection, evasion of the host immune system and tropism across different tissues. Current trends in parasitology focus on unraveling host-parasite protein-protein interactions to aid the development of new strategies to combat pathogenic parasites with better treatments and prevention mechanisms. Due to the complexity of capturing experimentally these interactions, computational approaches integrating data from different sources (mainly "omics" data) become key to complement or support experimental approaches. Here, we focus on the application of experimental and computational methods in the prediction of host-parasite interactions and highlight the potential of each of these methods in specific contexts.
Collapse
|
13
|
Ptushkina M, Poolman T, Iqbal M, Ashe M, Petersen J, Woodburn J, Rattray M, Whetton A, Ray D. A non-transcriptional role for the glucocorticoid receptor in mediating the cell stress response. Sci Rep 2017; 7:12101. [PMID: 28935859 PMCID: PMC5608759 DOI: 10.1038/s41598-017-09722-z] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/15/2017] [Accepted: 07/17/2017] [Indexed: 11/25/2022] Open
Abstract
The glucocorticoid receptor (GR) is essential for the stress response in mammals. We investigated potential non-transcriptional roles of GR in cellular stress response using fission yeast as a model.We surprisingly discovered marked heat stress resistance in yeast ectopically expressing human GR, which required expression of both the N-terminal transactivation domain, and the C-terminal ligand binding domain, but not the DNA-binding domain of the GR. This effect was not affected by GR ligand exposure, and occurred without significant GR nuclear accumulation. Mechanistically, the GR survival effect required Hsp104, and, indeed, GR expression increased Hsp104 expression. Proteomic analysis revealed GR binding to translasome components, including eIF3, a known partner for Sty1, a pattern of protein interaction which we confirmed using yeast two-hybrid studies.Taken together, we find evidence for a novel pathway conferring stress resistance in yeast that can be activated by the human GR, acting by protein-protein mechanisms in the cytoplasm. This suggests that in organisms where GR is natively expressed, GR likely contributes to stress responses through non-transcriptional mechanisms in addition to its well-established transcriptional responses.
Collapse
Affiliation(s)
- Marina Ptushkina
- School of Medical Sciences, Faculty of Biology, Medicine, and Health, University of Manchester, Manchester, M13 9PT, UK.
- Manchester Academic Health Sciences Centre, Manchester, M13 9PT, UK.
| | - Toryn Poolman
- School of Medical Sciences, Faculty of Biology, Medicine, and Health, University of Manchester, Manchester, M13 9PT, UK
- Manchester Academic Health Sciences Centre, Manchester, M13 9PT, UK
| | - Mudassar Iqbal
- School of Medical Sciences, Faculty of Biology, Medicine, and Health, University of Manchester, Manchester, M13 9PT, UK
- Manchester Academic Health Sciences Centre, Manchester, M13 9PT, UK
| | - Mark Ashe
- School of Medical Sciences, Faculty of Biology, Medicine, and Health, University of Manchester, Manchester, M13 9PT, UK
- Manchester Academic Health Sciences Centre, Manchester, M13 9PT, UK
| | - Janni Petersen
- School of Health Science, Flinders University, South Australia Sturt Road 5042, GPO Box 2100, Adelaide, Australia
| | - Joanna Woodburn
- School of Medical Sciences, Faculty of Biology, Medicine, and Health, University of Manchester, Manchester, M13 9PT, UK
- Manchester Academic Health Sciences Centre, Manchester, M13 9PT, UK
| | - Magnus Rattray
- School of Medical Sciences, Faculty of Biology, Medicine, and Health, University of Manchester, Manchester, M13 9PT, UK
- Manchester Academic Health Sciences Centre, Manchester, M13 9PT, UK
| | - Anthony Whetton
- Division of Cancer, School of Medical Sciences, Faculty of Biology, Medicine and Health, University of Manchester, Manchester, M13 9PT, UK
- Manchester Academic Health Sciences Centre, Manchester, M13 9PT, UK
| | - David Ray
- School of Medical Sciences, Faculty of Biology, Medicine, and Health, University of Manchester, Manchester, M13 9PT, UK.
- Manchester Academic Health Sciences Centre, Manchester, M13 9PT, UK.
- Department of Endocrinology, Manchester Royal Infirmary, Manchester, M13 9WL, UK.
| |
Collapse
|
14
|
Mondal SI, Mahmud Z, Elahi M, Akter A, Jewel NA, Muzahidul Islam M, Ferdous S, Kikuchi T. Study of intra-inter species protein-protein interactions for potential drug targets identification and subsequent drug design for Escherichia coli O104:H4 C277-11. In Silico Pharmacol 2017; 5:1. [PMID: 28401513 DOI: 10.1007/s40203-017-0021-5] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/23/2017] [Accepted: 03/20/2017] [Indexed: 11/29/2022] Open
Abstract
Protein-protein interaction (PPI) and host-pathogen interactions (HPI) proteomic analysis has been successfully practiced for potential drug target identification in pathogenic infections. In this research, we attempted to identify new drug target based on PPI and HPI computation approaches and subsequently design new drug against devastating enterohemorrhagic Escherichia coli O104:H4 C277-11 (Broad), which causes life-threatening food borne disease outbreak in Germany and other countries in Europe in 2011. Our systematic in silico analysis on PPI and HPI of E. coli O104:H4 was able to identify bacterial D-galactose-binding periplasmic and UDP-N-acetylglucosamine 1-carboxyvinyltransferase as attractive candidates for new drug targets. Furthermore, computational three-dimensional structure modeling and subsequent molecular docking finally proposed [3-(5-Amino-7-Hydroxy-[1,2,3]Triazolo[4,5-D]Pyrimidin-2-Yl)-N-(3,5-Dichlorobenzyl)-Benzamide)] and (6-amino-2-[(1-naphthylmethyl)amino]-3,7-dihydro-8H-imidazo[4,5-g]quinazolin-8-one) as promising candidate drugs for further evaluation and development for E. coli O104:H4 mediated diseases. Identification of new drug target would be of great utility for humanity as the demand for designing new drugs to fight infections is increasing due to the developing resistance and side effects of current treatments. This research provided the basis for computer aided drug design which might be useful for new drug target identification and subsequent drug design for other infectious organisms.
Collapse
Affiliation(s)
- Shakhinur Islam Mondal
- Department of Genetic Engineering and Biotechnology, Shahjalal University of Science and Technology, Sylhet, 3114, Bangladesh. .,Division of Microbiology, Department of Infectious Diseases, Faculty of Medicine, University of Miyazaki, 5200 Kihara, Kiyotake, Miyazaki, 889-1692, Japan.
| | - Zabed Mahmud
- Department of Genetic Engineering and Biotechnology, Shahjalal University of Science and Technology, Sylhet, 3114, Bangladesh
| | - Montasir Elahi
- Department of Diagnosis, Prevention and Treatment of Dementia, Juntendo University Graduate School of Medicine, Bunkyō, Tokyo, Japan
| | - Arzuba Akter
- Division of Microbiology, Department of Infectious Diseases, Faculty of Medicine, University of Miyazaki, 5200 Kihara, Kiyotake, Miyazaki, 889-1692, Japan.,Department of Biochemistry and Molecular Biology, Shahjalal University of Science and Technology, Sylhet, 3114, Bangladesh
| | - Nurnabi Azad Jewel
- Department of Genetic Engineering and Biotechnology, Shahjalal University of Science and Technology, Sylhet, 3114, Bangladesh
| | - Md Muzahidul Islam
- Department of Genetic Engineering and Biotechnology, Shahjalal University of Science and Technology, Sylhet, 3114, Bangladesh
| | - Sabiha Ferdous
- Department of Genetic Engineering and Biotechnology, Shahjalal University of Science and Technology, Sylhet, 3114, Bangladesh
| | - Taisei Kikuchi
- Division of Parasitology, Faculty of Medicine, University of Miyazaki, Miyazaki, 889-1692, Japan
| |
Collapse
|
15
|
Kaltdorf M, Srivastava M, Gupta SK, Liang C, Binder J, Dietl AM, Meir Z, Haas H, Osherov N, Krappmann S, Dandekar T. Systematic Identification of Anti-Fungal Drug Targets by a Metabolic Network Approach. Front Mol Biosci 2016; 3:22. [PMID: 27379244 PMCID: PMC4911368 DOI: 10.3389/fmolb.2016.00022] [Citation(s) in RCA: 36] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/31/2015] [Accepted: 05/24/2016] [Indexed: 11/13/2022] Open
Abstract
New antimycotic drugs are challenging to find, as potential target proteins may have close human orthologs. We here focus on identifying metabolic targets that are critical for fungal growth and have minimal similarity to targets among human proteins. We compare and combine here: (I) direct metabolic network modeling using elementary mode analysis and flux estimates approximations using expression data, (II) targeting metabolic genes by transcriptome analysis of condition-specific highly expressed enzymes, and (III) analysis of enzyme structure, enzyme interconnectedness ("hubs"), and identification of pathogen-specific enzymes using orthology relations. We have identified 64 targets including metabolic enzymes involved in vitamin synthesis, lipid, and amino acid biosynthesis including 18 targets validated from the literature, two validated and five currently examined in own genetic experiments, and 38 further promising novel target proteins which are non-orthologous to human proteins, involved in metabolism and are highly ranked drug targets from these pipelines.
Collapse
Affiliation(s)
- Martin Kaltdorf
- Department of Bioinformatics, Biocenter, University of Würzburg Würzburg, Germany
| | - Mugdha Srivastava
- Department of Bioinformatics, Biocenter, University of Würzburg Würzburg, Germany
| | - Shishir K Gupta
- Department of Bioinformatics, Biocenter, University of Würzburg Würzburg, Germany
| | - Chunguang Liang
- Department of Bioinformatics, Biocenter, University of Würzburg Würzburg, Germany
| | - Jasmin Binder
- Microbiology Institute - Clinical Microbiology, Immunology and Hygiene, Friedrich-Alexander University Erlangen-Nürnberg, University Hospital of Erlangen Erlangen, Germany
| | - Anna-Maria Dietl
- Division of Molecular Biology/Biocenter, Medical University Innsbruck Innsbruck, Austria
| | - Zohar Meir
- Aspergillus and Antifungal Research Laboratory, Department of Clinical Microbiology and Immunology, Sackler School of Medicine, Tel Aviv University Tel-Aviv, Israel
| | - Hubertus Haas
- Division of Molecular Biology/Biocenter, Medical University Innsbruck Innsbruck, Austria
| | - Nir Osherov
- Aspergillus and Antifungal Research Laboratory, Department of Clinical Microbiology and Immunology, Sackler School of Medicine, Tel Aviv University Tel-Aviv, Israel
| | - Sven Krappmann
- Microbiology Institute - Clinical Microbiology, Immunology and Hygiene, Friedrich-Alexander University Erlangen-Nürnberg, University Hospital of Erlangen Erlangen, Germany
| | - Thomas Dandekar
- Department of Bioinformatics, Biocenter, University of Würzburg Würzburg, Germany
| |
Collapse
|
16
|
Keskin O, Tuncbag N, Gursoy A. Predicting Protein–Protein Interactions from the Molecular to the Proteome Level. Chem Rev 2016; 116:4884-909. [DOI: 10.1021/acs.chemrev.5b00683] [Citation(s) in RCA: 207] [Impact Index Per Article: 25.9] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/17/2023]
Affiliation(s)
| | - Nurcan Tuncbag
- Graduate
School of Informatics, Department of Health Informatics, Middle East Technical University, 06800 Ankara, Turkey
| | | |
Collapse
|
17
|
Surfing the Protein-Protein Interaction Surface Using Docking Methods: Application to the Design of PPI Inhibitors. Molecules 2015; 20:11569-603. [PMID: 26111183 PMCID: PMC6272567 DOI: 10.3390/molecules200611569] [Citation(s) in RCA: 44] [Impact Index Per Article: 4.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/27/2015] [Revised: 06/02/2015] [Accepted: 06/15/2015] [Indexed: 02/06/2023] Open
Abstract
Blocking protein-protein interactions (PPI) using small molecules or peptides modulates biochemical pathways and has therapeutic significance. PPI inhibition for designing drug-like molecules is a new area that has been explored extensively during the last decade. Considering the number of available PPI inhibitor databases and the limited number of 3D structures available for proteins, docking and scoring methods play a major role in designing PPI inhibitors as well as stabilizers. Docking methods are used in the design of PPI inhibitors at several stages of finding a lead compound, including modeling the protein complex, screening for hot spots on the protein-protein interaction interface and screening small molecules or peptides that bind to the PPI interface. There are three major challenges to the use of docking on the relatively flat surfaces of PPI. In this review we will provide some examples of the use of docking in PPI inhibitor design as well as its limitations. The combination of experimental and docking methods with improved scoring function has thus far resulted in few success stories of PPI inhibitors for therapeutic purposes. Docking algorithms used for PPI are in the early stages, however, and as more data are available docking will become a highly promising area in the design of PPI inhibitors or stabilizers.
Collapse
|
18
|
Subramanian N, Torabi-Parizi P, Gottschalk RA, Germain RN, Dutta B. Network representations of immune system complexity. WILEY INTERDISCIPLINARY REVIEWS-SYSTEMS BIOLOGY AND MEDICINE 2015; 7:13-38. [PMID: 25625853 PMCID: PMC4339634 DOI: 10.1002/wsbm.1288] [Citation(s) in RCA: 50] [Impact Index Per Article: 5.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 08/25/2014] [Revised: 12/09/2014] [Accepted: 12/11/2014] [Indexed: 12/25/2022]
Abstract
The mammalian immune system is a dynamic multiscale system composed of a hierarchically organized set of molecular, cellular, and organismal networks that act in concert to promote effective host defense. These networks range from those involving gene regulatory and protein–protein interactions underlying intracellular signaling pathways and single‐cell responses to increasingly complex networks of in vivo cellular interaction, positioning, and migration that determine the overall immune response of an organism. Immunity is thus not the product of simple signaling events but rather nonlinear behaviors arising from dynamic, feedback‐regulated interactions among many components. One of the major goals of systems immunology is to quantitatively measure these complex multiscale spatial and temporal interactions, permitting development of computational models that can be used to predict responses to perturbation. Recent technological advances permit collection of comprehensive datasets at multiple molecular and cellular levels, while advances in network biology support representation of the relationships of components at each level as physical or functional interaction networks. The latter facilitate effective visualization of patterns and recognition of emergent properties arising from the many interactions of genes, molecules, and cells of the immune system. We illustrate the power of integrating ‘omics’ and network modeling approaches for unbiased reconstruction of signaling and transcriptional networks with a focus on applications involving the innate immune system. We further discuss future possibilities for reconstruction of increasingly complex cellular‐ and organism‐level networks and development of sophisticated computational tools for prediction of emergent immune behavior arising from the concerted action of these networks. WIREs Syst Biol Med 2015, 7:13–38. doi: 10.1002/wsbm.1288 This article is categorized under:
Analytical and Computational Methods > Computational Methods Laboratory Methods and Technologies > Macromolecular Interactions, Methods
Collapse
Affiliation(s)
- Naeha Subramanian
- Institute for Systems Biology, Seattle, WA, USA; Laboratory of Systems Biology, National Institute of Allergy and Infectious Diseases, National Institutes of Health, Bethesda, MD, USA
| | | | | | | | | |
Collapse
|
19
|
Szklarczyk D, Franceschini A, Wyder S, Forslund K, Heller D, Huerta-Cepas J, Simonovic M, Roth A, Santos A, Tsafou KP, Kuhn M, Bork P, Jensen LJ, von Mering C. STRING v10: protein-protein interaction networks, integrated over the tree of life. Nucleic Acids Res 2014; 43:D447-52. [PMID: 25352553 PMCID: PMC4383874 DOI: 10.1093/nar/gku1003] [Citation(s) in RCA: 7226] [Impact Index Per Article: 722.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/06/2023] Open
Abstract
The many functional partnerships and interactions that occur between proteins are at the core of cellular processing and their systematic characterization helps to provide context in molecular systems biology. However, known and predicted interactions are scattered over multiple resources, and the available data exhibit notable differences in terms of quality and completeness. The STRING database (http://string-db.org) aims to provide a critical assessment and integration of protein–protein interactions, including direct (physical) as well as indirect (functional) associations. The new version 10.0 of STRING covers more than 2000 organisms, which has necessitated novel, scalable algorithms for transferring interaction information between organisms. For this purpose, we have introduced hierarchical and self-consistent orthology annotations for all interacting proteins, grouping the proteins into families at various levels of phylogenetic resolution. Further improvements in version 10.0 include a completely redesigned prediction pipeline for inferring protein–protein associations from co-expression data, an API interface for the R computing environment and improved statistical analysis for enrichment tests in user-provided networks.
Collapse
Affiliation(s)
- Damian Szklarczyk
- Institute of Molecular Life Sciences and Swiss Institute of Bioinformatics, University of Zurich, 8057 Zurich, Switzerland
| | - Andrea Franceschini
- Institute of Molecular Life Sciences and Swiss Institute of Bioinformatics, University of Zurich, 8057 Zurich, Switzerland
| | - Stefan Wyder
- Institute of Molecular Life Sciences and Swiss Institute of Bioinformatics, University of Zurich, 8057 Zurich, Switzerland
| | | | - Davide Heller
- Institute of Molecular Life Sciences and Swiss Institute of Bioinformatics, University of Zurich, 8057 Zurich, Switzerland
| | | | - Milan Simonovic
- Institute of Molecular Life Sciences and Swiss Institute of Bioinformatics, University of Zurich, 8057 Zurich, Switzerland
| | - Alexander Roth
- Institute of Molecular Life Sciences and Swiss Institute of Bioinformatics, University of Zurich, 8057 Zurich, Switzerland
| | - Alberto Santos
- Novo Nordisk Foundation Center for Protein Research, University of Copenhagen, 2200 Copenhagen N, Denmark
| | - Kalliopi P Tsafou
- Novo Nordisk Foundation Center for Protein Research, University of Copenhagen, 2200 Copenhagen N, Denmark
| | - Michael Kuhn
- Biotechnology Center, Technische Universität Dresden, 01062 Dresden, Germany Max Planck Institute of Molecular Cell Biology and Genetics, 01062 Dresden, Germany
| | - Peer Bork
- European Molecular Biology Laboratory, 69117 Heidelberg, Germany
| | - Lars J Jensen
- Novo Nordisk Foundation Center for Protein Research, University of Copenhagen, 2200 Copenhagen N, Denmark
| | - Christian von Mering
- Institute of Molecular Life Sciences and Swiss Institute of Bioinformatics, University of Zurich, 8057 Zurich, Switzerland
| |
Collapse
|
20
|
Begum T, Ghosh TC. Elucidating the genotype-phenotype relationships and network perturbations of human shared and specific disease genes from an evolutionary perspective. Genome Biol Evol 2014; 6:2741-53. [PMID: 25287147 PMCID: PMC4224346 DOI: 10.1093/gbe/evu220] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open
Abstract
To date, numerous studies have been attempted to determine the extent of variation in evolutionary rates between human disease and nondisease (ND) genes. In our present study, we have considered human autosomal monogenic (Mendelian) disease genes, which were classified into two groups according to the number of phenotypic defects, that is, specific disease (SPD) gene (one gene: one defect) and shared disease (SHD) gene (one gene: multiple defects). Here, we have compared the evolutionary rates of these two groups of genes, that is, SPD genes and SHD genes with respect to ND genes. We observed that the average evolutionary rates are slow in SHD group, intermediate in SPD group, and fast in ND group. Group-to-group evolutionary rate differences remain statistically significant regardless of their gene expression levels and number of defects. We demonstrated that disease genes are under strong selective constraint if they emerge through edgetic perturbation or drug-induced perturbation of the interactome network, show tissue-restricted expression, and are involved in transmembrane transport. Among all the factors, our regression analyses interestingly suggest the independent effects of 1) drug-induced perturbation and 2) the interaction term of expression breadth and transmembrane transport on protein evolutionary rates. We reasoned that the drug-induced network disruption is a combination of several edgetic perturbations and, thus, has more severe effect on gene phenotypes.
Collapse
Affiliation(s)
- Tina Begum
- Bioinformatics Centre, Bose Institute, Kolkata, West Bengal, India
| | | |
Collapse
|
21
|
Palmeri A, Ferrè F, Helmer-Citterich M. Exploiting holistic approaches to model specificity in protein phosphorylation. Front Genet 2014; 5:315. [PMID: 25324856 PMCID: PMC4179730 DOI: 10.3389/fgene.2014.00315] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/06/2014] [Accepted: 08/21/2014] [Indexed: 12/27/2022] Open
Abstract
Phosphate plays a chemically unique role in shaping cellular signaling of all current living systems, especially eukaryotes. Protein phosphorylation has been studied at several levels, from the near-site context, both in sequence and structure, to the crowded cellular environment, and ultimately to the systems-level perspective. Despite the tremendous advances in mass spectrometry and efforts dedicated to the development of ad hoc highly sophisticated methods, phosphorylation site inference and associated kinase identification are still unresolved problems in kinome biology. The sequence and structure of the substrate near-site context are not sufficient alone to model the in vivo phosphorylation rules, and they should be integrated with orthogonal information in all possible applications. Here we provide an overview of the different contexts that contribute to protein phosphorylation, discussing their potential impact in phosphorylation site annotation and in predicting kinase-substrate specificity.
Collapse
Affiliation(s)
- Antonio Palmeri
- Department of Biology, Centre for Molecular Bioinformatics, University of Rome Tor Vergata Rome, Italy
| | - Fabrizio Ferrè
- Department of Biology, Centre for Molecular Bioinformatics, University of Rome Tor Vergata Rome, Italy
| | - Manuela Helmer-Citterich
- Department of Biology, Centre for Molecular Bioinformatics, University of Rome Tor Vergata Rome, Italy
| |
Collapse
|
22
|
Galperin MY, Koonin EV. Comparative Genomics Approaches to Identifying Functionally Related Genes. ALGORITHMS FOR COMPUTATIONAL BIOLOGY 2014. [DOI: 10.1007/978-3-319-07953-0_1] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/03/2023]
|
23
|
Waaijers S, Koorman T, Kerver J, Boxem M. Identification of human protein interaction domains using an ORFeome-based yeast two-hybrid fragment library. J Proteome Res 2013; 12:3181-92. [PMID: 23718855 DOI: 10.1021/pr400047p] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/22/2023]
Abstract
Physical interactions between proteins are essential for biological processes. Hence, there have been major efforts to elucidate the complete networks of protein-protein interactions, or "interactomes", of various organisms. Detailed descriptions of protein interaction networks should include information on the discrete domains that mediate these interactions, yet most large-scale efforts model interactions between whole proteins only. We previously developed a yeast two-hybrid-based strategy to systematically map interaction domains and generated a domain-based interactome network for 750 proteins involved in C. elegans early embryonic development. Here, we expand the concept of Y2H-based interaction domain mapping to the genome-wide level. We generated a human fragment library by randomly fragmenting the full-length open reading frames (ORFs) present in the human ORFeome collection. Screens using several proteins required for cell division or polarity establishment as baits demonstrate the ability to accurately identify interaction domains for human proteins using this approach, while the experimental quality of the Y2H data was independently verified in coaffinity purification assays. The library generation strategy can easily be adapted to generate libraries from full-length ORF collections of other organisms.
Collapse
Affiliation(s)
- Selma Waaijers
- Department of Biology, Utrecht University, Padualaan 8, 3584 CH Utrecht, The Netherlands
| | | | | | | |
Collapse
|
24
|
Abstract
UNLABELLED Protein interaction networks are important for the understanding of regulatory mechanisms, for the explanation of experimental data and for the prediction of protein functions. Unfortunately, most interaction data is available only for model organisms. As a possible remedy, the transfer of interactions to organisms of interest is common practice, but it is not clear when interactions can be transferred from one organism to another and, thus, the confidence in the derived interactions is low. Here, we propose to use a rich set of features to train Random Forests in order to score transferred interactions. We evaluated the transfer from a range of eukaryotic organisms to S. cerevisiae using orthologs. Directly transferred interactions to S. cerevisiae are on average only 24% consistent with the current S. cerevisiae interaction network. By using commonly applied filter approaches the transfer precision can be improved, but at the cost of a large decrease in the number of transferred interactions. Our Random Forest approach uses various features derived from both the target and the source network as well as the ortholog annotations to assign confidence values to transferred interactions. Thereby, we could increase the average transfer consistency to 85%, while still transferring almost 70% of all correctly transferable interactions. We tested our approach for the transfer of interactions to other species and showed that our approach outperforms competing methods for the transfer of interactions to species where no experimental knowledge is available. Finally, we applied our predictor to score transferred interactions to 83 targets species and we were able to extend the available interactome of B. taurus, M. musculus and G. gallus with over 40,000 interactions each. Our transferred interaction networks are publicly available via our web interface, which allows to inspect and download transferred interaction sets of different sizes, for various species, and at specified expected precision levels. AVAILABILITY http://services.bio.ifi.lmu.de/coin-db/.
Collapse
Affiliation(s)
- Robert Pesch
- Institute for Informatics, Ludwig-Maximilians-Universität München, Munich, Germany
- * E-mail:
| | - Ralf Zimmer
- Institute for Informatics, Ludwig-Maximilians-Universität München, Munich, Germany
| |
Collapse
|
25
|
Biophysical and computational fragment-based approaches to targeting protein-protein interactions: applications in structure-guided drug discovery. Q Rev Biophys 2012; 45:383-426. [PMID: 22971516 DOI: 10.1017/s0033583512000108] [Citation(s) in RCA: 70] [Impact Index Per Article: 5.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/17/2022]
Abstract
Drug discovery has classically targeted the active sites of enzymes or ligand-binding sites of receptors and ion channels. In an attempt to improve selectivity of drug candidates, modulation of protein-protein interfaces (PPIs) of multiprotein complexes that mediate conformation or colocation of components of cell-regulatory pathways has become a focus of interest. However, PPIs in multiprotein systems continue to pose significant challenges, as they are generally large, flat and poor in distinguishing features, making the design of small molecule antagonists a difficult task. Nevertheless, encouragement has come from the recognition that a few amino acids - so-called hotspots - may contribute the majority of interaction-free energy. The challenges posed by protein-protein interactions have led to a wellspring of creative approaches, including proteomimetics, stapled α-helical peptides and a plethora of antibody inspired molecular designs. Here, we review a more generic approach: fragment-based drug discovery. Fragments allow novel areas of chemical space to be explored more efficiently, but the initial hits have low affinity. This means that they will not normally disrupt PPIs, unless they are tethered, an approach that has been pioneered by Wells and co-workers. An alternative fragment-based approach is to stabilise the uncomplexed components of the multiprotein system in solution and employ conventional fragment-based screening. Here, we describe the current knowledge of the structures and properties of protein-protein interactions and the small molecules that can modulate them. We then describe the use of sensitive biophysical methods - nuclear magnetic resonance, X-ray crystallography, surface plasmon resonance, differential scanning fluorimetry or isothermal calorimetry - to screen and validate fragment binding. Fragment hits can subsequently be evolved into larger molecules with higher affinity and potency. These may provide new leads for drug candidates that target protein-protein interactions and have therapeutic value.
Collapse
|
26
|
Arnold R, Boonen K, Sun MG, Kim PM. Computational analysis of interactomes: current and future perspectives for bioinformatics approaches to model the host-pathogen interaction space. Methods 2012; 57:508-18. [PMID: 22750305 PMCID: PMC7128575 DOI: 10.1016/j.ymeth.2012.06.011] [Citation(s) in RCA: 33] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/13/2012] [Revised: 06/20/2012] [Accepted: 06/21/2012] [Indexed: 11/05/2022] Open
Abstract
Bacterial and viral pathogens affect their eukaryotic host partly by interacting with proteins of the host cell. Hence, to investigate infection from a systems' perspective we need to construct complete and accurate host-pathogen protein-protein interaction networks. Because of the paucity of available data and the cost associated with experimental approaches, any construction and analysis of such a network in the near future has to rely on computational predictions. Specifically, this challenge consists of a number of sub-problems: First, prediction of possible pathogen interactors (e.g. effector proteins) is necessary for bacteria and protozoa. Second, the prospective host binding partners have to be determined and finally, the impact on the host cell analyzed. This review gives an overview of current bioinformatics approaches to obtain and understand host-pathogen interactions. As an application example of the methods covered, we predict host-pathogen interactions of Salmonella and discuss the value of these predictions as a prospective for further research.
Collapse
Affiliation(s)
- Roland Arnold
- Terrence Donnelly Centre for Cellular and Biomolecular Research, University of Toronto, Toronto, ON, Canada M5S 3E1
| | - Kurt Boonen
- Terrence Donnelly Centre for Cellular and Biomolecular Research, University of Toronto, Toronto, ON, Canada M5S 3E1
| | - Mark G.F. Sun
- Terrence Donnelly Centre for Cellular and Biomolecular Research, University of Toronto, Toronto, ON, Canada M5S 3E1
| | - Philip M. Kim
- Terrence Donnelly Centre for Cellular and Biomolecular Research, University of Toronto, Toronto, ON, Canada M5S 3E1
- Banting and Best Department of Medical Research, University of Toronto, Toronto, ON, Canada M5S 3E1
- Department of Molecular Genetics, University of Toronto, Toronto, ON, Canada M5S 3E1
- Department of Computer Science, University of Toronto, Toronto, ON, Canada M5S 3E1
| |
Collapse
|
27
|
Chen TW, Gan RRC, Wu TH, Lin WC, Tang P. VIP DB--a viral protein domain usage and distribution database. Genomics 2012; 100:149-56. [PMID: 22735743 DOI: 10.1016/j.ygeno.2012.06.008] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/24/2012] [Revised: 06/13/2012] [Accepted: 06/15/2012] [Indexed: 11/19/2022]
Abstract
During the viral infection and replication processes, viral proteins are highly regulated and may interact with host proteins. However, the functions and interaction partners of many viral proteins have yet to be explored. Here, we compiled a VIral Protein domain DataBase (VIP DB) to associate viral proteins with putative functions and interaction partners. We systematically assign domains and infer the functions of proteins and their protein interaction partners from their domain annotations. A total of 2,322 unique domains that were identified from 2,404 viruses are used as a starting point to correlate GO classification, KEGG metabolic pathway annotation and domain-domain interactions. Of the unique domains, 42.7% have GO records, 39.6% have at least one domain-domain interaction record and 26.3% can also be found in either mammals or plants. This database provides a resource to help virologists identify potential roles for viral protein. All of the information is available at http://vipdb.cgu.edu.tw.
Collapse
Affiliation(s)
- Ting-Wen Chen
- Molecular Medicine Research Center, Chang Gung University, Taoyuan, Taiwan.
| | | | | | | | | |
Collapse
|
28
|
Kim Y, Min B, Yi GS. IDDI: integrated domain-domain interaction and protein interaction analysis system. Proteome Sci 2012; 10 Suppl 1:S9. [PMID: 22759586 PMCID: PMC3380739 DOI: 10.1186/1477-5956-10-s1-s9] [Citation(s) in RCA: 36] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/11/2022] Open
Abstract
Background Deciphering protein-protein interaction (PPI) in domain level enriches valuable information about binding mechanism and functional role of interacting proteins. The 3D structures of complex proteins are reliable source of domain-domain interaction (DDI) but the number of proven structures is very limited. Several resources for the computationally predicted DDI have been generated but they are scattered in various places and their prediction show erratic performances. A well-organized PPI and DDI analysis system integrating these data with fair scoring system is necessary. Method We integrated three structure-based DDI datasets and twenty computationally predicted DDI datasets and constructed an interaction analysis system, named IDDI, which enables to browse protein and domain interactions with their relationships. To integrate heterogeneous DDI information, a novel scoring scheme is introduced to determine the reliability of DDI by considering the prediction scores of each DDI and the confidence levels of each prediction method in the datasets, and independencies between predicted datasets. In addition, we connected this DDI information to the comprehensive PPI information and developed a unified interface for the interaction analysis exploring interaction networks at both protein and domain level. Result IDDI provides 204,705 DDIs among total 7,351 Pfam domains in the current version. The result presents that total number of DDIs is increased eight times more than that of previous studies. Due to the increment of data, 50.4% of PPIs could be correlated with DDIs which is more than twice of previous resources. Newly designed scoring scheme outperformed the previous system in its accuracy too. User interface of IDDI system provides interactive investigation of proteins and domains in interactions with interconnected way. A specific example is presented to show the efficiency of the systems to acquire the comprehensive information of target protein with PPI and DDI relationships. IDDI is freely available at http://pcode.kaist.ac.kr/iddi/.
Collapse
Affiliation(s)
- Yul Kim
- Department of Bio and Brain Engineering, KAIST, Daejeon 305-701, South Korea
| | - Bumki Min
- Department of Bio and Brain Engineering, KAIST, Daejeon 305-701, South Korea
| | - Gwan-Su Yi
- Department of Bio and Brain Engineering, KAIST, Daejeon 305-701, South Korea
| |
Collapse
|
29
|
Kufareva I, Ilatovskiy AV, Abagyan R. Pocketome: an encyclopedia of small-molecule binding sites in 4D. Nucleic Acids Res 2012; 40:D535-40. [PMID: 22080553 PMCID: PMC3245087 DOI: 10.1093/nar/gkr825] [Citation(s) in RCA: 116] [Impact Index Per Article: 9.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/15/2011] [Accepted: 09/18/2011] [Indexed: 11/22/2022] Open
Abstract
The importance of binding site plasticity in protein-ligand interactions is well-recognized, and so are the difficulties in predicting the nature and the degree of this plasticity by computational means. To assist in understanding the flexible protein-ligand interactions, we constructed the Pocketome, an encyclopedia of about one thousand experimentally solved conformational ensembles of druggable binding sites in proteins, grouped by location and consistent chain/cofactor composition. The multiplicity of pockets within the ensembles adds an extra, fourth dimension to the Pocketome entry data. Within each ensemble, the pockets were carefully classified by the degree of their pairwise similarity and compatibility with different ligands. The core of the Pocketome is derived regularly and automatically from the current releases of the Protein Data Bank and the Uniprot Knowledgebase; this core is complemented by entries built from manually provided seed ligand locations. The Pocketome website (www.pocketome.org) allows searching for the sites of interest, analysis of conformational clusters, important residues, binding compatibility matrices and interactive visualization of the ensembles using the ActiveICM web browser plugin. The Pocketome collection can be used to build multi-conformational docking and 3D activity models as well as to design cross-docking and virtual ligand screening benchmarks.
Collapse
Affiliation(s)
- Irina Kufareva
- UCSD Skaggs School of Pharmacy and Pharmaceutical Sciences, La Jolla, CA, 92093, USA, Division of Molecular and Radiation Biophysics, Petersburg Nuclear Physics Institute, Russian Academy of Sciences, Gatchina, 188300 and Research and Education Center ‘Biophysics’, PNPI RAS and St. Petersburg State Polytechnical University, St. Petersburg, 194064, Russia
| | - Andrey V. Ilatovskiy
- UCSD Skaggs School of Pharmacy and Pharmaceutical Sciences, La Jolla, CA, 92093, USA, Division of Molecular and Radiation Biophysics, Petersburg Nuclear Physics Institute, Russian Academy of Sciences, Gatchina, 188300 and Research and Education Center ‘Biophysics’, PNPI RAS and St. Petersburg State Polytechnical University, St. Petersburg, 194064, Russia
| | - Ruben Abagyan
- UCSD Skaggs School of Pharmacy and Pharmaceutical Sciences, La Jolla, CA, 92093, USA, Division of Molecular and Radiation Biophysics, Petersburg Nuclear Physics Institute, Russian Academy of Sciences, Gatchina, 188300 and Research and Education Center ‘Biophysics’, PNPI RAS and St. Petersburg State Polytechnical University, St. Petersburg, 194064, Russia
| |
Collapse
|
30
|
Lees J, Yeats C, Perkins J, Sillitoe I, Rentzsch R, Dessailly BH, Orengo C. Gene3D: a domain-based resource for comparative genomics, functional annotation and protein network analysis. Nucleic Acids Res 2011; 40:D465-71. [PMID: 22139938 PMCID: PMC3245158 DOI: 10.1093/nar/gkr1181] [Citation(s) in RCA: 67] [Impact Index Per Article: 5.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/31/2022] Open
Abstract
Gene3D http://gene3d.biochem.ucl.ac.uk is a comprehensive database of protein domain assignments for sequences from the major sequence databases. Domains are directly mapped from structures in the CATH database or predicted using a library of representative profile HMMs derived from CATH superfamilies. As previously described, Gene3D integrates many other protein family and function databases. These facilitate complex associations of molecular function, structure and evolution. Gene3D now includes a domain functional family (FunFam) level below the homologous superfamily level assignments. Additions have also been made to the interaction data. More significantly, to help with the visualization and interpretation of multi-genome scale data sets, we have developed a new, revamped website. Searching has been simplified with more sophisticated filtering of results, along with new tools based on Cytoscape Web, for visualizing protein–protein interaction networks, differences in domain composition between genomes and the taxonomic distribution of individual superfamilies.
Collapse
Affiliation(s)
- Jonathan Lees
- Institute of Structural and Molecular Biology, University College London, Darwin Building, Gower St, London WC1E 6BT, UK.
| | | | | | | | | | | | | |
Collapse
|
31
|
Lees JG, Heriche JK, Morilla I, Ranea JA, Orengo CA. Systematic computational prediction of protein interaction networks. Phys Biol 2011; 8:035008. [PMID: 21572181 DOI: 10.1088/1478-3975/8/3/035008] [Citation(s) in RCA: 31] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/29/2023]
Abstract
Determining the network of physical protein associations is an important first step in developing mechanistic evidence for elucidating biological pathways. Despite rapid advances in the field of high throughput experiments to determine protein interactions, the majority of associations remain unknown. Here we describe computational methods for significantly expanding protein association networks. We describe methods for integrating multiple independent sources of evidence to obtain higher quality predictions and we compare the major publicly available resources available for experimentalists to use.
Collapse
Affiliation(s)
- J G Lees
- Research Department of Structural & Molecular Biology, University College London, London, UK.
| | | | | | | | | |
Collapse
|
32
|
Galperin MY, Cochrane GR. The 2011 Nucleic Acids Research Database Issue and the online Molecular Biology Database Collection. Nucleic Acids Res 2011; 39:D1-6. [PMID: 21177655 PMCID: PMC3013748 DOI: 10.1093/nar/gkq1243] [Citation(s) in RCA: 58] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/09/2023] Open
Abstract
The current 18th Database Issue of Nucleic Acids Research features descriptions of 96 new and 83 updated online databases covering various areas of molecular biology. It includes two editorials, one that discusses COMBREX, a new exciting project aimed at figuring out the functions of the ‘conserved hypothetical’ proteins, and one concerning BioDBcore, a proposed description of the ‘minimal information about a biological database’. Papers from the members of the International Nucleotide Sequence Database collaboration (INSDC) describe each of the participating databases, DDBJ, ENA and GenBank, principles of data exchange within the collaboration, and the recently established Sequence Read Archive. A testament to the longevity of databases, this issue includes updates on the RNA modification database, Definition of Secondary Structure of Proteins (DSSP) and Homology-derived Secondary Structure of Proteins (HSSP) databases, which have not been featured here in >12 years. There is also a block of papers describing recent progress in protein structure databases, such as Protein DataBank (PDB), PDB in Europe (PDBe), CATH, SUPERFAMILY and others, as well as databases on protein structure modeling, protein–protein interactions and the organization of inter-protein contact sites. Other highlights include updates of the popular gene expression databases, GEO and ArrayExpress, several cancer gene databases and a detailed description of the UK PubMed Central project. The Nucleic Acids Research online Database Collection, available at: http://www.oxfordjournals.org/nar/database/a/, now lists 1330 carefully selected molecular biology databases. The full content of the Database Issue is freely available online at the Nucleic Acids Research web site (http://nar.oxfordjournals.org/).
Collapse
Affiliation(s)
- Michael Y Galperin
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD 20894, USA.
| | | |
Collapse
|