1
|
Das S, Scholes HM, Sen N, Orengo C. CATH functional families predict functional sites in proteins. Bioinformatics 2021; 37:1099-1106. [PMID: 33135053 PMCID: PMC8150129 DOI: 10.1093/bioinformatics/btaa937] [Citation(s) in RCA: 13] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/05/2020] [Revised: 09/30/2020] [Accepted: 10/27/2020] [Indexed: 01/12/2023] Open
Abstract
MOTIVATION Identification of functional sites in proteins is essential for functional characterization, variant interpretation and drug design. Several methods are available for predicting either a generic functional site, or specific types of functional site. Here, we present FunSite, a machine learning predictor that identifies catalytic, ligand-binding and protein-protein interaction functional sites using features derived from protein sequence and structure, and evolutionary data from CATH functional families (FunFams). RESULTS FunSite's prediction performance was rigorously benchmarked using cross-validation and a holdout dataset. FunSite outperformed other publicly available functional site prediction methods. We show that conserved residues in FunFams are enriched in functional sites. We found FunSite's performance depends greatly on the quality of functional site annotations and the information content of FunFams in the training data. Finally, we analyze which structural and evolutionary features are most predictive for functional sites. AVAILABILITYAND IMPLEMENTATION https://github.com/UCL/cath-funsite-predictor. SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Sayoni Das
- PrecisionLife Ltd., Long Hanborough, OX29 8LJ Oxford, UK
| | - Harry M Scholes
- Institute of Structural and Molecular Biology, University College London, WC1E 6BT, London, UK
| | - Neeladri Sen
- Institute of Structural and Molecular Biology, University College London, WC1E 6BT, London, UK
| | - Christine Orengo
- Institute of Structural and Molecular Biology, University College London, WC1E 6BT, London, UK
| |
Collapse
|
2
|
Campbell C, Albert R. Edgetic perturbations to eliminate fixed-point attractors in Boolean regulatory networks. CHAOS (WOODBURY, N.Y.) 2019; 29:023130. [PMID: 30823730 DOI: 10.1063/1.5083060] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/27/2018] [Accepted: 01/29/2019] [Indexed: 06/09/2023]
Abstract
The dynamics of complex biological networks may be modeled in a Boolean framework, where the state of each system component is either abundant (ON) or scarce/absent (OFF), and each component's dynamic trajectory is determined by a logical update rule involving the state(s) of its regulator(s). It is possible to encode the update rules in the topology of the so-called expanded graph, analysis of which reveals the long-term behavior, or attractors, of the network. Here, we develop an algorithm to perturb the expanded graph (or, equivalently, the logical update rules) to eliminate stable motifs: subgraphs that cause a subset of components to stabilize to one state. Depending on the topology of the expanded graph, these perturbations lead to the modification or loss of the corresponding attractor. While most perturbations of biological regulatory networks in the literature involve the knockout (fixing to OFF) or constitutive activation (fixing to ON) of one or more nodes, we here consider edgetic perturbations, where a node's update rule is modified such that one or more of its regulators is viewed as ON or OFF regardless of its actual state. We apply the methodology to two biological networks. In a network representing T-LGL leukemia, we identify edgetic perturbations that eliminate the cancerous attractor, leaving only the healthy attractor representing cell death. In a network representing drought-induced closure of plant stomata, we identify edgetic perturbations that modify the single attractor such that stomata, instead of being fixed in the closed state, oscillates between the open and closed states.
Collapse
Affiliation(s)
- Colin Campbell
- Department of Physics, Washington College, Chestertown, Maryland 21620, USA
| | - Réka Albert
- Department of Physics, Pennsylvania State University, University Park, Pennsylvania 16802, USA
| |
Collapse
|
3
|
Elongation factor Tu is a multifunctional and processed moonlighting protein. Sci Rep 2017; 7:11227. [PMID: 28894125 PMCID: PMC5593925 DOI: 10.1038/s41598-017-10644-z] [Citation(s) in RCA: 62] [Impact Index Per Article: 8.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/15/2017] [Accepted: 08/10/2017] [Indexed: 01/10/2023] Open
Abstract
Many bacterial moonlighting proteins were originally described in medically, agriculturally, and commercially important members of the low G + C Firmicutes. We show Elongation factor Tu (Ef-Tu) moonlights on the surface of the human pathogens Staphylococcus aureus (SaEf-Tu) and Mycoplasma pneumoniae (MpnEf-Tu), and the porcine pathogen Mycoplasma hyopneumoniae (MhpEf-Tu). Ef-Tu is also a target of multiple processing events on the cell surface and these were characterised using an N-terminomics pipeline. Recombinant MpnEf-Tu bound strongly to a diverse range of host molecules, and when bound to plasminogen, was able to convert plasminogen to plasmin in the presence of plasminogen activators. Fragments of Ef-Tu retain binding capabilities to host proteins. Bioinformatics and structural modelling studies indicate that the accumulation of positively charged amino acids in short linear motifs (SLiMs), and protein processing promote multifunctional behaviour. Codon bias engendered by an A + T rich genome may influence how positively-charged residues accumulate in SLiMs.
Collapse
|
4
|
Exploring Protein-Protein Interactions as Drug Targets for Anti-cancer Therapy with In Silico Workflows. Methods Mol Biol 2017; 1647:221-236. [PMID: 28809006 DOI: 10.1007/978-1-4939-7201-2_15] [Citation(s) in RCA: 22] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/01/2022]
Abstract
We describe a computational protocol to aid the design of small molecule and peptide drugs that target protein-protein interactions, particularly for anti-cancer therapy. To achieve this goal, we explore multiple strategies, including finding binding hot spots, incorporating chemical similarity and bioactivity data, and sampling similar binding sites from homologous protein complexes. We demonstrate how to combine existing interdisciplinary resources with examples of semi-automated workflows. Finally, we discuss several major problems, including the occurrence of drug-resistant mutations, drug promiscuity, and the design of dual-effect inhibitors.
Collapse
|
5
|
Scott DE, Bayly AR, Abell C, Skidmore J. Small molecules, big targets: drug discovery faces the protein–protein interaction challenge. Nat Rev Drug Discov 2016; 15:533-50. [DOI: 10.1038/nrd.2016.29] [Citation(s) in RCA: 625] [Impact Index Per Article: 78.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/13/2022]
|
6
|
Kuang X, Dhroso A, Han JG, Shyu CR, Korkin D. DOMMINO 2.0: integrating structurally resolved protein-, RNA-, and DNA-mediated macromolecular interactions. DATABASE-THE JOURNAL OF BIOLOGICAL DATABASES AND CURATION 2016; 2016:bav114. [PMID: 26827237 PMCID: PMC4733329 DOI: 10.1093/database/bav114] [Citation(s) in RCA: 26] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 03/01/2015] [Accepted: 11/16/2015] [Indexed: 11/14/2022]
Abstract
Macromolecular interactions are formed between proteins, DNA and RNA molecules. Being a principle building block in macromolecular assemblies and pathways, the interactions underlie most of cellular functions. Malfunctioning of macromolecular interactions is also linked to a number of diseases. Structural knowledge of the macromolecular interaction allows one to understand the interaction's mechanism, determine its functional implications and characterize the effects of genetic variations, such as single nucleotide polymorphisms, on the interaction. Unfortunately, until now the interactions mediated by different types of macromolecules, e.g. protein-protein interactions or protein-DNA interactions, are collected into individual and unrelated structural databases. This presents a significant obstacle in the analysis of macromolecular interactions. For instance, the homogeneous structural interaction databases prevent scientists from studying structural interactions of different types but occurring in the same macromolecular complex. Here, we introduce DOMMINO 2.0, a structural Database Of Macro-Molecular INteractiOns. Compared to DOMMINO 1.0, a comprehensive database on protein-protein interactions, DOMMINO 2.0 includes the interactions between all three basic types of macromolecules extracted from PDB files. DOMMINO 2.0 is automatically updated on a weekly basis. It currently includes ∼1,040,000 interactions between two polypeptide subunits (e.g. domains, peptides, termini and interdomain linkers), ∼43,000 RNA-mediated interactions, and ∼12,000 DNA-mediated interactions. All protein structures in the database are annotated using SCOP and SUPERFAMILY family annotation. As a result, protein-mediated interactions involving protein domains, interdomain linkers, C- and N- termini, and peptides are identified. Our database provides an intuitive web interface, allowing one to investigate interactions at three different resolution levels: whole subunit network, binary interaction and interaction interface. Database URL: http://dommino.org.
Collapse
Affiliation(s)
- Xingyan Kuang
- Informatics Institute, University of Missouri, Columbia, MO, USA
| | - Andi Dhroso
- Department of Computer Science and Bioinformatics and Computational Biology Program, Worcester Polytechnic Institute, Worcester, MA, USA
| | - Jing Ginger Han
- Informatics Institute, University of Missouri, Columbia, MO, USA
| | - Chi-Ren Shyu
- Informatics Institute, University of Missouri, Columbia, MO, USA, Department of Electrical and Computer Engineering, Department of Computer Science, University of Missouri, Columbia, MO, USA
| | - Dmitry Korkin
- Department of Computer Science and Bioinformatics and Computational Biology Program, Worcester Polytechnic Institute, Worcester, MA, USA,
| |
Collapse
|
7
|
Composition of Overlapping Protein-Protein and Protein-Ligand Interfaces. PLoS One 2015; 10:e0140965. [PMID: 26517868 PMCID: PMC4627654 DOI: 10.1371/journal.pone.0140965] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/27/2015] [Accepted: 10/03/2015] [Indexed: 11/19/2022] Open
Abstract
Protein-protein interactions (PPIs) play a major role in many biological processes and they represent an important class of targets for therapeutic intervention. However, targeting PPIs is challenging because often no convenient natural substrates are available as starting point for small-molecule design. Here, we explored the characteristics of protein interfaces in five non-redundant datasets of 174 protein-protein (PP) complexes, and 161 protein-ligand (PL) complexes from the ABC database, 436 PP complexes, and 196 PL complexes from the PIBASE database and a dataset of 89 PL complexes from the Timbal database. In all cases, the small molecule ligands must bind at the respective PP interface. We observed similar amino acid frequencies in all three datasets. Remarkably, also the characteristics of PP contacts and overlapping PL contacts are highly similar.
Collapse
|
8
|
Sikosek T, Chan HS. Biophysics of protein evolution and evolutionary protein biophysics. J R Soc Interface 2015; 11:20140419. [PMID: 25165599 DOI: 10.1098/rsif.2014.0419] [Citation(s) in RCA: 150] [Impact Index Per Article: 16.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/05/2023] Open
Abstract
The study of molecular evolution at the level of protein-coding genes often entails comparing large datasets of sequences to infer their evolutionary relationships. Despite the importance of a protein's structure and conformational dynamics to its function and thus its fitness, common phylogenetic methods embody minimal biophysical knowledge of proteins. To underscore the biophysical constraints on natural selection, we survey effects of protein mutations, highlighting the physical basis for marginal stability of natural globular proteins and how requirement for kinetic stability and avoidance of misfolding and misinteractions might have affected protein evolution. The biophysical underpinnings of these effects have been addressed by models with an explicit coarse-grained spatial representation of the polypeptide chain. Sequence-structure mappings based on such models are powerful conceptual tools that rationalize mutational robustness, evolvability, epistasis, promiscuous function performed by 'hidden' conformational states, resolution of adaptive conflicts and conformational switches in the evolution from one protein fold to another. Recently, protein biophysics has been applied to derive more accurate evolutionary accounts of sequence data. Methods have also been developed to exploit sequence-based evolutionary information to predict biophysical behaviours of proteins. The success of these approaches demonstrates a deep synergy between the fields of protein biophysics and protein evolution.
Collapse
Affiliation(s)
- Tobias Sikosek
- Department of Biochemistry, University of Toronto, Toronto, Ontario, Canada M5S 1A8 Department of Molecular Genetics, University of Toronto, Toronto, Ontario, Canada M5S 1A8 Department of Physics, University of Toronto, Toronto, Ontario, Canada M5S 1A8
| | - Hue Sun Chan
- Department of Biochemistry, University of Toronto, Toronto, Ontario, Canada M5S 1A8 Department of Molecular Genetics, University of Toronto, Toronto, Ontario, Canada M5S 1A8 Department of Physics, University of Toronto, Toronto, Ontario, Canada M5S 1A8
| |
Collapse
|
9
|
Kruger FA, Gaulton A, Nowotka M, Overington JP. PPDMs-a resource for mapping small molecule bioactivities from ChEMBL to Pfam-A protein domains. Bioinformatics 2014; 31:776-8. [PMID: 25348214 PMCID: PMC4341065 DOI: 10.1093/bioinformatics/btu711] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/06/2023] Open
Abstract
Summary: PPDMs is a resource that maps small molecule bioactivities to protein domains from the Pfam-A collection of protein families. Small molecule bioactivities mapped to protein domains add important precision to approaches that use protein sequence searches alignments to assist applications in computational drug discovery and systems and chemical biology. We have previously proposed a mapping heuristic for a subset of bioactivities stored in ChEMBL with the Pfam-A domain most likely to mediate small molecule binding. We have since refined this mapping using a manual procedure. Here, we present a resource that provides up-to-date mappings and the possibility to review assigned mappings as well as to participate in their assignment and curation. We also describe how mappings provided through the PPDMs resource are made accessible through the main schema of the ChEMBL database. Availability and implementation: The PPDMs resource and curation interface is available at https://www.ebi.ac.uk/chembl/research/ppdms/pfam_maps. The source-code for PPDMs is available under the Apache license at https://github.com/chembl/pfam_maps. Source code is available at https://github.com/chembl/pfam_map_loader to demonstrate the integration process with the main schema of ChEMBL. Contact:jpo@ebi.ac.uk
Collapse
Affiliation(s)
- Felix A Kruger
- ChEMBL group, EMBL-EBI, Wellcome Trust Genome Campus, Hinxton, UK
| | - Anna Gaulton
- ChEMBL group, EMBL-EBI, Wellcome Trust Genome Campus, Hinxton, UK
| | - Michal Nowotka
- ChEMBL group, EMBL-EBI, Wellcome Trust Genome Campus, Hinxton, UK
| | | |
Collapse
|
10
|
Sudha G, Nussinov R, Srinivasan N. An overview of recent advances in structural bioinformatics of protein-protein interactions and a guide to their principles. PROGRESS IN BIOPHYSICS AND MOLECULAR BIOLOGY 2014; 116:141-50. [PMID: 25077409 DOI: 10.1016/j.pbiomolbio.2014.07.004] [Citation(s) in RCA: 50] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/16/2014] [Accepted: 07/13/2014] [Indexed: 12/20/2022]
Abstract
Rich data bearing on the structural and evolutionary principles of protein-protein interactions are paving the way to a better understanding of the regulation of function in the cell. This is particularly the case when these interactions are considered in the framework of key pathways. Knowledge of the interactions may provide insights into the mechanisms of crucial 'driver' mutations in oncogenesis. They also provide the foundation toward the design of protein-protein interfaces and inhibitors that can abrogate their formation or enhance them. The main features to learn from known 3-D structures of protein-protein complexes and the extensive literature which analyzes them computationally and experimentally include the interaction details which permit undertaking structure-based drug discovery, the evolution of complexes and their interactions, the consequences of alterations such as post-translational modifications, ligand binding, disease causing mutations, host pathogen interactions, oligomerization, aggregation and the roles of disorder, dynamics, allostery and more to the protein and the cell. This review highlights some of the recent advances in these areas, including design, inhibition and prediction of protein-protein complexes. The field is broad, and much work has been carried out in these areas, making it challenging to cover it in its entirety. Much of this is due to the fast increase in the number of molecules whose structures have been determined experimentally and the vast increase in computational power. Here we provide a concise overview.
Collapse
Affiliation(s)
- Govindarajan Sudha
- Molecular Biophysics Unit, Indian Institute of Science, Bangalore 560012, India.
| | - Ruth Nussinov
- Cancer and Inflammation Program, Frederick National Laboratory for Cancer Research, Leidos Biomedical Research, Inc., National Cancer Institute, Frederick, MD 21702, USA; Sackler Institute of Molecular Medicine, Department of Human Genetics and Molecular Medicine, Sackler School of Medicine, Tel Aviv University, Tel Aviv 69978, Israel.
| | | |
Collapse
|
11
|
Villoutreix BO, Kuenemann MA, Poyet JL, Bruzzoni-Giovanelli H, Labbé C, Lagorce D, Sperandio O, Miteva MA. Drug-Like Protein-Protein Interaction Modulators: Challenges and Opportunities for Drug Discovery and Chemical Biology. Mol Inform 2014; 33:414-437. [PMID: 25254076 PMCID: PMC4160817 DOI: 10.1002/minf.201400040] [Citation(s) in RCA: 84] [Impact Index Per Article: 8.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/24/2014] [Accepted: 04/21/2014] [Indexed: 12/13/2022]
Abstract
[Formula: see text] Fundamental processes in living cells are largely controlled by macromolecular interactions and among them, protein-protein interactions (PPIs) have a critical role while their dysregulations can contribute to the pathogenesis of numerous diseases. Although PPIs were considered as attractive pharmaceutical targets already some years ago, they have been thus far largely unexploited for therapeutic interventions with low molecular weight compounds. Several limiting factors, from technological hurdles to conceptual barriers, are known, which, taken together, explain why research in this area has been relatively slow. However, this last decade, the scientific community has challenged the dogma and became more enthusiastic about the modulation of PPIs with small drug-like molecules. In fact, several success stories were reported both, at the preclinical and clinical stages. In this review article, written for the 2014 International Summer School in Chemoinformatics (Strasbourg, France), we discuss in silico tools (essentially post 2012) and databases that can assist the design of low molecular weight PPI modulators (these tools can be found at www.vls3d.com). We first introduce the field of protein-protein interaction research, discuss key challenges and comment recently reported in silico packages, protocols and databases dedicated to PPIs. Then, we illustrate how in silico methods can be used and combined with experimental work to identify PPI modulators.
Collapse
Affiliation(s)
- Bruno O Villoutreix
- Université Paris Diderot, Sorbonne Paris Cité, UMRS 973 InsermParis 75013, France
- Inserm, U973Paris 75013, France
- CDithem, Faculté de Pharmacie, 1 rue du Prof Laguesse59000 Lille, France
| | - Melaine A Kuenemann
- Université Paris Diderot, Sorbonne Paris Cité, UMRS 973 InsermParis 75013, France
- Inserm, U973Paris 75013, France
| | - Jean-Luc Poyet
- Université Paris Diderot, Sorbonne Paris Cité, UMRS 973 InsermParis 75013, France
- Inserm, U973Paris 75013, France
- IUH, Hôpital Saint-LouisParis, France
- CDithem, Faculté de Pharmacie, 1 rue du Prof Laguesse59000 Lille, France
| | - Heriberto Bruzzoni-Giovanelli
- Université Paris Diderot, Sorbonne Paris Cité, UMRS 973 InsermParis 75013, France
- Inserm, U973Paris 75013, France
- CIC, Clinical investigation center, Hôpital Saint-LouisParis, France
| | - Céline Labbé
- Université Paris Diderot, Sorbonne Paris Cité, UMRS 973 InsermParis 75013, France
- Inserm, U973Paris 75013, France
| | - David Lagorce
- Université Paris Diderot, Sorbonne Paris Cité, UMRS 973 InsermParis 75013, France
- Inserm, U973Paris 75013, France
| | - Olivier Sperandio
- Université Paris Diderot, Sorbonne Paris Cité, UMRS 973 InsermParis 75013, France
- Inserm, U973Paris 75013, France
- CDithem, Faculté de Pharmacie, 1 rue du Prof Laguesse59000 Lille, France
| | - Maria A Miteva
- Université Paris Diderot, Sorbonne Paris Cité, UMRS 973 InsermParis 75013, France
- Inserm, U973Paris 75013, France
| |
Collapse
|
12
|
Finn RD, Miller BL, Clements J, Bateman A. iPfam: a database of protein family and domain interactions found in the Protein Data Bank. Nucleic Acids Res 2013; 42:D364-73. [PMID: 24297255 PMCID: PMC3965099 DOI: 10.1093/nar/gkt1210] [Citation(s) in RCA: 117] [Impact Index Per Article: 10.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/26/2022] Open
Abstract
The database iPfam, available at http://ipfam.org, catalogues Pfam domain interactions based on known 3D structures that are found in the Protein Data Bank, providing interaction data at the molecular level. Previously, the iPfam domain–domain interaction data was integrated within the Pfam database and website, but it has now been migrated to a separate database. This allows for independent development, improving data access and giving clearer separation between the protein family and interactions datasets. In addition to domain–domain interactions, iPfam has been expanded to include interaction data for domain bound small molecule ligands. Functional annotations are provided from source databases, supplemented by the incorporation of Wikipedia articles where available. iPfam (version 1.0) contains >9500 domain–domain and 15 500 domain–ligand interactions. The new website provides access to this data in a variety of ways, including interactive visualizations of the interaction data.
Collapse
Affiliation(s)
- Robert D Finn
- HHMI Janelia Farm Research Campus, 19700 Helix Drive, Ashburn VA 20147 USA and European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | | | | | | |
Collapse
|
13
|
Khazanov NA, Carlson HA. Exploring the composition of protein-ligand binding sites on a large scale. PLoS Comput Biol 2013; 9:e1003321. [PMID: 24277997 PMCID: PMC3836696 DOI: 10.1371/journal.pcbi.1003321] [Citation(s) in RCA: 43] [Impact Index Per Article: 3.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/06/2013] [Accepted: 09/23/2013] [Indexed: 12/21/2022] Open
Abstract
The residue composition of a ligand binding site determines the interactions available for diffusion-mediated ligand binding, and understanding general composition of these sites is of great importance if we are to gain insight into the functional diversity of the proteome. Many structure-based drug design methods utilize such heuristic information for improving prediction or characterization of ligand-binding sites in proteins of unknown function. The Binding MOAD database if one of the largest curated sets of protein-ligand complexes, and provides a source of diverse, high-quality data for establishing general trends of residue composition from currently available protein structures. We present an analysis of 3,295 non-redundant proteins with 9,114 non-redundant binding sites to identify residues over-represented in binding regions versus the rest of the protein surface. The Binding MOAD database delineates biologically-relevant “valid” ligands from “invalid” small-molecule ligands bound to the protein. Invalids are present in the crystallization medium and serve no known biological function. Contacts are found to differ between these classes of ligands, indicating that residue composition of biologically relevant binding sites is distinct not only from the rest of the protein surface, but also from surface regions capable of opportunistic binding of non-functional small molecules. To confirm these trends, we perform a rigorous analysis of the variation of residue propensity with respect to the size of the dataset and the content bias inherent in structure sets obtained from a large protein structure database. The optimal size of the dataset for establishing general trends of residue propensities, as well as strategies for assessing the significance of such trends, are suggested for future studies of binding-site composition. Describing the general structure of protein binding sites is fundamentally important for guiding drug design and better understanding structure-function relationships. Here, we analyze small molecules bound to proteins within our large database, Binding MOAD (Mother of All Databases, pronounced like “mode” as a pun referring to ligand-binding modes). We focus on different contacts across the residues in the binding sites, and we normalize the data relative to the protein's entire surface. A key feature of this study is the use of a “control” where we compare real, functional binding sites to the random contacts seen for crystallographic additives against the protein surface. Controls are required in experimental biology, but they are ill-defined in many computational approaches. This allows us to describe how true binding sites are unique on the protein surface and distinct from random patches that attract common, small molecules.
Collapse
Affiliation(s)
- Nickolay A. Khazanov
- Bioinformatics Graduate Program, University of Michigan, Ann Arbor, Michigan, United States of America
| | - Heather A. Carlson
- Bioinformatics Graduate Program, University of Michigan, Ann Arbor, Michigan, United States of America
- Department of Medicinal Chemistry, University of Michigan, Ann Arbor, Michigan, United States of America
- * E-mail:
| |
Collapse
|
14
|
London N, Raveh B, Schueler-Furman O. Druggable protein-protein interactions--from hot spots to hot segments. Curr Opin Chem Biol 2013; 17:952-9. [PMID: 24183815 DOI: 10.1016/j.cbpa.2013.10.011] [Citation(s) in RCA: 151] [Impact Index Per Article: 13.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/03/2013] [Revised: 10/05/2013] [Accepted: 10/07/2013] [Indexed: 11/24/2022]
Abstract
Protein-Protein Interactions (PPIs) mediate numerous biological functions. As such, the inhibition of specific PPIs has tremendous therapeutic value. The notion that these interactions are 'undruggable' has petered out with the emergence of more and more successful examples of PPI inhibitors, expanding considerably the scope of potential drug targets. The accumulated data on successes in the inhibition of PPIs allow us to analyze the features that are required for such inhibition. Whereas it has been suggested and shown that targeting hot spots at PPI interfaces is a good strategy to achieve inhibition, in this review we focus on the notion that the most amenable interactions for inhibition are those that are mediated by a 'hot segment', a continuous epitope that contributes the majority of the binding energy. This criterion is both useful in guiding future target selection efforts, and in suggesting immediate inhibitory candidates--the dominant peptidic segment that mediates the targeted interaction.
Collapse
Affiliation(s)
- Nir London
- Department of Pharmaceutical Chemistry, University of California, San Francisco, San Francisco, CA 94158, USA
| | | | | |
Collapse
|
15
|
Fornili A, Pandini A, Lu HC, Fraternali F. Specialized Dynamical Properties of Promiscuous Residues Revealed by Simulated Conformational Ensembles. J Chem Theory Comput 2013; 9:5127-5147. [PMID: 24250278 PMCID: PMC3827836 DOI: 10.1021/ct400486p] [Citation(s) in RCA: 35] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/09/2013] [Indexed: 12/13/2022]
Abstract
![]()
The
ability to interact with different partners is one of the most
important features in proteins. Proteins that bind a large number
of partners (hubs) have been often associated with intrinsic disorder.
However, many examples exist of hubs with an ordered structure, and
evidence of a general mechanism promoting promiscuity in ordered proteins
is still elusive. An intriguing hypothesis is that promiscuous binding
sites have specific dynamical properties, distinct from the rest of
the interface and pre-existing in the protein isolated state. Here,
we present the first comprehensive study of the intrinsic dynamics
of promiscuous residues in a large protein data set. Different computational
methods, from coarse-grained elastic models to geometry-based sampling
methods and to full-atom Molecular Dynamics simulations, were used
to generate conformational ensembles for the isolated proteins. The
flexibility and dynamic correlations of interface residues with a
different degree of binding promiscuity were calculated and compared
considering side chain and backbone motions, the latter both on a
local and on a global scale. The study revealed that (a) promiscuous
residues tend to be more flexible than nonpromiscuous ones, (b) this
additional flexibility has a higher degree of organization, and (c)
evolutionary conservation and binding promiscuity have opposite effects
on intrinsic dynamics. Findings on simulated ensembles were also validated
on ensembles of experimental structures extracted from the Protein
Data Bank (PDB). Additionally, the low occurrence of single nucleotide
polymorphisms observed for promiscuous residues indicated a tendency
to preserve binding diversity at these positions. A case study on
two ubiquitin-like proteins exemplifies how binding promiscuity in
evolutionary related proteins can be modulated by the fine-tuning
of the interface dynamics. The interplay between promiscuity and flexibility
highlighted here can inspire new directions in protein–protein
interaction prediction and design methods.
Collapse
Affiliation(s)
- Arianna Fornili
- Randall Division of Cell and Molecular Biophysics, King's College London , New Hunt's House, London SE1 1UL, United Kingdom
| | | | | | | |
Collapse
|
16
|
Raj M, Bullock BN, Arora PS. Plucking the high hanging fruit: a systematic approach for targeting protein-protein interactions. Bioorg Med Chem 2013; 21:4051-7. [PMID: 23267671 PMCID: PMC3622812 DOI: 10.1016/j.bmc.2012.11.023] [Citation(s) in RCA: 51] [Impact Index Per Article: 4.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/07/2012] [Revised: 11/08/2012] [Accepted: 11/14/2012] [Indexed: 10/27/2022]
Abstract
Development of specific ligands for protein targets that help decode the complexities of protein-protein interaction networks is a key goal for the field of chemical biology. Despite the emergence of powerful in silico and experimental high-throughput screening strategies, the discovery of synthetic ligands that selectively modulate protein-protein interactions remains a challenge for bioorganic and medicinal chemists. This Perspective discusses emerging principles for the rational design of PPI inhibitors. Fundamentally, the approach seeks to adapt nature's protein recognition principles for the design of suitable secondary structure mimetics.
Collapse
Affiliation(s)
- Monika Raj
- Department of Chemistry, New York University, NY 10003, USA
| | | | | |
Collapse
|
17
|
Wang L, Hou Y, Quan H, Xu W, Bao Y, Li Y, Fu Y, Zou S. A compound-based computational approach for the accurate determination of hot spots. Protein Sci 2013; 22:1060-70. [PMID: 23776011 DOI: 10.1002/pro.2296] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/06/2013] [Revised: 05/27/2013] [Accepted: 06/01/2013] [Indexed: 12/21/2022]
Abstract
A plethora of both experimental and computational methods have been proposed in the past 20 years for the identification of hot spots at a protein-protein interface. The experimental determination of a protein-protein complex followed by alanine scanning mutagenesis, though able to determine hot spots with much precision, is expensive and has no guarantee of success while the accuracy of the current computational methods for hot-spot identification remains low. Here, we present a novel structure-based computational approach that accurately determines hot spots through docking into a set of proteins homologous to only one of the two interacting partners of a compound capable of disrupting the protein-protein interaction (PPI). This approach has been applied to identify the hot spots of human activin receptor type II (ActRII) critical for its binding toward Cripto-I. The subsequent experimental confirmation of the computationally identified hot spots portends a potentially accurate method for hot-spot determination in silico given a compound capable of disrupting the PPI in question. The hot spots of human ActRII first reported here may well become the focal points for the design of small molecule drugs that target the PPI. The determination of their interface may have significant biological implications in that it suggests that Cripto-I plays an important role in both activin and nodal signal pathways.
Collapse
Affiliation(s)
- Lincong Wang
- The College of Computer Science and Technology, Jilin University, Changchun, Jilin, China.
| | | | | | | | | | | | | | | |
Collapse
|
18
|
Villoutreix BO, Labbé CM, Lagorce D, Laconde G, Sperandio O. A leap into the chemical space of protein-protein interaction inhibitors. Curr Pharm Des 2013; 18:4648-67. [PMID: 22650260 DOI: 10.2174/138161212802651571] [Citation(s) in RCA: 53] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/01/2012] [Accepted: 04/16/2012] [Indexed: 11/22/2022]
Abstract
Protein-protein interactions (PPI) are involved in vital cellular processes and are therefore associated to a growing number of diseases. But working with them as therapeutic targets comes with some major hurdles that require substantial mutations from our way to design drugs on historical targets such as enzymes and G-Protein Coupled Receptor (GPCR). Among the numerous ways we could improve our methodologies to maximize the potential of developing new chemical entities on PPI targets, is the fundamental question of what type of compounds should we use to identify the first hits and among which chemical space should we navigate to optimize them to the drug candidate stage. In this review article, we cover different aspects on PPI but with the aim to gain some insights into the specific nature of the chemical space of PPI inhibitors. We describe the work of different groups to highlight such properties and discuss their respective approach. We finally discuss a case study in which we describe the properties of a set of 115 PPI inhibitors that we compare to a reference set of 1730 enzyme inhibitors. This case study highlights interesting properties such as the unfortunate price that still needs to be paid by PPI inhibitors in terms of molecular weight, hydrophobicity, and aromaticity in order to reach a critical level of activity. But it also shows that not all PPI targets are equivalent, and that some PPI targets can demonstrate a better druggability by illustrating the better drug likeness of their associated inhibitors.
Collapse
|
19
|
Walter P, Metzger J, Thiel C, Helms V. Predicting where small molecules bind at protein-protein interfaces. PLoS One 2013; 8:e58583. [PMID: 23505538 PMCID: PMC3591369 DOI: 10.1371/journal.pone.0058583] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/17/2012] [Accepted: 02/07/2013] [Indexed: 11/25/2022] Open
Abstract
Small molecules that bind at protein-protein interfaces may either block or stabilize protein-protein interactions in cells. Thus, some of these binding interfaces may turn into prospective targets for drug design. Here, we collected 175 pairs of protein-protein (PP) complexes and protein-ligand (PL) complexes with known three-dimensional structures for which (1) one protein from the PP complex shares at least 40% sequence identity with the protein from the PL complex, and (2) the interface regions of these proteins overlap at least partially with each other. We found that those residues of the interfaces that may bind the other protein as well as the small molecule are evolutionary more conserved on average, have a higher tendency of being located in pockets and expose a smaller fraction of their surface area to the solvent than the remaining protein-protein interface region. Based on these findings we derived a statistical classifier that predicts patches at binding interfaces that have a higher tendency to bind small molecules. We applied this new prediction method to more than 10 000 interfaces from the protein data bank. For several complexes related to apoptosis the predicted binding patches were in direct contact to co-crystallized small molecules.
Collapse
Affiliation(s)
- Peter Walter
- Center for Bioinformatics, Saarland University, Saarbrücken, Germany
| | - Jennifer Metzger
- Center for Bioinformatics, Saarland University, Saarbrücken, Germany
| | - Christoph Thiel
- Center for Bioinformatics, Saarland University, Saarbrücken, Germany
| | - Volkhard Helms
- Center for Bioinformatics, Saarland University, Saarbrücken, Germany
- * E-mail:
| |
Collapse
|
20
|
McCaffrey G, Davis TP. Physiology and pathophysiology of the blood-brain barrier: P-glycoprotein and occludin trafficking as therapeutic targets to optimize central nervous system drug delivery. J Investig Med 2012; 60:10.231/JIM.0b013e318276de79. [PMID: 23138008 PMCID: PMC3851303 DOI: 10.231/jim.0b013e318276de79] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022]
Abstract
The blood-brain barrier (BBB) is a physical and metabolic barrier that separates the central nervous system from the peripheral circulation. Central nervous system drug delivery across the BBB is challenging, primarily because of the physical restriction of paracellular diffusion between the endothelial cells that comprise the microvessels of the BBB and the activity of efflux transporters that quickly expel back into the capillary lumen a wide variety of xenobiotics. Therapeutic manipulation of protein trafficking is emerging as a novel means of modulating protein function, and in this minireview, the targeting of the trafficking of 2 key BBB proteins, P-glycoprotein and occludin, is presented as a novel, reversible means of optimizing central nervous system drug delivery.
Collapse
Affiliation(s)
- Gwen McCaffrey
- Department of Medical Pharmacology, University of Arizona College of Medicine, 1501 N. Campbell Ave, Tucson, AZ 85745
| | - Thomas P. Davis
- Department of Medical Pharmacology, University of Arizona College of Medicine, 1501 N. Campbell Ave, Tucson, AZ 85745
| |
Collapse
|
21
|
McCaffrey G, Davis TP. Physiology and pathophysiology of the blood-brain barrier: P-glycoprotein and occludin trafficking as therapeutic targets to optimize central nervous system drug delivery. J Investig Med 2012; 60:1131-40. [PMID: 23138008 PMCID: PMC3851303 DOI: 10.2310/jim.0b013e318276de79] [Citation(s) in RCA: 26] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/20/2022]
Abstract
The blood-brain barrier (BBB) is a physical and metabolic barrier that separates the central nervous system from the peripheral circulation. Central nervous system drug delivery across the BBB is challenging, primarily because of the physical restriction of paracellular diffusion between the endothelial cells that comprise the microvessels of the BBB and the activity of efflux transporters that quickly expel back into the capillary lumen a wide variety of xenobiotics. Therapeutic manipulation of protein trafficking is emerging as a novel means of modulating protein function, and in this minireview, the targeting of the trafficking of 2 key BBB proteins, P-glycoprotein and occludin, is presented as a novel, reversible means of optimizing central nervous system drug delivery.
Collapse
Affiliation(s)
- Gwen McCaffrey
- Department of Medical Pharmacology, University of Arizona College of Medicine, Tucson, AZ 85745, USA.
| | | |
Collapse
|
22
|
Biophysical and computational fragment-based approaches to targeting protein-protein interactions: applications in structure-guided drug discovery. Q Rev Biophys 2012; 45:383-426. [PMID: 22971516 DOI: 10.1017/s0033583512000108] [Citation(s) in RCA: 70] [Impact Index Per Article: 5.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/17/2022]
Abstract
Drug discovery has classically targeted the active sites of enzymes or ligand-binding sites of receptors and ion channels. In an attempt to improve selectivity of drug candidates, modulation of protein-protein interfaces (PPIs) of multiprotein complexes that mediate conformation or colocation of components of cell-regulatory pathways has become a focus of interest. However, PPIs in multiprotein systems continue to pose significant challenges, as they are generally large, flat and poor in distinguishing features, making the design of small molecule antagonists a difficult task. Nevertheless, encouragement has come from the recognition that a few amino acids - so-called hotspots - may contribute the majority of interaction-free energy. The challenges posed by protein-protein interactions have led to a wellspring of creative approaches, including proteomimetics, stapled α-helical peptides and a plethora of antibody inspired molecular designs. Here, we review a more generic approach: fragment-based drug discovery. Fragments allow novel areas of chemical space to be explored more efficiently, but the initial hits have low affinity. This means that they will not normally disrupt PPIs, unless they are tethered, an approach that has been pioneered by Wells and co-workers. An alternative fragment-based approach is to stabilise the uncomplexed components of the multiprotein system in solution and employ conventional fragment-based screening. Here, we describe the current knowledge of the structures and properties of protein-protein interactions and the small molecules that can modulate them. We then describe the use of sensitive biophysical methods - nuclear magnetic resonance, X-ray crystallography, surface plasmon resonance, differential scanning fluorimetry or isothermal calorimetry - to screen and validate fragment binding. Fragment hits can subsequently be evolved into larger molecules with higher affinity and potency. These may provide new leads for drug candidates that target protein-protein interactions and have therapeutic value.
Collapse
|
23
|
Ashford P, Moss DS, Alex A, Yeap SK, Povia A, Nobeli I, Williams MA. Visualisation of variable binding pockets on protein surfaces by probabilistic analysis of related structure sets. BMC Bioinformatics 2012; 13:39. [PMID: 22417279 PMCID: PMC3359218 DOI: 10.1186/1471-2105-13-39] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/07/2011] [Accepted: 03/14/2012] [Indexed: 11/30/2022] Open
Abstract
Background Protein structures provide a valuable resource for rational drug design. For a protein with no known ligand, computational tools can predict surface pockets that are of suitable size and shape to accommodate a complementary small-molecule drug. However, pocket prediction against single static structures may miss features of pockets that arise from proteins' dynamic behaviour. In particular, ligand-binding conformations can be observed as transiently populated states of the apo protein, so it is possible to gain insight into ligand-bound forms by considering conformational variation in apo proteins. This variation can be explored by considering sets of related structures: computationally generated conformers, solution NMR ensembles, multiple crystal structures, homologues or homology models. It is non-trivial to compare pockets, either from different programs or across sets of structures. For a single structure, difficulties arise in defining particular pocket's boundaries. For a set of conformationally distinct structures the challenge is how to make reasonable comparisons between them given that a perfect structural alignment is not possible. Results We have developed a computational method, Provar, that provides a consistent representation of predicted binding pockets across sets of related protein structures. The outputs are probabilities that each atom or residue of the protein borders a predicted pocket. These probabilities can be readily visualised on a protein using existing molecular graphics software. We show how Provar simplifies comparison of the outputs of different pocket prediction algorithms, of pockets across multiple simulated conformations and between homologous structures. We demonstrate the benefits of use of multiple structures for protein-ligand and protein-protein interface analysis on a set of complexes and consider three case studies in detail: i) analysis of a kinase superfamily highlights the conserved occurrence of surface pockets at the active and regulatory sites; ii) a simulated ensemble of unliganded Bcl2 structures reveals extensions of a known ligand-binding pocket not apparent in the apo crystal structure; iii) visualisations of interleukin-2 and its homologues highlight conserved pockets at the known receptor interfaces and regions whose conformation is known to change on inhibitor binding. Conclusions Through post-processing of the output of a variety of pocket prediction software, Provar provides a flexible approach to the analysis and visualization of the persistence or variability of pockets in sets of related protein structures.
Collapse
Affiliation(s)
- Paul Ashford
- Institute of Structural and Molecular Biology, Department of Biological Sciences, Birkbeck, University of London, Malet Street, London WC1E 7HX, UK
| | | | | | | | | | | | | |
Collapse
|
24
|
Kinjo AR, Nakamura H. Composite structural motifs of binding sites for delineating biological functions of proteins. PLoS One 2012; 7:e31437. [PMID: 22347478 PMCID: PMC3275580 DOI: 10.1371/journal.pone.0031437] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/14/2011] [Accepted: 01/08/2012] [Indexed: 11/19/2022] Open
Abstract
Most biological processes are described as a series of interactions between proteins and other molecules, and interactions are in turn described in terms of atomic structures. To annotate protein functions as sets of interaction states at atomic resolution, and thereby to better understand the relation between protein interactions and biological functions, we conducted exhaustive all-against-all atomic structure comparisons of all known binding sites for ligands including small molecules, proteins and nucleic acids, and identified recurring elementary motifs. By integrating the elementary motifs associated with each subunit, we defined composite motifs that represent context-dependent combinations of elementary motifs. It is demonstrated that function similarity can be better inferred from composite motif similarity compared to the similarity of protein sequences or of individual binding sites. By integrating the composite motifs associated with each protein function, we define meta-composite motifs each of which is regarded as a time-independent diagrammatic representation of a biological process. It is shown that meta-composite motifs provide richer annotations of biological processes than sequence clusters. The present results serve as a basis for bridging atomic structures to higher-order biological phenomena by classification and integration of binding site structures.
Collapse
Affiliation(s)
- Akira R Kinjo
- Institute for Protein Research, Osaka University, Suita, Osaka, Japan.
| | | |
Collapse
|
25
|
Koes DR, Camacho CJ. Small-molecule inhibitor starting points learned from protein-protein interaction inhibitor structure. ACTA ACUST UNITED AC 2011; 28:784-91. [PMID: 22210869 PMCID: PMC3307105 DOI: 10.1093/bioinformatics/btr717] [Citation(s) in RCA: 48] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/09/2023]
Abstract
MOTIVATION Protein-protein interactions (PPIs) are a promising, but challenging target for pharmaceutical intervention. One approach for addressing these difficult targets is the rational design of small-molecule inhibitors that mimic the chemical and physical properties of small clusters of key residues at the protein-protein interface. The identification of appropriate clusters of interface residues provides starting points for inhibitor design and supports an overall assessment of the susceptibility of PPIs to small-molecule inhibition. RESULTS We extract Small-Molecule Inhibitor Starting Points (SMISPs) from protein-ligand and protein-protein complexes in the Protein Data Bank (PDB). These SMISPs are used to train two distinct classifiers, a support vector machine and an easy to interpret exhaustive rule classifier. Both classifiers achieve better than 70% leave-one-complex-out cross-validation accuracy and correctly predict SMISPs of known PPI inhibitors not in the training set. A PDB-wide analysis suggests that nearly half of all PPIs may be susceptible to small-molecule inhibition.
Collapse
Affiliation(s)
- David Ryan Koes
- Department of Computational and Systems Biology, University of Pittsburgh, Pittsburgh, PA 15260, USA.
| | | |
Collapse
|
26
|
Modulating protein-protein interactions with small molecules: the importance of binding hotspots. J Mol Biol 2011; 415:443-53. [PMID: 22198293 DOI: 10.1016/j.jmb.2011.12.026] [Citation(s) in RCA: 49] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/22/2011] [Revised: 11/23/2011] [Accepted: 12/12/2011] [Indexed: 12/30/2022]
Abstract
The modulation of protein-protein interactions (PPIs) by small drug-like molecules is a relatively new area of research and has opened up new opportunities in drug discovery. However, the progress made in this area is limited to a handful of known cases of small molecules that target specific diseases. With the increasing availability of protein structure complexes, it is highly important to devise strategies exploiting homologous structure space on a large scale for discovering putative PPIs that could be attractive drug targets. Here, we propose a scheme that allows performing large-scale screening of all protein complexes and finding putative small-molecule and/or peptide binding sites overlapping with protein-protein binding sites (so-called "multibinding sites"). We find more than 600 nonredundant proteins from 60 protein families with multibinding sites. Moreover, we show that the multibinding sites are mostly observed in transient complexes, largely overlap with the binding hotspots and are more evolutionarily conserved than other interface sites. We investigate possible mechanisms of how small molecules may modulate protein-protein binding and discuss examples of new candidates for drug design.
Collapse
|
27
|
Ito JI, Tabei Y, Shimizu K, Tsuda K, Tomii K. PoSSuM: a database of similar protein-ligand binding and putative pockets. Nucleic Acids Res 2011; 40:D541-8. [PMID: 22135290 PMCID: PMC3245044 DOI: 10.1093/nar/gkr1130] [Citation(s) in RCA: 40] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022] Open
Abstract
Numerous potential ligand-binding sites are available today, along with hundreds of thousands of known binding sites observed in the PDB. Exhaustive similarity search for such vastly numerous binding site pairs is useful to predict protein functions and to enable rapid screening of target proteins for drug design. Existing databases of ligand-binding sites offer databases of limited scale. For example, SitesBase covers only ~33,000 known binding sites. Inferring protein function and drug discovery purposes, however, demands a much more comprehensive database including known and putative-binding sites. Using a novel algorithm, we conducted a large-scale all-pairs similarity search for 1.8 million known and potential binding sites in the PDB, and discovered over 14 million similar pairs of binding sites. Here, we present the results as a relational database Pocket Similarity Search using Multiple-sketches (PoSSuM) including all the discovered pairs with annotations of various types. PoSSuM enables rapid exploration of similar binding sites among structures with different global folds as well as similar ones. Moreover, PoSSuM is useful for predicting the binding ligand for unbound structures, which provides important clues for characterizing protein structures with unclear functions. The PoSSuM database is freely available at http://possum.cbrc.jp/PoSSuM/.
Collapse
Affiliation(s)
- Jun-Ichi Ito
- Department of Computational Biology, Graduate School of Frontier Sciences, The University of Tokyo, 5-1-5 Kashiwanoha, Kashiwa, Chiba 277-8568, Japan
| | | | | | | | | |
Collapse
|
28
|
Geppert T, Bauer S, Hiss JA, Conrad E, Reutlinger M, Schneider P, Weisel M, Pfeiffer B, Altmann KH, Waibler Z, Schneider G. Immunosuppressive small molecule discovered by structure-based virtual screening for inhibitors of protein-protein interactions. Angew Chem Int Ed Engl 2011; 51:258-61. [PMID: 22095772 DOI: 10.1002/anie.201105901] [Citation(s) in RCA: 32] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/20/2011] [Indexed: 01/30/2023]
Affiliation(s)
- Tim Geppert
- Eidgenössische Technische Hochschule (ETH) Zürich, Department of Chemistry and Applied Biosciences, Wolfgang-Pauli-Strasse 10, 8093 Zürich, Switzerland
| | | | | | | | | | | | | | | | | | | | | |
Collapse
|
29
|
Geppert T, Bauer S, Hiss JA, Conrad E, Reutlinger M, Schneider P, Weisel M, Pfeiffer B, Altmann KH, Waibler Z, Schneider G. Identifizierung eines immunsuppressiven Wirkstoffmoleküls durch strukturbasiertes virtuelles Screening nach Inhibitoren von Protein-Protein-Wechselwirkungen. Angew Chem Int Ed Engl 2011. [DOI: 10.1002/ange.201105901] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/10/2022]
|
30
|
Wass MN, David A, Sternberg MJE. Challenges for the prediction of macromolecular interactions. Curr Opin Struct Biol 2011; 21:382-90. [DOI: 10.1016/j.sbi.2011.03.013] [Citation(s) in RCA: 65] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/20/2010] [Revised: 03/04/2011] [Accepted: 03/24/2011] [Indexed: 12/14/2022]
|
31
|
Stein A, Mosca R, Aloy P. Three-dimensional modeling of protein interactions and complexes is going 'omics. Curr Opin Struct Biol 2011; 21:200-8. [PMID: 21320770 DOI: 10.1016/j.sbi.2011.01.005] [Citation(s) in RCA: 68] [Impact Index Per Article: 5.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/10/2010] [Revised: 01/11/2011] [Accepted: 01/13/2011] [Indexed: 10/18/2022]
Abstract
High-throughput interaction discovery initiatives have revealed the existence of hundreds of multiprotein complexes whose functions are regulated through thousands of protein-protein interactions (PPIs). However, the structural details of these interactions, often necessary to understand their function, are only available for a tiny fraction, and the experimental difficulties surrounding complex structure determination make computational modeling techniques paramount. In this manuscript, we critically review some of the most recent developments in the field of structural bioinformatics applied to the modeling of protein interactions and complexes, from large macromolecular machines to domain-domain and peptide-mediated interactions. In particular, we place a special emphasis on those methods that can be applied in a proteome-wide manner, and discuss how they will help in the ultimate objective of building 3D interactome networks.
Collapse
Affiliation(s)
- Amelie Stein
- Institute for Research in Biomedicine (IRB Barcelona), Joint IRB-BSC Program in Computational Biology, c/Baldiri i Reixac 10-12, 08028 Barcelona, Spain
| | | | | |
Collapse
|
32
|
Zhou P, Tian F, Ren Y, Shang Z. Systematic classification and analysis of themes in protein-DNA recognition. J Chem Inf Model 2010; 50:1476-88. [PMID: 20726602 DOI: 10.1021/ci100145d] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/30/2022]
Abstract
Protein-DNA recognition plays a central role in the regulation of gene expression. With the rapidly increasing number of protein-DNA complex structures available at atomic resolution in recent years, a systematic, complete, and intuitive framework to clarify the intrinsic relationship between the global binding modes of these complexes is needed. In this work, we modified, extended, and applied previously defined RNA-recognition themes to describe protein-DNA recognition and used a protocol that incorporates automatic methods into manual inspection to plant a comprehensive classification tree for currently available high-quality protein-DNA structures. Further, a nonredundant (representative) data set consisting of 200 thematically diverse complexes was extracted from the leaves of the classification tree by using a locally sensitive interface comparison algorithm. On the basis of the representative data set, various physical and chemical properties associated with protein-DNA interactions were analyzed using empirical or semiempirical methods. We also examined the individual energetic components involved in protein-DNA interactions and highlighted the importance of conformational entropy, which has been almost completely ignored in previous studies of protein-DNA binding energy.
Collapse
Affiliation(s)
- Peng Zhou
- Department of Chemistry, Zhejiang University, Hangzhou 310027, China, College of Bioengineering, Chongqing University, Chongqing 400044, China
| | | | | | | |
Collapse
|
33
|
Davis FP. Proteome-wide prediction of overlapping small molecule and protein binding sites using structure. MOLECULAR BIOSYSTEMS 2010; 7:545-57. [PMID: 21103609 DOI: 10.1039/c0mb00200c] [Citation(s) in RCA: 11] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/21/2022]
Abstract
Small molecules that modulate protein-protein interactions are of great interest for chemical biology and therapeutics. Here I present a structure-based approach to predict 'bi-functional' sites able to bind both small molecule ligands and proteins, in proteins of unknown structure. First, I develop a homology-based annotation method that transfers binding sites of known three-dimensional structure onto protein sequences, predicting residues in ligand and protein binding sites with estimated true positive rates of 98% and 88%, respectively, at 1% false positive rates. Applying this method to the human proteome predicts 8463 proteins with bi-functional residues and correctly recovers the targets of known interaction modulators. Proteins with significantly (p < 0.01) more bi-functional residues than expected were found to be enriched in regulatory and depleted in metabolism functions. Finally, I demonstrate the utility of the method by describing examples of predicted overlap and evidence of their biological and therapeutic relevance. The results suggest that combining the structures of known binding sites with established fold detection algorithms can predict regions of protein-protein interfaces that are amenable to small molecule modulation. Open-source software and the results for several complete proteomes are available at http://pibase.janelia.org/homolobind.
Collapse
Affiliation(s)
- Fred P Davis
- Howard Hughes Medical Institute, Janelia Farm Research Campus, 19700 Helix Dr, Ashburn, VA 20147, USA.
| |
Collapse
|
34
|
Abstract
Domain Interaction MAp (DIMA, available at http://webclu.bio.wzw.tum.de/dima) is a database of predicted and known interactions between protein domains. It integrates 5807 structurally known interactions imported from the iPfam and 3did databases and 46 900 domain interactions predicted by four computational methods: domain phylogenetic profiling, domain pair exclusion algorithm correlated mutations and domain interaction prediction in a discriminative way. Additionally predictions are filtered to exclude those domain pairs that are reported as non-interacting by the Negatome database. The DIMA Web site allows to calculate domain interaction networks either for a domain of interest or for entire organisms, and to explore them interactively using the Flash-based Cytoscape Web software.
Collapse
Affiliation(s)
- Qibin Luo
- Department of Genome Oriented Bioinformatics, Technische Universität München, Wissenschaftszentrum Weihenstephan, 85350 Freising, Germany
| | | | | | | |
Collapse
|
35
|
Jochim AL, Arora PS. Systematic analysis of helical protein interfaces reveals targets for synthetic inhibitors. ACS Chem Biol 2010; 5:919-23. [PMID: 20712375 DOI: 10.1021/cb1001747] [Citation(s) in RCA: 143] [Impact Index Per Article: 10.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
Abstract
Synthetic inhibitors of protein-protein interactions are being discovered despite the inherent challenge in targeting large contact surfaces with small molecules. An analysis of available examples identifies common features of complexes that make them tractable for small molecules. We deduced that relative disposition and energetic contributions of "hot spot" residues provide a predictive scale for the potential of protein-protein interactions to be inhibited by small molecules. On the basis of this model, we analyzed the full set of helical protein interfaces in the Protein Data Bank to identify those that are potentially suitable candidates for synthetic ligands.
Collapse
Affiliation(s)
- Andrea L. Jochim
- Department of Chemistry, New York University, New York, New York 10003
| | - Paramjit S. Arora
- Department of Chemistry, New York University, New York, New York 10003
| |
Collapse
|
36
|
Itzhaki Z, Akiva E, Margalit H. Preferential use of protein domain pairs as interaction mediators: order and transitivity. Bioinformatics 2010; 26:2564-70. [DOI: 10.1093/bioinformatics/btq495] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open
|