1
|
Aldossari RM, Ali A, Rehman MU, Rashid S, Ahmad SB. Computational Approaches for Identification of Potential Plant Bioactives as Novel G6PD Inhibitors Using Advanced Tools and Databases. Molecules 2023; 28:molecules28073018. [PMID: 37049781 PMCID: PMC10096328 DOI: 10.3390/molecules28073018] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/25/2023] [Revised: 02/26/2023] [Accepted: 03/06/2023] [Indexed: 03/31/2023] Open
Abstract
In glucose metabolism, the pentose phosphate pathway (PPP) is the major metabolic pathway that plays a crucial role in cancer growth and metastasis. Although it has been pointed out that blockade of the PPP is a promising approach against cancer, in the clinical setting, effective anti-PPP agents are still not available. Dysfunction of the G6PD enzyme in this pathway leads to cancer development as this enzyme possesses oncogenic activity. In the present study, an attempt was made to identify bioactive compounds that can be developed as potential G6PD inhibitors. In the present study, 11 natural compounds and a controlled drug were taken. The physicochemical and toxicity properties of the compounds were determined via ADMET and ProTox-II analysis. In the present study, the findings of docking studies revealed that staurosporine was the most effective compound with the highest binding energy of −9.2 kcal/mol when docked against G6PD. Homology modeling revealed that 97.56% of the residues were occupied in the Ramachandran-favored region. The modeled protein gave a quality Z-score of −10.13 by ProSA tool. iMODS server provided significant insights into the mobility, stability and flexibility of the G6PD protein that described the collective functional protein motion. In the present study, the physical and functional interactions between proteins were determined by STRING. CASTp server determined the topological and geometric properties of the G6PD protein. The findings of the present study revealed that staurosporine could be developed as a potential G6PD inhibitor; however, further in vivo and in vitro studies are needed for further validation of these results.
Collapse
Affiliation(s)
- Rana M. Aldossari
- Department of Pharmacology & Toxicology, College of Pharmacy, Prince Sattam Bin Abdulaziz University, P.O. Box 173, Al-Kharj 11942, Saudi Arabia
| | - Aarif Ali
- Division of Veterinary Biochemistry, Faculty of Veterinary Science and Animal Husbandry, SKUAST-Kashmir, Alustang, Shuhama 190006, Jammu & Kashmir, India
| | - Muneeb U. Rehman
- Department of Clinical Pharmacy, College of Pharmacy, King Saud University, P.O. Box 2457, Riyadh 11451, Saudi Arabia
| | - Summya Rashid
- Department of Pharmacology & Toxicology, College of Pharmacy, Prince Sattam Bin Abdulaziz University, P.O. Box 173, Al-Kharj 11942, Saudi Arabia
- Correspondence:
| | - Sheikh Bilal Ahmad
- Division of Veterinary Biochemistry, Faculty of Veterinary Science and Animal Husbandry, SKUAST-Kashmir, Alustang, Shuhama 190006, Jammu & Kashmir, India
| |
Collapse
|
2
|
Dey S, Prilusky J, Levy ED. QSalignWeb: A Server to Predict and Analyze Protein Quaternary Structure. Front Mol Biosci 2022; 8:787510. [PMID: 35071324 PMCID: PMC8769216 DOI: 10.3389/fmolb.2021.787510] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/30/2021] [Accepted: 12/02/2021] [Indexed: 11/16/2022] Open
Abstract
The identification of physiologically relevant quaternary structures (QSs) in crystal lattices is challenging. To predict the physiological relevance of a particular QS, QSalign searches for homologous structures in which subunits interact in the same geometry. This approach proved accurate but was limited to structures already present in the Protein Data Bank (PDB). Here, we introduce a webserver (www.QSalign.org) allowing users to submit homo-oligomeric structures of their choice to the QSalign pipeline. Given a user-uploaded structure, the sequence is extracted and used to search homologs based on sequence similarity and PFAM domain architecture. If structural conservation is detected between a homolog and the user-uploaded QS, physiological relevance is inferred. The web server also generates alternative QSs with PISA and processes them the same way as the query submitted to widen the predictions. The result page also shows representative QSs in the protein family of the query, which is informative if no QS conservation was detected or if the protein appears monomeric. These representative QSs can also serve as a starting point for homology modeling.
Collapse
Affiliation(s)
- Sucharita Dey
- Department of Chemical and Structural Biology, Weizmann Institute of Science, Rehovot, Israel
| | - Jaime Prilusky
- Department of Life Sciences and Core Facilities, Weizmann Institute of Science, Rehovot, Israel
| | - Emmanuel D. Levy
- Department of Chemical and Structural Biology, Weizmann Institute of Science, Rehovot, Israel
| |
Collapse
|
3
|
PDB-wide identification of physiological hetero-oligomeric assemblies based on conserved quaternary structure geometry. Structure 2021; 29:1303-1311.e3. [PMID: 34520740 PMCID: PMC8575123 DOI: 10.1016/j.str.2021.07.012] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/15/2020] [Revised: 03/22/2021] [Accepted: 07/23/2021] [Indexed: 11/21/2022]
Abstract
An accurate understanding of biomolecular mechanisms and diseases requires information on protein quaternary structure (QS). A critical challenge in inferring QS information from crystallography data is distinguishing biological interfaces from fortuitous crystal-packing contacts. Here, we employ QS conservation across homologs to infer the biological relevance of hetero-oligomers. We compare the structures and compositions of hetero-oligomers, which allow us to annotate 7,810 complexes as physiologically relevant, 1,060 as likely errors, and 1,432 with comparative information on subunit stoichiometry and composition. Excluding immunoglobulins, these annotations encompass over 51% of hetero-oligomers in the PDB. We curate a dataset of 577 hetero-oligomeric complexes to benchmark these annotations, which reveals an accuracy >94%. When homology information is not available, we compare QS across repositories (PDB, PISA, and EPPIC) to derive confidence estimates. This work provides high-quality annotations along with a large benchmark dataset of hetero-assemblies.
Collapse
|
4
|
Biological vs. Crystallographic Protein Interfaces: An Overview of Computational Approaches for Their Classification. CRYSTALS 2020. [DOI: 10.3390/cryst10020114] [Citation(s) in RCA: 12] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/31/2022]
Abstract
Complexes between proteins are at the basis of almost every process in cells. Their study, from a structural perspective, has a pivotal role in understanding biological functions and, importantly, in drug development. X-ray crystallography represents the broadest source for the experimental structural characterization of protein-protein complexes. Correctly identifying the biologically relevant interface from the crystallographic ones is, however, not trivial and can be prone to errors. Over the past two decades, computational methodologies have been developed to study the differences of those interfaces and automatically classify them as biological or crystallographic. Overall, protein-protein interfaces show differences in terms of composition, energetics and evolutionary conservation between biological and crystallographic ones. Based on those observations, a number of computational methods have been developed for this classification problem, which can be grouped into three main categories: Energy-, empirical knowledge- and machine learning-based approaches. In this review, we give a comprehensive overview of the training datasets and methods so far implemented, providing useful links and a brief description of each method.
Collapse
|
5
|
Kumari N, Yadav S. Modulation of protein oligomerization: An overview. PROGRESS IN BIOPHYSICS AND MOLECULAR BIOLOGY 2019; 149:99-113. [DOI: 10.1016/j.pbiomolbio.2019.03.003] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/30/2018] [Revised: 03/05/2019] [Accepted: 03/06/2019] [Indexed: 12/21/2022]
|
6
|
Accurate Classification of Biological and non-Biological Interfaces in Protein Crystal Structures using Subtle Covariation Signals. Sci Rep 2019; 9:12603. [PMID: 31471543 PMCID: PMC6717244 DOI: 10.1038/s41598-019-48913-8] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/10/2017] [Accepted: 08/14/2019] [Indexed: 11/08/2022] Open
Abstract
Proteins often work as oligomers or multimers in vivo. Therefore, elucidating their oligomeric or multimeric form (quaternary structure) is crucially important to ascertain their function. X-ray crystal structures of numerous proteins have been accumulated, providing information related to their biological units. Extracting information of biological units from protein crystal structures represents a meaningful task for modern biology. Nevertheless, although many methods have been proposed for identifying biological units appearing in protein crystal structures, it is difficult to distinguish biological protein-protein interfaces from crystallographic ones. Therefore, our simple but highly accurate classifier was developed to infer biological units in protein crystal structures using large amounts of protein sequence information and a modern contact prediction method to exploit covariation signals (CSs) in proteins. We demonstrate that our proposed method is promising even for weak signals of biological interfaces. We also discuss the relation between classification accuracy and conservation of biological units, and illustrate how the selection of sequences included in multiple sequence alignments as sources for obtaining CSs affects the results. With increased amounts of sequence data, the proposed method is expected to become increasingly useful.
Collapse
|
7
|
Waterhouse A, Bertoni M, Bienert S, Studer G, Tauriello G, Gumienny R, Heer FT, de Beer TAP, Rempfer C, Bordoli L, Lepore R, Schwede T. SWISS-MODEL: homology modelling of protein structures and complexes. Nucleic Acids Res 2019; 46:W296-W303. [PMID: 29788355 PMCID: PMC6030848 DOI: 10.1093/nar/gky427] [Citation(s) in RCA: 7317] [Impact Index Per Article: 1463.4] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/09/2018] [Accepted: 05/07/2018] [Indexed: 11/13/2022] Open
Abstract
Homology modelling has matured into an important technique in structural biology, significantly contributing to narrowing the gap between known protein sequences and experimentally determined structures. Fully automated workflows and servers simplify and streamline the homology modelling process, also allowing users without a specific computational expertise to generate reliable protein models and have easy access to modelling results, their visualization and interpretation. Here, we present an update to the SWISS-MODEL server, which pioneered the field of automated modelling 25 years ago and been continuously further developed. Recently, its functionality has been extended to the modelling of homo- and heteromeric complexes. Starting from the amino acid sequences of the interacting proteins, both the stoichiometry and the overall structure of the complex are inferred by homology modelling. Other major improvements include the implementation of a new modelling engine, ProMod3 and the introduction a new local model quality estimation method, QMEANDisCo. SWISS-MODEL is freely available at https://swissmodel.expasy.org.
Collapse
Affiliation(s)
- Andrew Waterhouse
- Biozentrum, University of Basel, Klingelbergstrasse 50-70, CH-4056 Basel, Switzerland.,SIB Swiss Institute of Bioinformatics, Biozentrum, University of Basel, Klingelbergstrasse 50-70, CH-4056 Basel, Switzerland
| | - Martino Bertoni
- Biozentrum, University of Basel, Klingelbergstrasse 50-70, CH-4056 Basel, Switzerland.,SIB Swiss Institute of Bioinformatics, Biozentrum, University of Basel, Klingelbergstrasse 50-70, CH-4056 Basel, Switzerland
| | - Stefan Bienert
- Biozentrum, University of Basel, Klingelbergstrasse 50-70, CH-4056 Basel, Switzerland.,SIB Swiss Institute of Bioinformatics, Biozentrum, University of Basel, Klingelbergstrasse 50-70, CH-4056 Basel, Switzerland
| | - Gabriel Studer
- Biozentrum, University of Basel, Klingelbergstrasse 50-70, CH-4056 Basel, Switzerland.,SIB Swiss Institute of Bioinformatics, Biozentrum, University of Basel, Klingelbergstrasse 50-70, CH-4056 Basel, Switzerland
| | - Gerardo Tauriello
- Biozentrum, University of Basel, Klingelbergstrasse 50-70, CH-4056 Basel, Switzerland.,SIB Swiss Institute of Bioinformatics, Biozentrum, University of Basel, Klingelbergstrasse 50-70, CH-4056 Basel, Switzerland
| | - Rafal Gumienny
- Biozentrum, University of Basel, Klingelbergstrasse 50-70, CH-4056 Basel, Switzerland.,SIB Swiss Institute of Bioinformatics, Biozentrum, University of Basel, Klingelbergstrasse 50-70, CH-4056 Basel, Switzerland
| | - Florian T Heer
- Biozentrum, University of Basel, Klingelbergstrasse 50-70, CH-4056 Basel, Switzerland.,SIB Swiss Institute of Bioinformatics, Biozentrum, University of Basel, Klingelbergstrasse 50-70, CH-4056 Basel, Switzerland
| | - Tjaart A P de Beer
- Biozentrum, University of Basel, Klingelbergstrasse 50-70, CH-4056 Basel, Switzerland.,SIB Swiss Institute of Bioinformatics, Biozentrum, University of Basel, Klingelbergstrasse 50-70, CH-4056 Basel, Switzerland
| | - Christine Rempfer
- Biozentrum, University of Basel, Klingelbergstrasse 50-70, CH-4056 Basel, Switzerland.,SIB Swiss Institute of Bioinformatics, Biozentrum, University of Basel, Klingelbergstrasse 50-70, CH-4056 Basel, Switzerland
| | - Lorenza Bordoli
- Biozentrum, University of Basel, Klingelbergstrasse 50-70, CH-4056 Basel, Switzerland.,SIB Swiss Institute of Bioinformatics, Biozentrum, University of Basel, Klingelbergstrasse 50-70, CH-4056 Basel, Switzerland
| | - Rosalba Lepore
- Biozentrum, University of Basel, Klingelbergstrasse 50-70, CH-4056 Basel, Switzerland.,SIB Swiss Institute of Bioinformatics, Biozentrum, University of Basel, Klingelbergstrasse 50-70, CH-4056 Basel, Switzerland
| | - Torsten Schwede
- Biozentrum, University of Basel, Klingelbergstrasse 50-70, CH-4056 Basel, Switzerland.,SIB Swiss Institute of Bioinformatics, Biozentrum, University of Basel, Klingelbergstrasse 50-70, CH-4056 Basel, Switzerland
| |
Collapse
|
8
|
Principles and characteristics of biological assemblies in experimentally determined protein structures. Curr Opin Struct Biol 2019; 55:34-49. [PMID: 30965224 DOI: 10.1016/j.sbi.2019.03.006] [Citation(s) in RCA: 11] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/26/2019] [Accepted: 03/01/2019] [Indexed: 12/27/2022]
Abstract
More than half of all structures in the PDB are assemblies of two or more proteins, including both homooligomers and heterooligomers. Structural information on these assemblies comes from X-ray crystallography, NMR, and cryo-EM spectroscopy. The correct assembly in an X-ray structure is often ambiguous, and computational methods have been developed to identify the most likely biologically relevant assembly based on physical properties of assemblies and sequence conservation in interfaces. Taking advantage of the large number of structures now available, some of the most recent methods have relied on similarity of interfaces and assemblies across structures of homologous proteins.
Collapse
|
9
|
Elez K, Bonvin AMJJ, Vangone A. Distinguishing crystallographic from biological interfaces in protein complexes: role of intermolecular contacts and energetics for classification. BMC Bioinformatics 2018; 19:438. [PMID: 30497368 PMCID: PMC6266931 DOI: 10.1186/s12859-018-2414-9] [Citation(s) in RCA: 18] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022] Open
Abstract
BACKGROUND Study of macromolecular assemblies is fundamental to understand functions in cells. X-ray crystallography is the most common technique to solve their 3D structure at atomic resolution. In a crystal, however, both biologically-relevant interfaces and non-specific interfaces resulting from crystallographic packing are observed. Due to the complexity of the biological assemblies currently tackled, classifying those interfaces, i.e. distinguishing biological from crystal lattice interfaces, is not trivial and often prone to errors. In this context, analyzing the physico-chemical characteristics of biological/crystal interfaces can help researchers identify possible features that distinguish them and gain a better understanding of the systems. RESULTS In this work, we are providing new insights into the differences between biological and crystallographic complexes by focusing on "pair-properties" of interfaces that have not yet been fully investigated. We investigated properties such intermolecular residue-residue contacts (already successfully applied to the prediction of binding affinities) and interaction energies (electrostatic, Van der Waals and desolvation). By using the XtalMany and BioMany interface datasets, we show that interfacial residue contacts, classified as a function of their physico-chemical properties, can distinguish between biological and crystallographic interfaces. The energetic terms show, on average, higher values for crystal interfaces, reflecting a less stable interface due to crystal packing compared to biological interfaces. By using a variety of machine learning approaches, we trained a new interface classification predictor based on contacts and interaction energetic features. Our predictor reaches an accuracy in classifying biological vs crystal interfaces of 0.92, compared to 0.88 for EPPIC (one of the main state-of-the-art classifiers reporting same performance as PISA). CONCLUSION In this work we have gained insights into the nature of intermolecular contacts and energetics terms distinguishing biological from crystallographic interfaces. Our findings might have a broader applicability in structural biology, for example for the identification of near native poses in docking. We implemented our classification approach into an easy-to-use and fast software, freely available to the scientific community from http://github.com/haddocking/interface-classifier .
Collapse
Affiliation(s)
- Katarina Elez
- Bijvoet Center for Biomolecular Research, Faculty of Science - Chemistry, Utrecht University, Padualaan 8, 3584 CH, Utrecht, The Netherlands
- Present address: University of Bologna, Via Selmi 3, 40126, Bologna, Italy
| | - Alexandre M J J Bonvin
- Bijvoet Center for Biomolecular Research, Faculty of Science - Chemistry, Utrecht University, Padualaan 8, 3584 CH, Utrecht, The Netherlands.
| | - Anna Vangone
- Bijvoet Center for Biomolecular Research, Faculty of Science - Chemistry, Utrecht University, Padualaan 8, 3584 CH, Utrecht, The Netherlands.
- present address: Pharma Research and Early Development, Large Molecule Research, Roche Innovation Center Munich, Nonnenwald 2, Penzberg, Germany.
| |
Collapse
|
10
|
Hu J, Liu HF, Sun J, Wang J, Liu R. Integrating co-evolutionary signals and other properties of residue pairs to distinguish biological interfaces from crystal contacts. Protein Sci 2018; 27:1723-1735. [PMID: 29931702 DOI: 10.1002/pro.3448] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/22/2018] [Revised: 04/21/2018] [Accepted: 05/16/2018] [Indexed: 12/25/2022]
Abstract
It remains challenging to accurately discriminate between biological and crystal interfaces. Most existing analyses and algorithms focused on the features derived from a single side of the interface. However, less attention has been paid to the properties of residue pairs across protein interfaces. To address this problem, we defined a novel co-evolutionary feature for homodimers through integrating direct coupling analysis and image processing techniques. The residue pairs across biological homodimeric interfaces were significantly enriched in co-evolving residues compared to those across crystal contacts, resulting in a promising classification accuracy with area under the curves (AUCs) of >0.85. Considering the availability of co-evolutionary feature, we also designed other residue pair based features that were useful for both homodimers and heterodimers. The most informative residue pairs were identified to reflect the interaction preferences across protein interfaces. Regarding the other extant properties, we designed the new descriptors at the interface residue level as well as at the pairwise contact level. Extensive validation showed that these single properties can be used to identify biological interfaces with AUCs ranging from 0.60 to 0.88. By integrating co-evolutionary feature with other residue pair based properties, our final prediction model output excellent performance with AUCs of >0.91 on different datasets. Compared to existing methods, our algorithm not only yielded better or comparable results but also provided complementary information. An easy-to-use web server is freely accessible at http://liulab.hzau.edu.cn/RPAIAnalyst.
Collapse
Affiliation(s)
- Jian Hu
- Hubei Key Laboratory of Agricultural Bioinformatics, College of Informatics, Huazhong Agricultural University, Wuhan, 430070, P. R. China.,College of Biomedical Engineering, South-Central University for Nationalities, Wuhan, 430074, P. R. China
| | - Hui-Fang Liu
- Hubei Key Laboratory of Agricultural Bioinformatics, College of Informatics, Huazhong Agricultural University, Wuhan, 430070, P. R. China
| | - Jun Sun
- Hubei Key Laboratory of Agricultural Bioinformatics, College of Informatics, Huazhong Agricultural University, Wuhan, 430070, P. R. China
| | - Jia Wang
- Hubei Key Laboratory of Agricultural Bioinformatics, College of Informatics, Huazhong Agricultural University, Wuhan, 430070, P. R. China
| | - Rong Liu
- Hubei Key Laboratory of Agricultural Bioinformatics, College of Informatics, Huazhong Agricultural University, Wuhan, 430070, P. R. China
| |
Collapse
|
11
|
Brunger AT, Leitz J, Zhou Q, Choi UB, Lai Y. Ca 2+-Triggered Synaptic Vesicle Fusion Initiated by Release of Inhibition. Trends Cell Biol 2018; 28:631-645. [PMID: 29706534 PMCID: PMC6056330 DOI: 10.1016/j.tcb.2018.03.004] [Citation(s) in RCA: 37] [Impact Index Per Article: 6.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/09/2018] [Revised: 03/17/2018] [Accepted: 03/26/2018] [Indexed: 12/20/2022]
Abstract
Recent structural and functional studies of the synaptic vesicle fusion machinery suggest an inhibited tripartite complex consisting of neuronal soluble N-ethylmaleimide sensitive factor attachment protein receptors (SNAREs), synaptotagmin, and complexin prior to Ca2+-triggered synaptic vesicle fusion. We speculate that Ca2+-triggered fusion commences with the release of inhibition by Ca2+ binding to synaptotagmin C2 domains. Subsequently, fusion is assisted by SNARE complex zippering and by active membrane remodeling properties of synaptotagmin. This additional, inhibitory role of synaptotagmin may be a general principle since other recent studies suggest that Ca2+ binding to extended synaptotagmin C2 domains enables lipid transport by releasing an inhibited state of the system, and that Munc13 may nominally be in an inhibited state, which is released upon Ca2+ binding to one of its C2 domains.
Collapse
Affiliation(s)
- Axel T Brunger
- Department of Molecular and Cellular Physiology, Stanford University, Stanford, CA, USA; Department of Neurology and Neurological Sciences, Stanford University, Stanford, CA, USA; Department of Structural Biology, Stanford University, Stanford, CA, USA; Department of Photon Science, Stanford University, Stanford, CA, USA; Howard Hughes Medical Institute, Stanford University, Stanford, CA, USA.
| | - Jeremy Leitz
- Department of Molecular and Cellular Physiology, Stanford University, Stanford, CA, USA; Department of Neurology and Neurological Sciences, Stanford University, Stanford, CA, USA; Department of Structural Biology, Stanford University, Stanford, CA, USA; Department of Photon Science, Stanford University, Stanford, CA, USA; Howard Hughes Medical Institute, Stanford University, Stanford, CA, USA
| | - Qiangjun Zhou
- Department of Molecular and Cellular Physiology, Stanford University, Stanford, CA, USA; Department of Neurology and Neurological Sciences, Stanford University, Stanford, CA, USA; Department of Structural Biology, Stanford University, Stanford, CA, USA; Department of Photon Science, Stanford University, Stanford, CA, USA; Howard Hughes Medical Institute, Stanford University, Stanford, CA, USA
| | - Ucheor B Choi
- Department of Molecular and Cellular Physiology, Stanford University, Stanford, CA, USA; Department of Neurology and Neurological Sciences, Stanford University, Stanford, CA, USA; Department of Structural Biology, Stanford University, Stanford, CA, USA; Department of Photon Science, Stanford University, Stanford, CA, USA; Howard Hughes Medical Institute, Stanford University, Stanford, CA, USA
| | - Ying Lai
- Department of Molecular and Cellular Physiology, Stanford University, Stanford, CA, USA; Department of Neurology and Neurological Sciences, Stanford University, Stanford, CA, USA; Department of Structural Biology, Stanford University, Stanford, CA, USA; Department of Photon Science, Stanford University, Stanford, CA, USA; Howard Hughes Medical Institute, Stanford University, Stanford, CA, USA
| |
Collapse
|
12
|
Waterhouse A, Bertoni M, Bienert S, Studer G, Tauriello G, Gumienny R, Heer FT, de Beer TAP, Rempfer C, Bordoli L, Lepore R, Schwede T. SWISS-MODEL: homology modelling of protein structures and complexes. Nucleic Acids Res 2018. [PMID: 29788355 DOI: 10.1093/nar/gky427.pmid:29788355] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 04/25/2023] Open
Abstract
Homology modelling has matured into an important technique in structural biology, significantly contributing to narrowing the gap between known protein sequences and experimentally determined structures. Fully automated workflows and servers simplify and streamline the homology modelling process, also allowing users without a specific computational expertise to generate reliable protein models and have easy access to modelling results, their visualization and interpretation. Here, we present an update to the SWISS-MODEL server, which pioneered the field of automated modelling 25 years ago and been continuously further developed. Recently, its functionality has been extended to the modelling of homo- and heteromeric complexes. Starting from the amino acid sequences of the interacting proteins, both the stoichiometry and the overall structure of the complex are inferred by homology modelling. Other major improvements include the implementation of a new modelling engine, ProMod3 and the introduction a new local model quality estimation method, QMEANDisCo. SWISS-MODEL is freely available at https://swissmodel.expasy.org.
Collapse
Affiliation(s)
- Andrew Waterhouse
- Biozentrum, University of Basel, Klingelbergstrasse 50-70, CH-4056 Basel, Switzerland
- SIB Swiss Institute of Bioinformatics, Biozentrum, University of Basel, Klingelbergstrasse 50-70, CH-4056 Basel, Switzerland
| | - Martino Bertoni
- Biozentrum, University of Basel, Klingelbergstrasse 50-70, CH-4056 Basel, Switzerland
- SIB Swiss Institute of Bioinformatics, Biozentrum, University of Basel, Klingelbergstrasse 50-70, CH-4056 Basel, Switzerland
| | - Stefan Bienert
- Biozentrum, University of Basel, Klingelbergstrasse 50-70, CH-4056 Basel, Switzerland
- SIB Swiss Institute of Bioinformatics, Biozentrum, University of Basel, Klingelbergstrasse 50-70, CH-4056 Basel, Switzerland
| | - Gabriel Studer
- Biozentrum, University of Basel, Klingelbergstrasse 50-70, CH-4056 Basel, Switzerland
- SIB Swiss Institute of Bioinformatics, Biozentrum, University of Basel, Klingelbergstrasse 50-70, CH-4056 Basel, Switzerland
| | - Gerardo Tauriello
- Biozentrum, University of Basel, Klingelbergstrasse 50-70, CH-4056 Basel, Switzerland
- SIB Swiss Institute of Bioinformatics, Biozentrum, University of Basel, Klingelbergstrasse 50-70, CH-4056 Basel, Switzerland
| | - Rafal Gumienny
- Biozentrum, University of Basel, Klingelbergstrasse 50-70, CH-4056 Basel, Switzerland
- SIB Swiss Institute of Bioinformatics, Biozentrum, University of Basel, Klingelbergstrasse 50-70, CH-4056 Basel, Switzerland
| | - Florian T Heer
- Biozentrum, University of Basel, Klingelbergstrasse 50-70, CH-4056 Basel, Switzerland
- SIB Swiss Institute of Bioinformatics, Biozentrum, University of Basel, Klingelbergstrasse 50-70, CH-4056 Basel, Switzerland
| | - Tjaart A P de Beer
- Biozentrum, University of Basel, Klingelbergstrasse 50-70, CH-4056 Basel, Switzerland
- SIB Swiss Institute of Bioinformatics, Biozentrum, University of Basel, Klingelbergstrasse 50-70, CH-4056 Basel, Switzerland
| | - Christine Rempfer
- Biozentrum, University of Basel, Klingelbergstrasse 50-70, CH-4056 Basel, Switzerland
- SIB Swiss Institute of Bioinformatics, Biozentrum, University of Basel, Klingelbergstrasse 50-70, CH-4056 Basel, Switzerland
| | - Lorenza Bordoli
- Biozentrum, University of Basel, Klingelbergstrasse 50-70, CH-4056 Basel, Switzerland
- SIB Swiss Institute of Bioinformatics, Biozentrum, University of Basel, Klingelbergstrasse 50-70, CH-4056 Basel, Switzerland
| | - Rosalba Lepore
- Biozentrum, University of Basel, Klingelbergstrasse 50-70, CH-4056 Basel, Switzerland
- SIB Swiss Institute of Bioinformatics, Biozentrum, University of Basel, Klingelbergstrasse 50-70, CH-4056 Basel, Switzerland
| | - Torsten Schwede
- Biozentrum, University of Basel, Klingelbergstrasse 50-70, CH-4056 Basel, Switzerland
- SIB Swiss Institute of Bioinformatics, Biozentrum, University of Basel, Klingelbergstrasse 50-70, CH-4056 Basel, Switzerland
| |
Collapse
|
13
|
Dey S, Levy ED. Inferring and Using Protein Quaternary Structure Information from Crystallographic Data. Methods Mol Biol 2018; 1764:357-375. [PMID: 29605927 DOI: 10.1007/978-1-4939-7759-8_23] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/08/2023]
Abstract
A precise knowledge of the quaternary structure of proteins is essential to illuminate both their function and their evolution. The major part of our knowledge on quaternary structure is inferred from X-ray crystallography data, but this inference process is hard and error-prone. The difficulty lies in discriminating fortuitous protein contacts, which make up the lattice of protein crystals, from biological protein contacts that exist in the native cellular environment. Here, we review methods devised to discriminate between both types of contacts and describe resources for downloading protein quaternary structure information and identifying high-confidence quaternary structures. The use of high-confidence datasets of quaternary structures will be critical for the analysis of structural, functional, and evolutionary properties of proteins.
Collapse
Affiliation(s)
- Sucharita Dey
- Department of Structural Biology, Weizmann Institute of Science, Rehovot, Israel
| | - Emmanuel D Levy
- Department of Structural Biology, Weizmann Institute of Science, Rehovot, Israel.
| |
Collapse
|
14
|
PDB-wide identification of biological assemblies from conserved quaternary structure geometry. Nat Methods 2017; 15:67-72. [DOI: 10.1038/nmeth.4510] [Citation(s) in RCA: 50] [Impact Index Per Article: 7.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/14/2017] [Accepted: 10/17/2017] [Indexed: 02/07/2023]
|
15
|
Fernández-Borges N, Parra B, Vidal E, Eraña H, Sánchez-Martín MA, de Castro J, Elezgarai SR, Pumarola M, Mayoral T, Castilla J. Unraveling the key to the resistance of canids to prion diseases. PLoS Pathog 2017; 13:e1006716. [PMID: 29131852 PMCID: PMC5703577 DOI: 10.1371/journal.ppat.1006716] [Citation(s) in RCA: 25] [Impact Index Per Article: 3.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/17/2017] [Revised: 11/27/2017] [Accepted: 10/28/2017] [Indexed: 01/08/2023] Open
Abstract
One of the characteristics of prions is their ability to infect some species but not others and prion resistant species have been of special interest because of their potential in deciphering the determinants for susceptibility. Previously, we developed different in vitro and in vivo models to assess the susceptibility of species that were erroneously considered resistant to prion infection, such as members of the Leporidae and Equidae families. Here we undertake in vitro and in vivo approaches to understand the unresolved low prion susceptibility of canids. Studies based on the amino acid sequence of the canine prion protein (PrP), together with a structural analysis in silico, identified unique key amino acids whose characteristics could orchestrate its high resistance to prion disease. Cell- and brain-based PMCA studies were performed highlighting the relevance of the D163 amino acid in proneness to protein misfolding. This was also investigated by the generation of a novel transgenic mouse model carrying this substitution and these mice showed complete resistance to disease despite intracerebral challenge with three different mouse prion strains (RML, 22L and 301C) known to cause disease in wild-type mice. These findings suggest that dog D163 amino acid is primarily, if not totally, responsible for the prion resistance of canids. Detection of individuals or whole species resistant to any infectious disease is vital to understand the determinants of susceptibility and to develop appropriate therapeutic and preventative strategies. Canids have long been considered resistant to prion infection given the absence of clinical disease despite exposure to the causal agent. Through extensive analysis of the canine prion protein we have detected a key amino acid that might be responsible for their universal resistance to prion disease. Using in vitro and in vivo models we demonstrated that the presence of this residue confers resistance to prion infection when introduced to susceptible animals, opening the way to develop a new therapeutic approach against these, at present, untreatable disorders.
Collapse
Affiliation(s)
| | - Beatriz Parra
- Laboratorio Central de Veterinaria (LCV), Madrid, Spain
| | - Enric Vidal
- Centre de Recerca en Sanitat Animal (CReSA), UAB-IRTA, Campus de la Universitat Autònoma de Barcelona, Bellaterra, Barcelona, Spain
| | - Hasier Eraña
- CIC bioGUNE, Parque tecnológico de Bizkaia, Derio, Bizkaia, Spain
| | - Manuel A. Sánchez-Martín
- Servicio de Transgénesis, Nucleus, Universidad de Salamanca, Salamanca, Spain
- IBSAL, Instituto de Investigación Biomédica de Salamanca, Salamanca, Spain
| | - Jorge de Castro
- Department of Infectology, Scripps Florida, Jupiter, Florida, United States of America
| | | | - Martí Pumarola
- Department of Animal Medicine and Surgery, Veterinary faculty, Universitat Autònoma de Barcelona (UAB), Bellaterra (Cerdanyola del Vallès), Barcelona, Spain
- Centro de Investigación Biomédica en Red en Bioingeniería, Biomateriales y Nanomedicina (CIBER-BBN), Madrid, Spain
| | - Tomás Mayoral
- Laboratorio Central de Veterinaria (LCV), Madrid, Spain
| | - Joaquín Castilla
- CIC bioGUNE, Parque tecnológico de Bizkaia, Derio, Bizkaia, Spain
- IKERBASQUE, Basque Foundation for Science, Bilbao, Bizkaia, Spain
- * E-mail:
| |
Collapse
|
16
|
Safari MS, Byington MC, Conrad JC, Vekilov PG. Polymorphism of Lysozyme Condensates. J Phys Chem B 2017; 121:9091-9101. [DOI: 10.1021/acs.jpcb.7b05425] [Citation(s) in RCA: 15] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/21/2022]
Affiliation(s)
- Mohammad S. Safari
- Department
of Chemical and Biomolecular Engineering, University of Houston, 4726 Calhoun Road, Houston, Texas 77204-4004, United States
| | - Michael C. Byington
- Department
of Chemical and Biomolecular Engineering, University of Houston, 4726 Calhoun Road, Houston, Texas 77204-4004, United States
| | - Jacinta C. Conrad
- Department
of Chemical and Biomolecular Engineering, University of Houston, 4726 Calhoun Road, Houston, Texas 77204-4004, United States
| | - Peter G. Vekilov
- Department
of Chemical and Biomolecular Engineering, University of Houston, 4726 Calhoun Road, Houston, Texas 77204-4004, United States
- Department
of Chemistry, University of Houston, 3585 Cullen Blvd., Houston, Texas 77204-5003, United States
| |
Collapse
|
17
|
Bertoni M, Kiefer F, Biasini M, Bordoli L, Schwede T. Modeling protein quaternary structure of homo- and hetero-oligomers beyond binary interactions by homology. Sci Rep 2017; 7:10480. [PMID: 28874689 PMCID: PMC5585393 DOI: 10.1038/s41598-017-09654-8] [Citation(s) in RCA: 492] [Impact Index Per Article: 70.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/09/2017] [Accepted: 07/28/2017] [Indexed: 01/01/2023] Open
Abstract
Cellular processes often depend on interactions between proteins and the formation of macromolecular complexes. The impairment of such interactions can lead to deregulation of pathways resulting in disease states, and it is hence crucial to gain insights into the nature of macromolecular assemblies. Detailed structural knowledge about complexes and protein-protein interactions is growing, but experimentally determined three-dimensional multimeric assemblies are outnumbered by complexes supported by non-structural experimental evidence. Here, we aim to fill this gap by modeling multimeric structures by homology, only using amino acid sequences to infer the stoichiometry and the overall structure of the assembly. We ask which properties of proteins within a family can assist in the prediction of correct quaternary structure. Specifically, we introduce a description of protein-protein interface conservation as a function of evolutionary distance to reduce the noise in deep multiple sequence alignments. We also define a distance measure to structurally compare homologous multimeric protein complexes. This allows us to hierarchically cluster protein structures and quantify the diversity of alternative biological assemblies known today. We find that a combination of conservation scores, structural clustering, and classical interface descriptors, can improve the selection of homologous protein templates leading to reliable models of protein complexes.
Collapse
Affiliation(s)
- Martino Bertoni
- SIB Swiss Institute of Bioinformatics, Basel, Switzerland.,Biozentrum, University of Basel, Klingelbergstrasse 50/70, 4056, Basel, Switzerland
| | - Florian Kiefer
- SIB Swiss Institute of Bioinformatics, Basel, Switzerland.,Biozentrum, University of Basel, Klingelbergstrasse 50/70, 4056, Basel, Switzerland
| | - Marco Biasini
- SIB Swiss Institute of Bioinformatics, Basel, Switzerland.,Biozentrum, University of Basel, Klingelbergstrasse 50/70, 4056, Basel, Switzerland
| | - Lorenza Bordoli
- SIB Swiss Institute of Bioinformatics, Basel, Switzerland.,Biozentrum, University of Basel, Klingelbergstrasse 50/70, 4056, Basel, Switzerland
| | - Torsten Schwede
- SIB Swiss Institute of Bioinformatics, Basel, Switzerland. .,Biozentrum, University of Basel, Klingelbergstrasse 50/70, 4056, Basel, Switzerland.
| |
Collapse
|
18
|
Uddin MS, Naider F, Becker JM. Dynamic roles for the N-terminus of the yeast G protein-coupled receptor Ste2p. BIOCHIMICA ET BIOPHYSICA ACTA-BIOMEMBRANES 2017; 1859:2058-2067. [PMID: 28754538 DOI: 10.1016/j.bbamem.2017.07.014] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/27/2017] [Revised: 07/13/2017] [Accepted: 07/24/2017] [Indexed: 12/11/2022]
Abstract
The Saccharomyces cerevisiae α-factor receptor Ste2p has been used extensively as a model to understand the molecular mechanism of signal transduction by G protein-coupled receptors (GPCRs). Single and double cysteine mutants of Ste2p were created and served as surrogates to detect intramolecular interactions and dimerization of Ste2p using disulfide cross-linking methodology. When a mutation was introduced into the phylogenetically conserved tyrosine residue at position 26 (Y26C) in the N-terminus of Ste2p, dimerization was increased greatly. The amount of dimer formed by this Y26C mutant was greatly reduced by ligand binding even though the ligand binding site is far removed from the N-terminus; the lowering of the dimer formation was consistent with a conformational change in the N-terminus of the receptor upon activation. Dimerization was decreased by double mutations Y26C/V109C or Y26C/T114C indicating that Y26 is in close proximity to V109 and T114 of extracellular loop 1 in native Ste2p. Combined with earlier studies, these results indicate previously unrecognized roles for the N-terminus of Ste2p, and perhaps of GPCRs in general, and reveal a specific N-terminus residue or region, that is involved in GPCR signaling, intrareceptor interactions, and receptor dimerization.
Collapse
Affiliation(s)
- M Seraj Uddin
- Department of Microbiology, University of Tennessee, Knoxville, Tennessee 37996, United States
| | - Fred Naider
- Department of Chemistry and Macromolecular Assemblies Institute, College of Staten Island, CUNY, New York, New York 10314, United States; Ph.D. Programs in Biochemistry and Chemistry, The Graduate Center of the City University of New York, New York, NY 10016, United States
| | - Jeffrey M Becker
- Department of Microbiology, University of Tennessee, Knoxville, Tennessee 37996, United States.
| |
Collapse
|
19
|
The "Sticky Patch" Model of Crystallization and Modification of Proteins for Enhanced Crystallizability. Methods Mol Biol 2017; 1607:77-115. [PMID: 28573570 DOI: 10.1007/978-1-4939-7000-1_4] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/27/2022]
Abstract
Crystallization of macromolecules has long been perceived as a stochastic process, which cannot be predicted or controlled. This is consistent with another popular notion that the interactions of molecules within the crystal, i.e., crystal contacts, are essentially random and devoid of specific physicochemical features. In contrast, functionally relevant surfaces, such as oligomerization interfaces and specific protein-protein interaction sites, are under evolutionary pressures so their amino acid composition, structure, and topology are distinct. However, current theoretical and experimental studies are significantly changing our understanding of the nature of crystallization. The increasingly popular "sticky patch" model, derived from soft matter physics, describes crystallization as a process driven by interactions between select, specific surface patches, with properties thermodynamically favorable for cohesive interactions. Independent support for this model comes from various sources including structural studies and bioinformatics. Proteins that are recalcitrant to crystallization can be modified for enhanced crystallizability through chemical or mutational modification of their surface to effectively engineer "sticky patches" which would drive crystallization. Here, we discuss the current state of knowledge of the relationship between the microscopic properties of the target macromolecule and its crystallizability, focusing on the "sticky patch" model. We discuss state-of-the-art in silico methods that evaluate the propensity of a given target protein to form crystals based on these relationships, with the objective to design variants with modified molecular surface properties and enhanced crystallization propensity. We illustrate this discussion with specific cases where these approaches allowed to generate crystals suitable for structural analysis.
Collapse
|
20
|
Jackson EL, Shahmoradi A, Spielman SJ, Jack BR, Wilke CO. Intermediate divergence levels maximize the strength of structure-sequence correlations in enzymes and viral proteins. Protein Sci 2016; 25:1341-53. [PMID: 26971720 PMCID: PMC4918415 DOI: 10.1002/pro.2920] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/07/2015] [Accepted: 03/04/2016] [Indexed: 12/16/2022]
Abstract
Structural properties such as solvent accessibility and contact number predict site-specific sequence variability in many proteins. However, the strength and significance of these structure-sequence relationships vary widely among different proteins, with absolute correlation strengths ranging from 0 to 0.8. In particular, two recent works have made contradictory observations. Yeh et al. (Mol. Biol. Evol. 31:135-139, 2014) found that both relative solvent accessibility (RSA) and weighted contact number (WCN) are good predictors of sitewise evolutionary rate in enzymes, with WCN clearly out-performing RSA. Shahmoradi et al. (J. Mol. Evol. 79:130-142, 2014) considered these same predictors (as well as others) in viral proteins and found much weaker correlations and no clear advantage of WCN over RSA. Because these two studies had substantial methodological differences, however, a direct comparison of their results is not possible. Here, we reanalyze the datasets of the two studies with one uniform analysis pipeline, and we find that many apparent discrepancies between the two analyses can be attributed to the extent of sequence divergence in individual alignments. Specifically, the alignments of the enzyme dataset are much more diverged than those of the virus dataset, and proteins with higher divergence exhibit, on average, stronger structure-sequence correlations. However, the highest structure-sequence correlations are observed at intermediate divergence levels, where both highly conserved and highly variable sites are present in the same alignment.
Collapse
Affiliation(s)
- Eleisha L Jackson
- Department of Integrative Biology, The University of Texas at Austin, Austin, Texas, 78712
- Center for Computational Biology and Bioinformatics, The University of Texas at Austin, Austin, Texas, 78712
- Institute for Cellular and Molecular Biology, The University of Texas at Austin, Austin, Texas, 78712
| | - Amir Shahmoradi
- Center for Computational Biology and Bioinformatics, The University of Texas at Austin, Austin, Texas, 78712
- Institute for Cellular and Molecular Biology, The University of Texas at Austin, Austin, Texas, 78712
- Department of Physics, The University of Texas at Austin, Austin, Texas, 78712
| | - Stephanie J Spielman
- Department of Integrative Biology, The University of Texas at Austin, Austin, Texas, 78712
- Center for Computational Biology and Bioinformatics, The University of Texas at Austin, Austin, Texas, 78712
- Institute for Cellular and Molecular Biology, The University of Texas at Austin, Austin, Texas, 78712
| | - Benjamin R Jack
- Department of Integrative Biology, The University of Texas at Austin, Austin, Texas, 78712
- Center for Computational Biology and Bioinformatics, The University of Texas at Austin, Austin, Texas, 78712
- Institute for Cellular and Molecular Biology, The University of Texas at Austin, Austin, Texas, 78712
| | - Claus O Wilke
- Department of Integrative Biology, The University of Texas at Austin, Austin, Texas, 78712
- Center for Computational Biology and Bioinformatics, The University of Texas at Austin, Austin, Texas, 78712
- Institute for Cellular and Molecular Biology, The University of Texas at Austin, Austin, Texas, 78712
| |
Collapse
|
21
|
Sudha G, Srinivasan N. Comparative analyses of quaternary arrangements in homo-oligomeric proteins in superfamilies: Functional implications. Proteins 2016; 84:1190-202. [PMID: 27177429 DOI: 10.1002/prot.25065] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/14/2016] [Revised: 05/03/2016] [Accepted: 05/08/2016] [Indexed: 11/08/2022]
Abstract
A comprehensive analysis of the quaternary features of distantly related homo-oligomeric proteins is the focus of the current study. This study has been performed at the levels of quaternary state, symmetry, and quaternary structure. Quaternary state and quaternary structure refers to the number of subunits and spatial arrangements of subunits, respectively. Using a large dataset of available 3D structures of biologically relevant assemblies, we show that only 53% of the distantly related homo-oligomeric proteins have the same quaternary state. Considering these homologous homo-oligomers with the same quaternary state, conservation of quaternary structures is observed only in 38% of the pairs. In 36% of the pairs of distantly related homo-oligomers with different quaternary states the larger assembly in a pair shows high structural similarity with the entire quaternary structure of the related protein with lower quaternary state and it is referred as "Russian doll effect." The differences in quaternary state and structure have been suggested to contribute to the functional diversity. Detailed investigations show that even though the gross functions of many distantly related homo-oligomers are the same, finer level differences in molecular functions are manifested by differences in quaternary states and structures. Comparison of structures of biological assemblies in distantly and closely related homo-oligomeric proteins throughout the study differentiates the effects of sequence divergence on the quaternary structures and function. Knowledge inferred from this study can provide insights for improved protein structure classification and function prediction of homo-oligomers. Proteins 2016; 84:1190-1202. © 2016 Wiley Periodicals, Inc.
Collapse
Affiliation(s)
- Govindarajan Sudha
- Molecular Biophysics Unit, Indian Institute of Science, Bangalore, 560012, India
| | | |
Collapse
|
22
|
Nepal R, Spencer J, Bhogal G, Nedunuri A, Poelman T, Kamath T, Chung E, Kantardjieff K, Gottlieb A, Lustig B. Logistic regression models to predict solvent accessible residues using sequence- and homology-based qualitative and quantitative descriptors applied to a domain-complete X-ray structure learning set. J Appl Crystallogr 2015; 48:1976-1984. [PMID: 26664348 PMCID: PMC4665666 DOI: 10.1107/s1600576715018531] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/25/2014] [Accepted: 10/03/2015] [Indexed: 11/11/2022] Open
Abstract
A working example of relative solvent accessibility (RSA) prediction for proteins is presented. Novel logistic regression models with various qualitative descriptors that include amino acid type and quantitative descriptors that include 20- and six-term sequence entropy have been built and validated. A domain-complete learning set of over 1300 proteins is used to fit initial models with various sequence homology descriptors as well as query residue qualitative descriptors. Homology descriptors are derived from BLASTp sequence alignments, whereas the RSA values are determined directly from the crystal structure. The logistic regression models are fitted using dichotomous responses indicating buried or accessible solvent, with binary classifications obtained from the RSA values. The fitted models determine binary predictions of residue solvent accessibility with accuracies comparable to other less computationally intensive methods using the standard RSA threshold criteria 20 and 25% as solvent accessible. When an additional non-homology descriptor describing Lobanov-Galzitskaya residue disorder propensity is included, incremental improvements in accuracy are achieved with 25% threshold accuracies of 76.12 and 74.79% for the Manesh-215 and CASP(8+9) test sets, respectively. Moreover, the described software and the accompanying learning and validation sets allow students and researchers to explore the utility of RSA prediction with simple, physically intuitive models in any number of related applications.
Collapse
Affiliation(s)
- Reecha Nepal
- Department of Chemistry, San Jose State University, San Jose, CA 95192-0101, USA
| | - Joanna Spencer
- Department of Mathematics and Statistics, San Jose State University, San Jose, CA 95192-0101, USA
| | - Guneet Bhogal
- Department of Biomedical, Chemical and Materials Engineering, San Jose State University, San Jose, CA 95192-0101, USA
| | - Amulya Nedunuri
- Department of General Engineering, San Jose State University, San Jose, CA 95192-0101, USA
| | - Thomas Poelman
- Department of Chemistry and Biochemistry, Cal Poly San Luis Obispo, San Luis Obispo, CA 93407, USA
| | - Thejas Kamath
- Department of Bioengineering, University of California, San Diego, San Diego, CA 92093-0412, USA
| | - Edwin Chung
- Department of Biomedical, Chemical and Materials Engineering, San Jose State University, San Jose, CA 95192-0101, USA
| | - Katherine Kantardjieff
- College of Science and Mathematics, California State University San Marcos, San Marcos, CA 92096-0001, USA
| | - Andrea Gottlieb
- Department of Mathematics and Statistics, San Jose State University, San Jose, CA 95192-0101, USA
| | - Brooke Lustig
- Department of Chemistry, San Jose State University, San Jose, CA 95192-0101, USA
| |
Collapse
|
23
|
Capitani G, Duarte JM, Baskaran K, Bliven S, Somody JC. Understanding the fabric of protein crystals: computational classification of biological interfaces and crystal contacts. Bioinformatics 2015; 32:481-9. [PMID: 26508758 PMCID: PMC4743631 DOI: 10.1093/bioinformatics/btv622] [Citation(s) in RCA: 30] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/17/2015] [Accepted: 10/16/2015] [Indexed: 11/20/2022] Open
Abstract
Modern structural biology still draws the vast majority of information from crystallography, a technique where the objects being investigated are embedded in a crystal lattice. Given the complexity and variety of those objects, it becomes fundamental to computationally assess which of the interfaces in the lattice are biologically relevant and which are simply crystal contacts. Since the mid-1990s, several approaches have been applied to obtain high-accuracy classification of crystal contacts and biological protein–protein interfaces. This review provides an overview of the concepts and main approaches to protein interface classification: thermodynamic estimation of interface stability, evolutionary approaches based on conservation of interface residues, and co-occurrence of the interface across different crystal forms. Among the three categories, evolutionary approaches offer the strongest promise for improvement, thanks to the incessant growth in sequence knowledge. Importantly, protein interface classification algorithms can also be used on multimeric structures obtained using other high-resolution techniques or for protein assembly design or validation purposes. A key issue linked to protein interface classification is the identification of the biological assembly of a crystal structure and the analysis of its symmetry. Here, we highlight the most important concepts and problems to be overcome in assembly prediction. Over the next few years, tools and concepts of interface classification will probably become more frequently used and integrated in several areas of structural biology and structural bioinformatics. Among the main challenges for the future are better addressing of weak interfaces and the application of interface classification concepts to prediction problems like protein–protein docking. Supplementary information: Supplementary data are available at Bioinformatics online. Contact:guido.capitani@psi.ch
Collapse
Affiliation(s)
- Guido Capitani
- Laboratory of Biomolecular Research, Paul Scherrer Institute, OFLC/110, 5232 Villigen PSI, Department of Biology, ETH Zurich, 8093 Zurich, Switzerland
| | - Jose M Duarte
- Laboratory of Biomolecular Research, Paul Scherrer Institute, OFLC/110, 5232 Villigen PSI, Department of Biology, ETH Zurich, 8093 Zurich, Switzerland
| | - Kumaran Baskaran
- Laboratory of Biomolecular Research, Paul Scherrer Institute, OFLC/110, 5232 Villigen PSI
| | - Spencer Bliven
- Laboratory of Biomolecular Research, Paul Scherrer Institute, OFLC/110, 5232 Villigen PSI, Bioinformatics and Systems Biology Program, UC San Diego, La Jolla, CA 92093, National Center for Biotechnology Information, NIH, Bethesda, MD 20894, USA and
| | - Joseph C Somody
- Laboratory of Biomolecular Research, Paul Scherrer Institute, OFLC/110, 5232 Villigen PSI, Department of Computer Science, ETH Zurich, 8092 Zurich, Switzerland
| |
Collapse
|
24
|
Da Silva F, Desaphy J, Bret G, Rognan D. IChemPIC: A Random Forest Classifier of Biological and Crystallographic Protein–Protein Interfaces. J Chem Inf Model 2015; 55:2005-14. [DOI: 10.1021/acs.jcim.5b00190] [Citation(s) in RCA: 25] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/31/2023]
Affiliation(s)
- Franck Da Silva
- Laboratoire d’Innovation
Thérapeutique, UMR 7200 CNRS−Université de Strasbourg, 67400 Illkirch, France
| | - Jérémy Desaphy
- Laboratoire d’Innovation
Thérapeutique, UMR 7200 CNRS−Université de Strasbourg, 67400 Illkirch, France
| | - Guillaume Bret
- Laboratoire d’Innovation
Thérapeutique, UMR 7200 CNRS−Université de Strasbourg, 67400 Illkirch, France
| | - Didier Rognan
- Laboratoire d’Innovation
Thérapeutique, UMR 7200 CNRS−Université de Strasbourg, 67400 Illkirch, France
| |
Collapse
|
25
|
Barik A, Nithin C, Karampudi NBR, Mukherjee S, Bahadur RP. Probing binding hot spots at protein-RNA recognition sites. Nucleic Acids Res 2015; 44:e9. [PMID: 26365245 PMCID: PMC4737170 DOI: 10.1093/nar/gkv876] [Citation(s) in RCA: 35] [Impact Index Per Article: 3.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/20/2015] [Accepted: 08/23/2015] [Indexed: 01/30/2023] Open
Abstract
We use evolutionary conservation derived from structure alignment of polypeptide sequences along with structural and physicochemical attributes of protein–RNA interfaces to probe the binding hot spots at protein–RNA recognition sites. We find that the degree of conservation varies across the RNA binding proteins; some evolve rapidly compared to others. Additionally, irrespective of the structural class of the complexes, residues at the RNA binding sites are evolutionary better conserved than those at the solvent exposed surfaces. For recognitions involving duplex RNA, residues interacting with the major groove are better conserved than those interacting with the minor groove. We identify multi-interface residues participating simultaneously in protein–protein and protein–RNA interfaces in complexes where more than one polypeptide is involved in RNA recognition, and show that they are better conserved compared to any other RNA binding residues. We find that the residues at water preservation site are better conserved than those at hydrated or at dehydrated sites. Finally, we develop a Random Forests model using structural and physicochemical attributes for predicting binding hot spots. The model accurately predicts 80% of the instances of experimental ΔΔG values in a particular class, and provides a stepping-stone towards the engineering of protein–RNA recognition sites with desired affinity.
Collapse
Affiliation(s)
- Amita Barik
- Computational Structural Biology Laboratory, Department of Biotechnology, Indian Institute of Technology Kharagpur, Kharagpur-721302, India
| | - Chandran Nithin
- Computational Structural Biology Laboratory, Department of Biotechnology, Indian Institute of Technology Kharagpur, Kharagpur-721302, India
| | | | - Sunandan Mukherjee
- Computational Structural Biology Laboratory, Department of Biotechnology, Indian Institute of Technology Kharagpur, Kharagpur-721302, India
| | - Ranjit Prasad Bahadur
- Computational Structural Biology Laboratory, Department of Biotechnology, Indian Institute of Technology Kharagpur, Kharagpur-721302, India Advanced Technology Development Centre, Indian Institute of Technology Kharagpur, Kharagpur-721302, India
| |
Collapse
|
26
|
Andreani J, Guerois R. Evolution of protein interactions: From interactomes to interfaces. Arch Biochem Biophys 2014; 554:65-75. [DOI: 10.1016/j.abb.2014.05.010] [Citation(s) in RCA: 33] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/10/2014] [Revised: 04/28/2014] [Accepted: 05/12/2014] [Indexed: 12/16/2022]
|
27
|
Landry C, Levy E, Abd Rabbo D, Tarassov K, Michnick S. Extracting Insight from Noisy Cellular Networks. Cell 2013; 155:983-9. [DOI: 10.1016/j.cell.2013.11.003] [Citation(s) in RCA: 46] [Impact Index Per Article: 4.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/17/2013] [Indexed: 01/25/2023]
|
28
|
Lin CY, Chen YC, Lo YS, Yang JM. Inferring homologous protein-protein interactions through pair position specific scoring matrix. BMC Bioinformatics 2013; 14 Suppl 2:S11. [PMID: 23367879 PMCID: PMC3549806 DOI: 10.1186/1471-2105-14-s2-s11] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/12/2023] Open
Abstract
Background The protein-protein interaction (PPI) is one of the most important features to understand biological processes. For a PPI, the physical domain-domain interaction (DDI) plays the key role for biology functions. In the post-genomic era, to rapidly identify homologous PPIs for analyzing the contact residue pairs of their interfaces within DDIs on a genomic scale is essential to determine PPI networks and the PPI interface evolution across multiple species. Results In this study, we proposed "pair Position Specific Scoring Matrix (pairPSSM)" to identify homologous PPIs. The pairPSSM can successfully distinguish the true protein complexes from unreasonable protein pairs with about 90% accuracy. For the test set including 1,122 representative heterodimers and 2,708,746 non-interacting protein pairs, the mean average precision and mean false positive rate of pairPSSM were 0.42 and 0.31, respectively. Moreover, we applied pairPSSM to identify ~450,000 homologous PPIs with their interacting domains and residues in seven common organisms (e.g. Homo sapiens, Mus musculus, Saccharomyces cerevisiae and Escherichia coli). Conclusions Our pairPSSM is able to provide statistical significance of residue pairs using evolutionary profiles and a scoring system for inferring homologous PPIs. According to our best knowledge, the pairPSSM is the first method for searching homologous PPIs across multiple species using pair position specific scoring matrix and a 3D dimer as the template to map interacting domain pairs of these PPIs. We believe that pairPSSM is able to provide valuable insights for the PPI evolution and networks across multiple species.
Collapse
Affiliation(s)
- Chun-Yu Lin
- Institute of Bioinformatics and Systems Biology, National Chiao Tung University, Hsinchu, 300, Taiwan
| | | | | | | |
Collapse
|
29
|
Potapov V, Edelman M, Sobolev V. Residue-residue contacts: application to analysis of secondary structure interactions. Methods Mol Biol 2013; 932:159-173. [PMID: 22987352 DOI: 10.1007/978-1-62703-065-6_10] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/01/2023]
Abstract
Protein structures and their complexes are formed and stabilized by interactions, both inside and outside of the protein. Analysis of such interactions helps in understanding different levels of structures (secondary, super-secondary, and oligomeric states). It can also assist molecular biologists in understanding structural consequences of modifying proteins and/or ligands. In this chapter, our definition of atom-atom and residue-residue contacts is described and applied to analysis of protein-protein interactions in dimeric β-sandwich proteins.
Collapse
|
30
|
Levy ED, Teichmann S. Structural, evolutionary, and assembly principles of protein oligomerization. PROGRESS IN MOLECULAR BIOLOGY AND TRANSLATIONAL SCIENCE 2013; 117:25-51. [PMID: 23663964 DOI: 10.1016/b978-0-12-386931-9.00002-7] [Citation(s) in RCA: 89] [Impact Index Per Article: 8.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/02/2022]
Abstract
In the protein universe, 30-50% of proteins self-assemble to form symmetrical complexes consisting of multiple copies of themselves, called homomers. The prevalence of homomers motivates us to review many of their properties. In Section 1, we describe the methods and challenges associated with quaternary structure inference-these methods are indeed at the basis of any analysis on homomers. In Section 2, we describe the morphological properties of homomers, as well as the database 3DComplex, which provides a taxonomy for both homomeric and heteromeric protein complexes. In Section 3, we review interface properties of homomeric complexes. In Section 4, we then present recent findings on the evolution of homomer interfaces, which we link in Section 5 to the evolution of homomers as entire entities. In Section 6, we discuss mechanisms involved in their assembly and how these mechanisms can be linked to evolution.
Collapse
Affiliation(s)
- Emmanuel D Levy
- Department of Structural Biology, Weizmann Institute of Science, Rehovot, Israel.
| | | |
Collapse
|
31
|
Duarte JM, Srebniak A, Schärer MA, Capitani G. Protein interface classification by evolutionary analysis. BMC Bioinformatics 2012; 13:334. [PMID: 23259833 PMCID: PMC3556496 DOI: 10.1186/1471-2105-13-334] [Citation(s) in RCA: 112] [Impact Index Per Article: 9.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/02/2012] [Accepted: 12/15/2012] [Indexed: 01/01/2023] Open
Abstract
Background Distinguishing biologically relevant interfaces from lattice contacts in protein crystals is a fundamental problem in structural biology. Despite efforts towards the computational prediction of interface character, many issues are still unresolved. Results We present here a protein-protein interface classifier that relies on evolutionary data to detect the biological character of interfaces. The classifier uses a simple geometric measure, number of core residues, and two evolutionary indicators based on the sequence entropy of homolog sequences. Both aim at detecting differential selection pressure between interface core and rim or rest of surface. The core residues, defined as fully buried residues (>95% burial), appear to be fundamental determinants of biological interfaces: their number is in itself a powerful discriminator of interface character and together with the evolutionary measures it is able to clearly distinguish evolved biological contacts from crystal ones. We demonstrate that this definition of core residues leads to distinctively better results than earlier definitions from the literature. The stringent selection and quality filtering of structural and sequence data was key to the success of the method. Most importantly we demonstrate that a more conservative selection of homolog sequences - with relatively high sequence identities to the query - is able to produce a clearer signal than previous attempts. Conclusions An evolutionary approach like the one presented here is key to the advancement of the field, which so far was missing an effective method exploiting the evolutionary character of protein interfaces. Its coverage and performance will only improve over time thanks to the incessant growth of sequence databases. Currently our method reaches an accuracy of 89% in classifying interfaces of the Ponstingl 2003 datasets and it lends itself to a variety of useful applications in structural biology and bioinformatics. We made the corresponding software implementation available to the community as an easy-to-use graphical web interface at http://www.eppic-web.org.
Collapse
Affiliation(s)
- Jose M Duarte
- Paul Scherrer Institut, Villigen, CH-5232, Switzerland
| | | | | | | |
Collapse
|
32
|
Kastritis PL, Bonvin AMJJ. On the binding affinity of macromolecular interactions: daring to ask why proteins interact. J R Soc Interface 2012; 10:20120835. [PMID: 23235262 PMCID: PMC3565702 DOI: 10.1098/rsif.2012.0835] [Citation(s) in RCA: 276] [Impact Index Per Article: 23.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/13/2023] Open
Abstract
Interactions between proteins are orchestrated in a precise and time-dependent manner, underlying cellular function. The binding affinity, defined as the strength of these interactions, is translated into physico-chemical terms in the dissociation constant (Kd), the latter being an experimental measure that determines whether an interaction will be formed in solution or not. Predicting binding affinity from structural models has been a matter of active research for more than 40 years because of its fundamental role in drug development. However, all available approaches are incapable of predicting the binding affinity of protein–protein complexes from coordinates alone. Here, we examine both theoretical and experimental limitations that complicate the derivation of structure–affinity relationships. Most work so far has concentrated on binary interactions. Systems of increased complexity are far from being understood. The main physico-chemical measure that relates to binding affinity is the buried surface area, but it does not hold for flexible complexes. For the latter, there must be a significant entropic contribution that will have to be approximated in the future. We foresee that any theoretical modelling of these interactions will have to follow an integrative approach considering the biology, chemistry and physics that underlie protein–protein recognition.
Collapse
Affiliation(s)
- Panagiotis L Kastritis
- Bijvoet Center for Biomolecular Research, Faculty of Science, Chemistry, Utrecht University, , Padualaan 8, Utrecht, The Netherlands
| | | |
Collapse
|
33
|
Winter C, Henschel A, Tuukkanen A, Schroeder M. Protein interactions in 3D: From interface evolution to drug discovery. J Struct Biol 2012; 179:347-58. [DOI: 10.1016/j.jsb.2012.04.009] [Citation(s) in RCA: 23] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/16/2011] [Revised: 03/27/2012] [Accepted: 04/18/2012] [Indexed: 11/25/2022]
|
34
|
White BR, Carlson JCT, Kerns JL, Wagner CR. Protein interface remodeling in a chemically induced protein dimer. J Mol Recognit 2012; 25:393-403. [DOI: 10.1002/jmr.2196] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022]
Affiliation(s)
- Brian R. White
- Department of Medicinal Chemistry, College of Pharmacy; University of Minnesota; Minneapolis; MN; 55455; USA
| | - Jonathan C. T. Carlson
- Department of Medicinal Chemistry, College of Pharmacy; University of Minnesota; Minneapolis; MN; 55455; USA
| | - Jessie L. Kerns
- Department of Medicinal Chemistry, College of Pharmacy; University of Minnesota; Minneapolis; MN; 55455; USA
| | - Carston R. Wagner
- Department of Medicinal Chemistry, College of Pharmacy; University of Minnesota; Minneapolis; MN; 55455; USA
| |
Collapse
|
35
|
Illingworth CJR, Chintipalli SV, Serapian SA, Miller AD, Veverka V, Carr MD, Reynolds CA. The statistical significance of selected sense-antisense peptide interactions. J Comput Chem 2012; 33:1440-7. [PMID: 22488506 DOI: 10.1002/jcc.22977] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/11/2012] [Revised: 02/28/2012] [Accepted: 03/01/2012] [Indexed: 11/11/2022]
Abstract
Sense and antisense peptides, encoded by sense and corresponding antisense DNA strands, are capable of specific interactions that could be a driving force to mediate protein-protein or protein-peptide binding associations. The complementary residue hypothesis suggests that these interactions are founded upon the sum of pairwise interactions between amino acids encoded by corresponding sense and antisense codons. Despite many successful experimental results obtained with the hypothesis, however, the physicochemical basis for these interactions is poorly understood. We examined the potential of the hypothesis for general identification of protein-protein interaction sites, and the possible role of the hypothesis in determining folding in a broad set of protein structures. In addition, we performed a structural study to investigate the binding of a complementary peptide to IL-1F2. Our results suggest that complementary residue pairs are no more frequent or conserved than average in protein-protein interfaces, and are statistically under-represented amongst contacting residue pairs in folded protein structures. Although our structural results matched experimental observations of binding between the peptide and IL-1F2, complementary residue interactions do not appear to be dominant in the bound structure. Overall, our data do not allow us to conclude that the complementary residue hypothesis accounts for specific sense-antisense peptide interactions.
Collapse
|
36
|
Liu JY, Li Z, Li H, Zhang JT. Critical residue that promotes protein dimerization: a story of partially exposed Phe25 in 14-3-3σ. J Chem Inf Model 2011; 51:2612-25. [PMID: 21870863 DOI: 10.1021/ci200212y] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
Abstract
Many proteins exist and function as oligomers. While hydrophobic interactions have been recognized as the major driving force for oligomerization, detailed molecular mechanisms for the assembly are unknown. Here, we used 14-3-3σ as a model protein and investigated the role of hydrophobic residues at the dimeric interface using MD simulations and coimmunoprecipitations. We found that a half-exposed and half-buried residue in the interface, Phe(25), plays a more important role in promoting homodimerization than the hydrophobic core residues by organizing both favorable hydrophobic and hydrophilic interactions. Phe(25) is critical in packing and stabilizing hydrophobic core residues. We conclude that the structural stability of hydrophobic cores is critical for a stable homodimer complex and this stable property can be bestowed by residues outside of hydrophobic core. The important organizing activity of Phe(25) for homodimerization of 14-3-3σ originates from its unique physical location, rigidity, size, and hydrophobicity. Thus, hydrophobic residues that are not deeply buried at the oligomeric interface may play important but different roles from the buried core residues and they may promote oligomerization by organizing co-operativity of core and other residues for favorable hydrophobic and electrostatic interactions.
Collapse
Affiliation(s)
- Jing-Yuan Liu
- Department of Pharmacology and Toxicology, Indiana University School of Medicine, Indianapolis, Indiana 46202, United States
| | | | | | | |
Collapse
|
37
|
Determinants, discriminants, conserved residues--a heuristic approach to detection of functional divergence in protein families. PLoS One 2011; 6:e24382. [PMID: 21931701 PMCID: PMC3171465 DOI: 10.1371/journal.pone.0024382] [Citation(s) in RCA: 11] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/15/2011] [Accepted: 08/08/2011] [Indexed: 11/19/2022] Open
Abstract
In this work, belonging to the field of comparative analysis of protein sequences, we focus on detection of functional specialization on the residue level. As the input, we take a set of sequences divided into groups of orthologues, each group known to be responsible for a different function. This provides two independent pieces of information: within group conservation and overlap in amino acid type across groups. We build our discussion around the set of scoring functions that keep the two separated and the source of the signal easy to trace back to its source.We propose a heuristic description of functional divergence that includes residue type exchangeability, both in the conservation and in the overlap measure, and does not make any assumptions on the rate of evolution in the groups other than the one under consideration. Residue types acceptable at a certain position within an orthologous group are described as a distribution which evolves in time, starting from a single ancestral type, and is subject to constraints that can be inferred only indirectly. To estimate the strength of the constraints, we compare the observed degrees of conservation and overlap with those expected in the hypothetical case of a freely evolving distribution.Our description matches the experiment well, but we also conclude that any attempt to capture the evolutionary behavior of specificity determining residues in terms of a scalar function will be tentative, because no single model can cover the variety of evolutionary behavior such residues exhibit. Especially, models expecting the same type of evolutionary behavior across functionally divergent groups tend to miss a portion of information otherwise retrievable by the conservation and overlap measures they use.
Collapse
|
38
|
Hayat S, Walter P, Park Y, Helms V. Prediction of the exposure status of transmembrane beta barrel residues from protein sequence. J Bioinform Comput Biol 2011; 9:43-65. [PMID: 21328706 DOI: 10.1142/s0219720011005240] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/16/2010] [Revised: 09/21/2010] [Accepted: 09/22/2010] [Indexed: 11/18/2022]
Abstract
We present BTMX (Beta barrel TransMembrane eXposure), a computational method to predict the exposure status (i.e. exposed to the bilayer or hidden in the protein structure) of transmembrane residues in transmembrane beta barrel proteins (TMBs). BTMX predicts the exposure status of known TM residues with an accuracy of 84.2% over 2,225 residues and provides a confidence score for all predictions. Predictions made are in concert with the fact that hydrophobic residues tend to be more exposed to the bilayer. The biological relevance of the input parameters is also discussed. The highest prediction accuracy is obtained when a sliding window comprising three residues with similar C(α)-C(β) vector orientations is employed. The prediction accuracy of the BTMX method on a separate unseen non-redundant test dataset is 78.1%. By employing out-pointing residues that are exposed to the bilayer, we have identified various physico-chemical properties that show statistically significant differences between the beta strands located at the oligomeric interfaces compared to the non-oligomeric strands. The BTMX web server generates colored, annotated snake-plots as part of the prediction results and is available under the BTMX tab at http://service.bioinformatik.uni-saarland.de/tmx-site/. Exposure status prediction of TMB residues may be useful in 3D structure prediction of TMBs.
Collapse
Affiliation(s)
- Sikander Hayat
- Center for Bioinformatics, Saarland University, Germany.
| | | | | | | |
Collapse
|
39
|
Fernández‐Recio J. Prediction of protein binding sites and hot spots. WILEY INTERDISCIPLINARY REVIEWS-COMPUTATIONAL MOLECULAR SCIENCE 2011. [DOI: 10.1002/wcms.45] [Citation(s) in RCA: 42] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/10/2022]
|
40
|
Kumar R, Bhakuni V. A functionally active dimer of mycobacterium tuberculosis malate synthase G. EUROPEAN BIOPHYSICS JOURNAL : EBJ 2010; 39:1557-1562. [PMID: 20306314 DOI: 10.1007/s00249-010-0598-7] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/25/2010] [Accepted: 03/02/2010] [Indexed: 05/29/2023]
Abstract
Malate synthase G is an important housekeeping enzyme of glyoxylate shunt in mycobacterium. The pleotropic function of this protein by virtue of its intracellular/extracellular localization and its behavior as an adhesin and virulence factor is quite enigmatic. Despite its importance in mycobacterium persistence, we do not know much about its biophysical and biochemical properties. Earlier reports suggest that the enzyme exists only as a monomer in prokaryotes; however, we observed the existence of both active monomer and dimer forms of the enzyme under physiological conditions. The dimeric form of the enzymes is more stable as compared to the monomeric form as evident from various biophysical parameters. In addition, the dimeric enzyme also shows enhanced stability against proteolysis than the monomers. Based on these studies, it seems that dimerization is an important factor in regulating stability. The differential localization and diverse functions of malate synthase other than its enzymatic role might be triggering the stabilization of the enzyme dimer and modulation of activity and stability in vivo.
Collapse
Affiliation(s)
- Ranjeet Kumar
- Division of Molecular and Structural Biology, Central Drug Research Institute, Lucknow 226 001, India.
| | | |
Collapse
|
41
|
Illingworth CJR, Scott PD, Parkes KEB, Snell CR, Campbell MP, Reynolds CA. Connectivity and binding-site recognition: Applications relevant to drug design. J Comput Chem 2010; 31:2677-88. [DOI: 10.1002/jcc.21561] [Citation(s) in RCA: 11] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/11/2022]
|
42
|
Bell RE, Ben-Tal N. In silico identification of functional protein interfaces. Comp Funct Genomics 2010; 4:420-3. [PMID: 18629079 PMCID: PMC2447364 DOI: 10.1002/cfg.309] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/22/2003] [Revised: 06/03/2003] [Accepted: 06/03/2003] [Indexed: 12/02/2022] Open
Abstract
Proteins perform many of their biological roles through protein–protein, protein–DNA or protein–ligand interfaces. The identification of the amino acids comprising
these interfaces often enhances our understanding of the biological function of
the proteins. Many methods for the detection of functional interfaces have been developed,
and large-scale analyses have provided assessments of their accuracy. Among
them are those that consider the size of the protein interface, its amino acid composition
and its physicochemical and geometrical properties. Other methods to this
effect use statistical potential functions of pairwise interactions, and evolutionary
information. The rationale of the evolutionary approach is that functional and structural
constraints impose selective pressure; hence, biologically important interfaces
often evolve at a slower pace than do other external regions of the protein. Recently,
an algorithm, Rate4Site, and a web-server, ConSurf (http://consurf.tau.ac.il/), for
the identification of functional interfaces based on the evolutionary relations among
homologous proteins as reflected in phylogenetic trees, were developed in our laboratory.
The explicit use of the tree topology and branch lengths makes the method
remarkably accurate and sensitive. Here we demonstrate its potency in the identification
of the functional interfaces of a hypothetical protein, the structure of which was
determined as part of the international structural genomics effort. Finally, we propose
to combine complementary procedures, in order to enhance the overall performance
of methods for the identification of functional interfaces in proteins.
Collapse
Affiliation(s)
- Rachel E Bell
- Department of Biochemistry, The George S. Wise Faculty of Life Sciences, Tel Aviv University, Ramat Aviv 69978, Israel
| | | |
Collapse
|
43
|
Martin J. Beauty is in the eye of the beholder: proteins can recognize binding sites of homologous proteins in more than one way. PLoS Comput Biol 2010; 6:e1000821. [PMID: 20585553 PMCID: PMC2887470 DOI: 10.1371/journal.pcbi.1000821] [Citation(s) in RCA: 16] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/05/2010] [Accepted: 05/18/2010] [Indexed: 11/18/2022] Open
Abstract
Understanding the mechanisms of protein-protein interaction is a fundamental problem with many practical applications. The fact that different proteins can bind similar partners suggests that convergently evolved binding interfaces are reused in different complexes. A set of protein complexes composed of non-homologous domains interacting with homologous partners at equivalent binding sites was collected in 2006, offering an opportunity to investigate this point. We considered 433 pairs of protein-protein complexes from the ABAC database (AB and AC binary protein complexes sharing a homologous partner A) and analyzed the extent of physico-chemical similarity at the atomic and residue level at the protein-protein interface. Homologous partners of the complexes were superimposed using Multiprot, and similar atoms at the interface were quantified using a five class grouping scheme and a distance cut-off. We found that the number of interfacial atoms with similar properties is systematically lower in the non-homologous proteins than in the homologous ones. We assessed the significance of the similarity by bootstrapping the atomic properties at the interfaces. We found that the similarity of binding sites is very significant between homologous proteins, as expected, but generally insignificant between the non-homologous proteins that bind to homologous partners. Furthermore, evolutionarily conserved residues are not colocalized within the binding sites of non-homologous proteins. We could only identify a limited number of cases of structural mimicry at the interface, suggesting that this property is less generic than previously thought. Our results support the hypothesis that different proteins can interact with similar partners using alternate strategies, but do not support convergent evolution.
Collapse
Affiliation(s)
- Juliette Martin
- Université de Lyon, Lyon, France; Université Lyon 1, IFR 128, CNRS, UMR 5086 Institut de Biologie et Chimie des Protéines (IBCP), Lyon, France.
| |
Collapse
|
44
|
Schärer MA, Grütter MG, Capitani G. CRK: An evolutionary approach for distinguishing biologically relevant interfaces from crystal contacts. Proteins 2010; 78:2707-13. [DOI: 10.1002/prot.22787] [Citation(s) in RCA: 27] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]
|
45
|
Guharoy M, Chakrabarti P. Conserved residue clusters at protein-protein interfaces and their use in binding site identification. BMC Bioinformatics 2010; 11:286. [PMID: 20507585 PMCID: PMC2894039 DOI: 10.1186/1471-2105-11-286] [Citation(s) in RCA: 68] [Impact Index Per Article: 4.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/16/2010] [Accepted: 05/27/2010] [Indexed: 12/30/2022] Open
Abstract
BACKGROUND Biological evolution conserves protein residues that are important for structure and function. Both protein stability and function often require a certain degree of structural co-operativity between spatially neighboring residues and it has previously been shown that conserved residues occur clustered together in protein tertiary structures, enzyme active sites and protein-DNA interfaces. Residues comprising protein interfaces are often more conserved compared to those occurring elsewhere on the protein surface. We investigate the extent to which conserved residues within protein-protein interfaces are clustered together in three-dimensions. RESULTS Out of 121 and 392 interfaces in homodimers and heterocomplexes, 96.7 and 86.7%, respectively, have the conserved positions clustered within the overall interface region. The significance of this clustering was established in comparison to what is seen for the subsets of the same size of randomly selected residues from the interface. Conserved residues occurring in larger interfaces could often be sub-divided into two or more distinct sub-clusters. These structural cluster(s) comprising conserved residues indicate functionally important regions within the protein-protein interface that can be targeted for further structural and energetic analysis by experimental scanning mutagenesis. Almost 60% of experimental hot spot residues (with DeltaDeltaG > 2 kcal/mol) were localized to these conserved residue clusters. An analysis of the residue types that are enriched within these conserved subsets compared to the overall interface showed that hydrophobic and aromatic residues are favored, but charged residues (both positive and negative) are less common. The potential use of this method for discriminating binding sites (interfaces) versus random surface patches was explored by comparing the clustering of conserved residues within each of these regions--in about 50% cases the true interface is ranked among the top 10% of all surface patches. CONCLUSIONS Protein-protein interaction sites are much larger than small molecule biding sites, but still conserved residues are not randomly distributed over the whole interface and are distinctly clustered. The clustered nature of evolutionarily conserved residues within interfaces as compared to those within other surface patches not involved in binding has important implications for the identification of protein-protein binding sites and would have applications in docking studies.
Collapse
Affiliation(s)
- Mainak Guharoy
- Bioinformatics Centre, Bose Institute, P-1/12 CIT Scheme VIIM, Kolkata, India
| | | |
Collapse
|
46
|
Choi YS, Han SK, Kim J, Yang JS, Jeon J, Ryu SH, Kim S. ConPlex: a server for the evolutionary conservation analysis of protein complex structures. Nucleic Acids Res 2010; 38:W450-6. [PMID: 20435678 PMCID: PMC2896159 DOI: 10.1093/nar/gkq328] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
Abstract
Evolutionary conservation analyses are important for the identification of protein-protein interactions. For protein complex structures, sequence conservation has been applied to determine protein oligomerization states, to characterize native interfaces from non-specific crystal contacts, and to discriminate near-native structures from docking artifacts. However, a user-friendly web-based service for evolutionary conservation analysis of protein complexes has not been available. Therefore, we developed ConPlex (http://sbi.postech.ac.kr/ConPlex/) a web application that enables evolutionary conservation analyses of protein interactions within protein quaternary structures. Users provide protein complex structures; ConPlex automatically identifies protein interfaces and carries out evolutionary conservation analyses for the interface regions. Moreover, ConPlex allows the results of the residue-specific conservation analysis to be displayed on the protein complex structure and provides several options to customize the display output to fit each user's needs. We believe that ConPlex offers a convenient platform to analyze protein complex structures based on evolutionary conservation of protein-protein interface residues.
Collapse
Affiliation(s)
- Yoon Sup Choi
- School of Interdisciplinary Bioscience and Bioengineering, Pohang University of Science and Technology, Pohang, 790-784, Republic of Korea
| | | | | | | | | | | | | |
Collapse
|
47
|
Abstract
The assembly of subunits in protein oligomers is an important topic to study as a vast number of proteins exists as stable or transient oligomer and because it is a mechanism used by some protein oligomers for killing cells (e.g., perforin from the human immune system, pore-forming toxins from bacteria, phage, amoeba, protein misfolding diseases, etc.). Only a few of the amino acids that constitute a protein oligomer seem to regulate the capacity of the protein to assemble (to form interfaces), and some of these amino acids are localized at the interfaces that link the different chains. The identification of the residues of these interfaces is rather difficult. We have developed a series of programs, under the common name of Gemini, that can select the subset of the residues that is involved in the interfaces of a protein oligomer of known atomic structure, and generate a 2D interaction network (or graph) of the subset. The graphs generated for several oligomers demonstrate the accuracy of the selection of subsets that are involved in the geometrical and the chemical properties of interfaces. The results of the Gemini programs are in good agreement with those of similar programs with an advantage that Gemini programs can perform the residue selection much more rapidly. Moreover, Gemini programs can also perform on a single protein oligomer without the need of comparison partners. The graphs are extremely useful for comparative studies that would help in addressing questions not only on the sequence specificity of protein interfaces but also on the mechanism of the assembly of unrelated protein oligomers.
Collapse
Affiliation(s)
- Giovanni Feverati
- Laboratoire de physique théorique LAPTH, CNRS, UMR 5108 associé à l'Université de Savoie, BP 110, Annecy le Vieux, France
| | - Claire Lesieur
- Laboratoire de physique théorique LAPTH, CNRS, UMR 5108 associé à l'Université de Savoie, BP 110, Annecy le Vieux, France
- * E-mail:
| |
Collapse
|
48
|
McGuffee SR, Elcock AH. Diffusion, crowding & protein stability in a dynamic molecular model of the bacterial cytoplasm. PLoS Comput Biol 2010; 6:e1000694. [PMID: 20221255 PMCID: PMC2832674 DOI: 10.1371/journal.pcbi.1000694] [Citation(s) in RCA: 524] [Impact Index Per Article: 37.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/29/2009] [Accepted: 01/30/2010] [Indexed: 01/24/2023] Open
Abstract
A longstanding question in molecular biology is the extent to which the behavior of macromolecules observed in vitro accurately reflects their behavior in vivo. A number of sophisticated experimental techniques now allow the behavior of individual types of macromolecule to be studied directly in vivo; none, however, allow a wide range of molecule types to be observed simultaneously. In order to tackle this issue we have adopted a computational perspective, and, having selected the model prokaryote Escherichia coli as a test system, have assembled an atomically detailed model of its cytoplasmic environment that includes 50 of the most abundant types of macromolecules at experimentally measured concentrations. Brownian dynamics (BD) simulations of the cytoplasm model have been calibrated to reproduce the translational diffusion coefficients of Green Fluorescent Protein (GFP) observed in vivo, and “snapshots” of the simulation trajectories have been used to compute the cytoplasm's effects on the thermodynamics of protein folding, association and aggregation events. The simulation model successfully describes the relative thermodynamic stabilities of proteins measured in E. coli, and shows that effects additional to the commonly cited “crowding” effect must be included in attempts to understand macromolecular behavior in vivo. The interior of a typical bacterial cell is a highly crowded place in which molecules must jostle and compete with each other in order to carry out their biological functions. The conditions under which such molecules are typically studied in vitro, however, are usually quite different: one or a few different types of molecules are studied as they freely diffuse in a dilute, aqueous solution. There is therefore a significant disconnect between the conditions under which molecules can be most usefully studied and the conditions under which such molecules usually “live”, and developing ways to bridge this gap is likely to be important for properly understanding molecular behavior in vivo. Toward this end, we show in this work that computer simulations can be used to model the interior of bacterial cells at a near atomic level of detail: the rates of diffusion of proteins are matched to known experimental values, and their thermodynamic stabilities are found to be in good agreement with the few measurements that have so far been performed in vivo. While the simulation approach is certainly not free of assumptions, it offers a potentially important complement to experimental techniques and provides a vivid illustration of molecular behavior inside a biological cell that is likely to be of significant educational value.
Collapse
Affiliation(s)
- Sean R. McGuffee
- Department of Biochemistry, University of Iowa, Iowa City, Iowa, United States of America
| | - Adrian H. Elcock
- Department of Biochemistry, University of Iowa, Iowa City, Iowa, United States of America
- * E-mail:
| |
Collapse
|
49
|
Choi YS, Yang JS, Choi Y, Ryu SH, Kim S. Evolutionary conservation in multiple faces of protein interaction. Proteins 2009; 77:14-25. [PMID: 19350617 DOI: 10.1002/prot.22410] [Citation(s) in RCA: 53] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/07/2022]
Abstract
Protein interfaces are believed to be evolutionarily more conserved than the rest of the protein surface, but this has not been properly verified using a large protein structural set. Furthermore, recent systematic protein interaction analyses have proved that proteins interacting with many partners have multiple interfaces to connect protein interaction networks, which have never taken into account for conservation analysis of protein interface. Here, we studied the evolutionary conservation of protein interfaces using a large-scale dataset of 2646 protein interfaces with the classification of homodimeric/heterodimeric and obligatory/transient interactions, considering all their known multiple interfaces. We found that protein interfaces were indeed more conserved than noninterface surfaces, and the conservation level of protein interfaces increased when multiple interfaces were properly considered. These findings suggest that conservation analysis should be a good descriptor for protein interface identification and protein-protein interaction predictions. We applied this evolutionary feature to filter docking decoys and found that protein interface conservation worked remarkably well in selecting the near-native structures from the large number of generated docking complexes. Moreover, we discovered that a strong correlation exist between protein interface size and protein interface conservation, which could be a useful filter for the prediction of protein-protein interactions.
Collapse
Affiliation(s)
- Yoon Sup Choi
- School of Interdisciplinary Bioscience and Bioengineering, Pohang University of Science and Technology (POSTECH), Pohang 790-784, Republic of Korea
| | | | | | | | | |
Collapse
|
50
|
Cho KI, Kim D, Lee D. A feature-based approach to modeling protein-protein interaction hot spots. Nucleic Acids Res 2009; 37:2672-87. [PMID: 19273533 PMCID: PMC2677884 DOI: 10.1093/nar/gkp132] [Citation(s) in RCA: 106] [Impact Index Per Article: 7.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open
Abstract
Identifying features that effectively represent the energetic contribution of an individual interface residue to the interactions between proteins remains problematic. Here, we present several new features and show that they are more effective than conventional features. By combining the proposed features with conventional features, we develop a predictive model for interaction hot spots. Initially, 54 multifaceted features, composed of different levels of information including structure, sequence and molecular interaction information, are quantified. Then, to identify the best subset of features for predicting hot spots, feature selection is performed using a decision tree. Based on the selected features, a predictive model for hot spots is created using support vector machine (SVM) and tested on an independent test set. Our model shows better overall predictive accuracy than previous methods such as the alanine scanning methods Robetta and FOLDEF, and the knowledge-based method KFC. Subsequent analysis yields several findings about hot spots. As expected, hot spots have a larger relative surface area burial and are more hydrophobic than other residues. Unexpectedly, however, residue conservation displays a rather complicated tendency depending on the types of protein complexes, indicating that this feature is not good for identifying hot spots. Of the selected features, the weighted atomic packing density, relative surface area burial and weighted hydrophobicity are the top 3, with the weighted atomic packing density proving to be the most effective feature for predicting hot spots. Notably, we find that hot spots are closely related to π–related interactions, especially π · · · π interactions.
Collapse
Affiliation(s)
- Kyu-il Cho
- Department of Bio and Brain Engineering, KAIST, 305-701, Daejeon, South Korea
| | | | | |
Collapse
|