1
|
Sarma H, Jamir E, Sastry GN. Protein-protein interaction of RdRp with its co-factor NSP8 and NSP7 to decipher the interface hotspot residues for drug targeting: A comparison between SARS-CoV-2 and SARS-CoV. J Mol Struct 2022; 1257:132602. [PMID: 35153334 PMCID: PMC8824464 DOI: 10.1016/j.molstruc.2022.132602] [Citation(s) in RCA: 12] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/18/2021] [Revised: 02/03/2022] [Accepted: 02/07/2022] [Indexed: 02/09/2023]
Abstract
In this study we explored the molecular mechanism of RdRp (Non-Structural Protein, NSP12) interaction with its co-factors NSP7 and NSP8 which is the main toolbox for RNA replication and transcription of SARS-CoV-2 and SARS-CoV. The replication complex is a heterotetramer consists of one NSP12, one NSP7 and two NSP8. Extensive molecular dynamics (MD) simulations were applied on both the heterotetramer complexes to generate the conformations and were used to estimate the MMPBSA binding free energy (BFE) and per-residue energy decomposition of NSP12-NSP8 and NSP12-NSP7 and NSP7-NSP8 complexes. The BFE of SARS-CoV-2 heterotetramer complex with its corresponding partner protein was significantly higher as compared to SARS-CoV. Interface hotspot residues were predicted using different methods implemented in KFC (Knowledge-based FADA and Contracts), HotRegion and Robetta web servers. Per-residue energy decomposition analysis showed that the predicted interface hotspot residues contribute more energy towards the formation of complexes and most of the predicted hotspot residues are clustered together. However, there is a slight difference in the residue-wise energy contribution in the interface NSPs on heterotetramer viral replication complex of both coronaviruses. While the overall replication complex of SARS-CoV-2 was found to be slightly flexible as compared to SARS-CoV. This difference in terms of structural flexibility/stability and energetic characteristics of interface residues including hotspots at PPI interface in the viral replication complexes may be the reason of higher rate of RNA replication of SARS-CoV-2 as compared to SARS-CoV. Overall, the interaction profile at PPI interface such as, interface area, hotspot residues, nature of bonds and energies between NSPs, may provide valuable insights in designing of small molecules or peptide/peptidomimetic ligands which can fit into the PPI interface to disrupt the interaction.
Collapse
Affiliation(s)
- Himakshi Sarma
- Advanced Computation and Data Sciences Division, CSIR - North East Institute of Science and Technology, Jorhat, Assam, India
| | - Esther Jamir
- Advanced Computation and Data Sciences Division, CSIR - North East Institute of Science and Technology, Jorhat, Assam, India.,Academy of Scientific and Innovative Research (AcSIR), Ghaziabad, India
| | - G Narahari Sastry
- Advanced Computation and Data Sciences Division, CSIR - North East Institute of Science and Technology, Jorhat, Assam, India.,Academy of Scientific and Innovative Research (AcSIR), Ghaziabad, India
| |
Collapse
|
2
|
Das S, Chakrabarti S. Classification and prediction of protein-protein interaction interface using machine learning algorithm. Sci Rep 2021; 11:1761. [PMID: 33469042 PMCID: PMC7815773 DOI: 10.1038/s41598-020-80900-2] [Citation(s) in RCA: 33] [Impact Index Per Article: 11.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/04/2020] [Accepted: 12/15/2020] [Indexed: 01/29/2023] Open
Abstract
Structural insight of the protein-protein interaction (PPI) interface can provide knowledge about the kinetics, thermodynamics and molecular functions of the complex while elucidating its role in diseases and further enabling it as a potential therapeutic target. However, owing to experimental lag in solving protein-protein complex structures, three-dimensional (3D) knowledge of the PPI interfaces can be gained via computational approaches like molecular docking and post-docking analyses. Despite development of numerous docking tools and techniques, success in identification of native like interfaces based on docking score functions is limited. Hence, we employed an in-depth investigation of the structural features of the interface that might successfully delineate native complexes from non-native ones. We identify interface properties, which show statistically significant difference between native and non-native interfaces belonging to homo and hetero, protein-protein complexes. Utilizing these properties, a support vector machine (SVM) based classification scheme has been implemented to differentiate native and non-native like complexes generated using docking decoys. Benchmarking and comparative analyses suggest very good performance of our SVM classifiers. Further, protein interactions, which are proven via experimental findings but not resolved structurally, were subjected to this approach where 3D-models of the complexes were generated and most likely interfaces were predicted. A web server called Protein Complex Prediction by Interface Properties (PCPIP) is developed to predict whether interface of a given protein-protein dimer complex resembles known protein interfaces. The server is freely available at http://www.hpppi.iicb.res.in/pcpip/ .
Collapse
Affiliation(s)
- Subhrangshu Das
- grid.417635.20000 0001 2216 5074Structural Biology and Bioinformatics Division, CSIR-Indian Institute of Chemical Biology, Kolkata, WB India
| | - Saikat Chakrabarti
- grid.417635.20000 0001 2216 5074Structural Biology and Bioinformatics Division, CSIR-Indian Institute of Chemical Biology, Kolkata, WB India
| |
Collapse
|
3
|
Romero-Durana M, Jiménez-García B, Fernández-Recio J. pyDockEneRes: per-residue decomposition of protein-protein docking energy. Bioinformatics 2020; 36:2284-2285. [PMID: 31808797 DOI: 10.1093/bioinformatics/btz884] [Citation(s) in RCA: 14] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/19/2019] [Revised: 11/17/2019] [Accepted: 12/05/2019] [Indexed: 11/14/2022] Open
Abstract
MOTIVATION Protein-protein interactions are key to understand biological processes at the molecular level. As a complement to experimental characterization of protein interactions, computational docking methods have become useful tools for the structural and energetics modeling of protein-protein complexes. A key aspect of such algorithms is the use of scoring functions to evaluate the generated docking poses and try to identify the best models. When the scoring functions are based on energetic considerations, they can help not only to provide a reliable structural model for the complex, but also to describe energetic aspects of the interaction. This is the case of the scoring function used in pyDock, a combination of electrostatics, desolvation and van der Waals energy terms. Its correlation with experimental binding affinity values of protein-protein complexes was explored in the past, but the per-residue decomposition of the docking energy was never systematically analyzed. RESULTS Here, we present pyDockEneRes (pyDock Energy per-Residue), a web server that provides pyDock docking energy partitioned at the residue level, giving a much more detailed description of the docking energy landscape. Additionally, pyDockEneRes computes the contribution to the docking energy of the side-chain atoms. This fast approach can be applied to characterize a complex structure in order to identify energetically relevant residues (hot-spots) and estimate binding affinity changes upon mutation to alanine. AVAILABILITY AND IMPLEMENTATION The server does not require registration by the user and is freely accessible for academics at https://life.bsc.es/pid/pydockeneres. SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Miguel Romero-Durana
- Barcelona Supercomputing Center (BSC), 08034 Barcelona.,Instituto de Ciencias de la Vid y del Vino (ICVV), CSIC-Universidad de La Rioja-Gobierno de La Rioja, 26007 Logroño, Spain
| | - Brian Jiménez-García
- Barcelona Supercomputing Center (BSC), 08034 Barcelona.,Bijvoet Center for Biomolecular Research, Faculty of Science-Chemistry, Utrecht University, 3584 CH Utrecht, The Netherlands
| | - Juan Fernández-Recio
- Barcelona Supercomputing Center (BSC), 08034 Barcelona.,Instituto de Ciencias de la Vid y del Vino (ICVV), CSIC-Universidad de La Rioja-Gobierno de La Rioja, 26007 Logroño, Spain.,Institut de Biologia Molecular de Barcelona (IBMB), CSIC, 08028 Barcelona, Spain
| |
Collapse
|
4
|
Perthold JW, Oostenbrink C. GroScore: Accurate Scoring of Protein–Protein Binding Poses Using Explicit-Solvent Free-Energy Calculations. J Chem Inf Model 2019; 59:5074-5085. [DOI: 10.1021/acs.jcim.9b00687] [Citation(s) in RCA: 11] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
Affiliation(s)
- Jan Walther Perthold
- Institute of Molecular Modeling and Simulation, University of Natural Resources and Life Sciences, Muthgasse 18, 1190 Vienna, Austria
| | - Chris Oostenbrink
- Institute of Molecular Modeling and Simulation, University of Natural Resources and Life Sciences, Muthgasse 18, 1190 Vienna, Austria
| |
Collapse
|
5
|
Rosell M, Rodríguez‐Lumbreras LA, Romero‐Durana M, Jiménez‐García B, Díaz L, Fernández‐Recio J. Integrative modeling of protein‐protein interactions with pyDock for the new docking challenges. Proteins 2019; 88:999-1008. [DOI: 10.1002/prot.25858] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/02/2019] [Revised: 10/30/2019] [Accepted: 11/15/2019] [Indexed: 01/12/2023]
Affiliation(s)
- Mireia Rosell
- Barcelona Supercomputing Center (BSC) Barcelona Spain
- Instituto de Ciencias de la Vid y del Vino (CSIC, Universidad de La Rioja, Gobierno de La Rioja) Logroño Spain
| | - Luis A. Rodríguez‐Lumbreras
- Barcelona Supercomputing Center (BSC) Barcelona Spain
- Instituto de Ciencias de la Vid y del Vino (CSIC, Universidad de La Rioja, Gobierno de La Rioja) Logroño Spain
| | - Miguel Romero‐Durana
- Barcelona Supercomputing Center (BSC) Barcelona Spain
- Instituto de Ciencias de la Vid y del Vino (CSIC, Universidad de La Rioja, Gobierno de La Rioja) Logroño Spain
- Structural Biology Unit, Instituto de Biología Molecular de Barcelona (IBMB‐CSIC) Barcelona Spain
| | | | - Lucía Díaz
- Barcelona Supercomputing Center (BSC) Barcelona Spain
| | - Juan Fernández‐Recio
- Barcelona Supercomputing Center (BSC) Barcelona Spain
- Instituto de Ciencias de la Vid y del Vino (CSIC, Universidad de La Rioja, Gobierno de La Rioja) Logroño Spain
- Structural Biology Unit, Instituto de Biología Molecular de Barcelona (IBMB‐CSIC) Barcelona Spain
| |
Collapse
|
6
|
Jankauskaite J, Jiménez-García B, Dapkunas J, Fernández-Recio J, Moal IH. SKEMPI 2.0: an updated benchmark of changes in protein-protein binding energy, kinetics and thermodynamics upon mutation. Bioinformatics 2019; 35:462-469. [PMID: 30020414 PMCID: PMC6361233 DOI: 10.1093/bioinformatics/bty635] [Citation(s) in RCA: 141] [Impact Index Per Article: 28.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/16/2018] [Accepted: 07/17/2018] [Indexed: 11/18/2022] Open
Abstract
Motivation Understanding the relationship between the sequence, structure, binding energy, binding kinetics and binding thermodynamics of protein–protein interactions is crucial to understanding cellular signaling, the assembly and regulation of molecular complexes, the mechanisms through which mutations lead to disease, and protein engineering. Results We present SKEMPI 2.0, a major update to our database of binding free energy changes upon mutation for structurally resolved protein–protein interactions. This version now contains manually curated binding data for 7085 mutations, an increase of 133%, including changes in kinetics for 1844 mutations, enthalpy and entropy changes for 443 mutations, and 440 mutations, which abolish detectable binding. Availability and implementation The database is available as supplementary data and at https://life.bsc.es/pid/skempi2/. Supplementary information Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Justina Jankauskaite
- Institute of Biotechnology, Life Sciences Center, Vilnius University, Vilnius, Lithuania
| | - Brian Jiménez-García
- Barcelona Supercomputing Center (BSC), Barcelona, Spain.,Bijvoet Center for Biomolecular Research, Faculty of Science, Utrecht University, Utrecht, the Netherlands
| | - Justas Dapkunas
- Institute of Biotechnology, Life Sciences Center, Vilnius University, Vilnius, Lithuania
| | - Juan Fernández-Recio
- Barcelona Supercomputing Center (BSC), Barcelona, Spain.,Institut de Biologia Molecular de Barcelona (IBMB), CSIC, Barcelona, Spain
| | - Iain H Moal
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Hinxton, Cambridge, UK
| |
Collapse
|
7
|
Jiménez-García B, Roel-Touris J, Romero-Durana M, Vidal M, Jiménez-González D, Fernández-Recio J. LightDock: a new multi-scale approach to protein-protein docking. Bioinformatics 2018; 34:49-55. [PMID: 28968719 DOI: 10.1093/bioinformatics/btx555] [Citation(s) in RCA: 50] [Impact Index Per Article: 8.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/25/2017] [Accepted: 09/01/2017] [Indexed: 12/18/2022] Open
Abstract
Motivation Computational prediction of protein-protein complex structure by docking can provide structural and mechanistic insights for protein interactions of biomedical interest. However, current methods struggle with difficult cases, such as those involving flexible proteins, low-affinity complexes or transient interactions. A major challenge is how to efficiently sample the structural and energetic landscape of the association at different resolution levels, given that each scoring function is often highly coupled to a specific type of search method. Thus, new methodologies capable of accommodating multi-scale conformational flexibility and scoring are strongly needed. Results We describe here a new multi-scale protein-protein docking methodology, LightDock, capable of accommodating conformational flexibility and a variety of scoring functions at different resolution levels. Implicit use of normal modes during the search and atomic/coarse-grained combined scoring functions yielded improved predictive results with respect to state-of-the-art rigid-body docking, especially in flexible cases. Availability and implementation The source code of the software and installation instructions are available for download at https://life.bsc.es/pid/lightdock/. Contact juanf@bsc.es. Supplementary information Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Brian Jiménez-García
- Life Sciences Department, Barcelona Supercomputing Center (BSC), 08034 Barcelona, Spain
| | - Jorge Roel-Touris
- Life Sciences Department, Barcelona Supercomputing Center (BSC), 08034 Barcelona, Spain
| | - Miguel Romero-Durana
- Life Sciences Department, Barcelona Supercomputing Center (BSC), 08034 Barcelona, Spain
| | - Miquel Vidal
- Life Sciences Department, Barcelona Supercomputing Center (BSC), 08034 Barcelona, Spain
| | - Daniel Jiménez-González
- Life Sciences Department, Barcelona Supercomputing Center (BSC), 08034 Barcelona, Spain.,Department of Computer Architecture, Universitat Politècnica de Catalunya, 08034 Barcelona, Spain
| | - Juan Fernández-Recio
- Life Sciences Department, Barcelona Supercomputing Center (BSC), 08034 Barcelona, Spain.,Structural Biology Unit, IBMB-CSIC, 08028 Barcelona, Spain
| |
Collapse
|
8
|
Yan Y, Wen Z, Wang X, Huang SY. Addressing recent docking challenges: A hybrid strategy to integrate template-based and free protein-protein docking. Proteins 2017; 85:497-512. [PMID: 28026062 DOI: 10.1002/prot.25234] [Citation(s) in RCA: 80] [Impact Index Per Article: 11.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/16/2016] [Revised: 12/15/2016] [Accepted: 12/16/2016] [Indexed: 12/23/2022]
Abstract
Protein-protein docking is an important computational tool for predicting protein-protein interactions. With the rapid development of proteomics projects, more and more experimental binding information ranging from mutagenesis data to three-dimensional structures of protein complexes are becoming available. Therefore, how to appropriately incorporate the biological information into traditional ab initio docking has been an important issue and challenge in the field of protein-protein docking. To address these challenges, we have developed a Hybrid DOCKing protocol of template-based and template-free approaches, referred to as HDOCK. The basic procedure of HDOCK is to model the structures of individual components based on the template complex by a template-based method if a template is available; otherwise, the component structures will be modeled based on monomer proteins by regular homology modeling. Then, the complex structure of the component models is predicted by traditional protein-protein docking. With the HDOCK protocol, we have participated in the CPARI experiment for rounds 28-35. Out of the 25 CASP-CAPRI targets for oligomer modeling, our HDOCK protocol predicted correct models for 16 targets, ranking one of the top algorithms in this challenge. Our docking method also made correct predictions on other CAPRI challenges such as protein-peptide binding for 6 out of 8 targets and water predictions for 2 out of 2 targets. The advantage of our hybrid docking approach over pure template-based docking was further confirmed by a comparative evaluation on 20 CASP-CAPRI targets. Proteins 2017; 85:497-512. © 2016 Wiley Periodicals, Inc.
Collapse
Affiliation(s)
- Yumeng Yan
- School of Physics, Huazhong University of Science and Technology, Wuhan Hubei, 430074, People's Republic of China
| | - Zeyu Wen
- School of Physics, Huazhong University of Science and Technology, Wuhan Hubei, 430074, People's Republic of China
| | - Xinxiang Wang
- School of Physics, Huazhong University of Science and Technology, Wuhan Hubei, 430074, People's Republic of China
| | - Sheng-You Huang
- School of Physics, Huazhong University of Science and Technology, Wuhan Hubei, 430074, People's Republic of China
| |
Collapse
|
9
|
Pérez-Cano L, Romero-Durana M, Fernández-Recio J. Structural and energy determinants in protein-RNA docking. Methods 2016; 118-119:163-170. [PMID: 27816523 DOI: 10.1016/j.ymeth.2016.11.001] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/20/2016] [Revised: 10/14/2016] [Accepted: 11/01/2016] [Indexed: 01/02/2023] Open
Abstract
Deciphering the structural and energetic determinants of protein-RNA interactions harbors the potential to understand key cell processes at molecular level, such as gene expression and regulation. With this purpose, computational methods like docking aim to complement current biophysical and structural biology efforts. However, the few reported docking algorithms for protein-RNA interactions show limited predictive success rates, mainly due to incomplete sampling of the conformational space of both the protein and the RNA molecules, as well as to the difficulties of the scoring function in identifying the correct docking models. Here, we have tested the predictive value of a variety of knowledge-based and energetic scoring functions on a recently published protein-RNA docking benchmark and developed a scoring function able to efficiently discriminate docking decoys. We first performed docking calculations with the bound conformation, which allowed us to analyze the problem in optimal conditions. We found that geometry-based terms and electrostatics were the most important scoring terms, while binding propensities and desolvation were much less relevant for the scoring of protein-RNA models. This is in contrast with what we observed for protein-protein docking. The results also showed an interesting dependence of the predictive rates on the flexibility of the protein molecule, which arises from the observed higher positive charge of flexible interfaces and provides hints for future development of more efficient protein-RNA docking methods.
Collapse
Affiliation(s)
- Laura Pérez-Cano
- Joint BSC-CRG-IRB Research Program in Computational Biology, Life Sciences Department, Barcelona Supercomputing Center (BSC), Jordi Girona 29, Barcelona 08034, Spain; Center for Neurobehavioral Genetics and Center for Autism Research and Treatment, Semel Institute, David Geffen School of Medicine, University of California, Los Angeles, CA, USA
| | - Miguel Romero-Durana
- Joint BSC-CRG-IRB Research Program in Computational Biology, Life Sciences Department, Barcelona Supercomputing Center (BSC), Jordi Girona 29, Barcelona 08034, Spain
| | - Juan Fernández-Recio
- Joint BSC-CRG-IRB Research Program in Computational Biology, Life Sciences Department, Barcelona Supercomputing Center (BSC), Jordi Girona 29, Barcelona 08034, Spain.
| |
Collapse
|
10
|
Pallara C, Jiménez-García B, Romero M, Moal IH, Fernández-Recio J. pyDock scoring for the new modeling challenges in docking: Protein-peptide, homo-multimers, and domain-domain interactions. Proteins 2016; 85:487-496. [PMID: 27701776 DOI: 10.1002/prot.25184] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/28/2016] [Revised: 09/22/2016] [Accepted: 10/02/2016] [Indexed: 12/18/2022]
Abstract
The sixth CAPRI edition included new modeling challenges, such as the prediction of protein-peptide complexes, and the modeling of homo-oligomers and domain-domain interactions as part of the first joint CASP-CAPRI experiment. Other non-standard targets included the prediction of interfacial water positions and the modeling of the interactions between proteins and nucleic acids. We have participated in all proposed targets of this CAPRI edition both as predictors and as scorers, with new protocols to efficiently use our docking and scoring scheme pyDock in a large variety of scenarios. In addition, we have participated for the first time in the servers section, with our recently developed webserver, pyDockWeb. Excluding the CASP-CAPRI cases, we submitted acceptable models (or better) for 7 out of the 18 evaluated targets as predictors, 4 out of the 11 targets as scorers, and 6 out of the 18 targets as servers. The overall success rates were below those in past CAPRI editions. This shows the challenging nature of this last edition, with many difficult targets for which no participant submitted a single acceptable model. Interestingly, we submitted acceptable models for 83% of the evaluated protein-peptide targets. As for the 25 cases of the CASP-CAPRI experiment, in which we used a larger variety of modeling techniques (template-based, symmetry restraints, literature information, etc.), we submitted acceptable models for 56% of the targets. In summary, this CAPRI edition showed that pyDock scheme can be efficiently adapted to the increasing variety of problems that the protein interactions field is currently facing. Proteins 2017; 85:487-496. © 2016 Wiley Periodicals, Inc.
Collapse
Affiliation(s)
- Chiara Pallara
- Life Sciences Department, Joint BSC-CRG-IRB Research Program in Computational Biology, Barcelona Supercomputing Center, Barcelona, Spain
| | - Brian Jiménez-García
- Life Sciences Department, Joint BSC-CRG-IRB Research Program in Computational Biology, Barcelona Supercomputing Center, Barcelona, Spain
| | - Miguel Romero
- Life Sciences Department, Joint BSC-CRG-IRB Research Program in Computational Biology, Barcelona Supercomputing Center, Barcelona, Spain
| | - Iain H Moal
- Life Sciences Department, Joint BSC-CRG-IRB Research Program in Computational Biology, Barcelona Supercomputing Center, Barcelona, Spain.,European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge, CB10 1SD, United Kingdom
| | - Juan Fernández-Recio
- Life Sciences Department, Joint BSC-CRG-IRB Research Program in Computational Biology, Barcelona Supercomputing Center, Barcelona, Spain
| |
Collapse
|
11
|
Pallara C, Rueda M, Abagyan R, Fernández-Recio J. Conformational Heterogeneity of Unbound Proteins Enhances Recognition in Protein-Protein Encounters. J Chem Theory Comput 2016; 12:3236-49. [PMID: 27294484 DOI: 10.1021/acs.jctc.6b00204] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/09/2023]
Abstract
To understand cellular processes at the molecular level we need to improve our knowledge of protein-protein interactions, from a structural, mechanistic, and energetic point of view. Current theoretical studies and computational docking simulations show that protein dynamics plays a key role in protein association and support the need for including protein flexibility in modeling protein interactions. Assuming the conformational selection binding mechanism, in which the unbound state can sample bound conformers, one possible strategy to include flexibility in docking predictions would be the use of conformational ensembles originated from unbound protein structures. Here we present an exhaustive computational study about the use of precomputed unbound ensembles in the context of protein docking, performed on a set of 124 cases of the Protein-Protein Docking Benchmark 3.0. Conformational ensembles were generated by conformational optimization and refinement with MODELLER and by short molecular dynamics trajectories with AMBER. We identified those conformers providing optimal binding and investigated the role of protein conformational heterogeneity in protein-protein recognition. Our results show that a restricted conformational refinement can generate conformers with better binding properties and improve docking encounters in medium-flexible cases. For more flexible cases, a more extended conformational sampling based on Normal Mode Analysis was proven helpful. We found that successful conformers provide better energetic complementarity to the docking partners, which is compatible with recent views of binding association. In addition to the mechanistic considerations, these findings could be exploited for practical docking predictions of improved efficiency.
Collapse
Affiliation(s)
- Chiara Pallara
- Joint BSC-CRG-IRB Research Program in Computational Biology, Life Sciences Department, Barcelona Supercomputing Center , C/Jordi Girona 29, Barcelona 08034, Spain
| | - Manuel Rueda
- Skaggs School of Pharmacy and Pharmaceutical Sciences, University of California, San Diego , 9500 Gilman Drive, La Jolla, California 92093, United States
| | - Ruben Abagyan
- Skaggs School of Pharmacy and Pharmaceutical Sciences, University of California, San Diego , 9500 Gilman Drive, La Jolla, California 92093, United States
| | - Juan Fernández-Recio
- Joint BSC-CRG-IRB Research Program in Computational Biology, Life Sciences Department, Barcelona Supercomputing Center , C/Jordi Girona 29, Barcelona 08034, Spain
| |
Collapse
|
12
|
Liu Q, Ren J, Song J, Li J. Co-Occurring Atomic Contacts for the Characterization of Protein Binding Hot Spots. PLoS One 2015; 10:e0144486. [PMID: 26675422 PMCID: PMC4684219 DOI: 10.1371/journal.pone.0144486] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/15/2014] [Accepted: 11/19/2015] [Indexed: 11/19/2022] Open
Abstract
A binding hot spot is a small area at a protein-protein interface that can make significant contribution to binding free energy. This work investigates the substantial contribution made by some special co-occurring atomic contacts at a binding hot spot. A co-occurring atomic contact is a pair of atomic contacts that are close to each other with no more than three covalent-bond steps. We found that two kinds of co-occurring atomic contacts can play an important part in the accurate prediction of binding hot spot residues. One is the co-occurrence of two nearby hydrogen bonds. For example, mutations of any residue in a hydrogen bond network consisting of multiple co-occurring hydrogen bonds could disrupt the interaction considerably. The other kind of co-occurring atomic contact is the co-occurrence of a hydrophobic carbon contact and a contact between a hydrophobic carbon atom and a π ring. In fact, this co-occurrence signifies the collective effect of hydrophobic contacts. We also found that the B-factor measurements of several specific groups of amino acids are useful for the prediction of hot spots. Taking the B-factor, individual atomic contacts and the co-occurring contacts as features, we developed a new prediction method and thoroughly assessed its performance via cross-validation and independent dataset test. The results show that our method achieves higher prediction performance than well-known methods such as Robetta, FoldX and Hotpoint. We conclude that these contact descriptors, in particular the novel co-occurring atomic contacts, can be used to facilitate accurate and interpretable characterization of protein binding hot spots.
Collapse
Affiliation(s)
- Qian Liu
- Advanced Analytics Institute, University of Technology Sydney, Broadway, NSW 2007, Australia
| | - Jing Ren
- Advanced Analytics Institute, University of Technology Sydney, Broadway, NSW 2007, Australia
| | - Jiangning Song
- Department of Biochemistry and Molecular Biology, Faculty of Medicine, Monash University, Melbourne, VIC 3800, Australia
- Centre for Research in Intelligent Systems, Faculty of Information Technology, Monash University, Melbourne, VIC 3800, Australia
| | - Jinyan Li
- Advanced Analytics Institute and Centre for Health Technologies, University of Technology Sydney, Broadway, NSW 2007, Australia
| |
Collapse
|
13
|
Shih ESC, Hwang MJ. NPPD: A Protein-Protein Docking Scoring Function Based on Dyadic Differences in Networks of Hydrophobic and Hydrophilic Amino Acid Residues. BIOLOGY 2015; 4:282-97. [PMID: 25811640 PMCID: PMC4498300 DOI: 10.3390/biology4020282] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 11/27/2014] [Accepted: 03/16/2015] [Indexed: 11/16/2022]
Abstract
Protein-protein docking (PPD) predictions usually rely on the use of a scoring function to rank docking models generated by exhaustive sampling. To rank good models higher than bad ones, a large number of scoring functions have been developed and evaluated, but the methods used for the computation of PPD predictions remain largely unsatisfactory. Here, we report a network-based PPD scoring function, the NPPD, in which the network consists of two types of network nodes, one for hydrophobic and the other for hydrophilic amino acid residues, and the nodes are connected when the residues they represent are within a certain contact distance. We showed that network parameters that compute dyadic interactions and those that compute heterophilic interactions of the amino acid networks thus constructed allowed NPPD to perform well in a benchmark evaluation of 115 PPD scoring functions, most of which, unlike NPPD, are based on some sort of protein-protein interaction energy. We also showed that NPPD was highly complementary to these energy-based scoring functions, suggesting that the combined use of conventional scoring functions and NPPD might significantly improve the accuracy of current PPD predictions.
Collapse
Affiliation(s)
- Edward S C Shih
- Institute of Biomedical Sciences, Academia Sinica, Nankang, Taipei 115, Taiwan.
| | - Ming-Jing Hwang
- Institute of Biomedical Sciences, Academia Sinica, Nankang, Taipei 115, Taiwan.
| |
Collapse
|
14
|
Moal IH, Dapkūnas J, Fernández-Recio J. Inferring the microscopic surface energy of protein-protein interfaces from mutation data. Proteins 2015; 83:640-50. [PMID: 25586563 DOI: 10.1002/prot.24761] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/24/2014] [Revised: 12/04/2014] [Accepted: 12/21/2014] [Indexed: 11/11/2022]
Abstract
Mutations at protein-protein recognition sites alter binding strength by altering the chemical nature of the interacting surfaces. We present a simple surface energy model, parameterized with empirical ΔΔG values, yielding mean energies of -48 cal mol(-1) Å(-2) for interactions between hydrophobic surfaces, -51 to -80 cal mol(-1) Å(-2) for surfaces of complementary charge, and 66-83 cal mol(-1) Å(-2) for electrostatically repelling surfaces, relative to the aqueous phase. This places the mean energy of hydrophobic surface burial at -24 cal mol(-1) Å(-2) . Despite neglecting configurational entropy and intramolecular changes, the model correlates with empirical binding free energies of a functionally diverse set of rigid-body interactions (r = 0.66). When used to rerank docking poses, it can place near-native solutions in the top 10 for 37% of the complexes evaluated, and 82% in the top 100. The method shows that hydrophobic burial is the driving force for protein association, accounting for 50-95% of the cohesive energy. The model is available open-source from http://life.bsc.es/pid/web/surface_energy/ and via the CCharpPPI web server http://life.bsc.es/pid/ccharppi/.
Collapse
Affiliation(s)
- Iain H Moal
- Joint BSC-IRB Research Program in Computational Biology, Life Science Department, Barcelona Supercomputing Center, Barcelona, 08034, Spain
| | | | | |
Collapse
|
15
|
Janin J. A minimal model of protein-protein binding affinities. Protein Sci 2014; 23:1813-7. [PMID: 25270898 DOI: 10.1002/pro.2560] [Citation(s) in RCA: 23] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/02/2014] [Accepted: 09/25/2014] [Indexed: 11/10/2022]
Abstract
A minimal model of protein-protein binding affinity that takes into account only two structural features of the complex, the size of its interface, and the amplitude of the conformation change between the free and bound subunits, is tested on the 144 complexes of a structure-affinity benchmark. It yields Kd values that are within two orders of magnitude of the experiment for 67% of the complexes, within three orders for 88%, and fails on 12%, which display either large conformation changes, or a very high or a low affinity. The minimal model lacks the specificity and accuracy needed to make useful affinity predictions, but it should help in assessing the added value of parameters used by more elaborate models, and set a baseline for evaluating their performances.
Collapse
Affiliation(s)
- Joël Janin
- IBBMC, CNRS UMR 8619, Université Paris-Sud 11, Orsay, France
| |
Collapse
|
16
|
Moal IH, Jiménez-García B, Fernández-Recio J. CCharPPI web server: computational characterization of protein-protein interactions from structure. Bioinformatics 2014; 31:123-5. [PMID: 25183488 DOI: 10.1093/bioinformatics/btu594] [Citation(s) in RCA: 48] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/03/2023] Open
Abstract
SUMMARY The atomic structures of protein-protein interactions are central to understanding their role in biological systems, and a wide variety of biophysical functions and potentials have been developed for their characterization and the construction of predictive models. These tools are scattered across a multitude of stand-alone programs, and are often available only as model parameters requiring reimplementation. This acts as a significant barrier to their widespread adoption. CCharPPI integrates many of these tools into a single web server. It calculates up to 108 parameters, including models of electrostatics, desolvation and hydrogen bonding, as well as interface packing and complementarity scores, empirical potentials at various resolutions, docking potentials and composite scoring functions. AVAILABILITY AND IMPLEMENTATION The server does not require registration by the user and is freely available for non-commercial academic use at http://life.bsc.es/pid/ccharppi.
Collapse
Affiliation(s)
- Iain H Moal
- Joint BSC-IRB Research Programme in Computational Biology, Department of Life Sciences, Barcelona Supercomputing Center, C/Jordi Girona 29, 08034 Barcelona, Spain
| | - Brian Jiménez-García
- Joint BSC-IRB Research Programme in Computational Biology, Department of Life Sciences, Barcelona Supercomputing Center, C/Jordi Girona 29, 08034 Barcelona, Spain
| | - Juan Fernández-Recio
- Joint BSC-IRB Research Programme in Computational Biology, Department of Life Sciences, Barcelona Supercomputing Center, C/Jordi Girona 29, 08034 Barcelona, Spain
| |
Collapse
|
17
|
Huang SY. Search strategies and evaluation in protein–protein docking: principles, advances and challenges. Drug Discov Today 2014; 19:1081-96. [DOI: 10.1016/j.drudis.2014.02.005] [Citation(s) in RCA: 87] [Impact Index Per Article: 8.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/22/2013] [Revised: 01/04/2014] [Accepted: 02/24/2014] [Indexed: 01/10/2023]
|
18
|
Liu Q, Hoi SCH, Kwoh CK, Wong L, Li J. Integrating water exclusion theory into β contacts to predict binding free energy changes and binding hot spots. BMC Bioinformatics 2014; 15:57. [PMID: 24568581 PMCID: PMC3941611 DOI: 10.1186/1471-2105-15-57] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/30/2013] [Accepted: 02/19/2014] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND Binding free energy and binding hot spots at protein-protein interfaces are two important research areas for understanding protein interactions. Computational methods have been developed previously for accurate prediction of binding free energy change upon mutation for interfacial residues. However, a large number of interrupted and unimportant atomic contacts are used in the training phase which caused accuracy loss. RESULTS This work proposes a new method, βACVASA, to predict the change of binding free energy after alanine mutations. βACVASA integrates accessible surface area (ASA) and our newly defined β contacts together into an atomic contact vector (ACV). A β contact between two atoms is a direct contact without being interrupted by any other atom between them. A β contact's potential contribution to protein binding is also supposed to be inversely proportional to its ASA to follow the water exclusion hypothesis of binding hot spots. Tested on a dataset of 396 alanine mutations, our method is found to be superior in classification performance to many other methods, including Robetta, FoldX, HotPOINT, an ACV method of β contacts without ASA integration, and ACVASA methods (similar to βACVASA but based on distance-cutoff contacts). Based on our data analysis and results, we can draw conclusions that: (i) our method is powerful in the prediction of binding free energy change after alanine mutation; (ii) β contacts are better than distance-cutoff contacts for modeling the well-organized protein-binding interfaces; (iii) β contacts usually are only a small fraction number of the distance-based contacts; and (iv) water exclusion is a necessary condition for a residue to become a binding hot spot. CONCLUSIONS βACVASA is designed using the advantages of both β contacts and water exclusion. It is an excellent tool to predict binding free energy changes and binding hot spots after alanine mutation.
Collapse
Affiliation(s)
- Qian Liu
- Advanced Analytics Institute and Center for Health Technologies, Faculty of Engineering and IT, University of Technology, Sydney, Australia
- School of Computer Engineering, Nanyang Technological University, Singapore 639798, Singapore
| | - Steven CH Hoi
- School of Computer Engineering, Nanyang Technological University, Singapore 639798, Singapore
| | - Chee Keong Kwoh
- School of Computer Engineering, Nanyang Technological University, Singapore 639798, Singapore
| | - Limsoon Wong
- School of Computing, National University of Singapore, Singapore 117417, Singapore
| | - Jinyan Li
- Advanced Analytics Institute and Center for Health Technologies, Faculty of Engineering and IT, University of Technology, Sydney, Australia
| |
Collapse
|
19
|
Moal IH, Torchala M, Bates PA, Fernández-Recio J. The scoring of poses in protein-protein docking: current capabilities and future directions. BMC Bioinformatics 2013; 14:286. [PMID: 24079540 PMCID: PMC3850738 DOI: 10.1186/1471-2105-14-286] [Citation(s) in RCA: 74] [Impact Index Per Article: 6.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/18/2013] [Accepted: 09/25/2013] [Indexed: 12/16/2022] Open
Abstract
BACKGROUND Protein-protein docking, which aims to predict the structure of a protein-protein complex from its unbound components, remains an unresolved challenge in structural bioinformatics. An important step is the ranking of docked poses using a scoring function, for which many methods have been developed. There is a need to explore the differences and commonalities of these methods with each other, as well as with functions developed in the fields of molecular dynamics and homology modelling. RESULTS We present an evaluation of 115 scoring functions on an unbound docking decoy benchmark covering 118 complexes for which a near-native solution can be found, yielding top 10 success rates of up to 58%. Hierarchical clustering is performed, so as to group together functions which identify near-natives in similar subsets of complexes. Three set theoretic approaches are used to identify pairs of scoring functions capable of correctly scoring different complexes. This shows that functions in different clusters capture different aspects of binding and are likely to work together synergistically. CONCLUSIONS All functions designed specifically for docking perform well, indicating that functions are transferable between sampling methods. We also identify promising methods from the field of homology modelling. Further, differential success rates by docking difficulty and solution quality suggest a need for flexibility-dependent scoring. Investigating pairs of scoring functions, the set theoretic measures identify known scoring strategies as well as a number of novel approaches, indicating promising augmentations of traditional scoring methods. Such augmentation and parameter combination strategies are discussed in the context of the learning-to-rank paradigm.
Collapse
Affiliation(s)
- Iain H Moal
- Joint BSC-IRB Research Program in Computational Biology, Life Science Department, Barcelona Super computing Center, Barcelona 08034, Spain
| | - Mieczyslaw Torchala
- Biomolecular Modelling Laboratory, Cancer Research UK London Research Institute, London WC2A 3LY, UK
| | - Paul A Bates
- Biomolecular Modelling Laboratory, Cancer Research UK London Research Institute, London WC2A 3LY, UK
| | - Juan Fernández-Recio
- Joint BSC-IRB Research Program in Computational Biology, Life Science Department, Barcelona Super computing Center, Barcelona 08034, Spain
| |
Collapse
|