1
|
Zhou Y, Myung Y, Rodrigues CM, Ascher D. DDMut-PPI: predicting effects of mutations on protein-protein interactions using graph-based deep learning. Nucleic Acids Res 2024; 52:W207-W214. [PMID: 38783112 PMCID: PMC11223791 DOI: 10.1093/nar/gkae412] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/08/2024] [Revised: 04/30/2024] [Accepted: 05/02/2024] [Indexed: 05/25/2024] Open
Abstract
Protein-protein interactions (PPIs) play a vital role in cellular functions and are essential for therapeutic development and understanding diseases. However, current predictive tools often struggle to balance efficiency and precision in predicting the effects of mutations on these complex interactions. To address this, we present DDMut-PPI, a deep learning model that efficiently and accurately predicts changes in PPI binding free energy upon single and multiple point mutations. Building on the robust Siamese network architecture with graph-based signatures from our prior work, DDMut, the DDMut-PPI model was enhanced with a graph convolutional network operated on the protein interaction interface. We used residue-specific embeddings from ProtT5 protein language model as node features, and a variety of molecular interactions as edge features. By integrating evolutionary context with spatial information, this framework enables DDMut-PPI to achieve a robust Pearson correlation of up to 0.75 (root mean squared error: 1.33 kcal/mol) in our evaluations, outperforming most existing methods. Importantly, the model demonstrated consistent performance across mutations that increase or decrease binding affinity. DDMut-PPI offers a significant advancement in the field and will serve as a valuable tool for researchers probing the complexities of protein interactions. DDMut-PPI is freely available as a web server and an application programming interface at https://biosig.lab.uq.edu.au/ddmut_ppi.
Collapse
Affiliation(s)
- Yunzhuo Zhou
- The Australian Centre for Ecogenomics, School of Chemistry and Molecular Biosciences, University of Queensland, St Lucia, Queensland 4072, Australia
- Computational Biology and Clinical Informatics, Baker Heart and Diabetes Institute, Melbourne, Victoria 3004, Australia
| | - YooChan Myung
- The Australian Centre for Ecogenomics, School of Chemistry and Molecular Biosciences, University of Queensland, St Lucia, Queensland 4072, Australia
- Computational Biology and Clinical Informatics, Baker Heart and Diabetes Institute, Melbourne, Victoria 3004, Australia
| | - Carlos H M Rodrigues
- The Australian Centre for Ecogenomics, School of Chemistry and Molecular Biosciences, University of Queensland, St Lucia, Queensland 4072, Australia
| | - David B Ascher
- The Australian Centre for Ecogenomics, School of Chemistry and Molecular Biosciences, University of Queensland, St Lucia, Queensland 4072, Australia
- Computational Biology and Clinical Informatics, Baker Heart and Diabetes Institute, Melbourne, Victoria 3004, Australia
| |
Collapse
|
2
|
Raisinghani N, Alshahrani M, Gupta G, Xiao S, Tao P, Verkhivker G. AlphaFold2-Enabled Atomistic Modeling of Structure, Conformational Ensembles, and Binding Energetics of the SARS-CoV-2 Omicron BA.2.86 Spike Protein with ACE2 Host Receptor and Antibodies: Compensatory Functional Effects of Binding Hotspots in Modulating Mechanisms of Receptor Binding and Immune Escape. J Chem Inf Model 2024; 64:1657-1681. [PMID: 38373700 DOI: 10.1021/acs.jcim.3c01857] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/21/2024]
Abstract
The latest wave of SARS-CoV-2 Omicron variants displayed a growth advantage and increased viral fitness through convergent evolution of functional hotspots that work synchronously to balance fitness requirements for productive receptor binding and efficient immune evasion. In this study, we combined AlphaFold2-based structural modeling approaches with atomistic simulations and mutational profiling of binding energetics and stability for prediction and comprehensive analysis of the structure, dynamics, and binding of the SARS-CoV-2 Omicron BA.2.86 spike variant with ACE2 host receptor and distinct classes of antibodies. We adapted several AlphaFold2 approaches to predict both the structure and conformational ensembles of the Omicron BA.2.86 spike protein in the complex with the host receptor. The results showed that the AlphaFold2-predicted structural ensemble of the BA.2.86 spike protein complex with ACE2 can accurately capture the main conformational states of the Omicron variant. Complementary to AlphaFold2 structural predictions, microsecond molecular dynamics simulations reveal the details of the conformational landscape and produced equilibrium ensembles of the BA.2.86 structures that are used to perform mutational scanning of spike residues and characterize structural stability and binding energy hotspots. The ensemble-based mutational profiling of the receptor binding domain residues in the BA.2 and BA.2.86 spike complexes with ACE2 revealed a group of conserved hydrophobic hotspots and critical variant-specific contributions of the BA.2.86 convergent mutational hotspots R403K, F486P, and R493Q. To examine the immune evasion properties of BA.2.86 in atomistic detail, we performed structure-based mutational profiling of the spike protein binding interfaces with distinct classes of antibodies that displayed significantly reduced neutralization against the BA.2.86 variant. The results revealed the molecular basis of compensatory functional effects of the binding hotspots, showing that BA.2.86 lineage may have evolved to outcompete other Omicron subvariants by improving immune evasion while preserving binding affinity with ACE2 via through a compensatory effect of R493Q and F486P convergent mutational hotspots. This study demonstrated that an integrative approach combining AlphaFold2 predictions with complementary atomistic molecular dynamics simulations and robust ensemble-based mutational profiling of spike residues can enable accurate and comprehensive characterization of structure, dynamics, and binding mechanisms of newly emerging Omicron variants.
Collapse
Affiliation(s)
- Nishank Raisinghani
- Keck Center for Science and Engineering, Graduate Program in Computational and Data Sciences, Schmid College of Science and Technology, Chapman University, Orange, California 92866, United States of America
| | - Mohammed Alshahrani
- Keck Center for Science and Engineering, Graduate Program in Computational and Data Sciences, Schmid College of Science and Technology, Chapman University, Orange, California 92866, United States of America
| | - Grace Gupta
- Keck Center for Science and Engineering, Graduate Program in Computational and Data Sciences, Schmid College of Science and Technology, Chapman University, Orange, California 92866, United States of America
| | - Sian Xiao
- Department of Chemistry, Center for Research Computing, Center for Drug Discovery, Design, and Delivery (CD4), Southern Methodist University, Dallas, Texas 75275, United States of America
| | - Peng Tao
- Department of Chemistry, Center for Research Computing, Center for Drug Discovery, Design, and Delivery (CD4), Southern Methodist University, Dallas, Texas 75275, United States of America
| | - Gennady Verkhivker
- Keck Center for Science and Engineering, Graduate Program in Computational and Data Sciences, Schmid College of Science and Technology, Chapman University, Orange, California 92866, United States of America
- Department of Biomedical and Pharmaceutical Sciences, Chapman University School of Pharmacy, Irvine, California 92618, United States of America
| |
Collapse
|
3
|
Thakur S, Planeta Kepp K, Mehra R. Predicting virus Fitness: Towards a structure-based computational model. J Struct Biol 2023; 215:108042. [PMID: 37931730 DOI: 10.1016/j.jsb.2023.108042] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/25/2023] [Revised: 10/12/2023] [Accepted: 11/03/2023] [Indexed: 11/08/2023]
Abstract
Predicting the impact of new emerging virus mutations is of major interest in surveillance and for understanding the evolutionary forces of the pathogens. The SARS-CoV-2 surface spike-protein (S-protein) binds to human ACE2 receptors as a critical step in host cell infection. At the same time, S-protein binding to human antibodies neutralizes the virus and prevents interaction with ACE2. Here we combine these two binding properties in a simple virus fitness model, using structure-based computation of all possible mutation effects averaged over 10 ACE2 complexes and 10 antibody complexes of the S-protein (∼380,000 computed mutations), and validated the approach against diverse experimental binding/escape data of ACE2 and antibodies. The ACE2-antibody selectivity change caused by mutation (i.e., the differential change in binding to ACE2 vs. immunity-inducing antibodies) is proposed to be a key metric of fitness model, enabling systematic error cancelation when evaluated. In this model, new mutations become fixated if they increase the selective binding to ACE2 relative to circulating antibodies, assuming that both are present in the host in a competitive binding situation. We use this model to categorize viral mutations that may best reach ACE2 before being captured by antibodies. Our model may aid the understanding of variant-specific vaccines and molecular mechanisms of viral evolution in the context of a human host.
Collapse
Affiliation(s)
- Shivani Thakur
- Department of Chemistry, Indian Institute of Technology Bhilai, Kutelabhata, Durg - 491001, Chhattisgarh, India
| | - Kasper Planeta Kepp
- DTU Chemistry, Technical University of Denmark, Building 206, 2800 Kongens Lyngby, Denmark
| | - Rukmankesh Mehra
- Department of Chemistry, Indian Institute of Technology Bhilai, Kutelabhata, Durg - 491001, Chhattisgarh, India; Department of Bioscience and Biomedical Engineering, Indian Institute of Technology Bhilai, Kutelabhata, Durg - 491001, Chhattisgarh, India.
| |
Collapse
|
4
|
Verkhivker G, Alshahrani M, Gupta G, Xiao S, Tao P. Probing conformational landscapes of binding and allostery in the SARS-CoV-2 omicron variant complexes using microsecond atomistic simulations and perturbation-based profiling approaches: hidden role of omicron mutations as modulators of allosteric signaling and epistatic relationships. Phys Chem Chem Phys 2023; 25:21245-21266. [PMID: 37548589 PMCID: PMC10536792 DOI: 10.1039/d3cp02042h] [Citation(s) in RCA: 8] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 08/08/2023]
Abstract
In this study, we systematically examine the conformational dynamics, binding and allosteric communications in the Omicron BA.1, BA.2, BA.3 and BA.4/BA.5 spike protein complexes with the ACE2 host receptor using molecular dynamics simulations and perturbation-based network profiling approaches. Microsecond atomistic simulations provided a detailed characterization of the conformational landscapes and revealed the increased thermodynamic stabilization of the BA.2 variant which can be contrasted with the BA.4/BA.5 variants inducing a significant mobility of the complexes. Using the dynamics-based mutational scanning of spike residues, we identified structural stability and binding affinity hotspots in the Omicron complexes. Perturbation response scanning and network-based mutational profiling approaches probed the effect of the Omicron mutations on allosteric interactions and communications in the complexes. The results of this analysis revealed specific roles of Omicron mutations as conformationally plastic and evolutionary adaptable modulators of binding and allostery which are coupled to the major regulatory positions through interaction networks. Through perturbation network scanning of allosteric residue potentials in the Omicron variant complexes performed in the background of the original strain, we characterized regions of epistatic couplings that are centered around the binding affinity hotspots N501Y and Q498R. Our results dissected the vital role of these epistatic centers in regulating protein stability, efficient ACE2 binding and allostery which allows for accumulation of multiple Omicron immune escape mutations at other sites. Through integrative computational approaches, this study provides a systematic analysis of the effects of Omicron mutations on thermodynamics, binding and allosteric signaling in the complexes with ACE2 receptor.
Collapse
Affiliation(s)
- Gennady Verkhivker
- Keck Center for Science and Engineering, Schmid College of Science and Technology, Chapman University, Orange, CA 92866, USA.
- Department of Biomedical and Pharmaceutical Sciences, Chapman University School of Pharmacy, Irvine, CA 92618, USA.
- Department of Pharmacology, Skaggs School of Pharmacy and Pharmaceutical Sciences, University of California San Diego, 9500 Gilman Drive, La Jolla, CA 92093, USA
| | - Mohammed Alshahrani
- Keck Center for Science and Engineering, Schmid College of Science and Technology, Chapman University, Orange, CA 92866, USA.
| | - Grace Gupta
- Keck Center for Science and Engineering, Schmid College of Science and Technology, Chapman University, Orange, CA 92866, USA.
| | - Sian Xiao
- Department of Chemistry, Center for Research Computing, Center for Drug Discovery, Design, and Delivery (CD4), Southern Methodist University, Dallas, Texas, 75275, USA.
| | - Peng Tao
- Department of Chemistry, Center for Research Computing, Center for Drug Discovery, Design, and Delivery (CD4), Southern Methodist University, Dallas, Texas, 75275, USA.
| |
Collapse
|
5
|
Narkhede YB, Bhardwaj A, Motsa BB, Saxena R, Sharma T, Chapagain PP, Stahelin RV, Wiest O. Elucidating Residue-Level Determinants Affecting Dimerization of Ebola Virus Matrix Protein Using High-Throughput Site Saturation Mutagenesis and Biophysical Approaches. J Phys Chem B 2023; 127:6449-6461. [PMID: 37458567 DOI: 10.1021/acs.jpcb.3c01759] [Citation(s) in RCA: 6] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 07/28/2023]
Abstract
The Ebola virus (EBOV) is a filamentous virus that acquires its lipid envelope from the plasma membrane of the host cell it infects. EBOV assembly and budding from the host cell plasma membrane are mediated by a peripheral protein, known as the matrix protein VP40. VP40 is a 326 amino acid protein with two domains that are loosely linked. The VP40 N-terminal domain (NTD) contains a hydrophobic α-helix, which mediates VP40 dimerization. The VP40 C-terminal domain has a cationic patch, which mediates interactions with anionic lipids and a hydrophobic region that mediates VP40 dimer-dimer interactions. The VP40 dimer is necessary for trafficking to the plasma membrane inner leaflet and interactions with anionic lipids to mediate the VP40 assembly and oligomerization. Despite significant structural information available on the VP40 dimer structure, little is known on how the VP40 dimer is stabilized and how residues outside the NTD hydrophobic portion of the α-helical dimer interface contribute to dimer stability. To better understand how VP40 dimer stability is maintained, we performed computational studies using per-residue energy decomposition and site saturation mutagenesis. These studies revealed a number of novel keystone residues for VP40 dimer stability just adjacent to the α-helical dimer interface as well as distant residues in the VP40 CTD that can stabilize the VP40 dimer form. Experimental studies with representative VP40 mutants in vitro and in cells were performed to test computational predictions that reveal residues that alter VP40 dimer stability. Taken together, these studies provide important biophysical insights into VP40 dimerization and may be useful in strategies to weaken or alter the VP40 dimer structure as a means of inhibiting the EBOV assembly.
Collapse
Affiliation(s)
- Yogesh B Narkhede
- Department of Chemistry and Biochemistry, University of Notre Dame, Notre Dame, Indiana 46556, United States
| | - Atul Bhardwaj
- Department of Chemistry and Biochemistry, University of Notre Dame, Notre Dame, Indiana 46556, United States
| | - Balindile B Motsa
- Department of Medicinal Chemistry & Molecular Pharmacology, Purdue Institute of Inflammation, Immunology, and Infectious Disease, Purdue University, West Lafayette, Indiana 47907, United States
| | - Roopashi Saxena
- Department of Medicinal Chemistry & Molecular Pharmacology, Purdue Institute of Inflammation, Immunology, and Infectious Disease, Purdue University, West Lafayette, Indiana 47907, United States
| | | | | | - Robert V Stahelin
- Department of Medicinal Chemistry & Molecular Pharmacology, Purdue Institute of Inflammation, Immunology, and Infectious Disease, Purdue University, West Lafayette, Indiana 47907, United States
| | - Olaf Wiest
- Department of Chemistry and Biochemistry, University of Notre Dame, Notre Dame, Indiana 46556, United States
| |
Collapse
|
6
|
Thakur S, Verma RK, Kepp KP, Mehra R. Modelling SARS-CoV-2 spike-protein mutation effects on ACE2 binding. J Mol Graph Model 2023; 119:108379. [PMID: 36481587 PMCID: PMC9690204 DOI: 10.1016/j.jmgm.2022.108379] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/12/2022] [Revised: 11/04/2022] [Accepted: 11/21/2022] [Indexed: 11/26/2022]
Abstract
The binding affinity of the SARS-CoV-2 spike (S)-protein to the human membrane protein ACE2 is critical for virus function. Computational structure-based screening of new S-protein mutations for ACE2 binding lends promise to rationalize virus function directly from protein structure and ideally aid early detection of potentially concerning variants. We used a computational protocol based on cryo-electron microscopy structures of the S-protein to estimate the change in ACE2-affinity due to S-protein mutation (ΔΔGbind) in good trend agreement with experimental ACE2 affinities. We then expanded predictions to all possible S-protein mutations in 21 different S-protein-ACE2 complexes (400,000 ΔΔGbind data points in total), using mutation group comparisons to reduce systematic errors. The results suggest that mutations that have arisen in major variants as a group maintain ACE2 affinity significantly more than random mutations in the total protein, at the interface, and at evolvable sites. Omicron mutations as a group had a modest change in binding affinity compared to mutations in other major variants. The single-mutation effects seem consistent with ACE2 binding being optimized and maintained in omicron, despite increased importance of other selection pressures (antigenic drift), however, epistasis, glycosylation and in vivo conditions will modulate these effects. Computational prediction of SARS-CoV-2 evolution remains far from achieved, but the feasibility of large-scale computation is substantially aided by using many structures and mutation groups rather than single mutation effects, which are very uncertain. Our results demonstrate substantial challenges but indicate ways forward to improve the quality of computer models for assessing SARS-CoV-2 mutation effects.
Collapse
Affiliation(s)
- Shivani Thakur
- Department of Chemistry, Indian Institute of Technology Bhilai, Sejbahar, Raipur, 492015, Chhattisgarh, India
| | - Rajaneesh Kumar Verma
- Department of Chemistry, Indian Institute of Technology Bhilai, Sejbahar, Raipur, 492015, Chhattisgarh, India
| | - Kasper Planeta Kepp
- DTU Chemistry, Technical University of Denmark, Building 206, 2800, Kongens Lyngby, Denmark.
| | - Rukmankesh Mehra
- Department of Chemistry, Indian Institute of Technology Bhilai, Sejbahar, Raipur, 492015, Chhattisgarh, India.
| |
Collapse
|
7
|
Nosrati M, Housaindokht MR. New insights into the effect of mutations on affibody-Fc interaction, a molecular dynamics simulation approach. J Struct Biol 2023; 215:107925. [PMID: 36470559 DOI: 10.1016/j.jsb.2022.107925] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/01/2022] [Revised: 11/24/2022] [Accepted: 11/28/2022] [Indexed: 12/12/2022]
Abstract
Staphylococcal protein A (SpA) domain B (the basis of affibody) has been widely used in affinity chromatography and found therapeutic applications against inflammatory diseases through targeting the Fc part of immunoglobulin G (IgG). We have performed extensive molecular dynamics simulation of 41 SpA mutants and compared their dynamics and conformations to wild type. The simulations revealed the molecular details of structural and dynamics changes that occurred due to introducing point mutations and helped to explain the SPR results. It was observed in some variants a point mutation caused extensive structural changes far from the mutation site, while an effect of some other mutations was limited to the site of the mutated residue. Also, the pattern of hydrogen bond networks and hydrophobic core arrangements were investigated. We figured out mutations that occurred at positions 128, 136, 150 and 153, affected two hydrophobic cores at the interface as well as mutations introduced at positions 129 and 154 interrupted two hydrogen bond networks of the interface, SPR data showed all of these mutations reduced binding affinity significantly. Overall, by scanning the SpA-Fc interface through the large numbers of introduced mutations, the new insights have been gained which would help to design high- affinity ligands of IgG.
Collapse
Affiliation(s)
- Masoumeh Nosrati
- Department of Chemistry, Faculty of Science, Ferdowsi University of Mashhad, Mashhad, Iran; Department of Cell and Molecular Biology, Uppsala University, BMC, Uppsala, Sweden.
| | | |
Collapse
|
8
|
Liu J, Xia KL, Wu J, Yau SST, Wei GW. Biomolecular Topology: Modelling and Analysis. ACTA MATHEMATICA SINICA, ENGLISH SERIES 2022; 38:1901-1938. [PMID: 36407804 PMCID: PMC9640850 DOI: 10.1007/s10114-022-2326-5] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/27/2022] [Accepted: 07/12/2022] [Indexed: 05/25/2023]
Abstract
With the great advancement of experimental tools, a tremendous amount of biomolecular data has been generated and accumulated in various databases. The high dimensionality, structural complexity, the nonlinearity, and entanglements of biomolecular data, ranging from DNA knots, RNA secondary structures, protein folding configurations, chromosomes, DNA origami, molecular assembly, to others at the macromolecular level, pose a severe challenge in their analysis and characterization. In the past few decades, mathematical concepts, models, algorithms, and tools from algebraic topology, combinatorial topology, computational topology, and topological data analysis, have demonstrated great power and begun to play an essential role in tackling the biomolecular data challenge. In this work, we introduce biomolecular topology, which concerns the topological problems and models originated from the biomolecular systems. More specifically, the biomolecular topology encompasses topological structures, properties and relations that are emerged from biomolecular structures, dynamics, interactions, and functions. We discuss the various types of biomolecular topology from structures (of proteins, DNAs, and RNAs), protein folding, and protein assembly. A brief discussion of databanks (and databases), theoretical models, and computational algorithms, is presented. Further, we systematically review related topological models, including graphs, simplicial complexes, persistent homology, persistent Laplacians, de Rham-Hodge theory, Yau-Hausdorff distance, and the topology-based machine learning models.
Collapse
Affiliation(s)
- Jian Liu
- School of Mathematical Sciences, Hebei Normal University, Shijiazhuang, 050024 P. R. China
- Yanqi Lake Beijing Institute of Mathematical Sciences and Applications, Beijing, 101408 P. R. China
| | - Ke-Lin Xia
- School of Physical and Mathematical Sciences, Nanyang Technological University, Singapore, 639798 Singapore
| | - Jie Wu
- Yanqi Lake Beijing Institute of Mathematical Sciences and Applications, Beijing, 101408 P. R. China
- Department of Mathematical Sciences, Tsinghua University, Beijing, 100084 P. R. China
| | - Stephen Shing-Toung Yau
- Yanqi Lake Beijing Institute of Mathematical Sciences and Applications, Beijing, 101408 P. R. China
- Department of Mathematical Sciences, Tsinghua University, Beijing, 100084 P. R. China
| | - Guo-Wei Wei
- Department of Mathematics & Department of Biochemistry and Molecular Biology & Department of Electrical and Computer Engineering, Michigan State University, Wells Hall 619 Red Cedar Road, East Lansing, MI 48824-1027 USA
| |
Collapse
|
9
|
Liu X, Feng H, Wu J, Xia K. Hom-Complex-Based Machine Learning (HCML) for the Prediction of Protein-Protein Binding Affinity Changes upon Mutation. J Chem Inf Model 2022; 62:3961-3969. [PMID: 36040839 DOI: 10.1021/acs.jcim.2c00580] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
Abstract
Protein-protein interactions (PPIs) are involved in almost all biological processes in the cell. Understanding protein-protein interactions holds the key for the understanding of biological functions, diseases and the development of therapeutics. Recently, artificial intelligence (AI) models have demonstrated great power in PPIs. However, a key issue for all AI-based PPI models is efficient molecular representations and featurization. Here, we propose Hom-complex-based PPI representation, and Hom-complex-based machine learning models for the prediction of PPI binding affinity changes upon mutation, for the first time. In our model, various Hom complexes Hom(G1, G) can be generated for the graph representation G of protein-protein complex by using different graphs G1, which reveal G1-related inner connections within the graph representation G of protein-protein complex. Further, for a specific graph G1, a series of nested Hom complexes are generated to give a multiscale characterization of the PPIs. Its persistent homology and persistent Euler characteristic are used as molecular descriptors and further combined with the machine learning model, in particular, gradient boosting tree (GBT). We systematically test our model on the two most-commonly used data sets, that is, SKEMPI and AB-Bind. It has been found that our model outperforms all the existing models as far as we know, which demonstrates the great potential of our model for the analysis of PPIs. Our model can be used for the analysis and design of efficient antibodies for SARS-CoV-2.
Collapse
Affiliation(s)
- Xiang Liu
- Chern Institute of Mathematics and LPMC, Nankai University, Tianjin, China, 300071.,Division of Mathematical Sciences, School of Physical and Mathematical Sciences Nanyang Technological University, Singapore 637371
| | - Huitao Feng
- Division of Mathematical Sciences, School of Physical and Mathematical Sciences Nanyang Technological University, Singapore 637371.,Mathematical Science Research Center, Chongqing University of Technology, Chongqing, China, 400054
| | - Jie Wu
- Yanqi Lake Beijing Institute of Mathematical Sciences and Applications (BIMSA), Beijing, China,101408
| | - Kelin Xia
- Division of Mathematical Sciences, School of Physical and Mathematical Sciences Nanyang Technological University, Singapore 637371
| |
Collapse
|
10
|
Wee J, Xia K. Persistent spectral based ensemble learning (PerSpect-EL) for protein-protein binding affinity prediction. Brief Bioinform 2022; 23:6533501. [PMID: 35189639 DOI: 10.1093/bib/bbac024] [Citation(s) in RCA: 14] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/15/2021] [Revised: 12/30/2021] [Accepted: 01/17/2022] [Indexed: 12/14/2022] Open
Abstract
Protein-protein interactions (PPIs) play a significant role in nearly all cellular and biological activities. Data-driven machine learning models have demonstrated great power in PPIs. However, the design of efficient molecular featurization poses a great challenge for all learning models for PPIs. Here, we propose persistent spectral (PerSpect) based PPI representation and featurization, and PerSpect-based ensemble learning (PerSpect-EL) models for PPI binding affinity prediction, for the first time. In our model, a sequence of Hodge (or combinatorial) Laplacian (HL) matrices at various different scales are generated from a specially designed filtration process. PerSpect attributes, which are statistical and combinatorial properties of spectrum information from these HL matrices, are used as features for PPI characterization. Each PerSpect attribute is input into a 1D convolutional neural network (CNN), and these CNN networks are stacked together in our PerSpect-based ensemble learning models. We systematically test our model on the two most commonly used datasets, i.e. SKEMPI and AB-Bind. It has been found that our model can achieve state-of-the-art results and outperform all existing models to the best of our knowledge.
Collapse
Affiliation(s)
- JunJie Wee
- Division of Mathematical Sciences, School of Physical and Mathematical Sciences, Nanyang Technological University, Singapore 637371
| | - Kelin Xia
- Division of Mathematical Sciences, School of Physical and Mathematical Sciences, Nanyang Technological University, Singapore 637371
| |
Collapse
|
11
|
Xiong D, Lee D, Li L, Zhao Q, Yu H. Implications of disease-related mutations at protein-protein interfaces. Curr Opin Struct Biol 2022; 72:219-225. [PMID: 34959033 PMCID: PMC8863207 DOI: 10.1016/j.sbi.2021.11.012] [Citation(s) in RCA: 8] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/02/2021] [Revised: 11/01/2021] [Accepted: 11/18/2021] [Indexed: 02/03/2023]
Abstract
Protein-protein interfaces have been attracting great attention owing to their critical roles in protein-protein interactions and the fact that human disease-related mutations are generally enriched in them. Recently, substantial research progress has been made in this field, which has significantly promoted the understanding and treatment of various human diseases. For example, many studies have discovered the properties of disease-related mutations. Besides, as more large-scale experimental data become available, various computational approaches have been proposed to advance our understanding of disease mutations from the data. Here, we overview recent advances in characteristics of disease-related mutations at protein-protein interfaces, mutation effects on protein interactions, and investigation of mutations on specific diseases.
Collapse
Affiliation(s)
- Dapeng Xiong
- Department of Computational Biology, Cornell University, Ithaca, NY 14853, USA,Weill Institute for Cell and Molecular Biology, Cornell University, Ithaca, NY 14853, USA
| | - Dongjin Lee
- Department of Computational Biology, Cornell University, Ithaca, NY 14853, USA,Weill Institute for Cell and Molecular Biology, Cornell University, Ithaca, NY 14853, USA
| | - Le Li
- Department of Computational Biology, Cornell University, Ithaca, NY 14853, USA,Weill Institute for Cell and Molecular Biology, Cornell University, Ithaca, NY 14853, USA
| | - Qiuye Zhao
- Department of Computational Biology, Cornell University, Ithaca, NY 14853, USA,Weill Institute for Cell and Molecular Biology, Cornell University, Ithaca, NY 14853, USA
| | - Haiyuan Yu
- Department of Computational Biology, Cornell University, Ithaca, NY 14853, USA,Weill Institute for Cell and Molecular Biology, Cornell University, Ithaca, NY 14853, USA
| |
Collapse
|
12
|
Flores SC, Alexiou A, Glaros A. Mining the Protein Data Bank to improve prediction of changes in protein-protein binding. PLoS One 2021; 16:e0257614. [PMID: 34727109 PMCID: PMC8562805 DOI: 10.1371/journal.pone.0257614] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/21/2021] [Accepted: 09/05/2021] [Indexed: 12/23/2022] Open
Abstract
Predicting the effect of mutations on protein-protein interactions is important for relating structure to function, as well as for in silico affinity maturation. The effect of mutations on protein-protein binding energy (ΔΔG) can be predicted by a variety of atomic simulation methods involving full or limited flexibility, and explicit or implicit solvent. Methods which consider only limited flexibility are naturally more economical, and many of them are quite accurate, however results are dependent on the atomic coordinate set used. In this work we perform a sequence and structure based search of the Protein Data Bank to find additional coordinate sets and repeat the calculation on each. The method increases precision and Positive Predictive Value, and decreases Root Mean Square Error, compared to using single structures. Given the ongoing growth of near-redundant structures in the Protein Data Bank, our method will only increase in applicability and accuracy.
Collapse
Affiliation(s)
| | - Athanasios Alexiou
- Department of Computer Science and Biomedical Informatics, University of Thessaly, Volos, Greece
| | - Anastasios Glaros
- Eukaryotic Single Cell Genomics Facility, Science For Life Laboratory, Stockholm, Sweden
| |
Collapse
|
13
|
Yu H, Alkhamis O, Canoura J, Liu Y, Xiao Y. Advances and Challenges in Small‐Molecule DNA Aptamer Isolation, Characterization, and Sensor Development. Angew Chem Int Ed Engl 2021. [DOI: 10.1002/ange.202008663] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/23/2022]
Affiliation(s)
- Haixiang Yu
- Department of Chemistry and Biochemistry Florida International University 11200 SW 8th Street Miami FL 33199 USA
| | - Obtin Alkhamis
- Department of Chemistry and Biochemistry Florida International University 11200 SW 8th Street Miami FL 33199 USA
| | - Juan Canoura
- Department of Chemistry and Biochemistry Florida International University 11200 SW 8th Street Miami FL 33199 USA
| | - Yingzhu Liu
- Department of Chemistry and Biochemistry Florida International University 11200 SW 8th Street Miami FL 33199 USA
| | - Yi Xiao
- Department of Chemistry and Biochemistry Florida International University 11200 SW 8th Street Miami FL 33199 USA
| |
Collapse
|
14
|
Yu H, Alkhamis O, Canoura J, Liu Y, Xiao Y. Advances and Challenges in Small-Molecule DNA Aptamer Isolation, Characterization, and Sensor Development. Angew Chem Int Ed Engl 2021; 60:16800-16823. [PMID: 33559947 PMCID: PMC8292151 DOI: 10.1002/anie.202008663] [Citation(s) in RCA: 166] [Impact Index Per Article: 55.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/19/2020] [Revised: 11/16/2021] [Indexed: 12/12/2022]
Abstract
Aptamers are short oligonucleotides isolated in vitro from randomized libraries that can bind to specific molecules with high affinity, and offer a number of advantages relative to antibodies as biorecognition elements in biosensors. However, it remains difficult and labor-intensive to develop aptamer-based sensors for small-molecule detection. Here, we review the challenges and advances in the isolation and characterization of small-molecule-binding DNA aptamers and their use in sensors. First, we discuss in vitro methodologies for the isolation of aptamers, and provide guidance on selecting the appropriate strategy for generating aptamers with optimal binding properties for a given application. We next examine techniques for characterizing aptamer-target binding and structure. Afterwards, we discuss various small-molecule sensing platforms based on original or engineered aptamers, and their detection applications. Finally, we conclude with a general workflow to develop aptamer-based small-molecule sensors for real-world applications.
Collapse
Affiliation(s)
- Haixiang Yu
- Department of Chemistry and Biochemistry, Florida International University, 11200 SW 8th Street, Miami, FL, 33199, USA
| | - Obtin Alkhamis
- Department of Chemistry and Biochemistry, Florida International University, 11200 SW 8th Street, Miami, FL, 33199, USA
| | - Juan Canoura
- Department of Chemistry and Biochemistry, Florida International University, 11200 SW 8th Street, Miami, FL, 33199, USA
| | - Yingzhu Liu
- Department of Chemistry and Biochemistry, Florida International University, 11200 SW 8th Street, Miami, FL, 33199, USA
| | - Yi Xiao
- Department of Chemistry and Biochemistry, Florida International University, 11200 SW 8th Street, Miami, FL, 33199, USA
| |
Collapse
|
15
|
Rodrigues CHM, Pires DEV, Ascher DB. mmCSM-PPI: predicting the effects of multiple point mutations on protein-protein interactions. Nucleic Acids Res 2021; 49:W417-W424. [PMID: 33893812 PMCID: PMC8262703 DOI: 10.1093/nar/gkab273] [Citation(s) in RCA: 35] [Impact Index Per Article: 11.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/27/2021] [Revised: 03/18/2021] [Accepted: 04/15/2021] [Indexed: 11/16/2022] Open
Abstract
Protein-protein interactions play a crucial role in all cellular functions and biological processes and mutations leading to their disruption are enriched in many diseases. While a number of computational methods to assess the effects of variants on protein-protein binding affinity have been proposed, they are in general limited to the analysis of single point mutations and have been shown to perform poorly on independent test sets. Here, we present mmCSM-PPI, a scalable and effective machine learning model for accurately assessing changes in protein-protein binding affinity caused by single and multiple missense mutations. We expanded our well-established graph-based signatures in order to capture physicochemical and geometrical properties of multiple wild-type residue environments and integrated them with substitution scores and dynamics terms from normal mode analysis. mmCSM-PPI was able to achieve a Pearson's correlation of up to 0.75 (RMSE = 1.64 kcal/mol) under 10-fold cross-validation and 0.70 (RMSE = 2.06 kcal/mol) on a non-redundant blind test, outperforming existing methods. Our method is freely available as a user-friendly and easy-to-use web server and API at http://biosig.unimelb.edu.au/mmcsm_ppi.
Collapse
Affiliation(s)
- Carlos H M Rodrigues
- Computational Biology and Clinical Informatics, Baker Heart and Diabetes Institute, Melbourne, Victoria, Australia
- Structural Biology and Bioinformatics, Department of Biochemistry and Pharmacology, University of Melbourne, Melbourne, Victoria, Australia
- Systems and Computational Biology, Bio21 Institute, University of Melbourne, Melbourne, Victoria, Australia
| | - Douglas E V Pires
- Computational Biology and Clinical Informatics, Baker Heart and Diabetes Institute, Melbourne, Victoria, Australia
- Structural Biology and Bioinformatics, Department of Biochemistry and Pharmacology, University of Melbourne, Melbourne, Victoria, Australia
- Systems and Computational Biology, Bio21 Institute, University of Melbourne, Melbourne, Victoria, Australia
- School of Computing and Information Systems, University of Melbourne, Melbourne, Victoria, Australia
| | - David B Ascher
- Computational Biology and Clinical Informatics, Baker Heart and Diabetes Institute, Melbourne, Victoria, Australia
- Structural Biology and Bioinformatics, Department of Biochemistry and Pharmacology, University of Melbourne, Melbourne, Victoria, Australia
- Systems and Computational Biology, Bio21 Institute, University of Melbourne, Melbourne, Victoria, Australia
- Department of Biochemistry, University of Cambridge, Cambridge, UK
| |
Collapse
|
16
|
Vihinen M. Functional effects of protein variants. Biochimie 2020; 180:104-120. [PMID: 33164889 DOI: 10.1016/j.biochi.2020.10.009] [Citation(s) in RCA: 24] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/30/2020] [Revised: 10/15/2020] [Accepted: 10/19/2020] [Indexed: 12/11/2022]
Abstract
Genetic and other variations frequently affect protein functions. Scientific articles can contain confusing descriptions about which function or property is affected, and in many cases the statements are pure speculation without any experimental evidence. To clarify functional effects of protein variations of genetic or non-genetic origin, a systematic conceptualisation and framework are introduced. This framework describes protein functional effects on abundance, activity, specificity and affinity, along with countermeasures, which allow cells, tissues and organisms to tolerate, avoid, repair, attenuate or resist (TARAR) the effects. Effects on abundance discussed include gene dosage, restricted expression, mis-localisation and degradation. Enzymopathies, effects on kinetics, allostery and regulation of protein activity are subtopics for the effects of variants on activity. Variation outcomes on specificity and affinity comprise promiscuity, specificity, affinity and moonlighting. TARAR mechanisms redress variations with active and passive processes including chaperones, redundancy, robustness, canalisation and metabolic and signalling rewiring. A framework for pragmatic protein function analysis and presentation is introduced. All of the mechanisms and effects are described along with representative examples, most often in relation to diseases. In addition, protein function is discussed from evolutionary point of view. Application of the presented framework facilitates unambiguous, detailed and specific description of functional effects and their systematic study.
Collapse
Affiliation(s)
- Mauno Vihinen
- Department of Experimental Medical Science, BMC B13, Lund University, SE-22 184, Lund, Sweden.
| |
Collapse
|
17
|
Huang X, Zheng W, Pearce R, Zhang Y. SSIPe: accurately estimating protein-protein binding affinity change upon mutations using evolutionary profiles in combination with an optimized physical energy function. Bioinformatics 2020; 36:2429-2437. [PMID: 31830252 DOI: 10.1093/bioinformatics/btz926] [Citation(s) in RCA: 29] [Impact Index Per Article: 7.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/27/2019] [Revised: 11/08/2019] [Accepted: 12/09/2019] [Indexed: 11/13/2022] Open
Abstract
MOTIVATION Most proteins perform their biological functions through interactions with other proteins in cells. Amino acid mutations, especially those occurring at protein interfaces, can change the stability of protein-protein interactions (PPIs) and impact their functions, which may cause various human diseases. Quantitative estimation of the binding affinity changes (ΔΔGbind) caused by mutations can provide critical information for protein function annotation and genetic disease diagnoses. RESULTS We present SSIPe, which combines protein interface profiles, collected from structural and sequence homology searches, with a physics-based energy function for accurate ΔΔGbind estimation. To offset the statistical limits of the PPI structure and sequence databases, amino acid-specific pseudocounts were introduced to enhance the profile accuracy. SSIPe was evaluated on large-scale experimental data containing 2204 mutations from 177 proteins, where training and test datasets were stringently separated with the sequence identity between proteins from the two datasets below 30%. The Pearson correlation coefficient between estimated and experimental ΔΔGbind was 0.61 with a root-mean-square-error of 1.93 kcal/mol, which was significantly better than the other methods. Detailed data analyses revealed that the major advantage of SSIPe over other traditional approaches lies in the novel combination of the physical energy function with the new knowledge-based interface profile. SSIPe also considerably outperformed a former profile-based method (BindProfX) due to the newly introduced sequence profiles and optimized pseudocount technique that allows for consideration of amino acid-specific prior mutation probabilities. AVAILABILITY AND IMPLEMENTATION Web-server/standalone program, source code and datasets are freely available at https://zhanglab.ccmb.med.umich.edu/SSIPe and https://github.com/tommyhuangthu/SSIPe. SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
| | - Wei Zheng
- Department of Computational Medicine and Bioinformatics
| | - Robin Pearce
- Department of Computational Medicine and Bioinformatics
| | - Yang Zhang
- Department of Computational Medicine and Bioinformatics.,Department of Biological Chemistry, University of Michigan, Ann Arbor, MI 48109, USA
| |
Collapse
|
18
|
Meseguer A, Dominguez L, Bota PM, Aguirre‐Plans J, Bonet J, Fernandez‐Fuentes N, Oliva B. Using collections of structural models to predict changes of binding affinity caused by mutations in protein-protein interactions. Protein Sci 2020; 29:2112-2130. [PMID: 32797645 PMCID: PMC7513729 DOI: 10.1002/pro.3930] [Citation(s) in RCA: 12] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/27/2020] [Revised: 08/04/2020] [Accepted: 08/05/2020] [Indexed: 12/24/2022]
Abstract
Protein-protein interactions (PPIs) in all the molecular aspects that take place both inside and outside cells. However, determining experimentally the structure and affinity of PPIs is expensive and time consuming. Therefore, the development of computational tools, as a complement to experimental methods, is fundamental. Here, we present a computational suite: MODPIN, to model and predict the changes of binding affinity of PPIs. In this approach we use homology modeling to derive the structures of PPIs and score them using state-of-the-art scoring functions. We explore the conformational space of PPIs by generating not a single structural model but a collection of structural models with different conformations based on several templates. We apply the approach to predict the changes in free energy upon mutations and splicing variants of large datasets of PPIs to statistically quantify the quality and accuracy of the predictions. As an example, we use MODPIN to study the effect of mutations in the interaction between colicin endonuclease 9 and colicin endonuclease 2 immune protein from Escherichia coli. Finally, we have compared our results with other state-of-art methods.
Collapse
Affiliation(s)
- Alberto Meseguer
- Structural Bioinformatics Group, Research Programme on Biomedical Informatics, Department of Experimental and Health ScienceUniversitat Pompeu FabraBarcelonaCataloniaSpain
| | - Lluis Dominguez
- Integrative Biomedical Informatics Group (GRIB‐IMIM). Department of Experimental and Life SciencesUniversitat Pompeu FabraBarcelonaCataloniaSpain
| | - Patricia M. Bota
- Structural Bioinformatics Group, Research Programme on Biomedical Informatics, Department of Experimental and Health ScienceUniversitat Pompeu FabraBarcelonaCataloniaSpain
- Department of BiosciencesUniversitat de Vic‐Universitat Central de CatalunyaVicCataloniaSpain
| | - Joaquim Aguirre‐Plans
- Structural Bioinformatics Group, Research Programme on Biomedical Informatics, Department of Experimental and Health ScienceUniversitat Pompeu FabraBarcelonaCataloniaSpain
| | - Jaume Bonet
- Structural Bioinformatics Group, Research Programme on Biomedical Informatics, Department of Experimental and Health ScienceUniversitat Pompeu FabraBarcelonaCataloniaSpain
| | - Narcis Fernandez‐Fuentes
- Department of BiosciencesUniversitat de Vic‐Universitat Central de CatalunyaVicCataloniaSpain
- Institute of Biological, Environmental and Rural SciencesAberystwyth UniversityAberystwythUK
| | - Baldo Oliva
- Structural Bioinformatics Group, Research Programme on Biomedical Informatics, Department of Experimental and Health ScienceUniversitat Pompeu FabraBarcelonaCataloniaSpain
| |
Collapse
|
19
|
Shringari SR, Giannakoulias S, Ferrie JJ, Petersson EJ. Rosetta custom score functions accurately predict ΔΔG of mutations at protein-protein interfaces using machine learning. Chem Commun (Camb) 2020; 56:6774-6777. [PMID: 32441721 DOI: 10.1039/d0cc01959c] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/21/2022]
Abstract
Protein-protein interfaces play essential roles in a variety of biological processes and many therapeutic molecules are targeted at these interfaces. However, accurate predictions of the effects of interfacial mutations to identify "hotspots" have remained elusive despite the myriad of modeling and machine learning methods tested. Here, for the first time, we demonstrate that nonlinear reweighting of energy terms from Rosetta, through the use of machine learning, exhibits improved predictability of ΔΔG values associated with interfacial mutations.
Collapse
Affiliation(s)
- Sumant R Shringari
- Department of Chemistry, University of Pennsylvania, 231 South 34th Street, Philadelphia, PA 19104, USA.
| | | | | | | |
Collapse
|
20
|
Surpeta B, Sequeiros-Borja CE, Brezovsky J. Dynamics, a Powerful Component of Current and Future in Silico Approaches for Protein Design and Engineering. Int J Mol Sci 2020; 21:E2713. [PMID: 32295283 PMCID: PMC7215530 DOI: 10.3390/ijms21082713] [Citation(s) in RCA: 9] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/18/2020] [Revised: 04/10/2020] [Accepted: 04/12/2020] [Indexed: 12/13/2022] Open
Abstract
Computational prediction has become an indispensable aid in the processes of engineering and designing proteins for various biotechnological applications. With the tremendous progress in more powerful computer hardware and more efficient algorithms, some of in silico tools and methods have started to apply the more realistic description of proteins as their conformational ensembles, making protein dynamics an integral part of their prediction workflows. To help protein engineers to harness benefits of considering dynamics in their designs, we surveyed new tools developed for analyses of conformational ensembles in order to select engineering hotspots and design mutations. Next, we discussed the collective evolution towards more flexible protein design methods, including ensemble-based approaches, knowledge-assisted methods, and provable algorithms. Finally, we highlighted apparent challenges that current approaches are facing and provided our perspectives on their further development.
Collapse
Affiliation(s)
- Bartłomiej Surpeta
- Laboratory of Biomolecular Interactions and Transport, Department of Gene Expression, Institute of Molecular Biology and Biotechnology, Faculty of Biology, Adam Mickiewicz University, Uniwersytetu Poznanskiego 6, 61-614 Poznan, Poland; (B.S.); (C.E.S.-B.)
- International Institute of Molecular and Cell Biology in Warsaw, Ks Trojdena 4, 02-109 Warsaw, Poland
| | - Carlos Eduardo Sequeiros-Borja
- Laboratory of Biomolecular Interactions and Transport, Department of Gene Expression, Institute of Molecular Biology and Biotechnology, Faculty of Biology, Adam Mickiewicz University, Uniwersytetu Poznanskiego 6, 61-614 Poznan, Poland; (B.S.); (C.E.S.-B.)
- International Institute of Molecular and Cell Biology in Warsaw, Ks Trojdena 4, 02-109 Warsaw, Poland
| | - Jan Brezovsky
- Laboratory of Biomolecular Interactions and Transport, Department of Gene Expression, Institute of Molecular Biology and Biotechnology, Faculty of Biology, Adam Mickiewicz University, Uniwersytetu Poznanskiego 6, 61-614 Poznan, Poland; (B.S.); (C.E.S.-B.)
- International Institute of Molecular and Cell Biology in Warsaw, Ks Trojdena 4, 02-109 Warsaw, Poland
| |
Collapse
|
21
|
Jankauskaite J, Jiménez-García B, Dapkunas J, Fernández-Recio J, Moal IH. SKEMPI 2.0: an updated benchmark of changes in protein-protein binding energy, kinetics and thermodynamics upon mutation. Bioinformatics 2019; 35:462-469. [PMID: 30020414 PMCID: PMC6361233 DOI: 10.1093/bioinformatics/bty635] [Citation(s) in RCA: 161] [Impact Index Per Article: 32.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/16/2018] [Accepted: 07/17/2018] [Indexed: 11/18/2022] Open
Abstract
Motivation Understanding the relationship between the sequence, structure, binding energy, binding kinetics and binding thermodynamics of protein–protein interactions is crucial to understanding cellular signaling, the assembly and regulation of molecular complexes, the mechanisms through which mutations lead to disease, and protein engineering. Results We present SKEMPI 2.0, a major update to our database of binding free energy changes upon mutation for structurally resolved protein–protein interactions. This version now contains manually curated binding data for 7085 mutations, an increase of 133%, including changes in kinetics for 1844 mutations, enthalpy and entropy changes for 443 mutations, and 440 mutations, which abolish detectable binding. Availability and implementation The database is available as supplementary data and at https://life.bsc.es/pid/skempi2/. Supplementary information Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Justina Jankauskaite
- Institute of Biotechnology, Life Sciences Center, Vilnius University, Vilnius, Lithuania
| | - Brian Jiménez-García
- Barcelona Supercomputing Center (BSC), Barcelona, Spain.,Bijvoet Center for Biomolecular Research, Faculty of Science, Utrecht University, Utrecht, the Netherlands
| | - Justas Dapkunas
- Institute of Biotechnology, Life Sciences Center, Vilnius University, Vilnius, Lithuania
| | - Juan Fernández-Recio
- Barcelona Supercomputing Center (BSC), Barcelona, Spain.,Institut de Biologia Molecular de Barcelona (IBMB), CSIC, Barcelona, Spain
| | - Iain H Moal
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Hinxton, Cambridge, UK
| |
Collapse
|
22
|
Ibarra A, Bartlett GJ, Hegedüs Z, Dutt S, Hobor F, Horner KA, Hetherington K, Spence K, Nelson A, Edwards TA, Woolfson DN, Sessions RB, Wilson AJ. Predicting and Experimentally Validating Hot-Spot Residues at Protein-Protein Interfaces. ACS Chem Biol 2019; 14:2252-2263. [PMID: 31525028 PMCID: PMC6804253 DOI: 10.1021/acschembio.9b00560] [Citation(s) in RCA: 32] [Impact Index Per Article: 6.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/15/2019] [Accepted: 09/16/2019] [Indexed: 01/02/2023]
Abstract
Protein-protein interactions (PPIs) are vital to all biological processes. These interactions are often dynamic, sometimes transient, typically occur over large topographically shallow protein surfaces, and can exhibit a broad range of affinities. Considerable progress has been made in determining PPI structures. However, given the above properties, understanding the key determinants of their thermodynamic stability remains a challenge in chemical biology. An improved ability to identify and engineer PPIs would advance understanding of biological mechanisms and mutant phenotypes and also provide a firmer foundation for inhibitor design. In silico prediction of PPI hot-spot amino acids using computational alanine scanning (CAS) offers a rapid approach for predicting key residues that drive protein-protein association. This can be applied to all known PPI structures; however there is a trade-off between throughput and accuracy. Here we describe a comparative analysis of multiple CAS methods, which highlights effective approaches to improve the accuracy of predicting hot-spot residues. Alongside this, we introduce a new method, BUDE Alanine Scanning, which can be applied to single structures from crystallography and to structural ensembles from NMR or molecular dynamics data. The comparative analyses facilitate accurate prediction of hot-spots that we validate experimentally with three diverse targets: NOXA-B/MCL-1 (an α-helix-mediated PPI), SIMS/SUMO, and GKAP/SHANK-PDZ (both β-strand-mediated interactions). Finally, the approach is applied to the accurate prediction of hot-spot residues at a topographically novel Affimer/BCL-xL protein-protein interface.
Collapse
Affiliation(s)
- Amaurys
A. Ibarra
- School
of Biochemistry, University of Bristol, Medical Sciences Building, University
Walk, Bristol BS8 1TD, U.K.
| | - Gail J. Bartlett
- School
of Chemistry, University of Bristol, Cantock’s Close, Bristol BS8 1TS, U.K.
| | - Zsöfia Hegedüs
- School
of Chemistry, University of Leeds, Woodhouse Lane, Leeds LS2 9JT, U.K.
- Astbury
Centre for Structural Molecular Biology, University of Leeds, Woodhouse Lane, Leeds LS2 9JT, U.K.
| | - Som Dutt
- School
of Chemistry, University of Leeds, Woodhouse Lane, Leeds LS2 9JT, U.K.
- Astbury
Centre for Structural Molecular Biology, University of Leeds, Woodhouse Lane, Leeds LS2 9JT, U.K.
| | - Fruzsina Hobor
- Astbury
Centre for Structural Molecular Biology, University of Leeds, Woodhouse Lane, Leeds LS2 9JT, U.K.
- School
of Molecular and Cellular Biology, University
of Leeds, Woodhouse Lane, Leeds LS2 9JT, U.K.
| | - Katherine A. Horner
- School
of Chemistry, University of Leeds, Woodhouse Lane, Leeds LS2 9JT, U.K.
- Astbury
Centre for Structural Molecular Biology, University of Leeds, Woodhouse Lane, Leeds LS2 9JT, U.K.
| | - Kristina Hetherington
- School
of Chemistry, University of Leeds, Woodhouse Lane, Leeds LS2 9JT, U.K.
- Astbury
Centre for Structural Molecular Biology, University of Leeds, Woodhouse Lane, Leeds LS2 9JT, U.K.
| | - Kirstin Spence
- Astbury
Centre for Structural Molecular Biology, University of Leeds, Woodhouse Lane, Leeds LS2 9JT, U.K.
- School
of Molecular and Cellular Biology, University
of Leeds, Woodhouse Lane, Leeds LS2 9JT, U.K.
| | - Adam Nelson
- School
of Chemistry, University of Leeds, Woodhouse Lane, Leeds LS2 9JT, U.K.
- Astbury
Centre for Structural Molecular Biology, University of Leeds, Woodhouse Lane, Leeds LS2 9JT, U.K.
| | - Thomas A. Edwards
- Astbury
Centre for Structural Molecular Biology, University of Leeds, Woodhouse Lane, Leeds LS2 9JT, U.K.
- School
of Molecular and Cellular Biology, University
of Leeds, Woodhouse Lane, Leeds LS2 9JT, U.K.
| | - Derek N. Woolfson
- School
of Biochemistry, University of Bristol, Medical Sciences Building, University
Walk, Bristol BS8 1TD, U.K.
- School
of Chemistry, University of Bristol, Cantock’s Close, Bristol BS8 1TS, U.K.
- BrisSynBio, University of Bristol, Life Sciences Building, Tyndall Avenue, Bristol BS8 1TQ, U.K.
| | - Richard B. Sessions
- School
of Biochemistry, University of Bristol, Medical Sciences Building, University
Walk, Bristol BS8 1TD, U.K.
- BrisSynBio, University of Bristol, Life Sciences Building, Tyndall Avenue, Bristol BS8 1TQ, U.K.
| | - Andrew J. Wilson
- School
of Chemistry, University of Leeds, Woodhouse Lane, Leeds LS2 9JT, U.K.
- Astbury
Centre for Structural Molecular Biology, University of Leeds, Woodhouse Lane, Leeds LS2 9JT, U.K.
| |
Collapse
|
23
|
Malhotra S, Alsulami AF, Heiyun Y, Ochoa BM, Jubb H, Forbes S, Blundell TL. Understanding the impacts of missense mutations on structures and functions of human cancer-related genes: A preliminary computational analysis of the COSMIC Cancer Gene Census. PLoS One 2019; 14:e0219935. [PMID: 31323058 PMCID: PMC6641202 DOI: 10.1371/journal.pone.0219935] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/06/2019] [Accepted: 07/03/2019] [Indexed: 12/12/2022] Open
Abstract
Genomics and genome screening are proving central to the study of cancer. However, a good appreciation of the protein structures coded by cancer genes is also invaluable, especially for the understanding of functions, for assessing ligandability of potential targets, and for designing new drugs. To complement the wealth of information on the genetics of cancer in COSMIC, the most comprehensive database for cancer somatic mutations available, structural information obtained experimentally has been brought together recently in COSMIC-3D. Even where structural information is available for a gene in the Cancer Gene Census, a list of genes in COSMIC with substantial evidence supporting their impacts in cancer, this information is quite often for a single domain in a larger protein or for a single protomer in a multiprotein assembly. Here, we show that over 60% of the genes included in the Cancer Gene Census are predicted to possess multiple domains. Many are also multicomponent and membrane-associated molecular assemblies, with mutations recorded in COSMIC affecting such assemblies. However, only 469 of the gene products have a structure represented in the PDB, and of these only 87 structures have 90-100% coverage over the sequence and 69 have less than 10% coverage. As a first step to bridging gaps in our knowledge in the many cases where individual protein structures and domains are lacking, we discuss our attempts of protein structure modelling using our pipeline and investigating the effects of mutations using two of our in-house methods (SDM2 and mCSM) and identifying potential driver mutations. This allows us to begin to understand the effects of mutations not only on protein stability but also on protein-protein, protein-ligand and protein-nucleic acid interactions. In addition, we consider ways to combine the structural information with the wealth of mutation data available in COSMIC. We discuss the impacts of COSMIC missense mutations on protein structure in order to identify and assess the molecular consequences of cancer-driving mutations.
Collapse
Affiliation(s)
- Sony Malhotra
- Department of Biochemistry, University of Cambridge, Cambridge, United Kingdom
| | - Ali F. Alsulami
- Department of Biochemistry, University of Cambridge, Cambridge, United Kingdom
| | - Yang Heiyun
- Department of Biochemistry, University of Cambridge, Cambridge, United Kingdom
| | | | - Harry Jubb
- Wellcome Genome Campus, Hinxton, Cambridgeshire, United Kingdom
| | - Simon Forbes
- Wellcome Genome Campus, Hinxton, Cambridgeshire, United Kingdom
| | - Tom L. Blundell
- Department of Biochemistry, University of Cambridge, Cambridge, United Kingdom
| |
Collapse
|
24
|
Tan SK, Fong KP, Polizzi NF, Sternisha A, Slusky JSG, Yoon K, DeGrado WF, Bennett JS. Modulating Integrin αIIbβ3 Activity through Mutagenesis of Allosterically Regulated Intersubunit Contacts. Biochemistry 2019; 58:3251-3259. [PMID: 31264850 DOI: 10.1021/acs.biochem.9b00430] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/04/2023]
Abstract
Integrin αIIbβ3, a transmembrane heterodimer, mediates platelet aggregation when it switches from an inactive to an active ligand-binding conformation following platelet stimulation. Central to regulating αIIbβ3 activity is the interaction between the αIIb and β3 extracellular stalks, which form a tight heterodimer in the inactive state and dissociate in the active state. Here, we demonstrate that alanine replacements of sensitive positions in the heterodimer stalk interface destabilize the inactive conformation sufficiently to cause constitutive αIIbβ3 activation. To determine the structural basis for this effect, we performed a structural bioinformatics analysis and found that perturbing intersubunit contacts with favorable interaction geometry through substitutions to alanine quantitatively accounted for the degree of constitutive αIIbβ3 activation. This mutational study directly assesses the relationship between favorable interaction geometry at mutation-sensitive positions and the functional activity of those mutants, giving rise to a simple model that highlights the importance of interaction geometry in contributing to the stability between protein-protein interactions.
Collapse
Affiliation(s)
- Sophia K Tan
- Department of Pharmaceutical Chemistry , University of California, San Francisco , San Francisco , California 94158 , United States
| | - Karen P Fong
- Hematology-Oncology Division , University of Pennsylvania School of Medicine , Philadelphia , Pennsylvania 19104 , United States
| | - Nicholas F Polizzi
- Department of Pharmaceutical Chemistry , University of California, San Francisco , San Francisco , California 94158 , United States
| | - Alex Sternisha
- Hematology-Oncology Division , University of Pennsylvania School of Medicine , Philadelphia , Pennsylvania 19104 , United States
| | - Joanna S G Slusky
- Department of Molecular Biosciences and Center for Computational Biology , University of Kansas , Lawrence , Kansas 66045 , United States
| | - Kyungchul Yoon
- Hematology-Oncology Division , University of Pennsylvania School of Medicine , Philadelphia , Pennsylvania 19104 , United States
| | - William F DeGrado
- Department of Pharmaceutical Chemistry , University of California, San Francisco , San Francisco , California 94158 , United States
| | - Joel S Bennett
- Hematology-Oncology Division , University of Pennsylvania School of Medicine , Philadelphia , Pennsylvania 19104 , United States
| |
Collapse
|
25
|
Geng C, Xue LC, Roel‐Touris J, Bonvin AMJJ. Finding the ΔΔ
G
spot: Are predictors of binding affinity changes upon mutations in protein–protein interactions ready for it? WILEY INTERDISCIPLINARY REVIEWS-COMPUTATIONAL MOLECULAR SCIENCE 2019. [DOI: 10.1002/wcms.1410] [Citation(s) in RCA: 43] [Impact Index Per Article: 8.6] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/01/2023]
Affiliation(s)
- Cunliang Geng
- Bijvoet Center for Biomolecular Research, Faculty of Science—Chemistry Utrecht University Utrecht The Netherlands
| | - Li C. Xue
- Bijvoet Center for Biomolecular Research, Faculty of Science—Chemistry Utrecht University Utrecht The Netherlands
| | - Jorge Roel‐Touris
- Bijvoet Center for Biomolecular Research, Faculty of Science—Chemistry Utrecht University Utrecht The Netherlands
| | - Alexandre M. J. J. Bonvin
- Bijvoet Center for Biomolecular Research, Faculty of Science—Chemistry Utrecht University Utrecht The Netherlands
| |
Collapse
|
26
|
Geng C, Vangone A, Folkers GE, Xue LC, Bonvin AMJJ. iSEE: Interface structure, evolution, and energy-based machine learning predictor of binding affinity changes upon mutations. Proteins 2018; 87:110-119. [PMID: 30417935 PMCID: PMC6587874 DOI: 10.1002/prot.25630] [Citation(s) in RCA: 40] [Impact Index Per Article: 6.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/31/2018] [Revised: 10/19/2018] [Accepted: 11/05/2018] [Indexed: 02/06/2023]
Abstract
Quantitative evaluation of binding affinity changes upon mutations is crucial for protein engineering and drug design. Machine learning‐based methods are gaining increasing momentum in this field. Due to the limited number of experimental data, using a small number of sensitive predictive features is vital to the generalization and robustness of such machine learning methods. Here we introduce a fast and reliable predictor of binding affinity changes upon single point mutation, based on a random forest approach. Our method, iSEE, uses a limited number of interface Structure, Evolution, and Energy‐based features for the prediction. iSEE achieves, using only 31 features, a high prediction performance with a Pearson correlation coefficient (PCC) of 0.80 and a root mean square error of 1.41 kcal/mol on a diverse training dataset consisting of 1102 mutations in 57 protein‐protein complexes. It competes with existing state‐of‐the‐art methods on two blind test datasets. Predictions for a new dataset of 487 mutations in 56 protein complexes from the recently published SKEMPI 2.0 database reveals that none of the current methods perform well (PCC < 0.42), although their combination does improve the predictions. Feature analysis for iSEE underlines the significance of evolutionary conservations for quantitative prediction of mutation effects. As an application example, we perform a full mutation scanning of the interface residues in the MDM2–p53 complex.
Collapse
Affiliation(s)
- Cunliang Geng
- Bijvoet Center for Biomolecular Research, Faculty of Science - Chemistry, Utrecht University, Utrecht, The Netherlands
| | - Anna Vangone
- Bijvoet Center for Biomolecular Research, Faculty of Science - Chemistry, Utrecht University, Utrecht, The Netherlands.,Roche Pharmaceutical Research and Early Development, Large Molecule Research, Roche Innovation Center Penzberg, Penzberg, Germany
| | - Gert E Folkers
- Bijvoet Center for Biomolecular Research, Faculty of Science - Chemistry, Utrecht University, Utrecht, The Netherlands
| | - Li C Xue
- Bijvoet Center for Biomolecular Research, Faculty of Science - Chemistry, Utrecht University, Utrecht, The Netherlands
| | - Alexandre M J J Bonvin
- Bijvoet Center for Biomolecular Research, Faculty of Science - Chemistry, Utrecht University, Utrecht, The Netherlands
| |
Collapse
|
27
|
Šoštarić N, O'Reilly FJ, Giansanti P, Heck AJR, Gavin AC, van Noort V. Effects of Acetylation and Phosphorylation on Subunit Interactions in Three Large Eukaryotic Complexes. Mol Cell Proteomics 2018; 17:2387-2401. [PMID: 30181345 DOI: 10.1074/mcp.ra118.000892] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/30/2018] [Revised: 08/27/2018] [Indexed: 01/18/2023] Open
Abstract
Protein post-translational modifications (PTMs) have an indispensable role in living cells as they expand chemical diversity of the proteome, providing a fine regulatory layer that can govern protein-protein interactions in changing environmental conditions. Here we investigated the effects of acetylation and phosphorylation on the stability of subunit interactions in purified Saccharomyces cerevisiae complexes, namely exosome, RNA polymerase II and proteasome. We propose a computational framework that consists of conformational sampling of the complexes by molecular dynamics simulations, followed by Gibbs energy calculation by MM/GBSA. After benchmarking against published tools such as FoldX and Mechismo, we could apply the framework for the first time on large protein assemblies with the aim of predicting the effects of PTMs located on interfaces of subunits on binding stability. We discovered that acetylation predominantly contributes to subunits' interactions in a locally stabilizing manner, while phosphorylation shows the opposite effect. Even though the local binding contributions of PTMs may be predictable to an extent, the long range effects and overall impact on subunits' binding were only captured because of our dynamical approach. Employing the developed, widely applicable workflow on other large systems will shed more light on the roles of PTMs in protein complex formation.
Collapse
Affiliation(s)
- Nikolina Šoštarić
- KU Leuven, Centre of Microbial and Plant Genetics, Kasteelpark Arenberg 20, Leuven, B-3001, Belgium
| | - Francis J O'Reilly
- European Molecular Biology Laboratory, Structural and Computational Biology Unit, Heidelberg, Germany; Technical University of Berlin, Berlin, Germany
| | - Piero Giansanti
- Biomolecular Mass Spectrometry and Proteomics, Bijvoet Center for Biomolecular Research and Utrecht Institute for Pharmaceutical Sciences, Science4Life, Utrecht University, Utrecht, The Netherlands; Netherlands Proteomics Centre, Utrecht, The Netherlands; Chair of Proteomics and Bioanalytics, Technical University of Munich, Freising, Germany
| | - Albert J R Heck
- Biomolecular Mass Spectrometry and Proteomics, Bijvoet Center for Biomolecular Research and Utrecht Institute for Pharmaceutical Sciences, Science4Life, Utrecht University, Utrecht, The Netherlands; Netherlands Proteomics Centre, Utrecht, The Netherlands
| | - Anne-Claude Gavin
- European Molecular Biology Laboratory, Structural and Computational Biology Unit, Heidelberg, Germany
| | - Vera van Noort
- KU Leuven, Centre of Microbial and Plant Genetics, Kasteelpark Arenberg 20, Leuven, B-3001, Belgium; Leiden University, Institute of Biology Leiden, Leiden, The Netherlands.
| |
Collapse
|
28
|
Banu H, Joseph MC, Nisar MN. In-silico approach to investigate death domains associated with nano-particle-mediated cellular responses. Comput Biol Chem 2018; 75:11-23. [DOI: 10.1016/j.compbiolchem.2018.04.013] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/17/2017] [Revised: 04/01/2018] [Accepted: 04/21/2018] [Indexed: 11/29/2022]
|
29
|
Viricel C, de Givry S, Schiex T, Barbe S. Cost function network-based design of protein–protein interactions: predicting changes in binding affinity. Bioinformatics 2018; 34:2581-2589. [DOI: 10.1093/bioinformatics/bty092] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/02/2017] [Accepted: 02/16/2018] [Indexed: 11/14/2022] Open
Affiliation(s)
- Clément Viricel
- Laboratoire d’Ingénierie des Systèmes Biologiques et des Procédés, Université de Toulouse, CNRS, INRA, INSA, Toulouse, France
- Unité de Mathématiques et Informatique Appliquées de Toulouse, INRA, Castanet Tolosan cedex, France
| | - Simon de Givry
- Unité de Mathématiques et Informatique Appliquées de Toulouse, INRA, Castanet Tolosan cedex, France
| | - Thomas Schiex
- Unité de Mathématiques et Informatique Appliquées de Toulouse, INRA, Castanet Tolosan cedex, France
| | - Sophie Barbe
- Laboratoire d’Ingénierie des Systèmes Biologiques et des Procédés, Université de Toulouse, CNRS, INRA, INSA, Toulouse, France
| |
Collapse
|
30
|
Barlow KA, Ó Conchúir S, Thompson S, Suresh P, Lucas JE, Heinonen M, Kortemme T. Flex ddG: Rosetta Ensemble-Based Estimation of Changes in Protein-Protein Binding Affinity upon Mutation. J Phys Chem B 2018; 122:5389-5399. [PMID: 29401388 DOI: 10.1021/acs.jpcb.7b11367] [Citation(s) in RCA: 141] [Impact Index Per Article: 23.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/05/2023]
Abstract
Computationally modeling changes in binding free energies upon mutation (interface ΔΔ G) allows large-scale prediction and perturbation of protein-protein interactions. Additionally, methods that consider and sample relevant conformational plasticity should be able to achieve higher prediction accuracy over methods that do not. To test this hypothesis, we developed a method within the Rosetta macromolecular modeling suite (flex ddG) that samples conformational diversity using "backrub" to generate an ensemble of models and then applies torsion minimization, side chain repacking, and averaging across this ensemble to estimate interface ΔΔ G values. We tested our method on a curated benchmark set of 1240 mutants, and found the method outperformed existing methods that sampled conformational space to a lesser degree. We observed considerable improvements with flex ddG over existing methods on the subset of small side chain to large side chain mutations, as well as for multiple simultaneous non-alanine mutations, stabilizing mutations, and mutations in antibody-antigen interfaces. Finally, we applied a generalized additive model (GAM) approach to the Rosetta energy function; the resulting nonlinear reweighting model improved the agreement with experimentally determined interface ΔΔ G values but also highlighted the necessity of future energy function improvements.
Collapse
Affiliation(s)
- Kyle A Barlow
- Graduate Program in Bioinformatics , University of California San Francisco , San Francisco , California , United States of America
| | - Shane Ó Conchúir
- California Institute for Quantitative Biosciences , University of California San Francisco , San Francisco , California , United States of America.,Department of Bioengineering and Therapeutic Sciences , University of California San Francisco , San Francisco , California , United States of America
| | - Samuel Thompson
- Graduate Program in Biophysics , University of California San Francisco , San Francisco , California , United States of America
| | - Pooja Suresh
- Graduate Program in Biophysics , University of California San Francisco , San Francisco , California , United States of America
| | - James E Lucas
- Graduate Program in Bioengineering , University of California San Francisco , San Francisco , California , United States of America
| | - Markus Heinonen
- Department of Computer Science , Aalto University , Espoo , Finland.,Helsinki Institute for Information Technology (HIIT) , Helsinki , Finland
| | - Tanja Kortemme
- Graduate Program in Bioinformatics , University of California San Francisco , San Francisco , California , United States of America.,California Institute for Quantitative Biosciences , University of California San Francisco , San Francisco , California , United States of America.,Department of Bioengineering and Therapeutic Sciences , University of California San Francisco , San Francisco , California , United States of America.,Graduate Program in Biophysics , University of California San Francisco , San Francisco , California , United States of America.,Graduate Program in Bioengineering , University of California San Francisco , San Francisco , California , United States of America.,Chan Zuckerberg Biohub , San Francisco , California 94158 , United States
| |
Collapse
|
31
|
Buß O, Rudat J, Ochsenreither K. FoldX as Protein Engineering Tool: Better Than Random Based Approaches? Comput Struct Biotechnol J 2018; 16:25-33. [PMID: 30275935 PMCID: PMC6158775 DOI: 10.1016/j.csbj.2018.01.002] [Citation(s) in RCA: 141] [Impact Index Per Article: 23.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/31/2017] [Revised: 12/21/2017] [Accepted: 01/20/2018] [Indexed: 02/04/2023] Open
Abstract
Improving protein stability is an important goal for basic research as well as for clinical and industrial applications but no commonly accepted and widely used strategy for efficient engineering is known. Beside random approaches like error prone PCR or physical techniques to stabilize proteins, e.g. by immobilization, in silico approaches are gaining more attention to apply target-oriented mutagenesis. In this review different algorithms for the prediction of beneficial mutation sites to enhance protein stability are summarized and the advantages and disadvantages of FoldX are highlighted. The question whether the prediction of mutation sites by the algorithm FoldX is more accurate than random based approaches is addressed.
Collapse
Affiliation(s)
- Oliver Buß
- Institute of Process Engineering in Life Sciences, Section II: Technical Biology, Karlsruhe Institute of Technology, Karlsruhe, Germany
| | | | | |
Collapse
|
32
|
Nosrati M, Solbak S, Nordesjö O, Nissbeck M, Dourado DFAR, Andersson KG, Housaindokht MR, Löfblom J, Virtanen A, Danielson UH, Flores SC. Insights from engineering the Affibody-Fc interaction with a computational-experimental method. Protein Eng Des Sel 2017; 30:593-601. [DOI: 10.1093/protein/gzx023] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/28/2016] [Accepted: 04/12/2017] [Indexed: 01/25/2023] Open
|
33
|
Lees WD, Stejskal L, Moss DS, Shepherd AJ. Investigating Substitutions in Antibody-Antigen Complexes Using Molecular Dynamics: A Case Study with Broad-spectrum, Influenza A Antibodies. Front Immunol 2017; 8:143. [PMID: 28261207 PMCID: PMC5309259 DOI: 10.3389/fimmu.2017.00143] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/02/2016] [Accepted: 01/30/2017] [Indexed: 11/20/2022] Open
Abstract
In studying the binding of host antibodies to the surface antigens of pathogens, the structural and functional characterization of antibody–antigen complexes by X-ray crystallography and binding assay is important. However, the characterization requires experiments that are typically time consuming and expensive: thus, many antibody–antigen complexes are under-characterized. For vaccine development and disease surveillance, it is often vital to assess the impact of amino acid substitutions on antibody binding. For example, are there antibody substitutions capable of improving binding without a loss of breadth, or antigen substitutions that lead to antigenic escape? The questions cannot be answered reliably from sequence variation alone, exhaustive substitution assays are usually impractical, and alanine scans provide at best an incomplete identification of the critical residue–residue interactions. Here, we show that, given an initial structure of an antibody bound to an antigen, molecular dynamics simulations using the energy method molecular mechanics with Generalized Born surface area (MM/GBSA) can model the impact of single amino acid substitutions on antibody–antigen binding energy. We apply the technique to three broad-spectrum antibodies to influenza A hemagglutinin and examine both previously characterized and novel variant strains observed in the human population that may give rise to antigenic escape. We find that in some cases the impact of a substitution is local, while in others it causes a reorientation of the antibody with wide-ranging impact on residue–residue interactions: this explains, in part, why the change in chemical properties of a residue can be, on its own, a poor predictor of overall change in binding energy. Our estimates are in good agreement with experimental results—indeed, they approximate the degree of agreement between different experimental techniques. Simulations were performed on commodity computer hardware; hence, this approach has the potential to be widely adopted by those undertaking infectious disease research. Novel aspects of this research include the application of MM/GBSA to investigate binding between broadly binding antibodies and a viral glycoprotein; the development of an approach for visualizing substrate–ligand interactions; and the use of experimental assay data to rescale our predictions, allowing us to make inferences about absolute, as well as relative, changes in binding energy.
Collapse
Affiliation(s)
- William D Lees
- Institute of Structural and Molecular Biology, Birkbeck College , London , UK
| | - Lenka Stejskal
- Institute of Structural and Molecular Biology, Birkbeck College , London , UK
| | - David S Moss
- Institute of Structural and Molecular Biology, Birkbeck College , London , UK
| | - Adrian J Shepherd
- Institute of Structural and Molecular Biology, Birkbeck College , London , UK
| |
Collapse
|
34
|
Gromiha MM, Yugandhar K, Jemimah S. Protein-protein interactions: scoring schemes and binding affinity. Curr Opin Struct Biol 2016; 44:31-38. [PMID: 27866112 DOI: 10.1016/j.sbi.2016.10.016] [Citation(s) in RCA: 80] [Impact Index Per Article: 10.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/29/2016] [Revised: 09/30/2016] [Accepted: 10/25/2016] [Indexed: 01/16/2023]
Abstract
Protein-protein interactions mediate several cellular functions, which can be understood from the information obtained using the three-dimensional structures of protein-protein complexes and binding affinity data. This review focuses on computational aspects of predicting the best native-like complex structure and binding affinities. The first part covers the prediction of protein-protein complex structures and the advantages of conformational searching and scoring functions in protein-protein docking. The second part is devoted to various aspects of protein-protein interaction thermodynamics, such as databases for binding affinities and other thermodynamic parameters, computational methods to predict the binding affinity using either the three-dimensional structures of complexes or amino acid sequences, and change in binding affinities of the complexes upon mutations. We provide the latest developments on protein-protein docking and binding affinity studies along with a list of available computational resources for understanding protein-protein interactions.
Collapse
Affiliation(s)
- M Michael Gromiha
- Department of Biotechnology, Bhupat and Jyoti Mehta School of Biosciences, Indian Institute of Technology Madras, Chennai 600036, Tamil Nadu, India.
| | - K Yugandhar
- Department of Biotechnology, Bhupat and Jyoti Mehta School of Biosciences, Indian Institute of Technology Madras, Chennai 600036, Tamil Nadu, India
| | - Sherlyn Jemimah
- Department of Biotechnology, Bhupat and Jyoti Mehta School of Biosciences, Indian Institute of Technology Madras, Chennai 600036, Tamil Nadu, India
| |
Collapse
|
35
|
Dourado DFAR, Pohle S, Carvalho ATP, Dheeman DS, Caswell JM, Skvortsov T, Miskelly I, Brown RT, Quinn DJ, Allen CCR, Kulakov L, Huang M, Moody TS. Rational Design of a (S)-Selective-Transaminase for Asymmetric Synthesis of (1S)-1-(1,1′-biphenyl-2-yl)ethanamine. ACS Catal 2016. [DOI: 10.1021/acscatal.6b02380] [Citation(s) in RCA: 39] [Impact Index Per Article: 4.9] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/15/2022]
Affiliation(s)
- Daniel F. A. R. Dourado
- School
of Chemistry and Chemical Engineering, Queen’s University Belfast, David
Keir Building, Stranmillis Road, Belfast BT9 5AG, Northern Ireland, United Kingdom
- Department
of Biocatalysis and Isotope Chemistry, Almac Sciences, 20 Seagoe Industrial
Estate, Craigavon BT63
5QD, Northern Ireland, United Kingdom
| | - Stefan Pohle
- Department
of Biocatalysis and Isotope Chemistry, Almac Sciences, 20 Seagoe Industrial
Estate, Craigavon BT63
5QD, Northern Ireland, United Kingdom
| | - Alexandra T. P. Carvalho
- School
of Chemistry and Chemical Engineering, Queen’s University Belfast, David
Keir Building, Stranmillis Road, Belfast BT9 5AG, Northern Ireland, United Kingdom
- Department
of Biocatalysis and Isotope Chemistry, Almac Sciences, 20 Seagoe Industrial
Estate, Craigavon BT63
5QD, Northern Ireland, United Kingdom
| | - Dharmendra S. Dheeman
- Department
of Biocatalysis and Isotope Chemistry, Almac Sciences, 20 Seagoe Industrial
Estate, Craigavon BT63
5QD, Northern Ireland, United Kingdom
| | - Jill M. Caswell
- Department
of Biocatalysis and Isotope Chemistry, Almac Sciences, 20 Seagoe Industrial
Estate, Craigavon BT63
5QD, Northern Ireland, United Kingdom
| | - Timofey Skvortsov
- Department
of Biocatalysis and Isotope Chemistry, Almac Sciences, 20 Seagoe Industrial
Estate, Craigavon BT63
5QD, Northern Ireland, United Kingdom
- School
of Biological Sciences, Queen’s University Belfast, Medical Biology
Centre, 97 Lisburn Road, Belfast BT9 7BL, Northern Ireland, United Kingdom
| | - Iain Miskelly
- Department
of Biocatalysis and Isotope Chemistry, Almac Sciences, 20 Seagoe Industrial
Estate, Craigavon BT63
5QD, Northern Ireland, United Kingdom
| | - Rodney T. Brown
- Department
of Biocatalysis and Isotope Chemistry, Almac Sciences, 20 Seagoe Industrial
Estate, Craigavon BT63
5QD, Northern Ireland, United Kingdom
| | - Derek J. Quinn
- Department
of Biocatalysis and Isotope Chemistry, Almac Sciences, 20 Seagoe Industrial
Estate, Craigavon BT63
5QD, Northern Ireland, United Kingdom
| | - Christopher C. R. Allen
- School
of Biological Sciences, Queen’s University Belfast, Medical Biology
Centre, 97 Lisburn Road, Belfast BT9 7BL, Northern Ireland, United Kingdom
| | - Leonid Kulakov
- School
of Biological Sciences, Queen’s University Belfast, Medical Biology
Centre, 97 Lisburn Road, Belfast BT9 7BL, Northern Ireland, United Kingdom
| | - Meilan Huang
- School
of Chemistry and Chemical Engineering, Queen’s University Belfast, David
Keir Building, Stranmillis Road, Belfast BT9 5AG, Northern Ireland, United Kingdom
| | - Thomas S. Moody
- Department
of Biocatalysis and Isotope Chemistry, Almac Sciences, 20 Seagoe Industrial
Estate, Craigavon BT63
5QD, Northern Ireland, United Kingdom
| |
Collapse
|
36
|
Cheng RR, Nordesjö O, Hayes RL, Levine H, Flores SC, Onuchic JN, Morcos F. Connecting the Sequence-Space of Bacterial Signaling Proteins to Phenotypes Using Coevolutionary Landscapes. Mol Biol Evol 2016; 33:3054-3064. [PMID: 27604223 PMCID: PMC5100047 DOI: 10.1093/molbev/msw188] [Citation(s) in RCA: 48] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/08/2023] Open
Abstract
Two-component signaling (TCS) is the primary means by which bacteria sense and respond to the environment. TCS involves two partner proteins working in tandem, which interact to perform cellular functions whereas limiting interactions with non-partners (i.e., cross-talk). We construct a Potts model for TCS that can quantitatively predict how mutating amino acid identities affect the interaction between TCS partners and non-partners. The parameters of this model are inferred directly from protein sequence data. This approach drastically reduces the computational complexity of exploring the sequence-space of TCS proteins. As a stringent test, we compare its predictions to a recent comprehensive mutational study, which characterized the functionality of 204 mutational variants of the PhoQ kinase in Escherichia coli We find that our best predictions accurately reproduce the amino acid combinations found in experiment, which enable functional signaling with its partner PhoP. These predictions demonstrate the evolutionary pressure to preserve the interaction between TCS partners as well as prevent unwanted cross-talk. Further, we calculate the mutational change in the binding affinity between PhoQ and PhoP, providing an estimate to the amount of destabilization needed to disrupt TCS.
Collapse
Affiliation(s)
- R R Cheng
- Center for Theoretical Biological Physics, Rice University, Houston, TX
| | - O Nordesjö
- Department of Cell and Molecular Biology, Uppsala University, Uppsala, Sweden
| | - R L Hayes
- Department of Biophysics, University of Michigan, Ann Arbor, MI
| | - H Levine
- Center for Theoretical Biological Physics, Rice University, Houston, TX.,Department of Bioengineering, Rice University, Houston, TX
| | - S C Flores
- Department of Cell and Molecular Biology, Uppsala University, Uppsala, Sweden
| | - J N Onuchic
- Center for Theoretical Biological Physics, Rice University, Houston, TX .,Department of Physics and Astronomy, Rice University, Houston, TX.,Department of Chemistry, and Biosciences, Rice University, Houston, TX
| | - F Morcos
- Department of Biological Sciences and Center for Systems Biology, University of Texas at Dallas, Dallas, TX
| |
Collapse
|
37
|
Geng C, Vangone A, Bonvin AMJJ. Exploring the interplay between experimental methods and the performance of predictors of binding affinity change upon mutations in protein complexes. Protein Eng Des Sel 2016; 29:291-299. [PMID: 27284087 DOI: 10.1093/protein/gzw020] [Citation(s) in RCA: 20] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/29/2016] [Accepted: 05/09/2016] [Indexed: 11/14/2022] Open
Abstract
Reliable prediction of binding affinity changes (ΔΔG) upon mutations in protein complexes relies not only on the performance of computational methods but also on the availability and quality of experimental data. Binding affinity changes can be measured by various experimental methods with different accuracies and limitations. To understand the impact of these on the prediction of binding affinity change, we present the Database of binding Affinity Change Upon Mutation (DACUM), a database of 1872 binding affinity changes upon single-point mutations, a subset of the SKEMPI database (Moal,I.H. and Fernández-Recio,J. Bioinformatics, 2012;28:2600-2607) extended with information on the experimental methods used for ΔΔG measurements. The ΔΔG data were classified into different data sets based on the experimental method used and the position of the mutation (interface and non-interface). We tested the prediction performance of the original HADDOCK score, a newly trained version of it and mutation Cutoff Scanning Matrix (Pires,D.E.V., Ascher,D.B. and Blundell,T.L. Bioinformatics 2014;30:335-342), one of the best reported ΔΔG predictors so far, on these various data sets. Our results demonstrate a strong impact of the experimental methods on the performance of binding affinity change predictors for protein complexes. This underscores the importance of properly considering and carefully choosing experimental methods in the development of novel binding affinity change predictors. The DACUM database is available online at https://github.com/haddocking/DACUM.
Collapse
Affiliation(s)
- Cunliang Geng
- Computational Structural Biology Group, Bijvoet Center for Biomolecular Research, Faculty of Science-Chemistry, Utrecht University, Padualaan 8, Utrecht 3584 CH, The Netherlands
| | - Anna Vangone
- Computational Structural Biology Group, Bijvoet Center for Biomolecular Research, Faculty of Science-Chemistry, Utrecht University, Padualaan 8, Utrecht 3584 CH, The Netherlands
| | - Alexandre M J J Bonvin
- Computational Structural Biology Group, Bijvoet Center for Biomolecular Research, Faculty of Science-Chemistry, Utrecht University, Padualaan 8, Utrecht 3584 CH, The Netherlands
| |
Collapse
|
38
|
Dourado DFAR, Flores SC. Modeling and fitting protein-protein complexes to predict change of binding energy. Sci Rep 2016; 6:25406. [PMID: 27173910 PMCID: PMC4865953 DOI: 10.1038/srep25406] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/04/2016] [Accepted: 04/18/2016] [Indexed: 01/18/2023] Open
Abstract
It is possible to accurately and economically predict change in protein-protein interaction energy upon mutation (ΔΔG), when a high-resolution structure of the complex is available. This is of growing usefulness for design of high-affinity or otherwise modified binding proteins for therapeutic, diagnostic, industrial, and basic science applications. Recently the field has begun to pursue ΔΔG prediction for homology modeled complexes, but so far this has worked mostly for cases of high sequence identity. If the interacting proteins have been crystallized in free (uncomplexed) form, in a majority of cases it is possible to find a structurally similar complex which can be used as the basis for template-based modeling. We describe how to use MMB to create such models, and then use them to predict ΔΔG, using a dataset consisting of free target structures, co-crystallized template complexes with sequence identify with respect to the targets as low as 44%, and experimental ΔΔG measurements. We obtain similar results by fitting to a low-resolution Cryo-EM density map. Results suggest that other structural constraints may lead to a similar outcome, making the method even more broadly applicable.
Collapse
Affiliation(s)
- Daniel F A R Dourado
- Department of Cell and Molecular Biology, Computational and Systems Biology, Uppsala University, Biomedical Center Box 596, 751 24, Uppsala, Sweden
| | - Samuel Coulbourn Flores
- Department of Cell and Molecular Biology, Computational and Systems Biology, Uppsala University, Biomedical Center Box 596, 751 24, Uppsala, Sweden
| |
Collapse
|
39
|
Tek A, Korostelev AA, Flores SC. MMB-GUI: a fast morphing method demonstrates a possible ribosomal tRNA translocation trajectory. Nucleic Acids Res 2015; 44:95-105. [PMID: 26673695 PMCID: PMC4705676 DOI: 10.1093/nar/gkv1457] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/10/2015] [Accepted: 11/28/2015] [Indexed: 02/07/2023] Open
Abstract
Easy-to-use macromolecular viewers, such as UCSF Chimera, are a standard tool in structural biology. They allow rendering and performing geometric operations on large complexes, such as viruses and ribosomes. Dynamical simulation codes enable modeling of conformational changes, but may require considerable time and many CPUs. There is an unmet demand from structural and molecular biologists for software in the middle ground, which would allow visualization combined with quick and interactive modeling of conformational changes, even of large complexes. This motivates MMB-GUI. MMB uses an internal-coordinate, multiscale approach, yielding as much as a 2000-fold speedup over conventional simulation methods. We use Chimera as an interactive graphical interface to control MMB. We show how this can be used for morphing of macromolecules that can be heterogeneous in biopolymer type, sequence, and chain count, accurately recapitulating structural intermediates. We use MMB-GUI to create a possible trajectory of EF-G mediated gate-passing translocation in the ribosome, with all-atom structures. This shows that the GUI makes modeling of large macromolecules accessible to a wide audience. The morph highlights similarities in tRNA conformational changes as tRNA translocates from A to P and from P to E sites and suggests that tRNA flexibility is critical for translocation completion.
Collapse
Affiliation(s)
- Alex Tek
- Cell and Molecular Biology Department, Uppsala University, Box 596, Uppsala 751 24, Sweden
| | - Andrei A Korostelev
- RNA Therapeutics Institute, Department of Biochemistry and Molecular Pharmacology, University of Massachusetts Medical School, 368 Plantation St., Worcester, MA 01605, USA
| | | |
Collapse
|
40
|
Brender JR, Zhang Y. Predicting the Effect of Mutations on Protein-Protein Binding Interactions through Structure-Based Interface Profiles. PLoS Comput Biol 2015; 11:e1004494. [PMID: 26506533 PMCID: PMC4624718 DOI: 10.1371/journal.pcbi.1004494] [Citation(s) in RCA: 99] [Impact Index Per Article: 11.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/25/2015] [Accepted: 08/06/2015] [Indexed: 11/18/2022] Open
Abstract
The formation of protein-protein complexes is essential for proteins to perform their physiological functions in the cell. Mutations that prevent the proper formation of the correct complexes can have serious consequences for the associated cellular processes. Since experimental determination of protein-protein binding affinity remains difficult when performed on a large scale, computational methods for predicting the consequences of mutations on binding affinity are highly desirable. We show that a scoring function based on interface structure profiles collected from analogous protein-protein interactions in the PDB is a powerful predictor of protein binding affinity changes upon mutation. As a standalone feature, the differences between the interface profile score of the mutant and wild-type proteins has an accuracy equivalent to the best all-atom potentials, despite being two orders of magnitude faster once the profile has been constructed. Due to its unique sensitivity in collecting the evolutionary profiles of analogous binding interactions and the high speed of calculation, the interface profile score has additional advantages as a complementary feature to combine with physics-based potentials for improving the accuracy of composite scoring approaches. By incorporating the sequence-derived and residue-level coarse-grained potentials with the interface structure profile score, a composite model was constructed through the random forest training, which generates a Pearson correlation coefficient >0.8 between the predicted and observed binding free-energy changes upon mutation. This accuracy is comparable to, or outperforms in most cases, the current best methods, but does not require high-resolution full-atomic models of the mutant structures. The binding interface profiling approach should find useful application in human-disease mutation recognition and protein interface design studies. Few proteins carry out their tasks in isolation. Instead, proteins combine with each other in complicated ways that can be affected by either the natural genetic variation that occurs among people or by disease causing mutations such as those that occur in cancer or in genetic disorders. To understand how these mutations affect our health, it is necessary to understand how mutations can affect the strength of the interactions that bind proteins together. This is a difficult task to do in a laboratory on a large scale and scientists are increasingly turning to computational methods to predict these effects in advance. We show that by looking at the multiple alignments of similar protein-protein complex structures at the interface regions, new constraints based on the evolution of the three dimensional structures of proteins can be made to predict which mutations are compatible with two proteins interacting and which are not.
Collapse
Affiliation(s)
- Jeffrey R. Brender
- Department of Computational Medicine and Bioinformatics, University of Michigan, Ann Arbor, Michigan, United States of America
| | - Yang Zhang
- Department of Computational Medicine and Bioinformatics, University of Michigan, Ann Arbor, Michigan, United States of America
- Department of Biological Chemistry, University of Michigan, Ann Arbor, Michigan, United States of America
- * E-mail:
| |
Collapse
|
41
|
Koripella RK, Holm M, Dourado D, Mandava CS, Flores S, Sanyal S. A conserved histidine in switch-II of EF-G moderates release of inorganic phosphate. Sci Rep 2015; 5:12970. [PMID: 26264741 PMCID: PMC4532990 DOI: 10.1038/srep12970] [Citation(s) in RCA: 19] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/08/2015] [Accepted: 07/13/2015] [Indexed: 01/13/2023] Open
Abstract
Elongation factor G (EF-G), a translational GTPase responsible for tRNA-mRNA translocation possesses a conserved histidine (H91 in Escherichia coli) at the apex of switch-II, which has been implicated in GTPase activation and GTP hydrolysis. While H91A, H91R and H91E mutants showed different degrees of defect in ribosome associated GTP hydrolysis, H91Q behaved like the WT. However, all these mutants, including H91Q, are much more defective in inorganic phosphate (Pi) release, thereby suggesting that H91 facilitates Pi release. In crystal structures of the ribosome bound EF-G•GTP a tight coupling between H91 and the γ-phosphate of GTP can be seen. Following GTP hydrolysis, H91 flips ~140° in the opposite direction, probably with Pi still coupled to it. This, we suggest, promotes Pi to detach from GDP and reach the inter-domain space of EF-G, which constitutes an exit path for the Pi. Molecular dynamics simulations are consistent with this hypothesis and demonstrate a vital role of an Mg2+ ion in the process.
Collapse
Affiliation(s)
- Ravi Kiran Koripella
- Department of Cell and Molecular Biology, Uppsala University, Box-596, BMC, 75124, Uppsala, Sweden
| | - Mikael Holm
- Department of Cell and Molecular Biology, Uppsala University, Box-596, BMC, 75124, Uppsala, Sweden
| | - Daniel Dourado
- Department of Cell and Molecular Biology, Uppsala University, Box-596, BMC, 75124, Uppsala, Sweden
| | - Chandra Sekhar Mandava
- Department of Cell and Molecular Biology, Uppsala University, Box-596, BMC, 75124, Uppsala, Sweden
| | - Samuel Flores
- Department of Cell and Molecular Biology, Uppsala University, Box-596, BMC, 75124, Uppsala, Sweden
| | - Suparna Sanyal
- Department of Cell and Molecular Biology, Uppsala University, Box-596, BMC, 75124, Uppsala, Sweden
| |
Collapse
|
42
|
Petukh M, Li M, Alexov E. Predicting Binding Free Energy Change Caused by Point Mutations with Knowledge-Modified MM/PBSA Method. PLoS Comput Biol 2015; 11:e1004276. [PMID: 26146996 PMCID: PMC4492929 DOI: 10.1371/journal.pcbi.1004276] [Citation(s) in RCA: 86] [Impact Index Per Article: 9.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/06/2015] [Accepted: 04/09/2015] [Indexed: 11/18/2022] Open
Abstract
A new methodology termed Single Amino Acid Mutation based change in Binding free Energy (SAAMBE) was developed to predict the changes of the binding free energy caused by mutations. The method utilizes 3D structures of the corresponding protein-protein complexes and takes advantage of both approaches: sequence- and structure-based methods. The method has two components: a MM/PBSA-based component, and an additional set of statistical terms delivered from statistical investigation of physico-chemical properties of protein complexes. While the approach is rigid body approach and does not explicitly consider plausible conformational changes caused by the binding, the effect of conformational changes, including changes away from binding interface, on electrostatics are mimicked with amino acid specific dielectric constants. This provides significant improvement of SAAMBE predictions as indicated by better match against experimentally determined binding free energy changes over 1300 mutations in 43 proteins. The final benchmarking resulted in a very good agreement with experimental data (correlation coefficient 0.624) while the algorithm being fast enough to allow for large-scale calculations (the average time is less than a minute per mutation).
Collapse
Affiliation(s)
- Marharyta Petukh
- Computational Biophysics and Bioinformatics, Department of Physics, Clemson University, Clemson, South Carolina, United States of America
| | - Minghui Li
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, Maryland, United States of America
| | - Emil Alexov
- Computational Biophysics and Bioinformatics, Department of Physics, Clemson University, Clemson, South Carolina, United States of America
- * E-mail:
| |
Collapse
|
43
|
Fiesel FC, Caulfield TR, Moussaud-Lamodière EL, Ogaki K, Dourado DFAR, Flores SC, Ross OA, Springer W. Structural and Functional Impact of Parkinson Disease-Associated Mutations in the E3 Ubiquitin Ligase Parkin. Hum Mutat 2015; 36:774-86. [PMID: 25939424 DOI: 10.1002/humu.22808] [Citation(s) in RCA: 59] [Impact Index Per Article: 6.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/10/2014] [Accepted: 04/23/2015] [Indexed: 12/24/2022]
Abstract
Mutations in the PARKIN/PARK2 gene that result in loss-of-function of the encoded, neuroprotective E3 ubiquitin ligase Parkin cause recessive, familial early-onset Parkinson disease. As an increasing number of rare Parkin sequence variants with unclear pathogenicity are identified, structure-function analyses will be critical to determine their disease relevance. Depending on the specific amino acids affected, several distinct pathomechanisms can result in loss of Parkin function. These include disruption of overall Parkin folding, decreased solubility, and protein aggregation. However pathogenic effects can also result from misregulation of Parkin autoinhibition and of its enzymatic functions. In addition, interference of binding to coenzymes, substrates, and adaptor proteins can affect its catalytic activity too. Herein, we have performed a comprehensive structural and functional analysis of 21 PARK2 missense mutations distributed across the individual protein domains. Using this combined approach, we were able to pinpoint some of the pathogenic mechanisms of individual sequence variants. Similar analyses will be critical in gaining a complete understanding of the complex regulations and enzymatic functions of Parkin. These studies will not only highlight the important residues, but will also help to develop novel therapeutics aimed at activating and preserving an active, neuroprotective form of Parkin.
Collapse
Affiliation(s)
| | | | | | - Kotaro Ogaki
- Department of Neuroscience, Mayo Clinic, Jacksonville, Florida
| | - Daniel F A R Dourado
- Department of Cell & Molecular Biology, Computational & Systems Biology, Uppsala University, Uppsala, Sweden
| | - Samuel C Flores
- Department of Cell & Molecular Biology, Computational & Systems Biology, Uppsala University, Uppsala, Sweden
| | - Owen A Ross
- Department of Neuroscience, Mayo Clinic, Jacksonville, Florida.,Mayo Graduate School, Neurobiology of Disease, Mayo Clinic, Jacksonville, Florida
| | - Wolfdieter Springer
- Department of Neuroscience, Mayo Clinic, Jacksonville, Florida.,Mayo Graduate School, Neurobiology of Disease, Mayo Clinic, Jacksonville, Florida
| |
Collapse
|
44
|
Ascher DB, Jubb HC, Pires DEV, Ochi T, Higueruelo A, Blundell TL. Protein-Protein Interactions: Structures and Druggability. MULTIFACETED ROLES OF CRYSTALLOGRAPHY IN MODERN DRUG DISCOVERY 2015. [DOI: 10.1007/978-94-017-9719-1_12] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/30/2022]
|
45
|
Caulfield TR, Fiesel FC, Moussaud-Lamodière EL, Dourado DFAR, Flores SC, Springer W. Phosphorylation by PINK1 releases the UBL domain and initializes the conformational opening of the E3 ubiquitin ligase Parkin. PLoS Comput Biol 2014; 10:e1003935. [PMID: 25375667 PMCID: PMC4222639 DOI: 10.1371/journal.pcbi.1003935] [Citation(s) in RCA: 88] [Impact Index Per Article: 8.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/02/2014] [Accepted: 09/25/2014] [Indexed: 11/19/2022] Open
Abstract
Loss-of-function mutations in PINK1 or PARKIN are the most common causes of autosomal recessive Parkinson's disease. Both gene products, the Ser/Thr kinase PINK1 and the E3 Ubiquitin ligase Parkin, functionally cooperate in a mitochondrial quality control pathway. Upon stress, PINK1 activates Parkin and enables its translocation to and ubiquitination of damaged mitochondria to facilitate their clearance from the cell. Though PINK1-dependent phosphorylation of Ser65 is an important initial step, the molecular mechanisms underlying the activation of Parkin's enzymatic functions remain unclear. Using molecular modeling, we generated a complete structural model of human Parkin at all atom resolution. At steady state, the Ub ligase is maintained inactive in a closed, auto-inhibited conformation that results from intra-molecular interactions. Evidently, Parkin has to undergo major structural rearrangements in order to unleash its catalytic activity. As a spark, we have modeled PINK1-dependent Ser65 phosphorylation in silico and provide the first molecular dynamics simulation of Parkin conformations along a sequential unfolding pathway that could release its intertwined domains and enable its catalytic activity. We combined free (unbiased) molecular dynamics simulation, Monte Carlo algorithms, and minimal-biasing methods with cell-based high content imaging and biochemical assays. Phosphorylation of Ser65 results in widening of a newly defined cleft and dissociation of the regulatory N-terminal UBL domain. This motion propagates through further opening conformations that allow binding of an Ub-loaded E2 co-enzyme. Subsequent spatial reorientation of the catalytic centers of both enzymes might facilitate the transfer of the Ub moiety to charge Parkin. Our structure-function study provides the basis to elucidate regulatory mechanisms and activity of the neuroprotective Parkin. This may open up new avenues for the development of small molecule Parkin activators through targeted drug design. Parkinson's disease (PD) is a devastating neurological condition caused by the selective and progressive degeneration of dopaminergic neurons in the brain. Loss-of-function mutations in the PINK1 or PARKIN genes are the most common causes of recessively inherited PD. Together the encoded proteins coordinate a protective cellular quality control pathway that allows elimination of impaired mitochondria in order to prevent further cellular damage and ultimately death. Although it is known that the kinase PINK1 operates upstream and activates the E3 Ubiquitin ligase Parkin, the molecular mechanisms remain elusive. Here, we combined state-of-the art computational and functional biological methods to demonstrate that Parkin is sequentially activated through PINK1-dependent phosphorylation and subsequent structural rearrangement. The induced motions result in release of Parkin's closed, auto-inhibited conformation to liberate its enzymatic functions. We provide for the first time a complete protein structure of Parkin at an all atom resolution and a comprehensive molecular dynamics simulation of its activation and opening conformations. The generated models will allow uncovering the exact mechanisms of regulation and enzymatic activity of Parkin and potentially the development of novel therapeutics through a structure-function-based drug design.
Collapse
Affiliation(s)
- Thomas R. Caulfield
- Department of Neuroscience, Mayo Clinic Jacksonville, Florida, United States of America
- * E-mail: (TRC); (WS)
| | - Fabienne C. Fiesel
- Department of Neuroscience, Mayo Clinic Jacksonville, Florida, United States of America
| | | | - Daniel F. A. R. Dourado
- Department of Cell & Molecular Biology, Computational & Systems Biology, Uppsala University, Uppsala, Sweden
| | - Samuel C. Flores
- Department of Cell & Molecular Biology, Computational & Systems Biology, Uppsala University, Uppsala, Sweden
| | - Wolfdieter Springer
- Department of Neuroscience, Mayo Clinic Jacksonville, Florida, United States of America
- Mayo Graduate School, Neurobiology of Disease, Mayo Clinic, Jacksonville, Florida, United States of America
- * E-mail: (TRC); (WS)
| |
Collapse
|