1
|
Gizzio J, Thakur A, Haldane A, Post CB, Levy RM. Evolutionary sequence and structural basis for the distinct conformational landscapes of Tyr and Ser/Thr kinases. Nat Commun 2024; 15:6545. [PMID: 39095350 PMCID: PMC11297160 DOI: 10.1038/s41467-024-50812-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/22/2024] [Accepted: 07/22/2024] [Indexed: 08/04/2024] Open
Abstract
Protein kinases are molecular machines with rich sequence variation that distinguishes the two main evolutionary branches - tyrosine kinases (TKs) from serine/threonine kinases (STKs). Using a sequence co-variation Potts statistical energy model we previously concluded that TK catalytic domains are more likely than STKs to adopt an inactive conformation with the activation loop in an autoinhibitory folded conformation, due to intrinsic sequence effects. Here we investigate the structural basis for this phenomenon by integrating the sequence-based model with structure-based molecular dynamics (MD) to determine the effects of mutations on the free energy difference between active and inactive conformations, using a thermodynamic cycle involving many (n = 108) protein-mutation free energy perturbation (FEP) simulations in the active and inactive conformations. The sequence and structure-based results are consistent and support the hypothesis that the inactive conformation DFG-out Activation Loop Folded, is a functional regulatory state that has been stabilized in TKs relative to STKs over the course of their evolution via the accumulation of residue substitutions in the activation loop and catalytic loop that facilitate distinct substrate binding modes in trans and additional modes of regulation in cis for TKs.
Collapse
Affiliation(s)
- Joan Gizzio
- Center for Biophysics and Computational Biology, Temple University, Philadelphia, PA, USA
- Department of Chemistry, Temple University, Philadelphia, PA, USA
| | - Abhishek Thakur
- Center for Biophysics and Computational Biology, Temple University, Philadelphia, PA, USA
- Department of Chemistry, Temple University, Philadelphia, PA, USA
| | - Allan Haldane
- Center for Biophysics and Computational Biology, Temple University, Philadelphia, PA, USA
- Department of Physics, Temple University, Philadelphia, PA, USA
| | - Carol Beth Post
- Borch Department of Medicinal Chemistry and Molecular Pharmacology, Purdue University, West Lafayette, IN, USA
| | - Ronald M Levy
- Center for Biophysics and Computational Biology, Temple University, Philadelphia, PA, USA.
- Department of Chemistry, Temple University, Philadelphia, PA, USA.
| |
Collapse
|
2
|
Valanciute A, Nygaard L, Zschach H, Maglegaard Jepsen M, Lindorff-Larsen K, Stein A. Accurate protein stability predictions from homology models. Comput Struct Biotechnol J 2022; 21:66-73. [PMID: 36514339 PMCID: PMC9729920 DOI: 10.1016/j.csbj.2022.11.048] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/15/2022] [Revised: 11/22/2022] [Accepted: 11/23/2022] [Indexed: 11/27/2022] Open
Abstract
Calculating changes in protein stability (ΔΔG) has been shown to be central for predicting the consequences of single amino acid substitutions in protein engineering as well as interpretation of genomic variants for disease risk. Structure-based calculations are considered most accurate, however the tools used to calculate ΔΔGs have been developed on experimentally resolved structures. Extending those calculations to homology models based on related proteins would greatly extend their applicability as large parts of e.g. the human proteome are not structurally resolved. In this study we aim to investigate the accuracy of ΔΔG values predicted on homology models compared to crystal structures. Specifically, we identified four proteins with a large number of experimentally tested ΔΔGs and templates for homology modeling across a broad range of sequence identities, and selected three methods for ΔΔG calculations to test. We find that ΔΔG-values predicted from homology models compare equally well to experimental ΔΔGs as those predicted on experimentally established crystal structures, as long as the sequence identity of the model template to the target protein is at least 40%. In particular, the Rosetta cartesian_ddg protocol is robust against the small perturbations in the structure which homology modeling introduces. In an independent assessment, we observe a similar trend when using ΔΔGs to categorize variants as low or wild-type-like abundance. Overall, our results show that stability calculations performed on homology models can substitute for those on crystal structures with acceptable accuracy as long as the model is built on a template with sequence identity of at least 40% to the target protein.
Collapse
Affiliation(s)
- Audrone Valanciute
- Linderstrøm-Lang Centre for Protein Science, Department of Biology, University of Copenhagen, Copenhagen, Denmark
| | - Lasse Nygaard
- Linderstrøm-Lang Centre for Protein Science, Department of Biology, University of Copenhagen, Copenhagen, Denmark
| | - Henrike Zschach
- Section for Computational and RNA Biology, Department of Biology, University of Copenhagen, Copenhagen, Denmark
| | - Michael Maglegaard Jepsen
- Linderstrøm-Lang Centre for Protein Science, Department of Biology, University of Copenhagen, Copenhagen, Denmark
| | - Kresten Lindorff-Larsen
- Linderstrøm-Lang Centre for Protein Science, Department of Biology, University of Copenhagen, Copenhagen, Denmark,Corresponding authors.
| | - Amelie Stein
- Section for Computational and RNA Biology, Department of Biology, University of Copenhagen, Copenhagen, Denmark,Corresponding authors.
| |
Collapse
|
3
|
Mohan S, Ozer HG, Ray WC. The Importance of Weakly Co-Evolving Residue Networks in Proteins is Revealed by Visual Analytics. FRONTIERS IN BIOINFORMATICS 2022; 2:836526. [PMID: 36304294 PMCID: PMC9580873 DOI: 10.3389/fbinf.2022.836526] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/15/2021] [Accepted: 03/07/2022] [Indexed: 11/13/2022] Open
Abstract
Small changes in a protein’s core packing produce changes in function, and even small changes in function bias species fitness and survival. Therefore individually deleterious mutations should be evolutionarily coupled with compensating mutations that recover fitness. Co-evolving pairs of mutations should be littered across evolutionary history. Despite longstanding intuition, the results of co-evolution analyses have largely disappointed expectations. Regardless of the statistics applied, only a small majority of the most strongly co-evolving residues are typically found to be in contact, and much of the “meaning” of observed co-evolution has been opaque. In a medium-sized protein of 300 amino acids, there are almost 20 million potentially-important interdependencies. It is impossible to understand this data in textual format without extreme summarization or truncation. And, due to summarization and truncation, it is impossible to identify most patterns in the data. We developed a visualization approach that eschews the common “look at a long list of statistics” approach and instead enables the user to literally look at all of the co-evolution statistics simultaneously. Users of our tool reported visually obvious “clouds” of co-evolution statistics forming distinct patterns in the data, and analysis demonstrated that these clouds had structural relevance. To determine whether this phenomenon generalized, we repeated this experiment in three proteins we had not previously studied. The results provide evidence about how structural constrains have impacted co-evolution, why previous “examine the most frequently co-evolving residues” approaches have had limited success, and additionally shed light on the biophysical importance of different types of co-evolution.
Collapse
Affiliation(s)
- Sidharth Mohan
- Interdisciplinary Graduate Program in Biophysics, The Ohio State University, Columbus, OH, United States
| | - Hatice Gulcin Ozer
- Interdisciplinary Graduate Program in Biophysics, The Ohio State University, Columbus, OH, United States
- Lilly Research Laboratories, Eli Lilly and Company, Indianapolis, IN, United States
| | - William C. Ray
- Interdisciplinary Graduate Program in Biophysics, The Ohio State University, Columbus, OH, United States
- The Battelle Center for Mathematical Medicine, The Research Institute at Nationwide Children’s Hospital, Columbus, OH, United States
- *Correspondence: William C. Ray ,
| |
Collapse
|
4
|
Gaba A, Hix MA, Suhail S, Flath B, Boysan B, Williams DR, Pelletier T, Emerman M, Morcos F, Cisneros GA, Chelico L. Divergence in Dimerization and Activity of Primate APOBEC3C. J Mol Biol 2021; 433:167306. [PMID: 34666043 PMCID: PMC9202443 DOI: 10.1016/j.jmb.2021.167306] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/22/2021] [Revised: 10/08/2021] [Accepted: 10/08/2021] [Indexed: 11/21/2022]
Abstract
The APOBEC3 (A3) family of single-stranded DNA cytidine deaminases are host restriction factors that inhibit lentiviruses, such as HIV-1, in the absence of the Vif protein that causes their degradation. Deamination of cytidine in HIV-1 (−)DNA forms uracil that causes inactivating mutations when uracil is used as a template for (+)DNA synthesis. For APOBEC3C (A3C), the chimpanzee and gorilla orthologues are more active than human A3C, and we determined that Old World Monkey A3C from rhesus macaque (rh) is not active against HIV-1. Biochemical, virological, and coevolutionary analyses combined with molecular dynamics simulations showed that the key amino acids needed to promote rhA3C antiviral activity, 44, 45, and 144, also promoted dimerization and changes to the dynamics of loop 1, near the enzyme active site. Although forced evolution of rhA3C resulted in a similar dimer interface with hominid A3C, the key amino acid contacts were different. Overall, our results determine the basis for why rhA3C is less active than human A3C and establish the amino acid network for dimerization and increased activity. Based on identification of the key amino acids determining Old World Monkey antiviral activity we predict that other Old World Monkey A3Cs did not impart anti-lentiviral activity, despite fixation of a key residue needed for hominid A3C activity. Overall, the coevolutionary analysis of the A3C dimerization interface presented also provides a basis from which to analyze dimerization interfaces of other A3 family members.
Collapse
Affiliation(s)
- Amit Gaba
- Department of Biochemistry, Microbiology, and Immunology, College of Medicine, University of Saskatchewan, Saskatoon, Canada. https://twitter.com/optimist1023
| | - Mark A Hix
- Department of Chemistry, University of North Texas, Denton, TX, USA. https://twitter.com/markahix
| | - Sana Suhail
- Department of Biological Sciences, Center for Systems Biology, University of Texas at Dallas, Richardson, TX, USA. https://twitter.com/sakuraa_329
| | - Ben Flath
- Department of Biochemistry, Microbiology, and Immunology, College of Medicine, University of Saskatchewan, Saskatoon, Canada
| | - Brock Boysan
- Department of Chemistry, University of North Texas, Denton, TX, USA
| | - Danielle R Williams
- Division of Human Biology, Fred Hutchinson Cancer Research Center, Seattle, WA, USA. https://twitter.com/dani_renee_
| | - Tomas Pelletier
- Department of Biochemistry, Microbiology, and Immunology, College of Medicine, University of Saskatchewan, Saskatoon, Canada
| | - Michael Emerman
- Division of Human Biology, Fred Hutchinson Cancer Research Center, Seattle, WA, USA; Division of Basic Sciences, Fred Hutchinson Cancer Research Center, Seattle, WA, USA. https://twitter.com/memerman
| | - Faruck Morcos
- Department of Biological Sciences, Center for Systems Biology, University of Texas at Dallas, Richardson, TX, USA; Department of Bioengineering, University of Texas at Dallas, Dallas, TX, USA. https://twitter.com/MorcosLab
| | - G Andrés Cisneros
- Department of Chemistry, University of North Texas, Denton, TX, USA. https://twitter.com/CisnerosRes
| | - Linda Chelico
- Department of Biochemistry, Microbiology, and Immunology, College of Medicine, University of Saskatchewan, Saskatoon, Canada.
| |
Collapse
|