1
|
Cea PA, Pérez M, Herrera SM, Muñoz SM, Fuentes-Ugarte N, Coche-Miranda J, Maturana P, Guixé V, Castro-Fernandez V. Deciphering Structural Traits for Thermal and Kinetic Stability across Protein Family Evolution through Ancestral Sequence Reconstruction. Mol Biol Evol 2024; 41:msae127. [PMID: 38913681 PMCID: PMC11229819 DOI: 10.1093/molbev/msae127] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/30/2024] [Revised: 05/17/2024] [Accepted: 06/17/2024] [Indexed: 06/26/2024] Open
Abstract
Natural proteins are frequently marginally stable, and an increase in environmental temperature can easily lead to unfolding. As a result, protein engineering to improve protein stability is an area of intensive research. Nonetheless, since there is usually a high degree of structural homology between proteins from thermophilic organisms and their mesophilic counterparts, the identification of structural determinants for thermoadaptation is challenging. Moreover, in many cases, it has become clear that the success of stabilization strategies is often dependent on the evolutionary history of a protein family. In the last few years, the use of ancestral sequence reconstruction (ASR) as a tool for elucidation of the evolutionary history of functional traits of a protein family has gained strength. Here, we used ASR to trace the evolutionary pathways between mesophilic and thermophilic kinases that participate in the biosynthetic pathway of vitamin B1 in bacteria. By combining biophysics approaches, X-ray crystallography, and molecular dynamics simulations, we found that the thermal stability of these enzymes correlates with their kinetic stability, where the highest thermal/kinetic stability is given by an increase in small hydrophobic amino acids that allow a higher number of interatomic hydrophobic contacts, making this type of interaction the main support for stability in this protein architecture. The results highlight the potential benefits of using ASR to explore the evolutionary history of protein sequence and structure to identify traits responsible for the kinetic and thermal stability of any protein architecture.
Collapse
Affiliation(s)
- Pablo A Cea
- Departamento de Biología, Facultad de Ciencias, Laboratorio de Bioquímica y Biología Molecular, Universidad de Chile, Santiago, Chile
| | - Myriam Pérez
- Departamento de Biología, Facultad de Ciencias, Laboratorio de Bioquímica y Biología Molecular, Universidad de Chile, Santiago, Chile
| | - Sixto M Herrera
- Departamento de Biología, Facultad de Ciencias, Laboratorio de Bioquímica y Biología Molecular, Universidad de Chile, Santiago, Chile
| | - Sebastián M Muñoz
- Departamento de Biología, Facultad de Ciencias, Laboratorio de Bioquímica y Biología Molecular, Universidad de Chile, Santiago, Chile
| | - Nicolás Fuentes-Ugarte
- Departamento de Biología, Facultad de Ciencias, Laboratorio de Bioquímica y Biología Molecular, Universidad de Chile, Santiago, Chile
| | - José Coche-Miranda
- Departamento de Biología, Facultad de Ciencias, Laboratorio de Bioquímica y Biología Molecular, Universidad de Chile, Santiago, Chile
| | - Pablo Maturana
- Departamento de Biología, Facultad de Ciencias, Laboratorio de Bioquímica y Biología Molecular, Universidad de Chile, Santiago, Chile
| | - Victoria Guixé
- Departamento de Biología, Facultad de Ciencias, Laboratorio de Bioquímica y Biología Molecular, Universidad de Chile, Santiago, Chile
| | - Victor Castro-Fernandez
- Departamento de Biología, Facultad de Ciencias, Laboratorio de Bioquímica y Biología Molecular, Universidad de Chile, Santiago, Chile
| |
Collapse
|
2
|
Nixon C, Lim SA, Sternke M, Barrick D, Harms MJ, Marqusee S. The importance of input sequence set to consensus-derived proteins and their relationship to reconstructed ancestral proteins. Protein Sci 2024; 33:e5011. [PMID: 38747388 PMCID: PMC11094778 DOI: 10.1002/pro.5011] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/29/2023] [Revised: 04/02/2024] [Accepted: 04/23/2024] [Indexed: 05/19/2024]
Abstract
A protein sequence encodes its energy landscape-all the accessible conformations, energetics, and dynamics. The evolutionary relationship between sequence and landscape can be probed phylogenetically by compiling a multiple sequence alignment of homologous sequences and generating common ancestors via Ancestral Sequence Reconstruction or a consensus protein containing the most common amino acid at each position. Both ancestral and consensus proteins are often more stable than their extant homologs-questioning the differences between them and suggesting that both approaches serve as general methods to engineer thermostability. We used the Ribonuclease H family to compare these approaches and evaluate how the evolutionary relationship of the input sequences affects the properties of the resulting consensus protein. While the consensus protein derived from our full Ribonuclease H sequence alignment is structured and active, it neither shows properties of a well-folded protein nor has enhanced stability. In contrast, the consensus protein derived from a phylogenetically-restricted set of sequences is significantly more stable and cooperatively folded, suggesting that cooperativity may be encoded by different mechanisms in separate clades and lost when too many diverse clades are combined to generate a consensus protein. To explore this, we compared pairwise covariance scores using a Potts formalism as well as higher-order sequence correlations using singular value decomposition (SVD). We find the SVD coordinates of a stable consensus sequence are close to coordinates of the analogous ancestor sequence and its descendants, whereas the unstable consensus sequences are outliers in SVD space.
Collapse
Affiliation(s)
- Charlotte Nixon
- Department of Molecular and Cell BiologyUniversity of California, BerkeleyBerkeleyCaliforniaUSA
| | - Shion A. Lim
- Department of Molecular and Cell BiologyUniversity of California, BerkeleyBerkeleyCaliforniaUSA
| | - Matt Sternke
- The T.C. Jenkins Department of BiophysicsJohns Hopkins UniversityBaltimoreMarylandUSA
| | - Doug Barrick
- The T.C. Jenkins Department of BiophysicsJohns Hopkins UniversityBaltimoreMarylandUSA
| | - Michael J. Harms
- Department of Chemistry and BiochemistryUniversity of OregonEugeneOregonUSA
| | - Susan Marqusee
- Department of Molecular and Cell BiologyUniversity of California, BerkeleyBerkeleyCaliforniaUSA
- Department of ChemistryUniversity of California, BerkeleyBerkeleyCaliforniaUSA
- California Institute for Quantitative Biosciences (QB3)BerkeleyCaliforniaUSA
| |
Collapse
|
3
|
Reddy KD, Rasool B, Akher FB, Kutlešić N, Pant S, Boudker O. Evolutionary analysis reveals the origin of sodium coupling in glutamate transporters. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2023.12.03.569786. [PMID: 38106174 PMCID: PMC10723334 DOI: 10.1101/2023.12.03.569786] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/19/2023]
Abstract
Secondary active membrane transporters harness the energy of ion gradients to concentrate their substrates. Homologous transporters evolved to couple transport to different ions in response to changing environments and needs. The bases of such diversification, and thus principles of ion coupling, are unexplored. Employing phylogenetics and ancestral protein reconstruction, we investigated sodium-coupled transport in prokaryotic glutamate transporters, a mechanism ubiquitous across life domains and critical to neurotransmitter recycling in humans. We found that the evolutionary transition from sodium-dependent to independent substrate binding to the transporter preceded changes in the coupling mechanism. Structural and functional experiments suggest that the transition entailed allosteric mutations, making sodium binding dispensable without affecting ion-binding sites. Allosteric tuning of transporters' energy landscapes might be a widespread route of their functional diversification.
Collapse
Affiliation(s)
- Krishna D. Reddy
- Dept. of Physiology & Biophysics, Weill Cornell Medical College, 1300 York Ave, New York, NY 10021, USA
| | - Burha Rasool
- Dept. of Physiology & Biophysics, Weill Cornell Medical College, 1300 York Ave, New York, NY 10021, USA
| | - Farideh Badichi Akher
- Dept. of Physiology & Biophysics, Weill Cornell Medical College, 1300 York Ave, New York, NY 10021, USA
| | - Nemanja Kutlešić
- Dept. of Physiology & Biophysics, Weill Cornell Medical College, 1300 York Ave, New York, NY 10021, USA
| | - Swati Pant
- Dept. of Biochemistry, Weill Cornell Medical College, 1300 York Ave, New York, NY 10021, USA
| | - Olga Boudker
- Dept. of Physiology & Biophysics, Weill Cornell Medical College, 1300 York Ave, New York, NY 10021, USA
- Howard Hughes Medical Institute, Weill Cornell Medical College, 1300 York Ave, New York, NY 10021, USA
| |
Collapse
|
4
|
Sennett MA, Theobald DL. Extant Sequence Reconstruction: The Accuracy of Ancestral Sequence Reconstructions Evaluated by Extant Sequence Cross-Validation. J Mol Evol 2024; 92:181-206. [PMID: 38502220 PMCID: PMC10978691 DOI: 10.1007/s00239-024-10162-3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/12/2023] [Accepted: 02/20/2024] [Indexed: 03/21/2024]
Abstract
Ancestral sequence reconstruction (ASR) is a phylogenetic method widely used to analyze the properties of ancient biomolecules and to elucidate mechanisms of molecular evolution. Despite its increasingly widespread application, the accuracy of ASR is currently unknown, as it is generally impossible to compare resurrected proteins to the true ancestors. Which evolutionary models are best for ASR? How accurate are the resulting inferences? Here we answer these questions using a cross-validation method to reconstruct each extant sequence in an alignment with ASR methodology, a method we term "extant sequence reconstruction" (ESR). We thus can evaluate the accuracy of ASR methodology by comparing ESR reconstructions to the corresponding known true sequences. We find that a common measure of the quality of a reconstructed sequence, the average probability, is indeed a good estimate of the fraction of correct amino acids when the evolutionary model is accurate or overparameterized. However, the average probability is a poor measure for comparing reconstructions from different models, because, surprisingly, a more accurate phylogenetic model often results in reconstructions with lower probability. While better (more predictive) models may produce reconstructions with lower sequence identity to the true sequences, better models nevertheless produce reconstructions that are more biophysically similar to true ancestors. In addition, we find that a large fraction of sequences sampled from the reconstruction distribution may have fewer errors than the single most probable (SMP) sequence reconstruction, despite the fact that the SMP has the lowest expected error of all possible sequences. Our results emphasize the importance of model selection for ASR and the usefulness of sampling sequence reconstructions for analyzing ancestral protein properties. ESR is a powerful method for validating the evolutionary models used for ASR and can be applied in practice to any phylogenetic analysis of real biological sequences. Most significantly, ESR uses ASR methodology to provide a general method by which the biophysical properties of resurrected proteins can be compared to the properties of the true protein.
Collapse
Affiliation(s)
- Michael A Sennett
- Department of Biochemistry, Brandeis University, Waltham, MA, 02453, USA
| | - Douglas L Theobald
- Department of Biochemistry, Brandeis University, Waltham, MA, 02453, USA.
| |
Collapse
|
5
|
Nam K, Shao Y, Major DT, Wolf-Watz M. Perspectives on Computational Enzyme Modeling: From Mechanisms to Design and Drug Development. ACS OMEGA 2024; 9:7393-7412. [PMID: 38405524 PMCID: PMC10883025 DOI: 10.1021/acsomega.3c09084] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 11/14/2023] [Revised: 01/15/2024] [Accepted: 01/19/2024] [Indexed: 02/27/2024]
Abstract
Understanding enzyme mechanisms is essential for unraveling the complex molecular machinery of life. In this review, we survey the field of computational enzymology, highlighting key principles governing enzyme mechanisms and discussing ongoing challenges and promising advances. Over the years, computer simulations have become indispensable in the study of enzyme mechanisms, with the integration of experimental and computational exploration now established as a holistic approach to gain deep insights into enzymatic catalysis. Numerous studies have demonstrated the power of computer simulations in characterizing reaction pathways, transition states, substrate selectivity, product distribution, and dynamic conformational changes for various enzymes. Nevertheless, significant challenges remain in investigating the mechanisms of complex multistep reactions, large-scale conformational changes, and allosteric regulation. Beyond mechanistic studies, computational enzyme modeling has emerged as an essential tool for computer-aided enzyme design and the rational discovery of covalent drugs for targeted therapies. Overall, enzyme design/engineering and covalent drug development can greatly benefit from our understanding of the detailed mechanisms of enzymes, such as protein dynamics, entropy contributions, and allostery, as revealed by computational studies. Such a convergence of different research approaches is expected to continue, creating synergies in enzyme research. This review, by outlining the ever-expanding field of enzyme research, aims to provide guidance for future research directions and facilitate new developments in this important and evolving field.
Collapse
Affiliation(s)
- Kwangho Nam
- Department
of Chemistry and Biochemistry, University
of Texas at Arlington, Arlington, Texas 76019, United States
| | - Yihan Shao
- Department
of Chemistry and Biochemistry, University
of Oklahoma, Norman, Oklahoma 73019-5251, United States
| | - Dan T. Major
- Department
of Chemistry and Institute for Nanotechnology & Advanced Materials, Bar-Ilan University, Ramat-Gan 52900, Israel
| | | |
Collapse
|
6
|
Hayes RL, Nixon CF, Marqusee S, Brooks CL. Selection pressures on evolution of ribonuclease H explored with rigorous free-energy-based design. Proc Natl Acad Sci U S A 2024; 121:e2312029121. [PMID: 38194446 PMCID: PMC10801872 DOI: 10.1073/pnas.2312029121] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/14/2023] [Accepted: 11/22/2023] [Indexed: 01/11/2024] Open
Abstract
Understanding natural protein evolution and designing novel proteins are motivating interest in development of high-throughput methods to explore large sequence spaces. In this work, we demonstrate the application of multisite λ dynamics (MSλD), a rigorous free energy simulation method, and chemical denaturation experiments to quantify evolutionary selection pressure from sequence-stability relationships and to address questions of design. This study examines a mesophilic phylogenetic clade of ribonuclease H (RNase H), furthering its extensive characterization in earlier studies, focusing on E. coli RNase H (ecRNH) and a more stable consensus sequence (AncCcons) differing at 15 positions. The stabilities of 32,768 chimeras between these two sequences were computed using the MSλD framework. The most stable and least stable chimeras were predicted and tested along with several other sequences, revealing a designed chimera with approximately the same stability increase as AncCcons, but requiring only half the mutations. Comparing the computed stabilities with experiment for 12 sequences reveals a Pearson correlation of 0.86 and root mean squared error of 1.18 kcal/mol, an unprecedented level of accuracy well beyond less rigorous computational design methods. We then quantified selection pressure using a simple evolutionary model in which sequences are selected according to the Boltzmann factor of their stability. Selection temperatures from 110 to 168 K are estimated in three ways by comparing experimental and computational results to evolutionary models. These estimates indicate selection pressure is high, which has implications for evolutionary dynamics and for the accuracy required for design, and suggests accurate high-throughput computational methods like MSλD may enable more effective protein design.
Collapse
Affiliation(s)
- Ryan L. Hayes
- Department of Chemical and Biomolecular Engineering, University of California, Irvine, CA92697
- Department of Chemistry, University of Michigan, Ann Arbor, MI48109
| | - Charlotte F. Nixon
- Department of Molecular and Cell Biology, University of California, Berkeley, CA94720
| | - Susan Marqusee
- Department of Molecular and Cell Biology, University of California, Berkeley, CA94720
- California Institute for Quantitative Biosciences, University of California, Berkeley, CA94720
- Department of Chemistry, University of California, Berkeley, CA94720
| | - Charles L. Brooks
- Department of Chemistry, University of Michigan, Ann Arbor, MI48109
- Biophysics Program, University of Michigan, Ann Arbor, MI48109
| |
Collapse
|
7
|
Nixon C, Lim SA, Sternke M, Barrick D, Harms M, Marqusee S. The importance of input sequence set to consensus-derived proteins and their relationship to reconstructed ancestral proteins. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.06.29.547063. [PMID: 37425932 PMCID: PMC10327145 DOI: 10.1101/2023.06.29.547063] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 07/11/2023]
Abstract
A protein sequence encodes its energy landscape - all the accessible conformations, energetics, and dynamics. The evolutionary relationship between sequence and landscape can be probed phylogenetically by compiling a multiple sequence alignment of homologous sequences and generating common ancestors via Ancestral Sequence Reconstruction or a consensus protein containing the most common amino acid at each position. Both ancestral and consensus proteins are often more stable than their extant homologs - questioning the differences and suggesting that both approaches serve as general methods to engineer thermostability. We used the Ribonuclease H family to compare these approaches and evaluate how the evolutionary relationship of the input sequences affects the properties of the resulting consensus protein. While the overall consensus protein is structured and active, it neither shows properties of a well-folded protein nor has enhanced stability. In contrast, the consensus protein derived from a phylogenetically-restricted region is significantly more stable and cooperatively folded, suggesting that cooperativity may be encoded by different mechanisms in separate clades and lost when too many diverse clades are combined to generate a consensus protein. To explore this, we compared pairwise covariance scores using a Potts formalism as well as higher-order couplings using singular value decomposition (SVD). We find the SVD coordinates of a stable consensus sequence are close to coordinates of the analogous ancestor sequence and its descendants, whereas the unstable consensus sequences are outliers in SVD space.
Collapse
Affiliation(s)
- Charlotte Nixon
- Department of Molecular and Cell Biology, University of California, Berkeley, Berkeley, CA 94720
| | - Shion A Lim
- Department of Molecular and Cell Biology, University of California, Berkeley, Berkeley, CA 94720
| | - Matt Sternke
- The T.C. Jenkins Department of Biophysics, Johns Hopkins University, Baltimore, MD 21218
| | - Doug Barrick
- The T.C. Jenkins Department of Biophysics, Johns Hopkins University, Baltimore, MD 21218
| | - Mike Harms
- Department of Chemistry and Biochemistry, University of Oregon, Eugene, OR 97403
| | - Susan Marqusee
- Department of Molecular and Cell Biology, University of California, Berkeley, Berkeley, CA 94720
- Department of Chemistry, University of California, Berkeley, Berkeley, CA 94720
- California Institute for Quantitative Biosciences (QB3), Berkeley
| |
Collapse
|
8
|
Cortez LM, Morrison AJ, Garen CR, Patterson S, Uyesugi T, Petrosyan R, Sekar RV, Harms MJ, Woodside MT, Sim VL. Probing the origin of prion protein misfolding via reconstruction of ancestral proteins. Protein Sci 2022; 31:e4477. [PMID: 36254680 PMCID: PMC9667828 DOI: 10.1002/pro.4477] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/04/2022] [Revised: 10/11/2022] [Accepted: 10/13/2022] [Indexed: 12/13/2022]
Abstract
Prion diseases are fatal neurodegenerative diseases caused by pathogenic misfolding of the prion protein, PrP. They are transmissible between hosts, and sometimes between different species, as with transmission of bovine spongiform encephalopathy to humans. Although PrP is found in a wide range of vertebrates, prion diseases are seen only in certain mammals, suggesting that infectious misfolding was a recent evolutionary development. To explore when PrP acquired the ability to misfold infectiously, we reconstructed the sequences of ancestral versions of PrP from the last common primate, primate-rodent, artiodactyl, placental, bird, and amniote. Recombinant ancestral PrPs were then tested for their ability to form β-sheet aggregates, either spontaneously or when seeded with infectious prion strains from human, cervid, or rodent species. The ability to aggregate developed after the oldest ancestor (last common amniote), and aggregation capabilities diverged along evolutionary pathways consistent with modern-day susceptibilities. Ancestral bird PrP could not be seeded with modern-day prions, just as modern-day birds are resistant to prion disease. Computational modeling of structures suggested that differences in helix 2 could account for the resistance of ancestral bird PrP to seeding. Interestingly, ancestral primate PrP could be converted by all prion seeds, including both human and cervid prions, raising the possibility that species descended from an ancestral primate have retained the susceptibility to conversion by cervid prions. More generally, the results suggest that susceptibility to prion disease emerged prior to ~100 million years ago, with placental mammals possibly being generally susceptible to disease.
Collapse
Affiliation(s)
- Leonardo M. Cortez
- Centre for Prions and Protein Folding DiseasesUniversity of AlbertaEdmontonAlbertaCanada
- Division of Neurology, Department of MedicineUniversity of AlbertaEdmontonAlbertaCanada
- Neuroscience and Mental Health InstituteUniversity of AlbertaEdmontonAlbertaCanada
| | - Anneliese J. Morrison
- Institute of Molecular BiologyUniversity of OregonEugeneOregonUSA
- Department of Chemistry and BiochemistryUniversity of OregonEugeneOregonUSA
| | - Craig R. Garen
- Department of PhysicsUniversity of AlbertaEdmontonAlbertaCanada
| | - Sawyer Patterson
- Centre for Prions and Protein Folding DiseasesUniversity of AlbertaEdmontonAlbertaCanada
| | - Toshi Uyesugi
- Department of PhysicsUniversity of AlbertaEdmontonAlbertaCanada
| | - Rafayel Petrosyan
- Department of PhysicsUniversity of AlbertaEdmontonAlbertaCanada
- Present address:
Zaven & Sonia Akian College of Science and EngineeringAmerican University of ArmeniaYerevanArmenia
| | | | - Michael J. Harms
- Institute of Molecular BiologyUniversity of OregonEugeneOregonUSA
- Department of Chemistry and BiochemistryUniversity of OregonEugeneOregonUSA
| | - Michael T. Woodside
- Centre for Prions and Protein Folding DiseasesUniversity of AlbertaEdmontonAlbertaCanada
- Department of PhysicsUniversity of AlbertaEdmontonAlbertaCanada
- Li Ka Shing Institute of VirologyUniversity of AlbertaEdmontonAlbertaCanada
| | - Valerie L. Sim
- Centre for Prions and Protein Folding DiseasesUniversity of AlbertaEdmontonAlbertaCanada
- Division of Neurology, Department of MedicineUniversity of AlbertaEdmontonAlbertaCanada
- Neuroscience and Mental Health InstituteUniversity of AlbertaEdmontonAlbertaCanada
| |
Collapse
|
9
|
Dube N, Khan SH, Okafor CD. Ancestral sequence reconstruction for evolutionary characterization of proteins. Trends Biochem Sci 2021; 47:98-99. [PMID: 34810082 DOI: 10.1016/j.tibs.2021.10.009] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/27/2021] [Revised: 10/25/2021] [Accepted: 10/28/2021] [Indexed: 11/19/2022]
Affiliation(s)
- Namita Dube
- Department of Biochemistry and Molecular Biology, Pennsylvania State University, University Park, PA 16802, USA
| | - Sabab H Khan
- Department of Biochemistry and Molecular Biology, Pennsylvania State University, University Park, PA 16802, USA
| | - C Denise Okafor
- Department of Biochemistry and Molecular Biology, Pennsylvania State University, University Park, PA 16802, USA.
| |
Collapse
|
10
|
Shrestha S, Clark AC. Evolution of the folding landscape of effector caspases. J Biol Chem 2021; 297:101249. [PMID: 34592312 PMCID: PMC8628267 DOI: 10.1016/j.jbc.2021.101249] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/05/2021] [Revised: 09/22/2021] [Accepted: 09/23/2021] [Indexed: 11/07/2022] Open
Abstract
Caspases are a family of cysteinyl proteases that control programmed cell death and maintain homeostasis in multicellular organisms. The caspase family is an excellent model to study protein evolution because all caspases are produced as zymogens (procaspases [PCPs]) that must be activated to gain full activity; the protein structures are conserved through hundreds of millions of years of evolution; and some allosteric features arose with the early ancestor, whereas others are more recent evolutionary events. The apoptotic caspases evolved from a common ancestor (CA) into two distinct subfamilies: monomers (initiator caspases) or dimers (effector caspases). Differences in activation mechanisms of the two subfamilies, and their oligomeric forms, play a central role in the regulation of apoptosis. Here, we examine changes in the folding landscape by characterizing human effector caspases and their CA. The results show that the effector caspases unfold by a minimum three-state equilibrium model at pH 7.5, where the native dimer is in equilibrium with a partially folded monomeric (PCP-7, CA) or dimeric (PCP-6) intermediate. In comparison, the unfolding pathway of PCP-3 contains both oligomeric forms of the intermediate. Overall, the data show that the folding landscape was first established with the CA and was retained for >650 million years. Partially folded monomeric or dimeric intermediates in the ancestral ensemble provide mechanisms for evolutionary changes that affect stability of extant caspases. The conserved folding landscape allows for the fine-tuning of enzyme stability in a species-dependent manner while retaining the overall caspase–hemoglobinase fold.
Collapse
Affiliation(s)
- Suman Shrestha
- Department of Biology, University of Texas at Arlington, Arlington, Texas, USA
| | - A Clay Clark
- Department of Biology, University of Texas at Arlington, Arlington, Texas, USA.
| |
Collapse
|
11
|
Romero-Romero S, Kordes S, Michel F, Höcker B. Evolution, folding, and design of TIM barrels and related proteins. Curr Opin Struct Biol 2021; 68:94-104. [PMID: 33453500 PMCID: PMC8250049 DOI: 10.1016/j.sbi.2020.12.007] [Citation(s) in RCA: 16] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/26/2020] [Revised: 12/13/2020] [Accepted: 12/14/2020] [Indexed: 12/16/2022]
Abstract
Proteins are chief actors in life that perform a myriad of exquisite functions. This diversity has been enabled through the evolution and diversification of protein folds. Analysis of sequences and structures strongly suggest that numerous protein pieces have been reused as building blocks and propagated to many modern folds. This information can be traced to understand how the protein world has diversified. In this review, we discuss the latest advances in the analysis of protein evolutionary units, and we use as a model system one of the most abundant and versatile topologies, the TIM-barrel fold, to highlight the existing common principles that interconnect protein evolution, structure, folding, function, and design.
Collapse
Affiliation(s)
| | - Sina Kordes
- Department of Biochemistry, University of Bayreuth, 95447 Bayreuth, Germany
| | - Florian Michel
- Department of Biochemistry, University of Bayreuth, 95447 Bayreuth, Germany
| | - Birte Höcker
- Department of Biochemistry, University of Bayreuth, 95447 Bayreuth, Germany.
| |
Collapse
|
12
|
Maciuba K, Rajasekaran N, Chen X, Kaiser CM. Co-translational folding of nascent polypeptides: Multi-layered mechanisms for the efficient biogenesis of functional proteins. Bioessays 2021; 43:e2100042. [PMID: 33987870 PMCID: PMC8262109 DOI: 10.1002/bies.202100042] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/05/2021] [Revised: 04/02/2021] [Accepted: 04/07/2021] [Indexed: 11/09/2022]
Abstract
The coupling of protein synthesis and folding is a crucial yet poorly understood aspect of cellular protein folding. Over the past few years, it has become possible to experimentally follow and define protein folding on the ribosome, revealing principles that shape co-translational folding and distinguish it from refolding in solution. Here, we highlight some of these recent findings from biochemical and biophysical studies and their potential significance for cellular protein biogenesis. In particular, we focus on nascent chain interactions with the ribosome, interactions within the nascent protein, modulation of translation elongation rates, and the role of mechanical force that accompanies nascent protein folding. The ability to obtain mechanistic insight in molecular detail has set the stage for exploring the intricate process of nascent protein folding. We believe that the aspects discussed here will be generally important for understanding how protein synthesis and folding are coupled and regulated.
Collapse
Affiliation(s)
- Kevin Maciuba
- CMDB Graduate Program, Johns Hopkins University, Baltimore, Maryland, USA
| | | | - Xiuqi Chen
- CMDB Graduate Program, Johns Hopkins University, Baltimore, Maryland, USA
| | - Christian M Kaiser
- Department of Biology, Johns Hopkins University, Baltimore, Maryland, USA.,Department of Biophysics, Johns Hopkins University, Baltimore, Maryland, USA
| |
Collapse
|
13
|
Razban RM, Dasmeh P, Serohijos AWR, Shakhnovich EI. Avoidance of protein unfolding constrains protein stability in long-term evolution. Biophys J 2021; 120:2413-2424. [PMID: 33932438 PMCID: PMC8390877 DOI: 10.1016/j.bpj.2021.03.042] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/16/2020] [Revised: 02/24/2021] [Accepted: 03/17/2021] [Indexed: 11/28/2022] Open
Abstract
Every amino acid residue can influence a protein's overall stability, making stability highly susceptible to change throughout evolution. We consider the distribution of protein stabilities evolutionarily permittable under two previously reported protein fitness functions: flux dynamics and misfolding avoidance. We develop an evolutionary dynamics theory and find that it agrees better with an extensive protein stability data set for dihydrofolate reductase orthologs under the misfolding avoidance fitness function rather than the flux dynamics fitness function. Further investigation with ribonuclease H data demonstrates that not any misfolded state is avoided; rather, it is only the unfolded state. At the end, we discuss how our work pertains to the universal protein abundance-evolutionary rate correlation seen across organisms' proteomes. We derive a closed-form expression relating protein abundance to evolutionary rate that captures Escherichia coli, Saccharomyces cerevisiae, and Homo sapiens experimental trends without fitted parameters.
Collapse
Affiliation(s)
- Rostam M Razban
- Department of Chemistry and Chemical Biology, Harvard University, Cambridge, Massachusetts
| | - Pouria Dasmeh
- Department of Chemistry and Chemical Biology, Harvard University, Cambridge, Massachusetts; Departement de Biochimie, Université de Montréal, Montreal, Quebec, Canada
| | | | - Eugene I Shakhnovich
- Department of Chemistry and Chemical Biology, Harvard University, Cambridge, Massachusetts.
| |
Collapse
|
14
|
Zou T, Woodrum BW, Halloran N, Campitelli P, Bobkov AA, Ghirlanda G, Ozkan SB. Local Interactions That Contribute Minimal Frustration Determine Foldability. J Phys Chem B 2021; 125:2617-2626. [PMID: 33687216 DOI: 10.1021/acs.jpcb.1c00364] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
Abstract
Earlier experiments suggest that the evolutionary information (conservation and coevolution) encoded in protein sequences is necessary and sufficient to specify the fold of a protein family. However, there is no computational work to quantify the effect of such evolutionary information on the folding process. Here we explore the role of early folding steps for sequences designed using coevolution and conservation through a combination of computational and experimental methods. We simulated a repertoire of native and designed WW domain sequences to analyze early local contact formation and found that the N-terminal β-hairpin turn would not form correctly due to strong non-native local contacts in unfoldable sequences. Through a maximum likelihood approach, we identified five local contacts that play a critical role in folding, suggesting that a small subset of amino acid pairs can be used to solve the "needle in the haystack" problem to design foldable sequences. Thus, using the contact probability of those five local contacts that form during the early stage of folding, we built a classification model that predicts the foldability of a WW sequence with 81% accuracy. This classification model was used to redesign WW domain sequences that could not fold due to frustration and make them foldable by introducing a few mutations that led to the stabilization of these critical local contacts. The experimental analysis shows that a redesigned sequence folds and binds to polyproline peptides with a similar affinity as those observed for native WW domains. Overall, our analysis shows that evolutionary-designed sequences should not only satisfy the folding stability but also ensure a minimally frustrated folding landscape.
Collapse
Affiliation(s)
- Taisong Zou
- Department of Physics and Center for Biological Physics, Arizona State University, Tempe, Arizona 85287, United States
| | - Brian W Woodrum
- School of Molecular Sciences, Arizona State University, Tempe, Arizona 85287, United States
| | - Nicholas Halloran
- School of Molecular Sciences, Arizona State University, Tempe, Arizona 85287, United States
| | - Paul Campitelli
- Department of Physics and Center for Biological Physics, Arizona State University, Tempe, Arizona 85287, United States
| | - Andrey A Bobkov
- Conrad Prebys Center for Chemical Genomics, Sanford Burnham Prebys Medical Discovery Institute, La Jolla, California 92037, United States
| | - Giovanna Ghirlanda
- School of Molecular Sciences, Arizona State University, Tempe, Arizona 85287, United States
| | - Sefika Banu Ozkan
- Department of Physics and Center for Biological Physics, Arizona State University, Tempe, Arizona 85287, United States
| |
Collapse
|
15
|
Shimko TC, Fordyce PM, Orenstein Y. DeCoDe: degenerate codon design for complete protein-coding DNA libraries. Bioinformatics 2020; 36:3357-3364. [PMID: 32176271 DOI: 10.1093/bioinformatics/btaa162] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/13/2019] [Revised: 02/13/2020] [Accepted: 03/13/2020] [Indexed: 11/12/2022] Open
Abstract
MOTIVATION High-throughput protein screening is a critical technique for dissecting and designing protein function. Libraries for these assays can be created through a number of means, including targeted or random mutagenesis of a template protein sequence or direct DNA synthesis. However, mutagenic library construction methods often yield vastly more nonfunctional than functional variants and, despite advances in large-scale DNA synthesis, individual synthesis of each desired DNA template is often prohibitively expensive. Consequently, many protein-screening libraries rely on the use of degenerate codons (DCs), mixtures of DNA bases incorporated at specific positions during DNA synthesis, to generate highly diverse protein-variant pools from only a few low-cost synthesis reactions. However, selecting DCs for sets of sequences that covary at multiple positions dramatically increases the difficulty of designing a DC library and leads to the creation of many undesired variants that can quickly outstrip screening capacity. RESULTS We introduce a novel algorithm for total DC library optimization, degenerate codon design (DeCoDe), based on integer linear programming. DeCoDe significantly outperforms state-of-the-art DC optimization algorithms and scales well to more than a hundred proteins sharing complex patterns of covariation (e.g. the lab-derived avGFP lineage). Moreover, DeCoDe is, to our knowledge, the first DC design algorithm with the capability to encode mixed-length protein libraries. We anticipate DeCoDe to be broadly useful for a variety of library generation problems, ranging from protein engineering attempts that leverage mutual information to the reconstruction of ancestral protein states. AVAILABILITY AND IMPLEMENTATION github.com/OrensteinLab/DeCoDe. CONTACT yaronore@bgu.ac.il. SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
| | - Polly M Fordyce
- Department of Genetics
- Department of Bioengineering
- Stanford ChEM-H, Stanford University, Stanford, CA 94305, USA
- Chan Zuckerberg Biohub, San Francisco, CA 94158, USA
| | - Yaron Orenstein
- School of Electrical and Computer Engineering, Ben-Gurion University of the Negev, Beer-Sheva 8410501, Israel
| |
Collapse
|
16
|
Addabbo RM, Dalphin MD, Mecha MF, Liu Y, Staikos A, Guzman-Luna V, Cavagnero S. Complementary Role of Co- and Post-Translational Events in De Novo Protein Biogenesis. J Phys Chem B 2020; 124:6488-6507. [DOI: 10.1021/acs.jpcb.0c03039] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/17/2022]
Affiliation(s)
- Rayna M. Addabbo
- Biophysics Graduate Degree Program, University of Wisconsin—Madison, Madison, Wisconsin 53706, United States
| | - Matthew D. Dalphin
- Biophysics Graduate Degree Program, University of Wisconsin—Madison, Madison, Wisconsin 53706, United States
| | - Miranda F. Mecha
- Biophysics Graduate Degree Program, University of Wisconsin—Madison, Madison, Wisconsin 53706, United States
| | - Yue Liu
- Department of Chemistry, University of Wisconsin—Madison, Madison, Wisconsin 53706, United States
| | - Alexios Staikos
- Department of Chemistry, University of Wisconsin—Madison, Madison, Wisconsin 53706, United States
| | - Valeria Guzman-Luna
- Department of Chemistry, University of Wisconsin—Madison, Madison, Wisconsin 53706, United States
| | - Silvia Cavagnero
- Biophysics Graduate Degree Program, University of Wisconsin—Madison, Madison, Wisconsin 53706, United States
- Department of Chemistry, University of Wisconsin—Madison, Madison, Wisconsin 53706, United States
| |
Collapse
|
17
|
Nandi T, Desai A, Ainavarapu SRK. The unfolding transition state of ubiquitin with charged residues has higher energy than that with hydrophobic residues. Phys Chem Chem Phys 2020; 22:23158-23168. [DOI: 10.1039/d0cp03876h] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/21/2022]
Abstract
The native-state structure and folding pathways of a protein are encoded in its amino acid sequence.
Collapse
Affiliation(s)
- Tathagata Nandi
- Department of Chemical Sciences
- Tata Institute of Fundamental Research
- Mumbai 400005
- India
| | - Amogh Desai
- Department of Chemical Sciences
- Tata Institute of Fundamental Research
- Mumbai 400005
- India
| | | |
Collapse
|
18
|
Norbiato F, Seno F, Trovato A, Baiesi M. Folding Rate Optimization Promotes Frustrated Interactions in Entangled Protein Structures. Int J Mol Sci 2019; 21:ijms21010213. [PMID: 31892272 PMCID: PMC6981561 DOI: 10.3390/ijms21010213] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/15/2019] [Revised: 12/16/2019] [Accepted: 12/25/2019] [Indexed: 12/15/2022] Open
Abstract
Many native structures of proteins accomodate complex topological motifs such as knots, lassos, and other geometrical entanglements. How proteins can fold quickly even in the presence of such topological obstacles is a debated question in structural biology. Recently, the hypothesis that energetic frustration might be a mechanism to avoid topological frustration has been put forward based on the empirical observation that loops involved in entanglements are stabilized by weak interactions between amino-acids at their extrema. To verify this idea, we use a toy lattice model for the folding of proteins into two almost identical structures, one entangled and one not. As expected, the folding time is longer when random sequences folds into the entangled structure. This holds also under an evolutionary pressure simulated by optimizing the folding time. It turns out that optmized protein sequences in the entangled structure are in fact characterized by frustrated interactions at the closures of entangled loops. This phenomenon is much less enhanced in the control case where the entanglement is not present. Our findings, which are in agreement with experimental observations, corroborate the idea that an evolutionary pressure shapes the folding funnel to avoid topological and kinetic traps.
Collapse
Affiliation(s)
- Federico Norbiato
- Department of Physics and Astronomy, University of Padova, Via Marzolo 8, I-35131 Padova, Italy; (F.N.); (F.S.); (A.T.)
| | - Flavio Seno
- Department of Physics and Astronomy, University of Padova, Via Marzolo 8, I-35131 Padova, Italy; (F.N.); (F.S.); (A.T.)
- INFN, Sezione di Padova, Via Marzolo 8, I-35131 Padova, Italy
| | - Antonio Trovato
- Department of Physics and Astronomy, University of Padova, Via Marzolo 8, I-35131 Padova, Italy; (F.N.); (F.S.); (A.T.)
- INFN, Sezione di Padova, Via Marzolo 8, I-35131 Padova, Italy
| | - Marco Baiesi
- Department of Physics and Astronomy, University of Padova, Via Marzolo 8, I-35131 Padova, Italy; (F.N.); (F.S.); (A.T.)
- INFN, Sezione di Padova, Via Marzolo 8, I-35131 Padova, Italy
- Correspondence:
| |
Collapse
|
19
|
Ding X, Zou Z, Brooks Iii CL. Deciphering protein evolution and fitness landscapes with latent space models. Nat Commun 2019; 10:5644. [PMID: 31822668 PMCID: PMC6904478 DOI: 10.1038/s41467-019-13633-0] [Citation(s) in RCA: 40] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/15/2019] [Accepted: 11/12/2019] [Indexed: 12/03/2022] Open
Abstract
Protein sequences contain rich information about protein evolution, fitness landscapes, and stability. Here we investigate how latent space models trained using variational auto-encoders can infer these properties from sequences. Using both simulated and real sequences, we show that the low dimensional latent space representation of sequences, calculated using the encoder model, captures both evolutionary and ancestral relationships between sequences. Together with experimental fitness data and Gaussian process regression, the latent space representation also enables learning the protein fitness landscape in a continuous low dimensional space. Moreover, the model is also useful in predicting protein mutational stability landscapes and quantifying the importance of stability in shaping protein evolution. Overall, we illustrate that the latent space models learned using variational auto-encoders provide a mechanism for exploration of the rich data contained in protein sequences regarding evolution, fitness and stability and hence are well-suited to help guide protein engineering efforts.
Collapse
Affiliation(s)
- Xinqiang Ding
- Department of Computational Medicine & Bioinformatics, University of Michigan, Ann Arbor, MI, 48109, USA
| | - Zhengting Zou
- Department of Ecology and Evolutionary Biology, University of Michigan, Ann Arbor, MI, 48109, USA
| | - Charles L Brooks Iii
- Department of Computational Medicine & Bioinformatics, University of Michigan, Ann Arbor, MI, 48109, USA.
- Department of Chemistry, University of Michigan, Ann Arbor, MI, 48109, USA.
- Biophysics Program, University of Michigan, Ann Arbor, MI, 48109, USA.
| |
Collapse
|
20
|
Nandi T, Yadav A, Ainavarapu SRK. Experimental comparison of energy landscape features of ubiquitin family proteins. Proteins 2019; 88:449-461. [DOI: 10.1002/prot.25822] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/03/2019] [Revised: 09/05/2019] [Accepted: 09/18/2019] [Indexed: 11/09/2022]
Affiliation(s)
- Tathagata Nandi
- Department of Chemical SciencesTata Institute of Fundamental Research Mumbai India
| | - Anju Yadav
- Department of Chemical SciencesTata Institute of Fundamental Research Mumbai India
| | | |
Collapse
|
21
|
Guin D, Gruebele M. Weak Chemical Interactions That Drive Protein Evolution: Crowding, Sticking, and Quinary Structure in Folding and Function. Chem Rev 2019; 119:10691-10717. [PMID: 31356058 DOI: 10.1021/acs.chemrev.8b00753] [Citation(s) in RCA: 77] [Impact Index Per Article: 15.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/22/2023]
Abstract
In recent years, better instrumentation and greater computing power have enabled the imaging of elusive biomolecule dynamics in cells, driving many advances in understanding the chemical organization of biological systems. The focus of this Review is on interactions in the cell that affect both biomolecular stability and function and modulate them. The same protein or nucleic acid can behave differently depending on the time in the cell cycle, the location in a specific compartment, or the stresses acting on the cell. We describe in detail the crowding, sticking, and quinary structure in the cell and the current methods to quantify them both in vitro and in vivo. Finally, we discuss protein evolution in the cell in light of current biophysical evidence. We describe the factors that drive protein evolution and shape protein interaction networks. These interactions can significantly affect the free energy, ΔG, of marginally stable and low-population proteins and, due to epistasis, direct the evolutionary pathways in an organism. We finally conclude by providing an outlook on experiments to come and the possibility of collaborative evolutionary biology and biophysical efforts.
Collapse
Affiliation(s)
- Drishti Guin
- Department of Chemistry , University of Illinois , Urbana , Illinois 61801 , United States
| | - Martin Gruebele
- Department of Chemistry , University of Illinois , Urbana , Illinois 61801 , United States.,Department of Physics , University of Illinois , Urbana , Illinois 61801 , United States.,Center for Biophysics and Quantitative Biology , University of Illinois , Urbana , Illinois 61801 , United States
| |
Collapse
|
22
|
Qiu X, Duvvuri VR, Bahl J. Computational Approaches and Challenges to Developing Universal Influenza Vaccines. Vaccines (Basel) 2019; 7:E45. [PMID: 31141933 PMCID: PMC6631137 DOI: 10.3390/vaccines7020045] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/10/2019] [Revised: 05/15/2019] [Accepted: 05/23/2019] [Indexed: 12/25/2022] Open
Abstract
The traditional design of effective vaccines for rapidly-evolving pathogens, such as influenza A virus, has failed to provide broad spectrum and long-lasting protection. With low cost whole genome sequencing technology and powerful computing capabilities, novel computational approaches have demonstrated the potential to facilitate the design of a universal influenza vaccine. However, few studies have integrated computational optimization in the design and discovery of new vaccines. Understanding the potential of computational vaccine design is necessary before these approaches can be implemented on a broad scale. This review summarizes some promising computational approaches under current development, including computationally optimized broadly reactive antigens with consensus sequences, phylogenetic model-based ancestral sequence reconstruction, and immunomics to compute conserved cross-reactive T-cell epitopes. Interactions between virus-host-environment determine the evolvability of the influenza population. We propose that with the development of novel technologies that allow the integration of data sources such as protein structural modeling, host antibody repertoire analysis and advanced phylodynamic modeling, computational approaches will be crucial for the development of a long-lasting universal influenza vaccine. Taken together, computational approaches are powerful and promising tools for the development of a universal influenza vaccine with durable and broad protection.
Collapse
Affiliation(s)
- Xueting Qiu
- Center for Ecology of Infectious Diseases, Department of Infectious Diseases, College of Veterinary Medicine, University of Georgia, Athens, GA 30602, USA.
| | - Venkata R Duvvuri
- Center for Ecology of Infectious Diseases, Department of Infectious Diseases, College of Veterinary Medicine, University of Georgia, Athens, GA 30602, USA.
| | - Justin Bahl
- Center for Ecology of Infectious Diseases, Department of Infectious Diseases, College of Veterinary Medicine, University of Georgia, Athens, GA 30602, USA.
- Department of Epidemiology and Biostatistics, College of Public Health, University of Georgia, Athens, GA 30606, USA.
- Duke-NUS Graduate Medical School, Singapore 169857, Singapore.
| |
Collapse
|
23
|
Socha RD, Chen J, Tokuriki N. The Molecular Mechanisms Underlying Hidden Phenotypic Variation among Metallo-β-Lactamases. J Mol Biol 2019; 431:1172-1185. [PMID: 30769117 DOI: 10.1016/j.jmb.2019.01.041] [Citation(s) in RCA: 16] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/05/2018] [Revised: 01/31/2019] [Accepted: 01/31/2019] [Indexed: 12/31/2022]
Abstract
Genetic variation among orthologous genes has been largely formed through neutral genetic drift while maintaining the functional role of these genes. However, because the evolution of gene occurs in the context of each host organism, their sequence changes are also associated with adaptation to a specific environment. Thus, genetic variation can create critical phenotypic variation, particularly when genes are transferred to a new host by horizontal gene transfer. Unveiling "hidden phenotypic variation" is particularly important for genes that confer resistance to antibiotics. However, our understanding of the molecular mechanisms that underlie phenotypic variation remains limited. Here we sought to determine the extent of phenotypic variation in the B1 metallo-β-lactamase (MBL) family and its molecular basis by systematically characterizing eight MBL orthologs, including NDM-1 and VIM-2 and IMP-1. We found that these MBLs confer diverse levels of resistance. The phenotypic variation cannot be explained by variation in catalytic efficiency alone; rather, it is the combination of the catalytic efficiency and abundance of functional periplasmic enzyme that best predicts the observed variation in resistance. The level of functional periplasmic expression varied dramatically between MBL orthologs. This was the result of changes at multiple levels of each ortholog's: (1) quantity of mRNA, (2) amount of MBL expressed, and (3) efficacy of functional enzyme translocation to the periplasm. Overall, it is the interaction between each gene and the host's underlying cellular processes (transcription, translation, and translocation) that determines MBL genetic incompatibility through horizontal gene transfer. These host-specific processes may constrain the effective spread and deployment of MBLs to certain host species and could explain the current observed distribution bias.
Collapse
Affiliation(s)
- Raymond D Socha
- Michael Smith Laboratories, University of British Columbia, Vancouver, BC V6T 1Z4, Canada
| | - John Chen
- Michael Smith Laboratories, University of British Columbia, Vancouver, BC V6T 1Z4, Canada
| | - Nobuhiko Tokuriki
- Michael Smith Laboratories, University of British Columbia, Vancouver, BC V6T 1Z4, Canada.
| |
Collapse
|
24
|
Baier F, Hong N, Yang G, Pabis A, Miton CM, Barrozo A, Carr PD, Kamerlin SC, Jackson CJ, Tokuriki N. Cryptic genetic variation shapes the adaptive evolutionary potential of enzymes. eLife 2019; 8:40789. [PMID: 30719972 PMCID: PMC6372284 DOI: 10.7554/elife.40789] [Citation(s) in RCA: 26] [Impact Index Per Article: 5.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/05/2018] [Accepted: 01/22/2019] [Indexed: 12/21/2022] Open
Abstract
Genetic variation among orthologous proteins can cause cryptic phenotypic properties that only manifest in changing environments. Such variation may impact the evolvability of proteins, but the underlying molecular basis remains unclear. Here, we performed comparative directed evolution of four orthologous metallo-β-lactamases toward a new function and found that different starting genotypes evolved to distinct evolutionary outcomes. Despite a low initial fitness, one ortholog reached a significantly higher fitness plateau than its counterparts, via increasing catalytic activity. By contrast, the ortholog with the highest initial activity evolved to a less-optimal and phenotypically distinct outcome through changes in expression, oligomerization and activity. We show how cryptic molecular properties and conformational variation of active site residues in the initial genotypes cause epistasis, that could lead to distinct evolutionary outcomes. Our work highlights the importance of understanding the molecular details that connect genetic variation to protein function to improve the prediction of protein evolution.
Collapse
Affiliation(s)
- Florian Baier
- Michael Smith Laboratory, University of British Columbia, Vancouver, Canada
| | - Nansook Hong
- Research School of Chemistry, Australian National University, Canberra, Australia
| | - Gloria Yang
- Michael Smith Laboratory, University of British Columbia, Vancouver, Canada
| | - Anna Pabis
- Department of Cell and Molecular Biology, Uppsala University, Uppsala, Sweden
| | - Charlotte M Miton
- Michael Smith Laboratory, University of British Columbia, Vancouver, Canada
| | - Alexandre Barrozo
- Department of Cell and Molecular Biology, Uppsala University, Uppsala, Sweden
| | - Paul D Carr
- Research School of Chemistry, Australian National University, Canberra, Australia
| | - Shina Cl Kamerlin
- Department of Cell and Molecular Biology, Uppsala University, Uppsala, Sweden
| | - Colin J Jackson
- Research School of Chemistry, Australian National University, Canberra, Australia
| | - Nobuhiko Tokuriki
- Michael Smith Laboratory, University of British Columbia, Vancouver, Canada
| |
Collapse
|
25
|
Yan Z, Wang J. Superfunneled Energy Landscape of Protein Evolution Unifies the Principles of Protein Evolution, Folding, and Design. PHYSICAL REVIEW LETTERS 2019; 122:018103. [PMID: 31012725 DOI: 10.1103/physrevlett.122.018103] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/06/2017] [Revised: 11/08/2018] [Indexed: 06/09/2023]
Abstract
Evolution is essential for shaping the biological functions. Darwin proposed the selection as the driving force for evolution upon mutations. While mutations are clear, the quantification of the selection force is still challenging. In this study, we identified and quantified both thermodynamic stability and kinetic accessibility as the selection forces for protein evolution. The protein evolution can be viewed and quantified as a trajectory moving along a superfunneled energy landscape with a line attractor at the bottom. The resulting evolved sequences and structures show strong protein characteristics including the hydrophobic core, high designability, and fast folding. The evolution principle uncovered here is validated on real proteins and sheds light on the protein design.
Collapse
Affiliation(s)
- Zhiqiang Yan
- State Key Laboratory of Electroanalytical Chemistry, Changchun Institute of Applied Chemistry, Chinese Academy of Sciences, Changchun, Jilin 130022, China
| | - Jin Wang
- State Key Laboratory of Electroanalytical Chemistry, Changchun Institute of Applied Chemistry, Chinese Academy of Sciences, Changchun, Jilin 130022, China
- Department of Chemistry & Physics, State University of New York at Stony Brook, Stony Brook, New York 11790, USA
| |
Collapse
|
26
|
Jemth P, Karlsson E, Vögeli B, Guzovsky B, Andersson E, Hultqvist G, Dogan J, Güntert P, Riek R, Chi CN. Structure and dynamics conspire in the evolution of affinity between intrinsically disordered proteins. SCIENCE ADVANCES 2018; 4:eaau4130. [PMID: 30397651 PMCID: PMC6200366 DOI: 10.1126/sciadv.aau4130] [Citation(s) in RCA: 23] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/07/2018] [Accepted: 09/13/2018] [Indexed: 05/10/2023]
Abstract
In every established species, protein-protein interactions have evolved such that they are fit for purpose. However, the molecular details of the evolution of new protein-protein interactions are poorly understood. We have used nuclear magnetic resonance spectroscopy to investigate the changes in structure and dynamics during the evolution of a protein-protein interaction involving the intrinsically disordered CREBBP (CREB-binding protein) interaction domain (CID) and nuclear coactivator binding domain (NCBD) from the transcriptional coregulators NCOA (nuclear receptor coactivator) and CREBBP/p300, respectively. The most ancient low-affinity "Cambrian-like" [540 to 600 million years (Ma) ago] CID/NCBD complex contained less secondary structure and was more dynamic than the complexes from an evolutionarily younger "Ordovician-Silurian" fish ancestor (ca. 440 Ma ago) and extant human. The most ancient Cambrian-like CID/NCBD complex lacked one helix and several interdomain interactions, resulting in a larger solvent-accessible surface area. Furthermore, the most ancient complex had a high degree of millisecond-to-microsecond dynamics distributed along the entire sequences of both CID and NCBD. These motions were reduced in the Ordovician-Silurian CID/NCBD complex and further redistributed in the extant human CID/NCBD complex. Isothermal calorimetry experiments show that complex formation is enthalpically favorable and that affinity is modulated by a largely unfavorable entropic contribution to binding. Our data demonstrate how changes in structure and motion conspire to shape affinity during the evolution of a protein-protein complex and provide direct evidence for the role of structural, dynamic, and frustrational plasticity in the evolution of interactions between intrinsically disordered proteins.
Collapse
Affiliation(s)
- Per Jemth
- Department of Medical Biochemistry and Microbiology, Uppsala University, BMC Box 582, SE-75123 Uppsala, Sweden
- Corresponding author. (C.N.C.); (P.J.)
| | - Elin Karlsson
- Department of Medical Biochemistry and Microbiology, Uppsala University, BMC Box 582, SE-75123 Uppsala, Sweden
| | - Beat Vögeli
- Department of Biochemistry and Molecular Genetics, University of Colorado at Denver, 12801 East 17th Avenue, Aurora, CO 80045, USA
| | - Brenda Guzovsky
- Protein Physiology Lab, FCEyN-Universidad de Buenos Aires, IQUIBICEN/CONICET, Intendente Güiraldes 2160, Ciudad Universitaria, C1428EGA Buenos Aires, Argentina
| | - Eva Andersson
- Department of Medical Biochemistry and Microbiology, Uppsala University, BMC Box 582, SE-75123 Uppsala, Sweden
| | - Greta Hultqvist
- Department of Pharmaceutical Biosciences, Uppsala University, BMC Box 591, SE-75124 Uppsala, Sweden
| | - Jakob Dogan
- Department of Biochemistry and Biophysics, Stockholm University, SE-10691 Stockholm, Sweden
| | - Peter Güntert
- Laboratory of Physical Chemistry, ETH Zürich, ETH-Hönggerberg, Zürich, Switzerland
- Institute of Biophysical Chemistry, Center for Biomolecular Magnetic Resonance, Goethe University, 60438 Frankfurt am Main, Germany
- Graduate School of Science, Tokyo Metropolitan University, Tokyo 192-0397, Japan
| | - Roland Riek
- Laboratory of Physical Chemistry, ETH Zürich, ETH-Hönggerberg, Zürich, Switzerland
| | - Celestine N. Chi
- Department of Medical Biochemistry and Microbiology, Uppsala University, BMC Box 582, SE-75123 Uppsala, Sweden
- Corresponding author. (C.N.C.); (P.J.)
| |
Collapse
|
27
|
Lim SA, Bolin ER, Marqusee S. Tracing a protein's folding pathway over evolutionary time using ancestral sequence reconstruction and hydrogen exchange. eLife 2018; 7:38369. [PMID: 30204082 PMCID: PMC6158009 DOI: 10.7554/elife.38369] [Citation(s) in RCA: 15] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/14/2018] [Accepted: 09/09/2018] [Indexed: 12/15/2022] Open
Abstract
The conformations populated during protein folding have been studied for decades; yet, their evolutionary importance remains largely unexplored. Ancestral sequence reconstruction allows access to proteins across evolutionary time, and new methods such as pulsed-labeling hydrogen exchange coupled with mass spectrometry allow determination of folding intermediate structures at near amino-acid resolution. Here, we combine these techniques to monitor the folding of the ribonuclease H family along the evolutionary lineages of T. thermophilus and E. coli RNase H. All homologs and ancestral proteins studied populate a similar folding intermediate despite being separated by billions of years of evolution. Even though this conformation is conserved, the pathway leading to it has diverged over evolutionary time, and rational mutations can alter this trajectory. Our results demonstrate that evolutionary processes can affect the energy landscape to preserve or alter specific features of a protein’s folding pathway.
Collapse
Affiliation(s)
- Shion An Lim
- Department of Molecular and Cell Biology, University of California, Berkeley, Berkeley, United States.,Institute for Quantitative Biosciences, University of California, Berkeley, Berkeley, United States
| | - Eric Richard Bolin
- Institute for Quantitative Biosciences, University of California, Berkeley, Berkeley, United States.,Biophysics Graduate Program, University of California, Berkeley, Berkeley, United States
| | - Susan Marqusee
- Department of Molecular and Cell Biology, University of California, Berkeley, Berkeley, United States.,Institute for Quantitative Biosciences, University of California, Berkeley, Berkeley, United States.,Department of Chemistry, University of California, Berkeley, Berkeley, United States.,Chan Zuckerberg Biohub, San Francisco, United States
| |
Collapse
|
28
|
Beerens K, Mazurenko S, Kunka A, Marques SM, Hansen N, Musil M, Chaloupkova R, Waterman J, Brezovsky J, Bednar D, Prokop Z, Damborsky J. Evolutionary Analysis As a Powerful Complement to Energy Calculations for Protein Stabilization. ACS Catal 2018. [DOI: 10.1021/acscatal.8b01677] [Citation(s) in RCA: 15] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/28/2023]
Affiliation(s)
- Koen Beerens
- Loschmidt Laboratories, Department of Experimental Biology and Research Centre for Toxic Compounds in the Environment RECETOX, Masaryk University, Kamenice 5/A13, 625 00 Brno, Czech Republic
| | - Stanislav Mazurenko
- Loschmidt Laboratories, Department of Experimental Biology and Research Centre for Toxic Compounds in the Environment RECETOX, Masaryk University, Kamenice 5/A13, 625 00 Brno, Czech Republic
| | - Antonin Kunka
- Loschmidt Laboratories, Department of Experimental Biology and Research Centre for Toxic Compounds in the Environment RECETOX, Masaryk University, Kamenice 5/A13, 625 00 Brno, Czech Republic
| | - Sergio M. Marques
- Loschmidt Laboratories, Department of Experimental Biology and Research Centre for Toxic Compounds in the Environment RECETOX, Masaryk University, Kamenice 5/A13, 625 00 Brno, Czech Republic
- International Clinical Research Center, St. Anne’s University Hospital Brno, Pekarska 53, 656 91 Brno, Czech Republic
| | - Niels Hansen
- Institute of Thermodynamics and Thermal Process Engineering, University of Stuttgart, D-70569 Stuttgart, Germany
| | - Milos Musil
- Loschmidt Laboratories, Department of Experimental Biology and Research Centre for Toxic Compounds in the Environment RECETOX, Masaryk University, Kamenice 5/A13, 625 00 Brno, Czech Republic
- Department of Information Systems, Faculty of Information Technology, Brno University of Technology, 612 66 Brno, Czech Republic
| | - Radka Chaloupkova
- Loschmidt Laboratories, Department of Experimental Biology and Research Centre for Toxic Compounds in the Environment RECETOX, Masaryk University, Kamenice 5/A13, 625 00 Brno, Czech Republic
- International Clinical Research Center, St. Anne’s University Hospital Brno, Pekarska 53, 656 91 Brno, Czech Republic
| | - Jitka Waterman
- Diamond Light Source, Harwell Science and Innovation Campus, Didcot OX11 0DE, United Kingdom
| | - Jan Brezovsky
- Loschmidt Laboratories, Department of Experimental Biology and Research Centre for Toxic Compounds in the Environment RECETOX, Masaryk University, Kamenice 5/A13, 625 00 Brno, Czech Republic
- International Clinical Research Center, St. Anne’s University Hospital Brno, Pekarska 53, 656 91 Brno, Czech Republic
| | - David Bednar
- Loschmidt Laboratories, Department of Experimental Biology and Research Centre for Toxic Compounds in the Environment RECETOX, Masaryk University, Kamenice 5/A13, 625 00 Brno, Czech Republic
- International Clinical Research Center, St. Anne’s University Hospital Brno, Pekarska 53, 656 91 Brno, Czech Republic
| | - Zbynek Prokop
- Loschmidt Laboratories, Department of Experimental Biology and Research Centre for Toxic Compounds in the Environment RECETOX, Masaryk University, Kamenice 5/A13, 625 00 Brno, Czech Republic
- International Clinical Research Center, St. Anne’s University Hospital Brno, Pekarska 53, 656 91 Brno, Czech Republic
| | - Jiri Damborsky
- Loschmidt Laboratories, Department of Experimental Biology and Research Centre for Toxic Compounds in the Environment RECETOX, Masaryk University, Kamenice 5/A13, 625 00 Brno, Czech Republic
- International Clinical Research Center, St. Anne’s University Hospital Brno, Pekarska 53, 656 91 Brno, Czech Republic
| |
Collapse
|
29
|
Lim SA, Marqusee S. The burst-phase folding intermediate of ribonuclease H changes conformation over evolutionary history. Biopolymers 2018; 109:e23086. [PMID: 29152711 PMCID: PMC6047922 DOI: 10.1002/bip.23086] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/03/2017] [Revised: 10/23/2017] [Accepted: 10/30/2017] [Indexed: 11/06/2022]
Abstract
The amino acid sequence encodes the energy landscape of a protein. Therefore, we expect evolutionary mutations to change features of the protein energy landscape, including the conformations adopted by a polypeptide as it folds to its native state. Ribonucleases H (RNase H) from Escherichia coli and Thermus thermophilus both fold via a partially folded intermediate in which the core region of the protein (helices A-D and strands 4-5) is structured. Strand 1, however, uniquely contributes to the T. thermophilus RNase H folding intermediate (Icore+1 ), but not the E. coli RNase H intermediate (Icore ) (Rosen & Marqusee, PLoS One 2015). We explore the origin of this difference by characterizing the folding intermediate of seven ancestral RNases H spanning the evolutionary history of these two homologs. Using fragment models with or without strand 1 and FRET probes to characterize the folding intermediate of each ancestor, we find a distinct evolutionary trend across the family-the involvement of strand 1 in the folding intermediate is an ancestral feature that is maintained in the thermophilic lineage and is gradually lost in the mesophilic lineage. Evolutionary sequence changes indeed modulate the conformations present on the folding landscape and altered the folding trajectory of RNase H.
Collapse
Affiliation(s)
- Shion An Lim
- Department of Molecular and Cell Biology, University of California, Berkeley, Berkeley, CA, United States
- Institute for Quantitative Biosciences (QB3), University of California, Berkeley, Berkeley, CA, United States
| | - Susan Marqusee
- Department of Molecular and Cell Biology, University of California, Berkeley, Berkeley, CA, United States
- Institute for Quantitative Biosciences (QB3), University of California, Berkeley, Berkeley, CA, United States
| |
Collapse
|
30
|
Chaturvedi D, Mahalakshmi R. Position-Specific contribution of interface tryptophans on membrane protein energetics. BIOCHIMICA ET BIOPHYSICA ACTA-BIOMEMBRANES 2017; 1860:451-457. [PMID: 29128310 DOI: 10.1016/j.bbamem.2017.11.003] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/14/2017] [Revised: 10/11/2017] [Accepted: 11/07/2017] [Indexed: 02/06/2023]
Abstract
Interface tryptophans are key residues that facilitate the folding and stability of membrane proteins. Escherichia coli OmpX possesses two unique interface tryptophans, namely Trp76, which is present at the interface and is solvent-exposed, and Trp140, which is relatively more lipid solvated than Trp76 in symmetric lipid membranes. Here, we address the requirement for tryptophan and the consequences of aromatic amino acid substitutions on the folding and stability of OmpX. Using spectroscopic measurements of OmpX-Trp/Tyr/Phe mutants, we show that the specific mutation W76→Y allows barrel assembly >1.5-fold faster than native OmpX, and increases stability by ~0.4kcalmol-1. In contrast, mutating W140→F/Y lowers OmpX thermodynamic stability by ~0.4kcalmol-1, without affecting the folding kinetics. We conclude that the stabilizing effect of tryptophan at the membrane interface can be position-and local environment-specific. We propose that the thermodynamic contributions for interface residues be interpreted with caution.
Collapse
Affiliation(s)
- Deepti Chaturvedi
- Molecular Biophysics Laboratory, Department of Biological Sciences, Indian Institute of Science Education and Research, Bhopal 462066, India
| | - Radhakrishnan Mahalakshmi
- Molecular Biophysics Laboratory, Department of Biological Sciences, Indian Institute of Science Education and Research, Bhopal 462066, India.
| |
Collapse
|