1
|
Yehorova D, Crean RM, Kasson PM, Kamerlin SCL. Friends and relatives: insight into conformational regulation from orthologues and evolutionary lineages using KIF and KIN. Faraday Discuss 2024; 252:341-353. [PMID: 38842247 PMCID: PMC11389856 DOI: 10.1039/d4fd00018h] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/07/2024]
Abstract
Noncovalent interaction networks provide a powerful means to represent and analyze protein structure. Such networks can represent both static structures and dynamic conformational ensembles. We have recently developed two tools for analyzing such interaction networks and generating hypotheses for protein engineering. Here, we apply these tools to the conformational regulation of substrate specificity in class A β-lactamases, particularly the evolutionary development from generalist to specialist catalytic function and how that can be recapitulated or reversed by protein engineering. These tools, KIF and KIN, generate a set of prioritized residues and interactions as targets for experimental protein engineering.
Collapse
Affiliation(s)
- Dariia Yehorova
- School of Chemistry and Biochemistry, Georgia Institute of Technology, USA.
| | - Rory M Crean
- Department of Chemistry-BMC, Uppsala University, Sweden
| | - Peter M Kasson
- Department of Biomedical Engineering, University of Virginia, USA
- Department of Cell and Molecular Biology, Uppsala University, Sweden
- Departments of Chemistry & Biochemistry and Biomedical Engineering, Georgia Institute of Technology, USA.
| | | |
Collapse
|
2
|
Chisholm LO, Orlandi KN, Phillips SR, Shavlik MJ, Harms MJ. Ancestral Reconstruction and the Evolution of Protein Energy Landscapes. Annu Rev Biophys 2024; 53:127-146. [PMID: 38134334 PMCID: PMC11192866 DOI: 10.1146/annurev-biophys-030722-125440] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/24/2023]
Abstract
A protein's sequence determines its conformational energy landscape. This, in turn, determines the protein's function. Understanding the evolution of new protein functions therefore requires understanding how mutations alter the protein energy landscape. Ancestral sequence reconstruction (ASR) has proven a valuable tool for tackling this problem. In ASR, one phylogenetically infers the sequences of ancient proteins, allowing characterization of their properties. When coupled to biophysical, biochemical, and functional characterization, ASR can reveal how historical mutations altered the energy landscape of ancient proteins, allowing the evolution of enzyme activity, altered conformations, binding specificity, oligomerization, and many other protein features. In this article, we review how ASR studies have been used to dissect the evolution of energy landscapes. We also discuss ASR studies that reveal how energy landscapes have shaped protein evolution. Finally, we propose that thinking about evolution from the perspective of an energy landscape can improve how we approach and interpret ASR studies.
Collapse
Affiliation(s)
- Lauren O Chisholm
- Department of Chemistry and Biochemistry, University of Oregon, Eugene, Oregon, USA;
- Institute of Molecular Biology, University of Oregon, Eugene, Oregon, USA
| | - Kona N Orlandi
- Institute of Molecular Biology, University of Oregon, Eugene, Oregon, USA
- Department of Biology, University of Oregon, Eugene, Oregon, USA
| | - Sophia R Phillips
- Department of Chemistry and Biochemistry, University of Oregon, Eugene, Oregon, USA;
- Institute of Molecular Biology, University of Oregon, Eugene, Oregon, USA
| | - Michael J Shavlik
- Institute of Molecular Biology, University of Oregon, Eugene, Oregon, USA
- Department of Biology, University of Oregon, Eugene, Oregon, USA
| | - Michael J Harms
- Department of Chemistry and Biochemistry, University of Oregon, Eugene, Oregon, USA;
- Institute of Molecular Biology, University of Oregon, Eugene, Oregon, USA
| |
Collapse
|
3
|
Jones BS, Ross CM, Foley G, Pozhydaieva N, Sharratt JW, Kress N, Seibt LS, Thomson RES, Gumulya Y, Hayes MA, Gillam EMJ, Flitsch SL. Engineering Biocatalysts for the C-H Activation of Fatty Acids by Ancestral Sequence Reconstruction. Angew Chem Int Ed Engl 2024; 63:e202314869. [PMID: 38163289 DOI: 10.1002/anie.202314869] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/04/2023] [Revised: 12/28/2023] [Accepted: 12/29/2023] [Indexed: 01/03/2024]
Abstract
Selective, one-step C-H activation of fatty acids from biomass is an attractive concept in sustainable chemistry. Biocatalysis has shown promise for generating high-value hydroxy acids, but to date enzyme discovery has relied on laborious screening and produced limited hits, which predominantly oxidise the subterminal positions of fatty acids. Herein we show that ancestral sequence reconstruction (ASR) is an effective tool to explore the sequence-activity landscape of a family of multidomain, self-sufficient P450 monooxygenases. We resurrected 11 catalytically active CYP116B ancestors, each with a unique regioselectivity fingerprint that varied from subterminal in the older ancestors to mid-chain in the lineage leading to the extant, P450-TT. In lineages leading to extant enzymes in thermophiles, thermostability increased from ancestral to extant forms, as expected if thermophily had arisen de novo. Our studies show that ASR can be applied to multidomain enzymes to develop active, self-sufficient monooxygenases as regioselective biocatalysts for fatty acid hydroxylation.
Collapse
Affiliation(s)
- Bethan S Jones
- School of Chemistry, The University of Manchester, Manchester Institute of Biotechnology (MIB), 131 Princess Street, Manchester, M1 7DN, UK
| | - Connie M Ross
- School of Chemistry and Molecular Biosciences, The University of Queensland, St. Lucia, Brisbane, 4072, Australia
| | - Gabriel Foley
- School of Chemistry and Molecular Biosciences, The University of Queensland, St. Lucia, Brisbane, 4072, Australia
| | - Nadiia Pozhydaieva
- School of Chemistry, The University of Manchester, Manchester Institute of Biotechnology (MIB), 131 Princess Street, Manchester, M1 7DN, UK
| | - Joseph W Sharratt
- School of Chemistry, The University of Manchester, Manchester Institute of Biotechnology (MIB), 131 Princess Street, Manchester, M1 7DN, UK
| | - Nico Kress
- School of Chemistry, The University of Manchester, Manchester Institute of Biotechnology (MIB), 131 Princess Street, Manchester, M1 7DN, UK
| | - Lisa S Seibt
- School of Chemistry, The University of Manchester, Manchester Institute of Biotechnology (MIB), 131 Princess Street, Manchester, M1 7DN, UK
| | - Raine E S Thomson
- School of Chemistry and Molecular Biosciences, The University of Queensland, St. Lucia, Brisbane, 4072, Australia
| | - Yosephine Gumulya
- School of Chemistry and Molecular Biosciences, The University of Queensland, St. Lucia, Brisbane, 4072, Australia
| | - Martin A Hayes
- Compound Synthesis and Management, Discovery Sciences, R&D, AstraZeneca, Gothenburg, SE
| | - Elizabeth M J Gillam
- School of Chemistry and Molecular Biosciences, The University of Queensland, St. Lucia, Brisbane, 4072, Australia
| | - Sabine L Flitsch
- School of Chemistry, The University of Manchester, Manchester Institute of Biotechnology (MIB), 131 Princess Street, Manchester, M1 7DN, UK
| |
Collapse
|
4
|
Severino R, Moreno-Paz M, Puente-Sánchez F, Sánchez-García L, Risso VA, Sanchez-Ruiz JM, Cabrol N, Parro V. Immunoanalytical Approach for Detecting and Identifying Ancestral Peptide Biomarkers in Early Earth Analogue Environments. Anal Chem 2023; 95:5323-5330. [PMID: 36926836 PMCID: PMC10061368 DOI: 10.1021/acs.analchem.2c05386] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/02/2022] [Accepted: 02/13/2023] [Indexed: 03/18/2023]
Abstract
Several mass spectrometry and spectroscopic techniques have been used in the search for molecular biomarkers on Mars. A major constraint is their capability to detect and identify large and complex compounds such as peptides or other biopolymers. Multiplex immunoassays can detect these compounds, but antibodies must be produced for a large number of sequence-dependent molecular targets. Ancestral Sequence Reconstruction (ASR) followed by protein "resurrection" in the lab can help to narrow the selection of targets. Herein, we propose an immunoanalytical method to identify ancient and universally conserved protein/peptide sequences as targets for identifying ancestral biomarkers in nature. We have developed, tested, and validated this approach by producing antibodies to eight previously described ancestral resurrected proteins (three β-lactamases, three thioredoxins, one Elongation Factor Tu, and one RuBisCO, all of them theoretically dated as Precambrian), and used them as a proxy to search for any potential feature of them that could be present in current natural environments. By fluorescent sandwich microarray immunoassays (FSMI), we have detected positive immunoreactions with antibodies to the oldest β-lactamase and thioredoxin proteins (ca. 4 Ga) in samples from a hydrothermal environment. Fine epitope mapping and inhibitory immunoassays allowed the identification of well-conserved epitope peptide sequences that resulted from ASR and were present in the sample. We corroborated these results by metagenomic sequencing and found several genes encoding analogue proteins with significant matches to the peptide epitopes identified with the antibodies. The results demonstrated that peptides inferred from ASR studies have true counterpart analogues in Nature, which validates and strengthens the well-known ASR/protein resurrection technique and our immunoanalytical approach for investigating ancient environments and metabolisms on Earth and elsewhere.
Collapse
Affiliation(s)
- Rita Severino
- Centro
de Astrobiología (CAB), CSIC-INTA, 28850 Torrejón de Ardoz, Madrid, Spain
- PhD
Program in Space Research and Astrobiology, University of Alcalá (UAH), 28805 Alcalá de Henares, Madrid, Spain
| | - Mercedes Moreno-Paz
- Centro
de Astrobiología (CAB), CSIC-INTA, 28850 Torrejón de Ardoz, Madrid, Spain
| | - Fernando Puente-Sánchez
- Department
of Aquatic Sciences and Assessment, Swedish
University of Agricultural Sciences (SLU), 75651 Uppsala, Sweden
| | - Laura Sánchez-García
- Centro
de Astrobiología (CAB), CSIC-INTA, 28850 Torrejón de Ardoz, Madrid, Spain
| | - Valeria A. Risso
- Departamento
de Química Física, Facultad de Ciencias, Unidad de Excelencia
de Química Aplicada a Biomedicina y Medioambiente (UEQ), Universidad de Granada, 18071 Granada, Spain
| | - Jose M. Sanchez-Ruiz
- Departamento
de Química Física, Facultad de Ciencias, Unidad de Excelencia
de Química Aplicada a Biomedicina y Medioambiente (UEQ), Universidad de Granada, 18071 Granada, Spain
| | - Nathalie Cabrol
- Carl
Sagan Center for the Study of Life in the Universe, SETI Institute, Mountain
View, California 94043, United States
| | - Victor Parro
- Centro
de Astrobiología (CAB), CSIC-INTA, 28850 Torrejón de Ardoz, Madrid, Spain
| |
Collapse
|
5
|
Medina-Carmona E, Gutierrez-Rus LI, Manssour-Triedo F, Newton MS, Gamiz-Arco G, Mota AJ, Reiné P, Cuerva JM, Ortega-Muñoz M, Andrés-León E, Ortega-Roldan JL, Seelig B, Ibarra-Molero B, Sanchez-Ruiz JM. Cell Survival Enabled by Leakage of a Labile Metabolic Intermediate. Mol Biol Evol 2023; 40:7036845. [PMID: 36788592 PMCID: PMC9989741 DOI: 10.1093/molbev/msad032] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/09/2022] [Revised: 01/19/2023] [Accepted: 02/07/2023] [Indexed: 02/16/2023] Open
Abstract
Many metabolites are generated in one step of a biochemical pathway and consumed in a subsequent step. Such metabolic intermediates are often reactive molecules which, if allowed to freely diffuse in the intracellular milieu, could lead to undesirable side reactions and even become toxic to the cell. Therefore, metabolic intermediates are often protected as protein-bound species and directly transferred between enzyme active sites in multi-functional enzymes, multi-enzyme complexes, and metabolons. Sequestration of reactive metabolic intermediates thus contributes to metabolic efficiency. It is not known, however, whether this evolutionary adaptation can be relaxed in response to challenges to organismal survival. Here, we report evolutionary repair experiments on Escherichia coli cells in which an enzyme crucial for the biosynthesis of proline has been deleted. The deletion makes cells unable to grow in a culture medium lacking proline. Remarkably, however, cell growth is efficiently restored by many single mutations (12 at least) in the gene of glutamine synthetase. The mutations cause the leakage to the intracellular milieu of a highly reactive phosphorylated intermediate common to the biosynthetic pathways of glutamine and proline. This intermediate is generally assumed to exist only as a protein-bound species. Nevertheless, its diffusion upon mutation-induced leakage enables a new route to proline biosynthesis. Our results support that leakage of sequestered metabolic intermediates can readily occur and contribute to organismal adaptation in some scenarios. Enhanced availability of reactive molecules may enable the generation of new biochemical pathways and the potential of mutation-induced leakage in metabolic engineering is noted.
Collapse
Affiliation(s)
- Encarnación Medina-Carmona
- Departamento de Quimica Fisica, Facultad de Ciencias, Unidad de Excelencia de Quimica Aplicada a Biomedicina y Medioambiente (UEQ), Universidad de Granada, Granada, Spain.,School of Biosciences, University of Kent, Canterbury, United Kingdom
| | - Luis I Gutierrez-Rus
- Departamento de Quimica Fisica, Facultad de Ciencias, Unidad de Excelencia de Quimica Aplicada a Biomedicina y Medioambiente (UEQ), Universidad de Granada, Granada, Spain
| | - Fadia Manssour-Triedo
- Departamento de Quimica Fisica, Facultad de Ciencias, Unidad de Excelencia de Quimica Aplicada a Biomedicina y Medioambiente (UEQ), Universidad de Granada, Granada, Spain
| | - Matilda S Newton
- Department of Biochemistry, Molecular Biology and Biophysics, University of Minnesota, Minneapolis, MN.,BioTechnology Institute, University of Minnesota, St Paul, MN
| | - Gloria Gamiz-Arco
- Departamento de Quimica Fisica, Facultad de Ciencias, Unidad de Excelencia de Quimica Aplicada a Biomedicina y Medioambiente (UEQ), Universidad de Granada, Granada, Spain
| | - Antonio J Mota
- Departamento de Quimica Inorganica, Facultad de Ciencias, Unidad de Excelencia de Quimica Aplicada a Biomedicina y Medioambiente (UEQ), Universidad de Granada, Granada, Spain
| | - Pablo Reiné
- Departamento de Quimica Organica, Facultad de Ciencias, Unidad de Excelencia de Quimica Aplicada a Biomedicina y Medioambiente (UEQ), Universidad de Granada, Granada, Spain
| | - Juan Manuel Cuerva
- Departamento de Quimica Organica, Facultad de Ciencias, Unidad de Excelencia de Quimica Aplicada a Biomedicina y Medioambiente (UEQ), Universidad de Granada, Granada, Spain
| | - Mariano Ortega-Muñoz
- Departamento de Quimica Organica, Facultad de Ciencias, Unidad de Excelencia de Quimica Aplicada a Biomedicina y Medioambiente (UEQ), Universidad de Granada, Granada, Spain
| | - Eduardo Andrés-León
- Unidad de Bioinformática, Instituto de Parasitología y Biomedicina "Lopez Neyra", CSIC, Armilla, Granada, Spain
| | | | - Burckhard Seelig
- Department of Biochemistry, Molecular Biology and Biophysics, University of Minnesota, Minneapolis, MN.,BioTechnology Institute, University of Minnesota, St Paul, MN
| | - Beatriz Ibarra-Molero
- Departamento de Quimica Fisica, Facultad de Ciencias, Unidad de Excelencia de Quimica Aplicada a Biomedicina y Medioambiente (UEQ), Universidad de Granada, Granada, Spain
| | - Jose M Sanchez-Ruiz
- Departamento de Quimica Fisica, Facultad de Ciencias, Unidad de Excelencia de Quimica Aplicada a Biomedicina y Medioambiente (UEQ), Universidad de Granada, Granada, Spain
| |
Collapse
|
6
|
Hu J, Chen X, Zhang L, Zhou J, Xu G, Ni Y. Engineering the Thermostability of a d-Carbamoylase Based on Ancestral Sequence Reconstruction for the Efficient Synthesis of d-Tryptophan. JOURNAL OF AGRICULTURAL AND FOOD CHEMISTRY 2023; 71:660-670. [PMID: 36541894 DOI: 10.1021/acs.jafc.2c07781] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/17/2023]
Abstract
Employing ancestral sequence reconstruction and consensus sequence analysis, the thermostability of a novel d-carbamoylase derived from Nitratireductor indicus (NiHyuC) was engineered through greedy-oriented iterative combinatorial mutagenesis. A mutant S202P/E208D/R277L (M4Th3) was obtained with significantly elevated thermostability. M4Th3 has a half-life of 36.5 h at 40 °C, about 28.5 times of 1.3 h of its parent M4. For the reaction at 40 °C, M4Th3 can catalyze 10 mM N-carbamoyl-d-tryptophan to produce d-tryptophan with a conversion ratio of 96.4% after 12 h, which is significantly higher than 64.1% of M4. MD simulation reveals that new hydrogen bonds emerging from E208D on the surface can increase the hydrophobicity of the protein, leading to improved stability. More importantly, R277L could contribute to enhanced interface stability of homodimeric M4. This study provides a thermostable d-carbamoylase for the "hydantoinase process", which has potential in the industrial synthesis of optically pure natural and non-natural amino acids.
Collapse
Affiliation(s)
- Jiamin Hu
- Key laboratory of industrial Biotechnology, Ministry of Education, School of Biotechnology, Jiangnan University, Wuxi214122, Jiangsu, China
| | - Xiaoyu Chen
- Key laboratory of industrial Biotechnology, Ministry of Education, School of Biotechnology, Jiangnan University, Wuxi214122, Jiangsu, China
| | - Lu Zhang
- Key laboratory of industrial Biotechnology, Ministry of Education, School of Biotechnology, Jiangnan University, Wuxi214122, Jiangsu, China
| | - Jieyu Zhou
- Key laboratory of industrial Biotechnology, Ministry of Education, School of Biotechnology, Jiangnan University, Wuxi214122, Jiangsu, China
| | - Guochao Xu
- Key laboratory of industrial Biotechnology, Ministry of Education, School of Biotechnology, Jiangnan University, Wuxi214122, Jiangsu, China
| | - Ye Ni
- Key laboratory of industrial Biotechnology, Ministry of Education, School of Biotechnology, Jiangnan University, Wuxi214122, Jiangsu, China
| |
Collapse
|
7
|
Hager M, Pöhler MT, Reinhardt F, Wellner K, Hübner J, Betat H, Prohaska S, Mörl M. Substrate Affinity Versus Catalytic Efficiency: Ancestral Sequence Reconstruction of tRNA Nucleotidyltransferases Solves an Enzyme Puzzle. Mol Biol Evol 2022; 39:6835633. [PMID: 36409584 PMCID: PMC9728577 DOI: 10.1093/molbev/msac250] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022] Open
Abstract
In tRNA maturation, CCA-addition by tRNA nucleotidyltransferase is a unique and highly accurate reaction. While the mechanism of nucleotide selection and polymerization is well understood, it remains a mystery why bacterial and eukaryotic enzymes exhibit an unexpected and surprisingly low tRNA substrate affinity while they efficiently catalyze the CCA-addition. To get insights into the evolution of this high-fidelity RNA synthesis, the reconstruction and characterization of ancestral enzymes is a versatile tool. Here, we investigate a reconstructed candidate of a 2 billion years old CCA-adding enzyme from Gammaproteobacteria and compare it to the corresponding modern enzyme of Escherichia coli. We show that the ancestral candidate catalyzes an error-free CCA-addition, but has a much higher tRNA affinity compared with the extant enzyme. The consequence of this increased substrate binding is an enhanced reverse reaction, where the enzyme removes the CCA end from the mature tRNA. As a result, the ancestral candidate exhibits a lower catalytic efficiency in vitro as well as in vivo. Furthermore, the efficient tRNA interaction leads to a processive polymerization, while the extant enzyme catalyzes nucleotide addition in a distributive way. Thus, the modern enzymes increased their polymerization efficiency by lowering the binding affinity to tRNA, so that CCA synthesis is efficiently promoted due to a reduced reverse reaction. Hence, the puzzling and at a first glance contradicting and detrimental weak substrate interaction represents a distinct activity enhancement in the evolution of CCA-adding enzymes.
Collapse
Affiliation(s)
| | | | - Franziska Reinhardt
- Computational EvoDevo Group, Institute for Computer Science, Leipzig University, Härtelstr. 16-18, 04109 Leipzig, Germany,Interdisciplinary Centre for Bioinformatics, Leipzig University, Härtelstr. 16-18, 04109 Leipzig, Germany
| | - Karolin Wellner
- Institute for Biochemistry, Leipzig University, Brüderstraße 34, D-04103 Leipzig, Germany
| | - Jessica Hübner
- Computational EvoDevo Group, Institute for Computer Science, Leipzig University, Härtelstr. 16-18, 04109 Leipzig, Germany
| | - Heike Betat
- Institute for Biochemistry, Leipzig University, Brüderstraße 34, D-04103 Leipzig, Germany
| | - Sonja Prohaska
- Computational EvoDevo Group, Institute for Computer Science, Leipzig University, Härtelstr. 16-18, 04109 Leipzig, Germany,Interdisciplinary Centre for Bioinformatics, Leipzig University, Härtelstr. 16-18, 04109 Leipzig, Germany,Santa Fe Institute, 1399 Hyde Park Road, Santa Fe, NM 87501, USA,Complexity Science Hub Vienna, Josefstädter Str. 39, 1080 Wien, Austria
| | | |
Collapse
|
8
|
Youssef N, Susko E, Roger AJ, Bielawski JP. Evolution of amino acid propensities under stability-mediated epistasis. Mol Biol Evol 2022; 39:6522130. [PMID: 35134997 PMCID: PMC8896634 DOI: 10.1093/molbev/msac030] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022] Open
Abstract
Site-specific amino acid preferences are influenced by the genetic background of the protein. The preferences for resident amino acids are expected to, on average, increase over time because of replacements at other sites - a nonadaptive phenomenon referred to as the 'evolutionary Stokes shift'. Alternatively, decreases in resident amino acid propensity have recently been viewed as evidence of adaptations to external environmental changes. Using population genetics theory and thermodynamic stability-constraints, we show that nonadaptive evolution can lead to both positive and negative shifts in propensities following the fixation of an amino acid, emphasizing that the detection of negative shifts is not conclusive evidence of adaptation. Considering shifts in propensities over windows between substitutions at a focal site, we find that following ≈ 50% of substitutions the propensity for the new resident amino acid decreases over time, and both positive and negative shifts were comparable in magnitude. Preferences were often conserved via a significant negative autocorrelation in propensity changes-increases in propensities often followed by decreases, and vice versa. Lastly, we explore the underlying mechanisms that lead propensities to fluctuate. We observe that stabilizing replacements increase the mutational tolerance at a site and in doing so decrease the propensity for the resident amino acid. In contrast, destabilizing substitutions result in more rugged fitness landscapes that tend to favor the resident amino acid. In summary, our results characterize propensity trajectories under nonadaptive stability-constrained evolution against which evidence of adaptations should be calibrated.
Collapse
Affiliation(s)
- Noor Youssef
- Department of Systems Biology, Harvard Medical School, Boston, MA, USA
| | - Edward Susko
- Department of Mathematics and Statistics, Dalhousie University, Halifax, NS, Canada
| | - Andrew J Roger
- Department of Biochemistry and Molecular Biology, Dalhousie University, Halifax, NS, Canada
| | - Joseph P Bielawski
- Department of Biology, Dalhousie University, Halifax, Nova Scotia, Canada Department of Mathematics and Statistics, Dalhousie University, Halifax, Nova Scotia, Canada
| |
Collapse
|
9
|
Gamiz-Arco G, Risso VA, Gaucher EA, Gavira JA, Naganathan AN, Ibarra-Molero B, Sanchez-Ruiz JM. Combining Ancestral Reconstruction with Folding-Landscape Simulations to Engineer Heterologous Protein Expression. J Mol Biol 2021; 433:167321. [PMID: 34687715 DOI: 10.1016/j.jmb.2021.167321] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/17/2021] [Revised: 10/01/2021] [Accepted: 10/17/2021] [Indexed: 11/30/2022]
Abstract
Obligate symbionts typically exhibit high evolutionary rates. Consequently, their proteins may differ considerably from their modern and ancestral homologs in terms of both sequence and properties, thus providing excellent models to study protein evolution. Also, obligate symbionts are challenging to culture in the lab and proteins from uncultured organisms must be produced in heterologous hosts using recombinant DNA technology. Obligate symbionts thus replicate a fundamental scenario of metagenomics studies aimed at the functional characterization and biotechnological exploitation of proteins from the bacteria in soil. Here, we use the thioredoxin from Candidatus Photodesmus katoptron, an uncultured symbiont of flashlight fish, to explore evolutionary and engineering aspects of protein folding in heterologous hosts. The symbiont protein is a standard thioredoxin in terms of 3D-structure, stability and redox activity. However, its folding outside the original host is severely impaired, as shown by a very slow refolding in vitro and an inefficient expression in E. coli that leads mostly to insoluble protein. By contrast, resurrected Precambrian thioredoxins express efficiently in E. coli, plausibly reflecting an ancient adaptation to unassisted folding. We have used a statistical-mechanical model of the folding landscape to guide back-to-ancestor engineering of the symbiont protein. Remarkably, we find that the efficiency of heterologous expression correlates with the in vitro (i.e., unassisted) folding rate and that the ancestral expression efficiency can be achieved with only 1-2 back-to-ancestor replacements. These results demonstrate a minimal-perturbation, sequence-engineering approach to rescue inefficient heterologous expression which may potentially be useful in metagenomics efforts targeting recent adaptations.
Collapse
Affiliation(s)
- Gloria Gamiz-Arco
- Departamento de Quimica Fisica, Facultad de Ciencias, Unidad de Excelencia de Quimica Aplicada a Biomedicina y Medioambiente (UEQ), Universidad de Granada, 18071 Granada, Spain
| | - Valeria A Risso
- Departamento de Quimica Fisica, Facultad de Ciencias, Unidad de Excelencia de Quimica Aplicada a Biomedicina y Medioambiente (UEQ), Universidad de Granada, 18071 Granada, Spain
| | - Eric A Gaucher
- Department of Biology, Georgia State University, Atlanta, GA 30303, USA
| | - Jose A Gavira
- Laboratorio de Estudios Cristalograficos, Instituto Andaluz de Ciencias de la Tierra, CSIC, Unidad de Excelencia de Quimica Aplicada a Biomedicina y Medioambiente (UEQ), Universidad de Granada, Avenida de las Palmeras 4, Armilla, Granada 18100, Spain. https://twitter.com/Gavirius
| | - Athi N Naganathan
- Department of Biotechnology, Bhupat and Jyoti Mehta School of Biosciences, Indian Institute of Technology Madras, Chennai 600036, India.
| | - Beatriz Ibarra-Molero
- Departamento de Quimica Fisica, Facultad de Ciencias, Unidad de Excelencia de Quimica Aplicada a Biomedicina y Medioambiente (UEQ), Universidad de Granada, 18071 Granada, Spain.
| | - Jose M Sanchez-Ruiz
- Departamento de Quimica Fisica, Facultad de Ciencias, Unidad de Excelencia de Quimica Aplicada a Biomedicina y Medioambiente (UEQ), Universidad de Granada, 18071 Granada, Spain.
| |
Collapse
|
10
|
Youssef N, Susko E, Roger AJ, Bielawski JP. Shifts in amino acid preferences as proteins evolve: A synthesis of experimental and theoretical work. Protein Sci 2021; 30:2009-2028. [PMID: 34322924 PMCID: PMC8442975 DOI: 10.1002/pro.4161] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/13/2021] [Revised: 07/19/2021] [Accepted: 07/26/2021] [Indexed: 11/08/2022]
Abstract
Amino acid preferences vary across sites and time. While variation across sites is widely accepted, the extent and frequency of temporal shifts are contentious. Our understanding of the drivers of amino acid preference change is incomplete: To what extent are temporal shifts driven by adaptive versus nonadaptive evolutionary processes? We review phenomena that cause preferences to vary (e.g., evolutionary Stokes shift, contingency, and entrenchment) and clarify how they differ. To determine the extent and prevalence of shifted preferences, we review experimental and theoretical studies. Analyses of natural sequence alignments often detect decreases in homoplasy (convergence and reversions) rates, and variation in replacement rates with time-signals that are consistent with temporally changing preferences. While approaches inferring shifts in preferences from patterns in natural alignments are valuable, they are indirect since multiple mechanisms (both adaptive and nonadaptive) could lead to the observed signal. Alternatively, site-directed mutagenesis experiments allow for a more direct assessment of shifted preferences. They corroborate evidence from multiple sequence alignments, revealing that the preference for an amino acid at a site varies depending on the background sequence. However, shifts in preferences are usually minor in magnitude and sites with significantly shifted preferences are low in frequency. The small yet consistent perturbations in preferences could, nevertheless, jeopardize the accuracy of inference procedures, which assume constant preferences. We conclude by discussing if and how such shifts in preferences might influence widely used time-homogenous inference procedures and potential ways to mitigate such effects.
Collapse
Affiliation(s)
- Noor Youssef
- Department of BiologyDalhousie UniversityHalifaxNova ScotiaCanada
| | - Edward Susko
- Department of Mathematics and StatisticsDalhousie UniversityHalifaxNova ScotiaCanada
| | - Andrew J. Roger
- Department of Biochemistry and Molecular BiologyDalhousie UniversityHalifaxNova ScotiaCanada
| | - Joseph P. Bielawski
- Department of BiologyDalhousie UniversityHalifaxNova ScotiaCanada
- Department of Mathematics and StatisticsDalhousie UniversityHalifaxNova ScotiaCanada
| |
Collapse
|
11
|
Romero-Romero S, Costas M, Silva Manzano DA, Kordes S, Rojas-Ortega E, Tapia C, Guerra Y, Shanmugaratnam S, Rodríguez-Romero A, Baker D, Höcker B, Fernández-Velasco DA. The Stability Landscape of de novo TIM Barrels Explored by a Modular Design Approach. J Mol Biol 2021; 433:167153. [PMID: 34271011 PMCID: PMC8404036 DOI: 10.1016/j.jmb.2021.167153] [Citation(s) in RCA: 15] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/29/2021] [Revised: 06/18/2021] [Accepted: 07/06/2021] [Indexed: 11/25/2022]
Abstract
The TIM barrel is a versatile fold to understand structure-stability relationships. A collection of de novo TIM barrels with improved hydrophobic cores was designed. DeNovoTIMs are reversible in chemical and thermal unfolding, which is uncommon in TIM barrels. Epistatic effects play a central role in DeNovoTIMs stabilization. DeNovoTIMs navigate a previously uncharted region of the stability landscape.
The ability to design stable proteins with custom-made functions is a major goal in biochemistry with practical relevance for our environment and society. Understanding and manipulating protein stability provide crucial information on the molecular determinants that modulate structure and stability, and expand the applications of de novo proteins. Since the (β/⍺)8-barrel or TIM-barrel fold is one of the most common functional scaffolds, in this work we designed a collection of stable de novo TIM barrels (DeNovoTIMs), using a computational fixed-backbone and modular approach based on improved hydrophobic packing of sTIM11, the first validated de novo TIM barrel, and subjected them to a thorough folding analysis. DeNovoTIMs navigate a region of the stability landscape previously uncharted by natural TIM barrels, with variations spanning 60 degrees in melting temperature and 22 kcal per mol in conformational stability throughout the designs. Significant non-additive or epistatic effects were observed when stabilizing mutations from different regions of the barrel were combined. The molecular basis of epistasis in DeNovoTIMs appears to be related to the extension of the hydrophobic cores. This study is an important step towards the fine-tuned modulation of protein stability by design.
Collapse
Affiliation(s)
- Sergio Romero-Romero
- Laboratorio de Fisicoquímica e Ingeniería de Proteínas, Departamento de Bioquímica, Facultad de Medicina, Universidad Nacional Autónoma de México, 04510 Mexico City, Mexico; Department of Biochemistry, University of Bayreuth, 95447 Bayreuth, Germany
| | - Miguel Costas
- Laboratorio de Biofisicoquímica, Departamento de Fisicoquímica, Facultad de Química, Universidad Nacional Autónoma de México, 04510 Mexico City, Mexico
| | - Daniel-Adriano Silva Manzano
- Department of Biochemistry, University of Washington, 98195 Seattle, USA; Institute for Protein Design, University of Washington, 98195 Seattle, USA
| | - Sina Kordes
- Department of Biochemistry, University of Bayreuth, 95447 Bayreuth, Germany
| | - Erendira Rojas-Ortega
- Laboratorio de Fisicoquímica e Ingeniería de Proteínas, Departamento de Bioquímica, Facultad de Medicina, Universidad Nacional Autónoma de México, 04510 Mexico City, Mexico
| | - Cinthya Tapia
- Laboratorio de Fisicoquímica e Ingeniería de Proteínas, Departamento de Bioquímica, Facultad de Medicina, Universidad Nacional Autónoma de México, 04510 Mexico City, Mexico
| | - Yasel Guerra
- Laboratorio de Fisicoquímica e Ingeniería de Proteínas, Departamento de Bioquímica, Facultad de Medicina, Universidad Nacional Autónoma de México, 04510 Mexico City, Mexico
| | | | - Adela Rodríguez-Romero
- Instituto de Química, Universidad Nacional Autónoma de México, 04510 Mexico City, Mexico
| | - David Baker
- Department of Biochemistry, University of Washington, 98195 Seattle, USA; Institute for Protein Design, University of Washington, 98195 Seattle, USA.
| | - Birte Höcker
- Department of Biochemistry, University of Bayreuth, 95447 Bayreuth, Germany.
| | - D Alejandro Fernández-Velasco
- Laboratorio de Fisicoquímica e Ingeniería de Proteínas, Departamento de Bioquímica, Facultad de Medicina, Universidad Nacional Autónoma de México, 04510 Mexico City, Mexico.
| |
Collapse
|
12
|
Carletti MS, Monzon AM, Garcia-Rios E, Benitez G, Hirsh L, Fornasari MS, Parisi G. Revenant: a database of resurrected proteins. DATABASE-THE JOURNAL OF BIOLOGICAL DATABASES AND CURATION 2021; 2020:5828294. [PMID: 32400867 PMCID: PMC7218706 DOI: 10.1093/database/baaa031] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 07/03/2019] [Revised: 03/06/2020] [Accepted: 03/31/2020] [Indexed: 11/29/2022]
Abstract
Revenant is a database of resurrected proteins coming from extinct organisms. Currently, it contains a manually curated collection of 84 resurrected proteins derived from bibliographic data. Each protein is extensively annotated, including structural, biochemical and biophysical information. Revenant contains a browse capability designed as a timeline from where the different proteins can be accessed. The oldest Revenant entries are between 4200 and 3500 million years ago, while the younger entries are between 8.8 and 6.3 million years ago. These proteins have been resurrected using computational tools called ancestral sequence reconstruction techniques combined with wet-laboratory synthesis and expression. Resurrected proteins are commonly used, with a noticeable increase during the past years, to explore and test different evolutionary hypotheses such as protein stability, to explore the origin of new functions, to get biochemical insights into past metabolisms and to explore specificity and promiscuous behaviour of ancient proteins.
Collapse
Affiliation(s)
- Matias Sebastian Carletti
- Departamento de Ciencia y Tecnología, CONICET, Universidad Nacional de Quilmes, Roque Saenz Peña 182, Bernal, B1876BXD, Buenos Aires, Argentina
| | - Alexander Miguel Monzon
- Departamento de Ciencia y Tecnología, CONICET, Universidad Nacional de Quilmes, Roque Saenz Peña 182, Bernal, B1876BXD, Buenos Aires, Argentina.,Department of Biomedical Sciences, University of Padova, Viale G. Colombo 3, Padova, I-35131, Padova, Italy
| | - Emilio Garcia-Rios
- Departamento de Ingeniería, Pontificia Universidad Católica del Perú, Lima, Perú
| | - Guillermo Benitez
- Departamento de Ciencia y Tecnología, CONICET, Universidad Nacional de Quilmes, Roque Saenz Peña 182, Bernal, B1876BXD, Buenos Aires, Argentina
| | - Layla Hirsh
- Departamento de Ingeniería, Pontificia Universidad Católica del Perú, Lima, Perú
| | - Maria Silvina Fornasari
- Departamento de Ciencia y Tecnología, CONICET, Universidad Nacional de Quilmes, Roque Saenz Peña 182, Bernal, B1876BXD, Buenos Aires, Argentina
| | - Gustavo Parisi
- Departamento de Ciencia y Tecnología, CONICET, Universidad Nacional de Quilmes, Roque Saenz Peña 182, Bernal, B1876BXD, Buenos Aires, Argentina
| |
Collapse
|
13
|
Razban RM, Dasmeh P, Serohijos AWR, Shakhnovich EI. Avoidance of protein unfolding constrains protein stability in long-term evolution. Biophys J 2021; 120:2413-2424. [PMID: 33932438 PMCID: PMC8390877 DOI: 10.1016/j.bpj.2021.03.042] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/16/2020] [Revised: 02/24/2021] [Accepted: 03/17/2021] [Indexed: 11/28/2022] Open
Abstract
Every amino acid residue can influence a protein's overall stability, making stability highly susceptible to change throughout evolution. We consider the distribution of protein stabilities evolutionarily permittable under two previously reported protein fitness functions: flux dynamics and misfolding avoidance. We develop an evolutionary dynamics theory and find that it agrees better with an extensive protein stability data set for dihydrofolate reductase orthologs under the misfolding avoidance fitness function rather than the flux dynamics fitness function. Further investigation with ribonuclease H data demonstrates that not any misfolded state is avoided; rather, it is only the unfolded state. At the end, we discuss how our work pertains to the universal protein abundance-evolutionary rate correlation seen across organisms' proteomes. We derive a closed-form expression relating protein abundance to evolutionary rate that captures Escherichia coli, Saccharomyces cerevisiae, and Homo sapiens experimental trends without fitted parameters.
Collapse
Affiliation(s)
- Rostam M Razban
- Department of Chemistry and Chemical Biology, Harvard University, Cambridge, Massachusetts
| | - Pouria Dasmeh
- Department of Chemistry and Chemical Biology, Harvard University, Cambridge, Massachusetts; Departement de Biochimie, Université de Montréal, Montreal, Quebec, Canada
| | | | - Eugene I Shakhnovich
- Department of Chemistry and Chemical Biology, Harvard University, Cambridge, Massachusetts.
| |
Collapse
|
14
|
Xie VC, Pu J, Metzger BP, Thornton JW, Dickinson BC. Contingency and chance erase necessity in the experimental evolution of ancestral proteins. eLife 2021; 10:67336. [PMID: 34061027 PMCID: PMC8282340 DOI: 10.7554/elife.67336] [Citation(s) in RCA: 22] [Impact Index Per Article: 7.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/08/2021] [Accepted: 05/30/2021] [Indexed: 12/13/2022] Open
Abstract
The roles of chance, contingency, and necessity in evolution are unresolved because they have never been assessed in a single system or on timescales relevant to historical evolution. We combined ancestral protein reconstruction and a new continuous evolution technology to mutate and select proteins in the B-cell lymphoma-2 (BCL-2) family to acquire protein–protein interaction specificities that occurred during animal evolution. By replicating evolutionary trajectories from multiple ancestral proteins, we found that contingency generated over long historical timescales steadily erased necessity and overwhelmed chance as the primary cause of acquired sequence variation; trajectories launched from phylogenetically distant proteins yielded virtually no common mutations, even under strong and identical selection pressures. Chance arose because many sets of mutations could alter specificity at any timepoint; contingency arose because historical substitutions changed these sets. Our results suggest that patterns of variation in BCL-2 sequences – and likely other proteins, too – are idiosyncratic products of a particular and unpredictable course of historical events. One of the most fundamental and unresolved questions in evolutionary biology is whether the outcomes of evolution are predictable. Is the diversity of life we see today the expected result of organisms adapting to their environment throughout history (also known as natural selection) or the product of random chance? Or did chance events early in history shape the paths that evolution could take next, determining the biological forms that emerged under natural selection much later? These questions are hard to study because evolution happened only once, long ago. To overcome this barrier, Xie, Pu, Metzger et al. developed an experimental approach that can evolve reconstructed ancestral proteins that existed deep in the past. Using this method, it is possible to replay evolution multiple times, from various historical starting points, under conditions similar to those that existed long ago. The end products of the evolutionary trajectories can then be compared to determine how predictable evolution actually is. Xie, Pu, Metzger et al. studied proteins belonging to the BCL-2 family, which originated some 800 million years ago. These proteins have diversified greatly over time in both their genetic sequences and their ability to bind to specific partner proteins called co-regulators. Xie, Pu, Metzger et al. synthesized BCL-2 proteins that existed at various times in the past. Each ancestral protein was then allowed to evolve repeatedly under natural selection to acquire the same co-regulator binding functions that evolved during history. At the end of each evolutionary trajectory, the genetic sequence of the resulting BCL-2 proteins was recorded. This revealed that the outcomes of evolution were almost completely unpredictable: trajectories initiated from the same ancestral protein produced proteins with very different sequences, and proteins launched from different ancestral starting points were even more dissimilar. Further experiments identified the mutations in each trajectory that caused changes in coregulator binding. When these mutations were introduced into other ancestral proteins, they did not yield the same change in function. This suggests that early chance events influenced each protein’s evolution in an unpredictable way by opening and closing the paths available to it in the future. This research expands our understanding of evolution on a molecular level whilst providing a new experimental approach for studying evolutionary drivers in more detail. The results suggest that BCL-2 proteins, in all their various forms, are unique products of a particular, unpredictable course of history set in motion by ancient chance events.
Collapse
Affiliation(s)
| | - Jinyue Pu
- Department of Chemistry, University of Chicago, Chicago, United States
| | - Brian Ph Metzger
- Department of Ecology and Evolution, University of Chicago, Chicago, United States
| | - Joseph W Thornton
- Department of Ecology and Evolution, University of Chicago, Chicago, United States.,Department of Human Genetics, University of Chicago, Chicago, United States
| | - Bryan C Dickinson
- Department of Chemistry, University of Chicago, Chicago, United States
| |
Collapse
|
15
|
Liu R, Wang J, Xiong P, Chen Q, Liu H. De novo sequence redesign of a functional Ras-binding domain globally inverted the surface charge distribution and led to extreme thermostability. Biotechnol Bioeng 2021; 118:2031-2042. [PMID: 33590881 DOI: 10.1002/bit.27716] [Citation(s) in RCA: 11] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/23/2020] [Revised: 02/05/2021] [Accepted: 02/14/2021] [Indexed: 11/05/2022]
Abstract
To acquire extremely thermostable proteins of given functions is challenging for conventional protein engineering. Here we applied ABACUS, a statistical energy function we developed for de novo amino acid sequence design, to globally redesign a Ras-binding domain (RBD), and obtained an extremely thermostable RBD that unfolds reversibly at above 110°C, the redesigned RBD experimentally confirmed to have expected structure and Ras-binding interface. Directed evolution of the redesigned RBD improved its Ras-binding affinity to the native protein level without excessive loss of thermostability. The designed amino acid substitutions were mostly at the protein surface. For many substitutions, strong epistasis or significantly differentiated effects on thermostability in the native sequence context relative to the redesigned sequence context were observed, suggesting the globally redesigned sequence to be unreachable through combining beneficial mutations of the native sequence. Further analyses revealed that by replacing 38 of a total of 48 non-interfacial surface residues at once, ABACUS redesign was able to globally "invert" the protein's charge distribution pattern in an optimized way. Our study demonstrates that computational protein design provides powerful new tools to solve challenging protein engineering problems.
Collapse
Affiliation(s)
- Ruicun Liu
- School of Life Sciences, University of Science and Technology of China, Hefei, Anhui, China
| | - Jichao Wang
- School of Life Sciences, University of Science and Technology of China, Hefei, Anhui, China
| | - Peng Xiong
- School of Life Sciences, University of Science and Technology of China, Hefei, Anhui, China
| | - Quan Chen
- School of Life Sciences, University of Science and Technology of China, Hefei, Anhui, China.,Hefei National Laboratory for Physical Sciences at the Microscale, University of Science and Technology of China, Hefei, Anhui, China
| | - Haiyan Liu
- School of Life Sciences, University of Science and Technology of China, Hefei, Anhui, China.,Hefei National Laboratory for Physical Sciences at the Microscale, University of Science and Technology of China, Hefei, Anhui, China.,School of Data Science, University of Science and Technology of China, Hefei, Anhui, China
| |
Collapse
|
16
|
Stolyarova AV, Nabieva E, Ptushenko VV, Favorov AV, Popova AV, Neverov AD, Bazykin GA. Senescence and entrenchment in evolution of amino acid sites. Nat Commun 2020; 11:4603. [PMID: 32929079 PMCID: PMC7490271 DOI: 10.1038/s41467-020-18366-z] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/10/2019] [Accepted: 08/20/2020] [Indexed: 01/01/2023] Open
Abstract
Amino acid propensities at a site change in the course of protein evolution. This may happen for two reasons. Changes may be triggered by substitutions at epistatically interacting sites elsewhere in the genome. Alternatively, they may arise due to environmental changes that are external to the genome. Here, we design a framework for distinguishing between these alternatives. Using analytical modelling and simulations, we show that they cause opposite dynamics of the fitness of the allele currently occupying the site: it tends to increase with the time since its origin due to epistasis ("entrenchment"), but to decrease due to random environmental fluctuations ("senescence"). By analysing the genomes of vertebrates and insects, we show that the amino acids originating at negatively selected sites experience strong entrenchment. By contrast, the amino acids originating at positively selected sites experience senescence. We propose that senescence of the current allele is a cause of adaptive evolution.
Collapse
Affiliation(s)
- A V Stolyarova
- Center of Life Sciences, Skolkovo Institute of Science and Technology, Skolkovo, 143028, Russia.
| | - E Nabieva
- Center of Life Sciences, Skolkovo Institute of Science and Technology, Skolkovo, 143028, Russia
- Institute for Information Transmission Problems (Kharkevich Institute), Russian Academy of Sciences, Moscow, 127051, Russia
| | - V V Ptushenko
- Department of Photochemistry and Photobiology, N. M. Emanuel Institute of Biochemical Physics of Russian Academy of Sciences, Moscow, 119334, Russia
- A. N. Belozersky Institute of Physical-Chemical Biology, M. V. Lomonosov Moscow State University, Moscow, 119992, Russia
| | - A V Favorov
- Division of Biostatistics and Bioinformatics, Department of Oncology, Sidney Kimmel Comprehensive Cancer Center, Johns Hopkins School of Medicine, Baltimore, MD, 21205, USA
- Laboratory of System Biology and Computational Genetics, Vavilov Institute of General Genetics, Moscow, 119991, Russia
| | - A V Popova
- Department of Molecular Diagnostics, Central Research Institute for Epidemiology, Moscow, 111123, Russia
| | - A D Neverov
- Department of Molecular Diagnostics, Central Research Institute for Epidemiology, Moscow, 111123, Russia
| | - G A Bazykin
- Center of Life Sciences, Skolkovo Institute of Science and Technology, Skolkovo, 143028, Russia
- Institute for Information Transmission Problems (Kharkevich Institute), Russian Academy of Sciences, Moscow, 127051, Russia
| |
Collapse
|
17
|
Youssef N, Susko E, Bielawski JP. Consequences of Stability-Induced Epistasis for Substitution Rates. Mol Biol Evol 2020; 37:3131-3148. [DOI: 10.1093/molbev/msaa151] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/18/2023] Open
Abstract
AbstractDo interactions between residues in a protein (i.e., epistasis) significantly alter evolutionary dynamics? If so, what consequences might they have on inference from traditional codon substitution models which assume site-independence for the sake of computational tractability? To investigate the effects of epistasis on substitution rates, we employed a mechanistic mutation-selection model in conjunction with a fitness framework derived from protein stability. We refer to this as the stability-informed site-dependent (S-SD) model and developed a new stability-informed site-independent (S-SI) model that captures the average effect of stability constraints on individual sites of a protein. Comparison of S-SI and S-SD offers a novel and direct method for investigating the consequences of stability-induced epistasis on protein evolution. We developed S-SI and S-SD models for three natural proteins and showed that they generate sequences consistent with real alignments. Our analyses revealed that epistasis tends to increase substitution rates compared with the rates under site-independent evolution. We then assessed the epistatic sensitivity of individual site and discovered a counterintuitive effect: Highly connected sites were less influenced by epistasis relative to exposed sites. Lastly, we show that, despite the unrealistic assumptions, traditional models perform comparably well in the presence and absence of epistasis and provide reasonable summaries of average selection intensities. We conclude that epistatic models are critical to understanding protein evolutionary dynamics, but epistasis might not be required for reasonable inference of selection pressure when averaging over time and sites.
Collapse
Affiliation(s)
- Noor Youssef
- Department of Biology, Dalhousie University, Halifax, Nova Scotia, Canada
- Centre for Genomics and Evolutionary Bioinformatics, Dalhousie University, Halifax, Nova Scotia, Canada
| | - Edward Susko
- Centre for Genomics and Evolutionary Bioinformatics, Dalhousie University, Halifax, Nova Scotia, Canada
- Department of Mathematics and Statistics, Dalhousie University, Halifax, Nova Scotia, Canada
| | - Joseph P Bielawski
- Department of Biology, Dalhousie University, Halifax, Nova Scotia, Canada
- Centre for Genomics and Evolutionary Bioinformatics, Dalhousie University, Halifax, Nova Scotia, Canada
- Department of Mathematics and Statistics, Dalhousie University, Halifax, Nova Scotia, Canada
| |
Collapse
|
18
|
Non-conservation of folding rates in the thioredoxin family reveals degradation of ancestral unassisted-folding. Biochem J 2020; 476:3631-3647. [PMID: 31750876 PMCID: PMC6906118 DOI: 10.1042/bcj20190739] [Citation(s) in RCA: 13] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/04/2019] [Revised: 11/19/2019] [Accepted: 11/21/2019] [Indexed: 01/04/2023]
Abstract
Evolution involves not only adaptation, but also the degradation of superfluous features. Many examples of degradation at the morphological level are known (vestigial organs, for instance). However, the impact of degradation on molecular evolution has been rarely addressed. Thioredoxins serve as general oxidoreductases in all cells. Here, we report extensive mutational analyses on the folding of modern and resurrected ancestral bacterial thioredoxins. Contrary to claims from recent literature, in vitro folding rates in the thioredoxin family are not evolutionarily conserved, but span at least a ∼100-fold range. Furthermore, modern thioredoxin folding is often substantially slower than ancestral thioredoxin folding. Unassisted folding, as probed in vitro, thus emerges as an ancestral vestigial feature that underwent degradation, plausibly upon the evolutionary emergence of efficient cellular folding assistance. More generally, our results provide evidence that degradation of ancestral features shapes, not only morphological evolution, but also the evolution of individual proteins.
Collapse
|
19
|
Phylogeny, evolution, and potential ecological relationship of cytochrome CYP52 enzymes in Saccharomycetales yeasts. Sci Rep 2020; 10:10269. [PMID: 32581293 PMCID: PMC7314818 DOI: 10.1038/s41598-020-67200-5] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/02/2019] [Accepted: 05/07/2020] [Indexed: 01/16/2023] Open
Abstract
Cytochrome P450s from the CYP52 family participate in the assimilation of alkanes and fatty acids in fungi. In this work, the evolutionary history of a set of orthologous and paralogous CYP52 proteins from Saccharomycetales yeasts was inferred. Further, the phenotypic assimilation profiles were related with the distribution of cytochrome CYP52 members among species. The maximum likelihood phylogeny of CYP52 inferred proteins reveled a frequent ancient and modern duplication and loss events that generated orthologous and paralogous groups. Phylogeny and assimilation profiles of alkanes and fatty acids showed a family expansion in yeast isolated from hydrophobic-rich environments. Docking analysis of deduced ancient CYP52 proteins suggests that the most ancient function was the oxidation of C4-C11 alkanes, while the oxidation of >10 carbon alkanes and fatty acids is a derived character. The ancient CYP52 paralogs displayed partial specialization and promiscuous interaction with hydrophobic substrates. Additionally, functional optimization was not evident. Changes in the interaction of ancient CYP52 with different alkanes and fatty acids could be associated with modifications in spatial orientations of the amino acid residues that comprise the active site. The extended family of CYP52 proteins is likely evolving toward functional specialization, and certain redundancy for substrates is being maintained.
Collapse
|
20
|
Gardner JM, Biler M, Risso VA, Sanchez-Ruiz JM, Kamerlin SCL. Manipulating Conformational Dynamics To Repurpose Ancient Proteins for Modern Catalytic Functions. ACS Catal 2020. [DOI: 10.1021/acscatal.0c00722] [Citation(s) in RCA: 30] [Impact Index Per Article: 7.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/20/2022]
Affiliation(s)
- Jasmine M. Gardner
- Department of Chemistry - BMC, Uppsala University, Box 576, 751 23 Uppsala, Sweden
| | - Michal Biler
- Department of Chemistry - BMC, Uppsala University, Box 576, 751 23 Uppsala, Sweden
| | - Valeria A. Risso
- Departamento de Quı́mica Fisica, Facultad de Ciencias, Unidad de Excelencia de Quı́mica Aplicada a Biomedicina y Medioambiente (UEQ), Universidad de Granada, 18071 Granada, Spain
| | - Jose M. Sanchez-Ruiz
- Departamento de Quı́mica Fisica, Facultad de Ciencias, Unidad de Excelencia de Quı́mica Aplicada a Biomedicina y Medioambiente (UEQ), Universidad de Granada, 18071 Granada, Spain
| | - Shina C. L. Kamerlin
- Department of Chemistry - BMC, Uppsala University, Box 576, 751 23 Uppsala, Sweden
| |
Collapse
|
21
|
Molecular origins of folding rate differences in the thioredoxin family. Biochem J 2020; 477:1083-1087. [DOI: 10.1042/bcj20190864] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/26/2019] [Revised: 02/19/2020] [Accepted: 02/20/2020] [Indexed: 12/13/2022]
Abstract
Thioredoxins are a family of conserved oxidoreductases responsible for maintaining redox balance within cells. They have also served as excellent model systems for protein design and engineering studies particularly through ancestral sequence reconstruction methods. The recent work by Gamiz-Arco et al. [Biochem J (2019) 476, 3631–3647] answers fundamental questions on how specific sequence differences can contribute to differences in folding rates between modern and ancient thioredoxins but also among a selected subset of modern thioredoxins. They surprisingly find that rapid unassisted folding, a feature of ancient thioredoxins, is not conserved in the modern descendants suggestive of co-evolution of better folding machinery that likely enabled the accumulation of mutations that slow-down folding. The work thus provides an interesting take on the expected folding-stability-function constraint while arguing for additional factors that contribute to sequence evolution and hence impact folding efficiency.
Collapse
|
22
|
Campitelli P, Modi T, Kumar S, Ozkan SB. The Role of Conformational Dynamics and Allostery in Modulating Protein Evolution. Annu Rev Biophys 2020; 49:267-288. [PMID: 32075411 DOI: 10.1146/annurev-biophys-052118-115517] [Citation(s) in RCA: 89] [Impact Index Per Article: 22.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]
Abstract
Advances in sequencing techniques and statistical methods have made it possible not only to predict sequences of ancestral proteins but also to identify thousands of mutations in the human exome, some of which are disease associated. These developments have motivated numerous theories and raised many questions regarding the fundamental principles behind protein evolution, which have been traditionally investigated horizontally using the tip of the phylogenetic tree through comparative studies of extant proteins within a family. In this article, we review a vertical comparison of the modern and resurrected ancestral proteins. We focus mainly on the dynamical properties responsible for a protein's ability to adapt new functions in response to environmental changes. Using the Dynamic Flexibility Index and the Dynamic Coupling Index to quantify the relative flexibility and dynamic coupling at a site-specific, single-amino-acid level, we provide evidence that the migration of hinges, which are often functionally critical rigid sites, is a mechanism through which proteins can rapidly evolve. Additionally, we show that disease-associated mutations in proteins often result in flexibility changes even at positions distal from mutational sites, particularly in the modulation of active site dynamics.
Collapse
Affiliation(s)
- Paul Campitelli
- Center for Biological Physics, Department of Physics, Arizona State University, Tempe, Arizona 85281, USA; , ,
| | - Tushar Modi
- Center for Biological Physics, Department of Physics, Arizona State University, Tempe, Arizona 85281, USA; , ,
| | - Sudhir Kumar
- Institute for Genomics and Evolutionary Medicine, Temple University, Philadelphia, Pennsylvania 19122, USA; .,Department of Biology, Temple University, Philadelphia, Pennsylvania 19122, USA.,Center for Excellence in Genome Medicine and Research, King Abdulaziz University, Jeddah 21589, Saudi Arabia
| | - S Banu Ozkan
- Center for Biological Physics, Department of Physics, Arizona State University, Tempe, Arizona 85281, USA; , ,
| |
Collapse
|
23
|
Arai S, Shibazaki C, Shimizu R, Adachi M, Ishibashi M, Tokunaga H, Tokunaga M. Catalytic mechanism and evolutionary characteristics of thioredoxin from Halobacterium salinarum NRC-1. Acta Crystallogr D Struct Biol 2020; 76:73-84. [PMID: 31909745 DOI: 10.1107/s2059798319015894] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/01/2019] [Accepted: 11/25/2019] [Indexed: 01/08/2023] Open
Abstract
Thioredoxin (TRX) is an important antioxidant against oxidative stress. TRX from the extremely halophilic archaeon Halobacterium salinarum NRC-1 (HsTRX-A), which has the highest acidic residue content [(Asp + Glu)/(Arg + Lys + His) = 9.0] among known TRXs, was chosen to elucidate the catalytic mechanism and evolutionary characteristics associated with haloadaptation. X-ray crystallographic analysis revealed that the main-chain structure of HsTRX-A is similar to those of homologous TRXs; for example, the root-mean-square deviations on Cα atoms were <2.3 Å for extant archaeal TRXs and <1.5 Å for resurrected Precambrian TRXs. A unique water network was located near the active-site residues (Cys45 and Cys48) in HsTRX-A, which may enhance the proton transfer required for the reduction of substrates under a high-salt environment. The high density of negative charges on the molecular surface (3.6 × 10-3 e Å-2) should improve the solubility and haloadaptivity. Moreover, circular-dichroism measurements and enzymatic assays using a mutant HsTRX-A with deletion of the long flexible N-terminal region (Ala2-Pro17) revealed that Ala2-Pro17 improves the structural stability and the enzymatic activity of HsTRX-A under high-salt environments (>2 M NaCl). The elongation of the N-terminal region in HsTRX-A accompanies the increased hydrophilicity and acidic residue content but does not affect the structure of the active site. These observations offer insights into molecular evolution for haloadaptation and potential applications in halophilic protein-related biotechnology.
Collapse
Affiliation(s)
- Shigeki Arai
- Institute for Quantum Life Science, National Institutes for Quantum and Radiological Science and Technology, 2-4 Shirakata, Tokai, Ibaraki 319-1106, Japan
| | - Chie Shibazaki
- Institute for Quantum Life Science, National Institutes for Quantum and Radiological Science and Technology, 2-4 Shirakata, Tokai, Ibaraki 319-1106, Japan
| | - Rumi Shimizu
- Institute for Quantum Life Science, National Institutes for Quantum and Radiological Science and Technology, 2-4 Shirakata, Tokai, Ibaraki 319-1106, Japan
| | - Motoyasu Adachi
- Institute for Quantum Life Science, National Institutes for Quantum and Radiological Science and Technology, 2-4 Shirakata, Tokai, Ibaraki 319-1106, Japan
| | - Matsujiro Ishibashi
- Applied and Molecular Microbiology, Faculty of Agriculture, Kagoshima University, 1-21-24 Korimoto, Kagoshima 890-0065, Japan
| | - Hiroko Tokunaga
- Applied and Molecular Microbiology, Faculty of Agriculture, Kagoshima University, 1-21-24 Korimoto, Kagoshima 890-0065, Japan
| | - Masao Tokunaga
- Applied and Molecular Microbiology, Faculty of Agriculture, Kagoshima University, 1-21-24 Korimoto, Kagoshima 890-0065, Japan
| |
Collapse
|
24
|
Pollack JD, Gerard D, Makhatadze GI, Pearl DK. Evolutionary conservation and structural localizations suggest a physical trace of metabolism’s progressive geochronological emergence. J Biomol Struct Dyn 2019; 38:3700-3719. [DOI: 10.1080/07391102.2019.1679666] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/25/2022]
Affiliation(s)
- J. Dennis Pollack
- Department of Molecular Virology, Immunology and Medical Genetics, College of Medicine, The Ohio State University, Columbus, Ohio, USA
| | - David Gerard
- Department of Mathematics and Statistics, American University, Washington, DC, USA
| | - George I. Makhatadze
- Department of Biological Sciences, Rensselaer Polytechnic Institute, Troy, New York, USA
| | - Dennis K. Pearl
- Department of Statistics, Penn State University, University Park, Pennsylvania, USA
| |
Collapse
|
25
|
Thomas A, Cutlan R, Finnigan W, van der Giezen M, Harmer N. Highly thermostable carboxylic acid reductases generated by ancestral sequence reconstruction. Commun Biol 2019; 2:429. [PMID: 31799431 PMCID: PMC6874671 DOI: 10.1038/s42003-019-0677-y] [Citation(s) in RCA: 27] [Impact Index Per Article: 5.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/28/2019] [Accepted: 11/04/2019] [Indexed: 12/19/2022] Open
Abstract
Carboxylic acid reductases (CARs) are biocatalysts of industrial importance. Their properties, especially their poor stability, render them sub-optimal for use in a bioindustrial pipeline. Here, we employed ancestral sequence reconstruction (ASR) - a burgeoning engineering tool that can identify stabilizing but enzymatically neutral mutations throughout a protein. We used a three-algorithm approach to reconstruct functional ancestors of the Mycobacterial and Nocardial CAR1 orthologues. Ancestral CARs (AncCARs) were confirmed to be CAR enzymes with a preference for aromatic carboxylic acids. Ancestors also showed varied tolerances to solvents, pH and in vivo-like salt concentrations. Compared to well-studied extant CARs, AncCARs had a Tm up to 35 °C higher, with half-lives up to nine times longer than the greatest previously observed. Using ancestral reconstruction we have expanded the existing CAR toolbox with three new thermostable CAR enzymes, providing access to the high temperature biosynthesis of aldehydes to drive new applications in biocatalysis.
Collapse
Affiliation(s)
- Adam Thomas
- Living Systems Institute, Stocker Road, Exeter, EX4 4QD UK
- Present Address: Department of Biosciences, Geoffrey Pope Building, Stocker Road, Exeter, EX4 4QD UK
| | - Rhys Cutlan
- Living Systems Institute, Stocker Road, Exeter, EX4 4QD UK
- Present Address: Department of Biosciences, Geoffrey Pope Building, Stocker Road, Exeter, EX4 4QD UK
| | - William Finnigan
- Present Address: Department of Biosciences, Geoffrey Pope Building, Stocker Road, Exeter, EX4 4QD UK
| | - Mark van der Giezen
- Present Address: Department of Biosciences, Geoffrey Pope Building, Stocker Road, Exeter, EX4 4QD UK
- Centre for Organelle Research, University of Stavanger, Richard Johnsens gate 4, Stavanger, 4021 Norway
| | - Nicholas Harmer
- Living Systems Institute, Stocker Road, Exeter, EX4 4QD UK
- Present Address: Department of Biosciences, Geoffrey Pope Building, Stocker Road, Exeter, EX4 4QD UK
| |
Collapse
|
26
|
Tomala K, Zrebiec P, Hartl DL. Limits to Compensatory Mutations: Insights from Temperature-Sensitive Alleles. Mol Biol Evol 2019; 36:1874-1883. [PMID: 31058959 PMCID: PMC6735812 DOI: 10.1093/molbev/msz110] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
Abstract
Previous experiments with temperature-sensitive mutants of the yeast enzyme orotidine 5'-phosphate decarboxylase (encoded in gene URA3) yielded the unexpected result that reversion occurs only through exact reversal of the original mutation (Jakubowska A, Korona R. 2009. Lack of evolutionary conservation at positions important for thermal stability in the yeast ODCase protein. Mol Biol Evol. 26(7):1431-1434.). We recreated a set of these mutations in which the codon had two nucleotide substitutions, making exact reversion much less likely. We screened these double mutants for reversion and obtained a number of compensatory mutations occurring at alternative sites in the molecule. None of these compensatory mutations fully restored protein performance. The mechanism of partial compensation is consistent with a model in which protein stabilization is additive, as the same secondary mutations can compensate different primary alternations. The distance between primary and compensatory residues precludes direct interaction between the sites. Instead, most of the compensatory mutants were clustered in proximity to the catalytic center. All of the second-site compensatory substitutions occurred at relatively conserved sites, and the amino acid replacements were to residues found at these sites in a multispecies alignment of the protein. Based on the estimated distribution of changes in Gibbs free energy among a large number of amino acid replacements, we estimate that, for most proteins, the probability that a second-site mutation would have a sufficiently large stabilizing effect to offset a temperature-sensitive mutation in the order of 10-4 or less. Hence compensation is likely to take place only for slightly destabilizing mutations because highly stabilizing mutations are exceeding rare.
Collapse
Affiliation(s)
- Katarzyna Tomala
- Institute of Environmental Sciences, Faculty of Biology, Jagiellonian University, Krakow, Poland
| | - Piotr Zrebiec
- Institute of Environmental Sciences, Faculty of Biology, Jagiellonian University, Krakow, Poland
| | - Daniel L Hartl
- Department of Organismic and Evolutionary Biology, Harvard University, Cambridge, MA
| |
Collapse
|
27
|
Ferrada E. Gene Families, Epistasis and the Amino Acid Preferences of Protein Homologs. Evol Bioinform Online 2019; 15:1176934319870485. [PMID: 31452598 PMCID: PMC6698995 DOI: 10.1177/1176934319870485] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/19/2019] [Accepted: 07/27/2019] [Indexed: 11/16/2022] Open
Abstract
In order to preserve structure and function, proteins tend to preferentially conserve amino acids at particular sites along the sequence. Because mutations can affect structure and function, the question arises whether the preference of a protein site for a particular amino acid varies between protein homologs, and to what extent that variation depends on sequence divergence. Answering these questions can help in the development of models of sequence evolution, as well as provide insights on the dependence of the fitness effects of mutations on the genetic background of sequences, a phenomenon known as epistasis. Here, I comment on recent computational work providing a systematic analysis of the extent to which the amino acid preferences of proteins depend on the background mutations of protein homologs.
Collapse
Affiliation(s)
- Evandro Ferrada
- Center for Genomics and Bioinformatics, Faculty of Science, Universidad Mayor, Santiago, Chile
| |
Collapse
|
28
|
Ferrada E. The Site-Specific Amino Acid Preferences of Homologous Proteins Depend on Sequence Divergence. Genome Biol Evol 2019; 11:121-135. [PMID: 30496400 PMCID: PMC6326188 DOI: 10.1093/gbe/evy261] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 11/26/2018] [Indexed: 12/20/2022] Open
Abstract
The propensity of protein sites to be occupied by any of the 20 amino acids is known as site-specific amino acid preferences (SSAP). Under the assumption that SSAP are conserved among homologs, they can be used to parameterize evolutionary models for the reconstruction of accurate phylogenetic trees. However, simulations and experimental studies have not been able to fully assess the relative conservation of SSAP as a function of sequence divergence between protein homologs. Here, we implement a computational procedure to predict the SSAP of proteins based on the effect of changes in thermodynamic stability upon mutation. An advantage of this computational approach is that it allows us to interrogate a large and unbiased sample of homologous proteins, over the entire spectrum of sequence divergence, and under selection for the same molecular trait. We show that computational predictions have reproducibilities that resemble those obtained in experimental replicates, and can largely recapitulate the SSAP observed in a large-scale mutagenesis experiment. Our results support recent experimental reports on the conservation of SSAP of related homologs, with a slowly increasing fraction of up to 15% of different sites at sequence distances lower than 40%. However, even under the sole contribution of thermodynamic stability, our conservative approach identifies up to 30% of significant different sites between divergent homologs. We show that this relation holds for homologs of diverse sizes and structural classes. Analyses of residue contact networks suggest that an important determinant of these differences is the increasing accumulation of structural deviations that results from sequence divergence.
Collapse
Affiliation(s)
- Evandro Ferrada
- Center for Genomics and Bioinformatics, Faculty of Science, Universidad Mayor, Camino La Pirámide 5750, Huechuraba, 8580745, Santiago, Chile
| |
Collapse
|
29
|
Posfai A, Zhou J, Plotkin JB, Kinney JB, McCandlish DM. Selection for Protein Stability Enriches for Epistatic Interactions. Genes (Basel) 2018; 9:E423. [PMID: 30134605 PMCID: PMC6162820 DOI: 10.3390/genes9090423] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/04/2018] [Revised: 07/30/2018] [Accepted: 08/14/2018] [Indexed: 12/15/2022] Open
Abstract
A now classical argument for the marginal thermodynamic stability of proteins explains the distribution of observed protein stabilities as a consequence of an entropic pull in protein sequence space. In particular, most sequences that are sufficiently stable to fold will have stabilities near the folding threshold. Here, we extend this argument to consider its predictions for epistatic interactions for the effects of mutations on the free energy of folding. Although there is abundant evidence to indicate that the effects of mutations on the free energy of folding are nearly additive and conserved over evolutionary time, we show that these observations are compatible with the hypothesis that a non-additive contribution to the folding free energy is essential for observed proteins to maintain their native structure. In particular, through both simulations and analytical results, we show that even very small departures from additivity are sufficient to drive this effect.
Collapse
Affiliation(s)
- Anna Posfai
- Simons Center for Quantitative Biology, Cold Spring Harbor Laboratory, Cold Spring Harbor, NY 11724, USA.
| | - Juannan Zhou
- Simons Center for Quantitative Biology, Cold Spring Harbor Laboratory, Cold Spring Harbor, NY 11724, USA.
| | - Joshua B Plotkin
- Department of Biology, University of Pennsylvania, Philadelphia, PA 19104, USA.
| | - Justin B Kinney
- Simons Center for Quantitative Biology, Cold Spring Harbor Laboratory, Cold Spring Harbor, NY 11724, USA.
| | - David M McCandlish
- Simons Center for Quantitative Biology, Cold Spring Harbor Laboratory, Cold Spring Harbor, NY 11724, USA.
| |
Collapse
|
30
|
Abstract
Genotype-phenotype relationships are notoriously complicated. Idiosyncratic interactions between specific combinations of mutations occur and are difficult to predict. Yet it is increasingly clear that many interactions can be understood in terms of global epistasis. That is, mutations may act additively on some underlying, unobserved trait, and this trait is then transformed via a nonlinear function to the observed phenotype as a result of subsequent biophysical and cellular processes. Here we infer the shape of such global epistasis in three proteins, based on published high-throughput mutagenesis data. To do so, we develop a maximum-likelihood inference procedure using a flexible family of monotonic nonlinear functions spanned by an I-spline basis. Our analysis uncovers dramatic nonlinearities in all three proteins; in some proteins a model with global epistasis accounts for virtually all of the measured variation, whereas in others we find substantial local epistasis as well. This method allows us to test hypotheses about the form of global epistasis and to distinguish variance components attributable to global epistasis, local epistasis, and measurement error.
Collapse
|
31
|
Petrović D, Risso VA, Kamerlin SCL, Sanchez-Ruiz JM. Conformational dynamics and enzyme evolution. J R Soc Interface 2018; 15:20180330. [PMID: 30021929 PMCID: PMC6073641 DOI: 10.1098/rsif.2018.0330] [Citation(s) in RCA: 115] [Impact Index Per Article: 19.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/10/2018] [Accepted: 06/27/2018] [Indexed: 12/21/2022] Open
Abstract
Enzymes are dynamic entities, and their dynamic properties are clearly linked to their biological function. It follows that dynamics ought to play an essential role in enzyme evolution. Indeed, a link between conformational diversity and the emergence of new enzyme functionalities has been recognized for many years. However, it is only recently that state-of-the-art computational and experimental approaches are revealing the crucial molecular details of this link. Specifically, evolutionary trajectories leading to functional optimization for a given host environment or to the emergence of a new function typically involve enriching catalytically competent conformations and/or the freezing out of non-competent conformations of an enzyme. In some cases, these evolutionary changes are achieved through distant mutations that shift the protein ensemble towards productive conformations. Multifunctional intermediates in evolutionary trajectories are probably multi-conformational, i.e. able to switch between different overall conformations, each competent for a given function. Conformational diversity can assist the emergence of a completely new active site through a single mutation by facilitating transition-state binding. We propose that this mechanism may have played a role in the emergence of enzymes at the primordial, progenote stage, where it was plausibly promoted by high environmental temperatures and the possibility of additional phenotypic mutations.
Collapse
Affiliation(s)
- Dušan Petrović
- Department of Chemistry, BMC, Uppsala University, Box 576, 751 23 Uppsala, Sweden
| | - Valeria A Risso
- Departamento de Quimica Fisica, Facultad de Ciencias, University of Granada, 18071 Granada, Spain
| | | | - Jose M Sanchez-Ruiz
- Departamento de Quimica Fisica, Facultad de Ciencias, University of Granada, 18071 Granada, Spain
| |
Collapse
|
32
|
Modi T, Huihui J, Ghosh K, Ozkan SB. Ancient thioredoxins evolved to modern-day stability-function requirement by altering native state ensemble. Philos Trans R Soc Lond B Biol Sci 2018; 373:20170184. [PMID: 29735738 PMCID: PMC5941179 DOI: 10.1098/rstb.2017.0184] [Citation(s) in RCA: 32] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 02/20/2018] [Indexed: 02/06/2023] Open
Abstract
Thioredoxins (THRXs)-small globular proteins that reduce other proteins-are ubiquitous in all forms of life, from Archaea to mammals. Although ancestral thioredoxins share sequential and structural similarity with the modern-day (extant) homologues, they exhibit significantly different functional activity and stability. We investigate this puzzle by comparative studies of their (ancient and modern-day THRXs') native state ensemble, as quantified by the dynamic flexibility index (DFI), a metric for the relative resilience of an amino acid to perturbations in the rest of the protein. Clustering proteins using DFI profiles strongly resemble an alternative classification scheme based on their activity and stability. The DFI profiles of the extant proteins are substantially different around the α3, α4 helices and catalytic regions. Likewise, allosteric coupling of the active site with the rest of the protein is different between ancient and extant THRXs, possibly explaining the decreased catalytic activity at low pH with evolution. At a global level, we note that the population of low-flexibility (called hinges) and high-flexibility sites increases with evolution. The heterogeneity (quantified by the variance) in DFI distribution increases with the decrease in the melting temperature typically associated with the evolution of ancient proteins to their modern-day counterparts.This article is part of a discussion meeting issue 'Allostery and molecular machines'.
Collapse
Affiliation(s)
- Tushar Modi
- Department of Physics and Center for Biological Physics, Arizona State University, Tempe, AZ 85281, USA
| | - Jonathan Huihui
- Department of Physics and Astronomy, University of Denver, Denver, CO 80209, USA
| | - Kingshuk Ghosh
- Department of Physics and Astronomy, University of Denver, Denver, CO 80209, USA
| | - S Banu Ozkan
- Department of Physics and Center for Biological Physics, Arizona State University, Tempe, AZ 85281, USA
| |
Collapse
|
33
|
Storz JF. Compensatory mutations and epistasis for protein function. Curr Opin Struct Biol 2018; 50:18-25. [PMID: 29100081 PMCID: PMC5936477 DOI: 10.1016/j.sbi.2017.10.009] [Citation(s) in RCA: 29] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/06/2017] [Revised: 10/05/2017] [Accepted: 10/12/2017] [Indexed: 01/09/2023]
Abstract
Adaptive protein evolution may be facilitated by neutral amino acid mutations that confer no benefit when they first arise but which potentiate subsequent function-altering mutations via direct or indirect structural mechanisms. Theoretical and empirical results indicate that such compensatory interactions (intramolecular epistasis) can exert a strong influence on trajectories of protein evolution. For this reason, assessing the form and prevalence of intramolecular epistasis and characterizing biophysical mechanisms of compensatory interaction are important research goals at the nexus of structural biology and molecular evolution. Here I review recent insights derived from protein-engineering studies, and I describe an approach for identifying and characterizing mechanisms of epistasis that integrates experimental data on structure-function relationships with analyses of comparative sequence data.
Collapse
Affiliation(s)
- Jay F Storz
- University of Nebraska, School of Biological Sciences, Lincoln, NE 68588-0114, United States.
| |
Collapse
|
34
|
Risso VA, Sanchez-Ruiz JM, Ozkan SB. Biotechnological and protein-engineering implications of ancestral protein resurrection. Curr Opin Struct Biol 2018; 51:106-115. [PMID: 29660672 DOI: 10.1016/j.sbi.2018.02.007] [Citation(s) in RCA: 26] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/21/2018] [Revised: 02/18/2018] [Accepted: 02/20/2018] [Indexed: 10/17/2022]
Abstract
Approximations to the sequences of ancestral proteins can be derived from the sequences of their modern descendants. Proteins encoded by such reconstructed sequences can be prepared in the laboratory and subjected to experimental scrutiny. These 'resurrected' ancestral proteins often display remarkable properties, reflecting ancestral adaptations to intra-cellular and extra-cellular environments that differed from the environments hosting modern/extant proteins. Recent experimental and computational work has specifically discussed high stability, substrate and catalytic promiscuity, conformational flexibility/diversity and altered patterns of interaction with other sub-cellular components. In this review, we discuss these remarkable properties as well as recent attempts to explore their biotechnological and protein-engineering potential.
Collapse
Affiliation(s)
- Valeria A Risso
- Departamento de Quimica Fisica, Facultad de Ciencias, University of Granada, 18071 Granada, Spain
| | - Jose M Sanchez-Ruiz
- Departamento de Quimica Fisica, Facultad de Ciencias, University of Granada, 18071 Granada, Spain.
| | - S Banu Ozkan
- Department of Physics and Center for Biological Physics, Arizona State University, Tempe, AZ 85281, United States.
| |
Collapse
|
35
|
Pervasive contingency and entrenchment in a billion years of Hsp90 evolution. Proc Natl Acad Sci U S A 2018; 115:4453-4458. [PMID: 29626131 DOI: 10.1073/pnas.1718133115] [Citation(s) in RCA: 60] [Impact Index Per Article: 10.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/06/2023] Open
Abstract
Interactions among mutations within a protein have the potential to make molecular evolution contingent and irreversible, but the extent to which epistasis actually shaped historical evolutionary trajectories is unclear. To address this question, we experimentally measured how the fitness effects of historical sequence substitutions changed during the billion-year evolutionary history of the heat shock protein 90 (Hsp90) ATPase domain beginning from a deep eukaryotic ancestor to modern Saccharomyces cerevisiae We found a pervasive influence of epistasis. Of 98 derived amino acid states that evolved along this lineage, about half compromise fitness when introduced into the reconstructed ancestral Hsp90. And the vast majority of ancestral states reduce fitness when introduced into the extant S. cerevisiae Hsp90. Overall, more than 75% of historical substitutions were contingent on permissive substitutions that rendered the derived state nondeleterious, became entrenched by subsequent restrictive substitutions that made the ancestral state deleterious, or both. This epistasis was primarily caused by specific interactions among sites rather than a general effect on the protein's tolerance to mutation. Our results show that epistasis continually opened and closed windows of mutational opportunity over evolutionary timescales, producing histories and biological states that reflect the transient internal constraints imposed by the protein's fleeting sequence states.
Collapse
|
36
|
Haddox HK, Dingens AS, Hilton SK, Overbaugh J, Bloom JD. Mapping mutational effects along the evolutionary landscape of HIV envelope. eLife 2018; 7:34420. [PMID: 29590010 PMCID: PMC5910023 DOI: 10.7554/elife.34420] [Citation(s) in RCA: 75] [Impact Index Per Article: 12.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/17/2017] [Accepted: 03/15/2018] [Indexed: 01/04/2023] Open
Abstract
The immediate evolutionary space accessible to HIV is largely determined by how single amino acid mutations affect fitness. These mutational effects can shift as the virus evolves. However, the prevalence of such shifts in mutational effects remains unclear. Here, we quantify the effects on viral growth of all amino acid mutations to two HIV envelope (Env) proteins that differ at >100 residues. Most mutations similarly affect both Envs, but the amino acid preferences of a minority of sites have clearly shifted. These shifted sites usually prefer a specific amino acid in one Env, but tolerate many amino acids in the other. Surprisingly, shifts are only slightly enriched at sites that have substituted between the Envs—and many occur at residues that do not even contact substitutions. Therefore, long-range epistasis can unpredictably shift Env’s mutational tolerance during HIV evolution, although the amino acid preferences of most sites are conserved between moderately diverged viral strains. The virus that causes AIDS, or HIV, has a protein called Env on its surface, which is essential for the virus to infect cells. Env can also be recognized by the immune system, which then targets the virus for destruction or blocks it from infecting cells. Unfortunately, Env evolves very quickly, which means that HIV can evade our defenses. However, there are limits to how much this protein can change, since it still needs to perform its essential role in helping viruses enter cells. In the century since HIV first appeared in human populations, the virus has evolved considerably. There are now many HIV strains that infect people, and they bear Env proteins with substantially different sequences. However, it is not clear if these changes in sequence have resulted in Envs from distinct strains being able to tolerate different mutations. To examine this question, Haddox et al. compared how the Envs from two strains of HIV react to modifications in their sequences. They created all possible individual mutations in the proteins, and the resulting collections of mutated viruses were then tested for their ability to infect cells in the laboratory. Most mutations had similar effects in both Env proteins. This allowed Haddox et al. to identify portions of the protein that easily accommodate changes, and portions that must remain unchanged for viruses to remain infectious—at least in the laboratory. Some of these mutations are under different types of pressures when the virus faces the immune system, and those were identified using computational approaches. However, some mutations were tolerated differently by the two Env proteins. Therefore, viral strains differ in how their Env proteins can evolve. The parts of Env that showed differences in mutational tolerance between the strains were not necessarily the parts that differ in sequence. This shows that changes in sequence in one part of the protein can modify how other portions evolve. It remains to be determined whether changes in tolerance to mutations translate into differences in how the virus can escape immunity. This is an important question given that the rapid evolution of Env is a major obstacle to creating a vaccine for HIV.
Collapse
Affiliation(s)
- Hugh K Haddox
- Basic Sciences Division and Computational Biology Program, Fred Hutchinson Cancer Research Center, Seattle, United States.,Molecular and Cellular Biology PhD program, University of Washington, Seattle, United States
| | - Adam S Dingens
- Basic Sciences Division and Computational Biology Program, Fred Hutchinson Cancer Research Center, Seattle, United States.,Molecular and Cellular Biology PhD program, University of Washington, Seattle, United States
| | - Sarah K Hilton
- Basic Sciences Division and Computational Biology Program, Fred Hutchinson Cancer Research Center, Seattle, United States.,Department of Genome Sciences, University of Washington, Seattle, United States
| | - Julie Overbaugh
- Human Biology Division, Fred Hutchinson Cancer Research Center, Seattle, United States.,Epidemiology Program, Fred Hutchinson Cancer Research Center, Seattle, United States
| | - Jesse D Bloom
- Basic Sciences Division and Computational Biology Program, Fred Hutchinson Cancer Research Center, Seattle, United States.,Department of Genome Sciences, University of Washington, Seattle, United States
| |
Collapse
|
37
|
Cortés Cabrera Á, Sánchez-Murcia PA, Gago F. Making sense of the past: hyperstability of ancestral thioredoxins explained by free energy simulations. Phys Chem Chem Phys 2018; 19:23239-23246. [PMID: 28825743 DOI: 10.1039/c7cp03659k] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/21/2022]
Abstract
Thioredoxin (Trx), a small and globular protein, is present in all kinds of organisms, from Archea to higher mammals. Throughout evolution, the Trx sequence has undergone subtle modifications to adapt to varying environmental conditions. The high degree of sequence conservation makes Trx very amenable to ancestral protein reconstruction techniques. In this work, we address the study of the structural and energetic determinants of thermostability in E. coli Trx using a dataset of mutations inspired by ancestral reconstruction. We compute, from first principles, the expected contribution of 19 different amino acid substitutions to the stability (ΔΔG) and the melting temperature (ΔTm) of the protein. We also describe the specific changes in structure and protein dynamics responsible for the stabilizing or destabilizing effects of these mutations. Our results point to local and independent changes for most of the variants. Our predictions are accurate enough to substantiate the proposal of new hypotheses regarding evolutionary relationships between mutations, as in the case of T89R, P68A and G74S or K90L and F102A, and reach beyond the initial set to suggest improved variants, such as K90I or K90Y.
Collapse
Affiliation(s)
- Álvaro Cortés Cabrera
- Área de Farmacología, Departamento de Ciencias Biomédicas, Facultad de Medicina y Ciencias de la Salud, Universidad de Alcalá, E-28805 Alcalá de Henares, Madrid, Spain.
| | - Pedro A Sánchez-Murcia
- Área de Farmacología, Departamento de Ciencias Biomédicas, Facultad de Medicina y Ciencias de la Salud, Universidad de Alcalá, E-28805 Alcalá de Henares, Madrid, Spain.
| | - Federico Gago
- Área de Farmacología, Departamento de Ciencias Biomédicas, Facultad de Medicina y Ciencias de la Salud, Universidad de Alcalá, E-28805 Alcalá de Henares, Madrid, Spain.
| |
Collapse
|
38
|
Broadly neutralizing antiviral responses induced by a single-molecule HPV vaccine based on thermostable thioredoxin-L2 multiepitope nanoparticles. Sci Rep 2017; 7:18000. [PMID: 29269879 PMCID: PMC5740060 DOI: 10.1038/s41598-017-18177-1] [Citation(s) in RCA: 26] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/07/2017] [Accepted: 12/04/2017] [Indexed: 12/22/2022] Open
Abstract
Vaccines targeting the human papillomavirus (HPV) minor capsid protein L2 are emerging as chemico-physically robust and broadly protective alternatives to the current HPV (L1-VLP) vaccines. We have previously developed a trivalent L2 vaccine prototype exploiting Pyrococcus furiosus thioredoxin (PfTrx) as a thermostable scaffold for the separate presentation of three distinct HPV L2(20–38) epitopes. With the aim of achieving a highly immunogenic, yet simpler and more GMP-production affordable formulation, we report here on a novel thermostable nanoparticle vaccine relying on genetic fusion of PfTrx-L2 with the heptamerizing coiled-coil polypeptide OVX313. A prototype HPV16 monoepitope version of this nanoparticle vaccine (PfTrx-L2-OVX313; median radius: 8.6 ± 1.0 nm) proved to be approximately 10-fold more immunogenic and with a strikingly enhanced cross-neutralization capacity compared to its monomeric counterpart. Vaccine-induced (cross-)neutralizing responses were further potentiated in a multiepitope derivative displaying eight different L2(20–38) epitopes, which elicited neutralizing antibodies against 10 different HPVs including three viral types not represented in the vaccine. Considering the prospective safety of the PfTrx scaffold and of the OVX313 heptamerization module, PfTrx-OVX313 nanoparticles lend themselves as robust L2-based immunogens with a high translational potential as a 3rd generation HPV vaccine, but also as a novel and extremely versatile peptide-antigen presentation platform.
Collapse
|
39
|
Medina-Carmona E, Fuchs JE, Gavira JA, Mesa-Torres N, Neira JL, Salido E, Palomino-Morales R, Burgos M, Timson DJ, Pey AL. Enhanced vulnerability of human proteins towards disease-associated inactivation through divergent evolution. Hum Mol Genet 2017; 26:3531-3544. [DOI: 10.1093/hmg/ddx238] [Citation(s) in RCA: 26] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/27/2017] [Accepted: 06/14/2017] [Indexed: 12/16/2022] Open
|
40
|
Exploring the past and the future of protein evolution with ancestral sequence reconstruction: the 'retro' approach to protein engineering. Biochem J 2017; 474:1-19. [PMID: 28008088 DOI: 10.1042/bcj20160507] [Citation(s) in RCA: 84] [Impact Index Per Article: 12.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/27/2016] [Revised: 11/07/2016] [Accepted: 11/10/2016] [Indexed: 12/22/2022]
Abstract
A central goal in molecular evolution is to understand the ways in which genes and proteins evolve in response to changing environments. In the absence of intact DNA from fossils, ancestral sequence reconstruction (ASR) can be used to infer the evolutionary precursors of extant proteins. To date, ancestral proteins belonging to eubacteria, archaea, yeast and vertebrates have been inferred that have been hypothesized to date from between several million to over 3 billion years ago. ASR has yielded insights into the early history of life on Earth and the evolution of proteins and macromolecular complexes. Recently, however, ASR has developed from a tool for testing hypotheses about protein evolution to a useful means for designing novel proteins. The strength of this approach lies in the ability to infer ancestral sequences encoding proteins that have desirable properties compared with contemporary forms, particularly thermostability and broad substrate range, making them good starting points for laboratory evolution. Developments in technologies for DNA sequencing and synthesis and computational phylogenetic analysis have led to an escalation in the number of ancient proteins resurrected in the last decade and greatly facilitated the use of ASR in the burgeoning field of synthetic biology. However, the primary challenge of ASR remains in accurately inferring ancestral states, despite the uncertainty arising from evolutionary models, incomplete sequences and limited phylogenetic trees. This review will focus, firstly, on the use of ASR to uncover links between sequence and phenotype and, secondly, on the practical application of ASR in protein engineering.
Collapse
|
41
|
Fast folding and slow unfolding of a resurrected Precambrian protein. Proc Natl Acad Sci U S A 2017; 114:E4122-E4123. [PMID: 28512228 DOI: 10.1073/pnas.1703227114] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
|
42
|
Chan YH, Venev SV, Zeldovich KB, Matthews CR. Correlation of fitness landscapes from three orthologous TIM barrels originates from sequence and structure constraints. Nat Commun 2017; 8:14614. [PMID: 28262665 PMCID: PMC5343507 DOI: 10.1038/ncomms14614] [Citation(s) in RCA: 28] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/06/2016] [Accepted: 01/11/2017] [Indexed: 02/07/2023] Open
Abstract
Sequence divergence of orthologous proteins enables adaptation to environmental stresses and promotes evolution of novel functions. Limits on evolution imposed by constraints on sequence and structure were explored using a model TIM barrel protein, indole-3-glycerol phosphate synthase (IGPS). Fitness effects of point mutations in three phylogenetically divergent IGPS proteins during adaptation to temperature stress were probed by auxotrophic complementation of yeast with prokaryotic, thermophilic IGPS. Analysis of beneficial mutations pointed to an unexpected, long-range allosteric pathway towards the active site of the protein. Significant correlations between the fitness landscapes of distant orthologues implicate both sequence and structure as primary forces in defining the TIM barrel fitness landscape and suggest that fitness landscapes can be translocated in sequence space. Exploration of fitness landscapes in the context of a protein fold provides a strategy for elucidating the sequence-structure-fitness relationships in other common motifs. The TIM barrel fold is an evolutionarily conserved motif found in proteins with a variety of enzymatic functions. Here the authors explore the fitness landscape of the TIM barrel protein IGPS and uncover evolutionary constraints on both sequence and structure, accompanied by long range allosteric interactions.
Collapse
Affiliation(s)
- Yvonne H Chan
- Department of Biochemistry and Molecular Pharmacology, University of Massachusetts Medical School, 364 Plantation Street, Worcester, Massachusetts 01605, USA
| | - Sergey V Venev
- Program in Bioinformatics and Integrative Biology, University of Massachusetts Medical School, 368 Plantation Street, Worcester, Massachusetts 01605, USA
| | - Konstantin B Zeldovich
- Program in Bioinformatics and Integrative Biology, University of Massachusetts Medical School, 368 Plantation Street, Worcester, Massachusetts 01605, USA
| | - C Robert Matthews
- Department of Biochemistry and Molecular Pharmacology, University of Massachusetts Medical School, 364 Plantation Street, Worcester, Massachusetts 01605, USA
| |
Collapse
|
43
|
Bastolla U, Dehouck Y, Echave J. What evolution tells us about protein physics, and protein physics tells us about evolution. Curr Opin Struct Biol 2017; 42:59-66. [DOI: 10.1016/j.sbi.2016.10.020] [Citation(s) in RCA: 33] [Impact Index Per Article: 4.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/23/2016] [Revised: 10/19/2016] [Accepted: 10/24/2016] [Indexed: 12/21/2022]
|
44
|
Bloom JD. Identification of positive selection in genes is greatly improved by using experimentally informed site-specific models. Biol Direct 2017; 12:1. [PMID: 28095902 PMCID: PMC5240389 DOI: 10.1186/s13062-016-0172-z] [Citation(s) in RCA: 48] [Impact Index Per Article: 6.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/15/2016] [Accepted: 12/14/2016] [Indexed: 12/23/2022] Open
Abstract
Background Sites of positive selection are identified by comparing observed evolutionary patterns to those expected under a null model for evolution in the absence of such selection. For protein-coding genes, the most common null model is that nonsynonymous and synonymous mutations fix at equal rates; this unrealistic model has limited power to detect many interesting forms of selection. Results I describe a new approach that uses a null model based on experimental measurements of a gene’s site-specific amino-acid preferences generated by deep mutational scanning in the lab. This null model makes it possible to identify both diversifying selection for repeated amino-acid change and differential selection for mutations to amino acids that are unexpected given the measurements made in the lab. I show that this approach identifies sites of adaptive substitutions in four genes (lactamase, Gal4, influenza nucleoprotein, and influenza hemagglutinin) far better than a comparable method that simply compares the rates of nonsynonymous and synonymous substitutions. Conclusions As rapid increases in biological data enable increasingly nuanced descriptions of the constraints on individual protein sites, approaches like the one here can improve our ability to identify many interesting forms of selection in natural sequences. Reviewers This article was reviewed by Sebastian Maurer-Stroh, Olivier Tenaillon, and Tal Pupko. All three reviewers are members of the Biology Direct editorial board. Electronic supplementary material The online version of this article (doi:10.1186/s13062-016-0172-z) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Jesse D Bloom
- Division of Basic Sciences and Computational Biology Program, Fred Hutchinson Cancer Research Center, 1100 Fairview Ave N, Seattle, 98109, WA, USA.
| |
Collapse
|
45
|
Engineering ancestral protein hyperstability. Biochem J 2016; 473:3611-3620. [PMID: 27528732 DOI: 10.1042/bcj20160532] [Citation(s) in RCA: 26] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/30/2016] [Accepted: 08/15/2016] [Indexed: 01/23/2023]
Abstract
Many experimental analyses and proposed scenarios support that ancient life was thermophilic. In congruence with this hypothesis, proteins encoded by reconstructed sequences corresponding to ancient phylogenetic nodes often display very high stability. Here, we show that such 'reconstructed ancestral hyperstability' can be further engineered on the basis of a straightforward approach that uses exclusively information afforded by the ancestral reconstruction process itself. Since evolution does not imply continuous progression, screening of the mutations between two evolutionarily related resurrected ancestral proteins may identify mutations that further stabilize the most stable one. To explore this approach, we have used a resurrected thioredoxin corresponding to the last common ancestor of the cyanobacterial, Deinococcus and Thermus groups (LPBCA thioredoxin), which has a denaturation temperature of ∼123°C. This high value is within the top 0.1% of the denaturation temperatures in the ProTherm database and, therefore, achieving further stabilization appears a priori as a challenging task. Nevertheless, experimental comparison with a resurrected thioredoxin corresponding to the last common ancestor of bacteria (denaturation temperature of ∼115°C) immediately identifies three mutations that increase the denaturation temperature of LPBCA thioredoxin to ∼128°C. Comparison between evolutionarily related resurrected ancestral proteins thus emerges as a simple approach to expand the capability of ancestral reconstruction to search sequence space for extreme protein properties of biotechnological interest. The fact that ancestral sequences for many phylogenetic nodes can be derived from a single alignment of modern sequences should contribute to the general applicability of this approach.
Collapse
|
46
|
Spielman SJ, Wilke CO. Extensively Parameterized Mutation-Selection Models Reliably Capture Site-Specific Selective Constraint. Mol Biol Evol 2016; 33:2990-3002. [PMID: 27512115 DOI: 10.1093/molbev/msw171] [Citation(s) in RCA: 19] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/07/2023] Open
Abstract
The mutation-selection model of coding sequence evolution has received renewed attention for its use in estimating site-specific amino acid propensities and selection coefficient distributions. Two computationally tractable mutation-selection inference frameworks have been introduced: One framework employs a fixed-effects, highly parameterized maximum likelihood approach, whereas the other employs a random-effects Bayesian Dirichlet Process approach. While both implementations follow the same model, they appear to make distinct predictions about the distribution of selection coefficients. The fixed-effects framework estimates a large proportion of highly deleterious substitutions, whereas the random-effects framework estimates that all substitutions are either nearly neutral or weakly deleterious. It remains unknown, however, how accurately each method infers evolutionary constraints at individual sites. Indeed, selection coefficient distributions pool all site-specific inferences, thereby obscuring a precise assessment of site-specific estimates. Therefore, in this study, we use a simulation-based strategy to determine how accurately each approach recapitulates the selective constraint at individual sites. We find that the fixed-effects approach, despite its extensive parameterization, consistently and accurately estimates site-specific evolutionary constraint. By contrast, the random-effects Bayesian approach systematically underestimates the strength of natural selection, particularly for slowly evolving sites. We also find that, despite the strong differences between their inferred selection coefficient distributions, the fixed- and random-effects approaches yield surprisingly similar inferences of site-specific selective constraint. We conclude that the fixed-effects mutation-selection framework provides the more reliable software platform for model application and future development.
Collapse
Affiliation(s)
- Stephanie J Spielman
- Department of Integrative Biology, Center for Computational Biology and Bioinformatics, The University of Texas at Austin, Austin, TX Institute for Cellular and Molecular Biology, The University of Texas at Austin, Austin, TX Present address: Institute for Genomics and Evolutionary Medicine, Temple University, Philadelphia, PA
| | - Claus O Wilke
- Department of Integrative Biology, Center for Computational Biology and Bioinformatics, The University of Texas at Austin, Austin, TX Institute for Cellular and Molecular Biology, The University of Texas at Austin, Austin, TX
| |
Collapse
|
47
|
Trudeau DL, Kaltenbach M, Tawfik DS. On the Potential Origins of the High Stability of Reconstructed Ancestral Proteins. Mol Biol Evol 2016; 33:2633-41. [PMID: 27413048 DOI: 10.1093/molbev/msw138] [Citation(s) in RCA: 85] [Impact Index Per Article: 10.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/13/2022] Open
Abstract
Ancestral reconstruction provides instrumental insights regarding the biochemical and biophysical characteristics of past proteins. A striking observation relates to the remarkably high thermostability of reconstructed ancestors. The latter has been linked to high environmental temperatures in the Precambrian era, the era relating to most reconstructed proteins. We found that inferred ancestors of the serum paraoxonase (PON) enzyme family, including the mammalian ancestor, exhibit dramatically increased thermostabilities compared with the extant, human enzyme (up to 30 °C higher melting temperature). However, the environmental temperature at the time of emergence of mammals is presumed to be similar to the present one. Additionally, the mammalian PON ancestor has superior folding properties (kinetic stability)-unlike the extant mammalian PONs, it expresses in E. coli in a soluble and functional form, and at a high yield. We discuss two potential origins of this unexpectedly high stability. First, ancestral stability may be overestimated by a "consensus effect," whereby replacing amino acids that are rare in contemporary sequences with the amino acid most common in the family increases protein stability. Comparison to other reconstructed ancestors indicates that the consensus effect may bias some but not all reconstructions. Second, we note that high stability may relate to factors other than high environmental temperature such as oxidative stress or high radiation levels. Foremost, intrinsic factors such as high rates of genetic mutations and/or of transcriptional and translational errors, and less efficient protein quality control systems, may underlie the high kinetic and thermodynamic stability of past proteins.
Collapse
Affiliation(s)
- Devin L Trudeau
- Department of Biomolecular Sciences, Weizmann Institute of Science, Rehovot, Israel
| | - Miriam Kaltenbach
- Department of Biomolecular Sciences, Weizmann Institute of Science, Rehovot, Israel
| | - Dan S Tawfik
- Department of Biomolecular Sciences, Weizmann Institute of Science, Rehovot, Israel
| |
Collapse
|
48
|
Starr TN, Thornton JW. Epistasis in protein evolution. Protein Sci 2016; 25:1204-18. [PMID: 26833806 PMCID: PMC4918427 DOI: 10.1002/pro.2897] [Citation(s) in RCA: 304] [Impact Index Per Article: 38.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/02/2015] [Revised: 01/25/2016] [Accepted: 01/27/2016] [Indexed: 01/18/2023]
Abstract
The structure, function, and evolution of proteins depend on physical and genetic interactions among amino acids. Recent studies have used new strategies to explore the prevalence, biochemical mechanisms, and evolutionary implications of these interactions-called epistasis-within proteins. Here we describe an emerging picture of pervasive epistasis in which the physical and biological effects of mutations change over the course of evolution in a lineage-specific fashion. Epistasis can restrict the trajectories available to an evolving protein or open new paths to sequences and functions that would otherwise have been inaccessible. We describe two broad classes of epistatic interactions, which arise from different physical mechanisms and have different effects on evolutionary processes. Specific epistasis-in which one mutation influences the phenotypic effect of few other mutations-is caused by direct and indirect physical interactions between mutations, which nonadditively change the protein's physical properties, such as conformation, stability, or affinity for ligands. In contrast, nonspecific epistasis describes mutations that modify the effect of many others; these typically behave additively with respect to the physical properties of a protein but exhibit epistasis because of a nonlinear relationship between the physical properties and their biological effects, such as function or fitness. Both types of interaction are rampant, but specific epistasis has stronger effects on the rate and outcomes of evolution, because it imposes stricter constraints and modulates evolutionary potential more dramatically; it therefore makes evolution more contingent on low-probability historical events and leaves stronger marks on the sequences, structures, and functions of protein families.
Collapse
Affiliation(s)
- Tyler N Starr
- Graduate Program in Biochemistry and Molecular Biophysics, University of Chicago, Chicago, Illinois, 60637
| | - Joseph W Thornton
- Departments of Ecology and Evolution and Human Genetics, University of Chicago, Chicago, Illinois, 60637
| |
Collapse
|
49
|
Accurate Measurement of the Effects of All Amino-Acid Mutations on Influenza Hemagglutinin. Viruses 2016; 8:v8060155. [PMID: 27271655 PMCID: PMC4926175 DOI: 10.3390/v8060155] [Citation(s) in RCA: 141] [Impact Index Per Article: 17.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/23/2016] [Revised: 05/21/2016] [Accepted: 05/25/2016] [Indexed: 12/17/2022] Open
Abstract
Influenza genes evolve mostly via point mutations, and so knowing the effect of every amino-acid mutation provides information about evolutionary paths available to the virus. We and others have combined high-throughput mutagenesis with deep sequencing to estimate the effects of large numbers of mutations to influenza genes. However, these measurements have suffered from substantial experimental noise due to a variety of technical problems, the most prominent of which is bottlenecking during the generation of mutant viruses from plasmids. Here we describe advances that ameliorate these problems, enabling us to measure with greatly improved accuracy and reproducibility the effects of all amino-acid mutations to an H1 influenza hemagglutinin on viral replication in cell culture. The largest improvements come from using a helper virus to reduce bottlenecks when generating viruses from plasmids. Our measurements confirm at much higher resolution the results of previous studies suggesting that antigenic sites on the globular head of hemagglutinin are highly tolerant of mutations. We also show that other regions of hemagglutinin—including the stalk epitopes targeted by broadly neutralizing antibodies—have a much lower inherent capacity to tolerate point mutations. The ability to accurately measure the effects of all influenza mutations should enhance efforts to understand and predict viral evolution.
Collapse
|
50
|
Romero-Romero ML, Risso VA, Martinez-Rodriguez S, Gaucher EA, Ibarra-Molero B, Sanchez-Ruiz JM. Selection for Protein Kinetic Stability Connects Denaturation Temperatures to Organismal Temperatures and Provides Clues to Archaean Life. PLoS One 2016; 11:e0156657. [PMID: 27253436 PMCID: PMC4890807 DOI: 10.1371/journal.pone.0156657] [Citation(s) in RCA: 16] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/22/2016] [Accepted: 05/17/2016] [Indexed: 01/24/2023] Open
Abstract
The relationship between the denaturation temperatures of proteins (Tm values) and the living temperatures of their host organisms (environmental temperatures: TENV values) is poorly understood. Since different proteins in the same organism may show widely different Tm’s, no simple universal relationship between Tm and TENV should hold, other than Tm≥TENV. Yet, when analyzing a set of homologous proteins from different hosts, Tm’s are oftentimes found to correlate with TENV’s but this correlation is shifted upward on the Tm axis. Supporting this trend, we recently reported Tm’s for resurrected Precambrian thioredoxins that mirror a proposed environmental cooling over long geological time, while remaining a shocking ~50°C above the proposed ancestral ocean temperatures. Here, we show that natural selection for protein kinetic stability (denaturation rate) can produce a Tm↔TENV correlation with a large upward shift in Tm. A model for protein stability evolution suggests a link between the Tm shift and the in vivo lifetime of a protein and, more specifically, allows us to estimate ancestral environmental temperatures from experimental denaturation rates for resurrected Precambrian thioredoxins. The TENV values thus obtained match the proposed ancestral ocean cooling, support comparatively high Archaean temperatures, and are consistent with a recent proposal for the environmental temperature (above 75°C) that hosted the last universal common ancestor. More generally, this work provides a framework for understanding how features of protein stability reflect the environmental temperatures of the host organisms.
Collapse
Affiliation(s)
- M. Luisa Romero-Romero
- Departamento de Quimica Fisica, Facultad de Ciencias, Universidad de Granada, 18071, Granada, Spain
| | - Valeria A. Risso
- Departamento de Quimica Fisica, Facultad de Ciencias, Universidad de Granada, 18071, Granada, Spain
| | | | - Eric A. Gaucher
- Georgia Institute of Technology, School of Biology, School of Chemistry and Biochemistry, and Parker H. Petit Institute for Bioengineering and Biosciences, Atlanta, Georgia, 30332, United States of America
| | - Beatriz Ibarra-Molero
- Departamento de Quimica Fisica, Facultad de Ciencias, Universidad de Granada, 18071, Granada, Spain
| | - Jose M. Sanchez-Ruiz
- Departamento de Quimica Fisica, Facultad de Ciencias, Universidad de Granada, 18071, Granada, Spain
- * E-mail:
| |
Collapse
|