1
|
Chi LA, Barnes JE, Patel JS, Ytreberg FM. Exploring the ability of the MD+FoldX method to predict SARS-CoV-2 antibody escape mutations using large-scale data. Sci Rep 2024; 14:23122. [PMID: 39366988 PMCID: PMC11452645 DOI: 10.1038/s41598-024-72491-z] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/22/2024] [Accepted: 09/09/2024] [Indexed: 10/06/2024] Open
Abstract
Antibody escape mutations pose a significant challenge to the effectiveness of vaccines and antibody-based therapies. The ability to predict these escape mutations with computer simulations would allow us to detect threats early and develop effective countermeasures, but a lack of large-scale experimental data has hampered the validation of these calculations. In this study, we evaluate the ability of the MD+FoldX molecular modeling method to predict escape mutations by leveraging a large deep mutational scanning dataset, focusing on the SARS-CoV-2 receptor binding domain. Our results show a positive correlation between predicted and experimental data, indicating that mutations with reduced predicted binding affinity correlate moderately with higher experimental escape fractions. We also demonstrate that higher precision can be achieved using affinity cutoffs tailored to distinct SARS-CoV-2 antibodies from four different classes rather than a one-size-fits-all approach. Further, we suggest that the quartile values of optimized cutoffs reported for each class in this study can serve as a valuable guide for future work on escape mutation predictions. We find that 70% of the systems surpass the 50% precision mark, and demonstrate success in identifying mutations present in significant variants of concern and variants of interest. Despite promising results for some systems, our study highlights the challenges in comparing predicted and experimental values. It also emphasizes the need for new binding affinity methods with improved accuracy that are fast enough to estimate hundreds to thousands of antibody-antigen binding affinities.
Collapse
Affiliation(s)
- L América Chi
- Institute for Modeling Collaboration and Innovation, University of Idaho, Moscow, ID, 83844, USA
| | - Jonathan E Barnes
- Institute for Modeling Collaboration and Innovation, University of Idaho, Moscow, ID, 83844, USA
| | - Jagdish Suresh Patel
- Institute for Modeling Collaboration and Innovation, University of Idaho, Moscow, ID, 83844, USA.
- Department of Chemical and Biological Engineering, University of Idaho, Moscow, ID, 83844, USA.
| | - F Marty Ytreberg
- Institute for Modeling Collaboration and Innovation, University of Idaho, Moscow, ID, 83844, USA.
- Department of Physics, University of Idaho, Moscow, ID, 83844, USA.
| |
Collapse
|
2
|
Chi LA, Barnes JE, Suresh Patel J, Ytreberg FM. Exploring the ability of the MD+FoldX method to predict SARS-CoV-2 antibody escape mutations using large-scale data. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.05.22.595230. [PMID: 38826284 PMCID: PMC11142147 DOI: 10.1101/2024.05.22.595230] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/04/2024]
Abstract
Antibody escape mutations pose a significant challenge to the effectiveness of vaccines and antibody-based therapies. The ability to predict these escape mutations with computer simulations would allow us to detect threats early and develop effective countermeasures, but a lack of large-scale experimental data has hampered the validation of these calculations. In this study, we evaluate the ability of the MD+FoldX molecular modeling method to predict escape mutations by leveraging a large deep mutational scanning dataset, focusing on the SARS-CoV-2 receptor binding domain. Our results show a positive correlation between predicted and experimental data, indicating that mutations with reduced predicted binding affinity correlate moderately with higher experimental escape fractions. We also demonstrate that better performance can be achieved using affinity cutoffs tailored to distinct antibody-antigen interactions rather than a one-size-fits-all approach. We find that 70% of the systems surpass the 50% precision mark, and demonstrate success in identifying mutations present in significant variants of concern and variants of interest. Despite promising results for some systems, our study highlights the challenges in comparing predicted and experimental values. It also emphasizes the need for new binding affinity methods with improved accuracy that are fast enough to estimate hundreds to thousands of antibody-antigen binding affinities.
Collapse
Affiliation(s)
- L. América Chi
- Institute for Modeling Collaboration and Innovation, University of Idaho, Moscow, ID 83843, USA
| | - Jonathan E. Barnes
- Institute for Modeling Collaboration and Innovation, University of Idaho, Moscow, ID 83843, USA
| | - Jagdish Suresh Patel
- Institute for Modeling Collaboration and Innovation, University of Idaho, Moscow, ID 83843, USA
- Department of Chemical and Biological Engineering, University of Idaho, Moscow, ID 83843, USA
| | - F. Marty Ytreberg
- Institute for Modeling Collaboration and Innovation, University of Idaho, Moscow, ID 83843, USA
- Department of Physics, University of Idaho, Moscow, ID 83843, USA
| |
Collapse
|
3
|
Standley M, Blay V, Beleva Guthrie V, Kim J, Lyman A, Moya A, Karchin R, Camps M. Experimental and In Silico Analysis of TEM β-Lactamase Adaptive Evolution. ACS Infect Dis 2022; 8:2451-2463. [PMID: 36377311 PMCID: PMC9745794 DOI: 10.1021/acsinfecdis.2c00216] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]
Abstract
Multiple mutations often have non-additive (epistatic) phenotypic effects. Epistasis is of fundamental biological relevance but is not well understood mechanistically. Adaptive evolution, i.e., the evolution of new biochemical activities, is rich in epistatic interactions. To better understand the principles underlying epistasis during genetic adaptation, we studied the evolution of TEM-1 β-lactamase variants exhibiting cefotaxime resistance. We report the collection of a library of 487 observed evolutionary trajectories for TEM-1 and determine the epistasis status based on cefotaxime resistance phenotype for 206 combinations of 2-3 TEM-1 mutations involving 17 positions under adaptive selective pressure. Gain-of-function (GOF) mutations are gatekeepers for adaptation. To see if GOF phenotypes can be inferred based solely on sequence data, we calculated the enrichment of GOF mutations in the different categories of epistatic pairs. Our results suggest that this is possible because GOF mutations are particularly enriched in sign and reciprocal sign epistasis, which leave a major imprint on the sequence space accessible to evolution. We also used FoldX to explore the relationship between thermodynamic stability and epistasis. We found that mutations in observed evolutionary trajectories tend to destabilize the folded structure of the protein, albeit their cumulative effects are consistently below the protein's free energy of folding. The destabilizing effect is stronger for epistatic pairs, suggesting that modest or local alterations in folding stability can modulate catalysis. Finally, we report a significant relationship between epistasis and the degree to which two protein positions are structurally and dynamically coupled, even in the absence of ligand.
Collapse
Affiliation(s)
- Melissa Standley
- Department
of Microbiology and Environmental Toxicology, University of California, Santa
Cruz, California95064, United States
| | - Vincent Blay
- Department
of Microbiology and Environmental Toxicology, University of California, Santa
Cruz, California95064, United States,Institute
for Integrative Systems Biology (I2Sysbio), Universitat de València and Spanish Research Council (CSIC), 46980Valencia, Spain,
| | - Violeta Beleva Guthrie
- Department
of Biomedical Engineering and Institute for Computational Medicine, The Johns Hopkins University, Baltimore, Maryland21218, United States
| | - Jay Kim
- Department
of Microbiology and Environmental Toxicology, University of California, Santa
Cruz, California95064, United States
| | - Audrey Lyman
- Department
of Microbiology and Environmental Toxicology, University of California, Santa
Cruz, California95064, United States
| | - Andrés Moya
- Institute
for Integrative Systems Biology (I2Sysbio), Universitat de València and Spanish Research Council (CSIC), 46980Valencia, Spain,Foundation
for the Promotion of Sanitary and Biomedical Research of Valencia
Region (FISABIO), 46021Valencia, Spain,CIBER
in Epidemiology and Public Health (CIBEResp), 28029Madrid, Spain
| | - Rachel Karchin
- Department
of Biomedical Engineering and Institute for Computational Medicine, The Johns Hopkins University, Baltimore, Maryland21218, United States
| | - Manel Camps
- Department
of Microbiology and Environmental Toxicology, University of California, Santa
Cruz, California95064, United States,
| |
Collapse
|
4
|
Ding D, Green AG, Wang B, Lite TLV, Weinstein EN, Marks DS, Laub MT. Co-evolution of interacting proteins through non-contacting and non-specific mutations. Nat Ecol Evol 2022; 6:590-603. [PMID: 35361892 PMCID: PMC9090974 DOI: 10.1038/s41559-022-01688-0] [Citation(s) in RCA: 22] [Impact Index Per Article: 11.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/25/2021] [Accepted: 01/31/2022] [Indexed: 01/08/2023]
Abstract
Proteins often accumulate neutral mutations that do not affect current functions but can profoundly influence future mutational possibilities and functions. Understanding such hidden potential has major implications for protein design and evolutionary forecasting but has been limited by a lack of systematic efforts to identify potentiating mutations. Here, through the comprehensive analysis of a bacterial toxin-antitoxin system, we identified all possible single substitutions in the toxin that enable it to tolerate otherwise interface-disrupting mutations in its antitoxin. Strikingly, the majority of enabling mutations in the toxin do not contact and promote tolerance non-specifically to many different antitoxin mutations, despite covariation in homologues occurring primarily between specific pairs of contacting residues across the interface. In addition, the enabling mutations we identified expand future mutational paths that both maintain old toxin-antitoxin interactions and form new ones. These non-specific mutations are missed by widely used covariation and machine learning methods. Identifying such enabling mutations will be critical for ensuring continued binding of therapeutically relevant proteins, such as antibodies, aimed at evolving targets.
Collapse
Affiliation(s)
- David Ding
- Department of Biology, Massachusetts Institute of Technology, Cambridge, MA, USA
- Department of Systems Biology, Harvard Medical School, Boston, MA, USA
| | - Anna G Green
- Department of Systems Biology, Harvard Medical School, Boston, MA, USA
- Department of Biomedical Informatics, Harvard Medical School, Boston, MA, USA
| | - Boyuan Wang
- Department of Pharmacology, UT Southwestern Medical Center, Dallas, TX, USA
| | - Thuy-Lan Vo Lite
- Harvard-MIT Division of Health Sciences and Technology, Harvard Medical School, Boston, MA, USA
| | | | - Debora S Marks
- Department of Systems Biology, Harvard Medical School, Boston, MA, USA
| | - Michael T Laub
- Department of Biology, Massachusetts Institute of Technology, Cambridge, MA, USA.
- Howard Hughes Medical Institute, Massachusetts Institute of Technology, Cambridge, MA, USA.
| |
Collapse
|
5
|
Akanuma S, Yamaguchi M, Yamagishi A. Comprehensive mutagenesis to identify amino acid residues contributing to the difference in thermostability between two originally thermostable ancestral proteins. PLoS One 2021; 16:e0258821. [PMID: 34673819 PMCID: PMC8530338 DOI: 10.1371/journal.pone.0258821] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/05/2021] [Accepted: 10/05/2021] [Indexed: 11/19/2022] Open
Abstract
Further improvement of the thermostability of inherently thermostable proteins is an attractive challenge because more thermostable proteins are industrially more useful and serve as better scaffolds for protein engineering. To establish guidelines that can be applied for the rational design of hyperthermostable proteins, we compared the amino acid sequences of two ancestral nucleoside diphosphate kinases, Arc1 and Bac1, reconstructed in our previous study. Although Bac1 is a thermostable protein whose unfolding temperature is around 100°C, Arc1 is much more thermostable with an unfolding temperature of 114°C. However, only 12 out of 139 amino acids are different between the two sequences. In this study, one or a combination of amino acid(s) in Bac1 was/were substituted by a residue(s) found in Arc1 at the same position(s). The best mutant, which contained three amino acid substitutions (S108D, G116A and L120P substitutions), showed an unfolding temperature more than 10°C higher than that of Bac1. Furthermore, a combination of the other nine amino acid substitutions also led to improved thermostability of Bac1, although the effects of individual substitutions were small. Therefore, not only the sum of the contributions of individual amino acids, but also the synergistic effects of multiple amino acids are deeply involved in the stability of a hyperthermostable protein. Such insights will be helpful for future rational design of hyperthermostable proteins.
Collapse
Affiliation(s)
- Satoshi Akanuma
- Faculty of Human Sciences, Waseda University, Tokorozawa, Saitama, Japan
- * E-mail:
| | - Minako Yamaguchi
- Department of Applied Life Sciences, Tokyo University of Pharmacy and Life Sciences, Hachioji, Tokyo, Japan
| | - Akihiko Yamagishi
- Department of Applied Life Sciences, Tokyo University of Pharmacy and Life Sciences, Hachioji, Tokyo, Japan
| |
Collapse
|
6
|
Marcos ML, Echave J. The variation among sites of protein structure divergence is shaped by mutation and scaled by selection. Curr Res Struct Biol 2021; 2:156-163. [PMID: 34235475 PMCID: PMC8244499 DOI: 10.1016/j.crstbi.2020.08.002] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/17/2020] [Revised: 07/09/2020] [Accepted: 08/17/2020] [Indexed: 12/30/2022] Open
Abstract
Protein structures do not evolve uniformly, but the degree of structure divergence varies among sites. The resulting site-dependent structure divergence patterns emerge from a process that involves mutation and selection, which may both, in principle, influence the emergent pattern. In contrast with sequence divergence patterns, which are known to be mainly determined by selection, the relative contributions of mutation and selection to structure divergence patterns is unclear. Here, studying 6 protein families with a mechanistic biophysical model of protein evolution, we untangle the effects of mutation and selection. We found that even in the absence of selection, structure divergence varies from site to site because the mutational sensitivity is not uniform. Selection scales the profile, increasing its amplitude, without changing its shape. This scaling effect follows from the similarity between mutational sensitivity and sequence variability profiles. The degree of evolutionary divergence of protein structures varies among sites. A Mutation-Selection model (MSM) of protein structure evolution with selection for stability is developed. Even in the case of no selection, the sensitivity of the structure to random mutations varies among sites. Selection amplifies this variation but it does not affect its shape. This scaling effect of selection follows from the similarity between the selection-independent mutational sensitivity and the selection-dependent sequence divergence, the two contributions that are combined to produce the observed structural divergence profile.
Collapse
Affiliation(s)
- María Laura Marcos
- Instituto de Ciencias Físicas, Escuela de Ciencia y Tecnología, Universidad Nacional de San Martín, Martín de Irigoyen 3100, 1650 San Martín, Buenos Aires, Argentina
| | - Julian Echave
- Instituto de Ciencias Físicas, Escuela de Ciencia y Tecnología, Universidad Nacional de San Martín, Martín de Irigoyen 3100, 1650 San Martín, Buenos Aires, Argentina
| |
Collapse
|
7
|
Escalera-Zamudio M, Golden M, Gutiérrez B, Thézé J, Keown JR, Carrique L, Bowden TA, Pybus OG. Parallel evolution in the emergence of highly pathogenic avian influenza A viruses. Nat Commun 2020; 11:5511. [PMID: 33139731 PMCID: PMC7608645 DOI: 10.1038/s41467-020-19364-x] [Citation(s) in RCA: 17] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/14/2019] [Accepted: 10/12/2020] [Indexed: 01/30/2023] Open
Abstract
Parallel molecular evolution and adaptation are important phenomena commonly observed in viruses. Here, we exploit parallel molecular evolution to understand virulence evolution in avian influenza viruses (AIV). Highly-pathogenic AIVs evolve independently from low-pathogenic ancestors via acquisition of polybasic cleavage sites. Why some AIV lineages but not others evolve in this way is unknown. We hypothesise that the parallel emergence of highly-pathogenic AIV may be facilitated by permissive or compensatory mutations occurring across the viral genome. We combine phylogenetic, statistical and structural approaches to discover parallel mutations in AIV genomes associated with the highly-pathogenic phenotype. Parallel mutations were screened using a statistical test of mutation-phenotype association and further evaluated in the contexts of positive selection and protein structure. Our resulting mutational panel may help to reveal new links between virulence evolution and other traits, and raises the possibility of predicting aspects of AIV evolution.
Collapse
Affiliation(s)
| | - Michael Golden
- Department of Zoology, Oxford University, Parks Rd, Oxford, OX1 3PS, UK
| | | | - Julien Thézé
- Department of Zoology, Oxford University, Parks Rd, Oxford, OX1 3PS, UK
| | - Jeremy Russell Keown
- Division of Structural Biology, Wellcome Centre for Human Genetics, Oxford, OX3 7BN, UK
| | - Loic Carrique
- Division of Structural Biology, Wellcome Centre for Human Genetics, Oxford, OX3 7BN, UK
| | - Thomas A Bowden
- Division of Structural Biology, Wellcome Centre for Human Genetics, Oxford, OX3 7BN, UK
| | - Oliver G Pybus
- Department of Zoology, Oxford University, Parks Rd, Oxford, OX1 3PS, UK.
- Department of Pathobiology and Population Sciences, Royal Veterinary College, London, UK.
| |
Collapse
|
8
|
Bai X, Li D, Ma F, Deng X, Luo M, Feng Y, Yang G. Improved thermostability of creatinase from Alcaligenes Faecalis through non-biased phylogenetic consensus-guided mutagenesis. Microb Cell Fact 2020; 19:194. [PMID: 33069232 PMCID: PMC7568399 DOI: 10.1186/s12934-020-01451-9] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/26/2020] [Accepted: 10/07/2020] [Indexed: 02/06/2023] Open
Abstract
Background Enzymatic quantification of creatinine has become an essential method for clinical evaluation of renal function. Although creatinase (CR) is frequently used for this purpose, its poor thermostability severely limits industrial applications. Herein, we report a novel creatinase from Alcaligenes faecalis (afCR) with higher catalytic activity and lower KM value, than currently used creatinases. Furthermore, we developed a non-biased phylogenetic consensus method to improve the thermostability of afCR. Results We applied a non-biased phylogenetic consensus method to identify 59 candidate consensus residues from 24 creatinase family homologs for screening afCR mutants with improved thermostability. Twenty-one amino acids of afCR were selected to mutagenesis and 11 of them exhibited improved thermostability compared to the parent enzyme (afCR-M0). Combination of single-site mutations in sequential screens resulted in a quadruple mutant D17V/T199S/L6P/T251C (M4-2) which showed ~ 1700-fold enhanced half-life at 57 °C and a 4.2 °C higher T5015 than that of afCR-M0. The mutant retained catalytic activity equivalent to afCR-M0, and thus showed strong promise for application in creatinine detection. Structural homology modeling revealed a wide range of potential molecular interactions associated with individual mutations that contributed to improving afCR thermostability. Conclusions Results of this study clearly demonstrated that the non-biased-phylogenetic consensus design for improvement of thermostability in afCR is effective and promising in improving the thermostability of more enzymes.
Collapse
Affiliation(s)
- Xue Bai
- Institute of Biothermal Science and Technology, University of Shanghai for Science and Technology, Shanghai, 200093, People's Republic of China
| | - Daixi Li
- Institute of Biothermal Science and Technology, University of Shanghai for Science and Technology, Shanghai, 200093, People's Republic of China.
| | - Fuqiang Ma
- CAS Key Lab of Bio-Medical Diagnostics, Suzhou Institute of Biomedical Engineering and Technology, Chinese Academy of Sciences, Suzhou, 215163, Jiangsu, People's Republic of China
| | - Xi Deng
- State Key Laboratory of Microbial Metabolism, School of Life Sciences and Biotechnology, Shanghai Jiao Tong University, 800 Dongchuan Rd., Shanghai, 200240, People's Republic of China
| | - Manjie Luo
- Wuhan Hzymes Biotechnology Co., Ltd., Wuhan, 430000, Hubei, People's Republic of China
| | - Yan Feng
- State Key Laboratory of Microbial Metabolism, School of Life Sciences and Biotechnology, Shanghai Jiao Tong University, 800 Dongchuan Rd., Shanghai, 200240, People's Republic of China
| | - Guangyu Yang
- State Key Laboratory of Microbial Metabolism, School of Life Sciences and Biotechnology, Shanghai Jiao Tong University, 800 Dongchuan Rd., Shanghai, 200240, People's Republic of China.
| |
Collapse
|
9
|
Early prediction of antigenic transitions for influenza A/H3N2. PLoS Comput Biol 2020; 16:e1007683. [PMID: 32069282 PMCID: PMC7048310 DOI: 10.1371/journal.pcbi.1007683] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/27/2019] [Revised: 02/28/2020] [Accepted: 01/26/2020] [Indexed: 11/20/2022] Open
Abstract
Influenza A/H3N2 is a rapidly evolving virus which experiences major antigenic transitions every two to eight years. Anticipating the timing and outcome of transitions is critical to developing effective seasonal influenza vaccines. Using a published phylodynamic model of influenza transmission, we identified indicators of future evolutionary success for an emerging antigenic cluster and quantified fundamental trade-offs in our ability to make such predictions. The eventual fate of a new cluster depends on its initial epidemiological growth rate––which is a function of mutational load and population susceptibility to the cluster––along with the variance in growth rate across co-circulating viruses. Logistic regression can predict whether a cluster at 5% relative frequency will eventually succeed with ~80% sensitivity, providing up to eight months advance warning. As a cluster expands, the predictions improve while the lead-time for vaccine development and other interventions decreases. However, attempts to make comparable predictions from 12 years of empirical influenza surveillance data, which are far sparser and more coarse-grained, achieve only 56% sensitivity. By expanding influenza surveillance to obtain more granular estimates of the frequencies of and population-wide susceptibility to emerging viruses, we can better anticipate major antigenic transitions. This provides added incentives for accelerating the vaccine production cycle to reduce the lead time required for strain selection. The efficacy of annual seasonal influenza vaccines depends on selecting the strain that best matches circulating viruses. This selection takes place 9–12 months prior to the influenza season. To advise this decision, we used an influenza A/H3N2 phylodynamic simulation to explore how reliably and how far in advance can we identify strains that will dominate future influenza seasons? What data should we collect to accelerate and improve the accuracy of such forecasts? And importantly, what is the gap between the theoretical limit of prediction and prediction based on current influenza surveillance? Our results suggest that even with detailed virological information, the tight race between the antigenic turnover dynamics and the vaccine development timeline limits early detection of emerging viruses. Predictions based on current influenza surveillance do not achieve the theoretical limit and thus our results provide impetus for denser sampling and the development of rapid methods for estimating viral fitness.
Collapse
|
10
|
Foy SG, Wilson BA, Bertram J, Cordes MHJ, Masel J. A Shift in Aggregation Avoidance Strategy Marks a Long-Term Direction to Protein Evolution. Genetics 2019; 211:1345-1355. [PMID: 30692195 PMCID: PMC6456324 DOI: 10.1534/genetics.118.301719] [Citation(s) in RCA: 16] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/23/2018] [Accepted: 01/25/2019] [Indexed: 01/06/2023] Open
Abstract
To detect a direction to evolution, without the pitfalls of reconstructing ancestral states, we need to compare "more evolved" to "less evolved" entities. But because all extant species have the same common ancestor, none are chronologically more evolved than any other. However, different gene families were born at different times, allowing us to compare young protein-coding genes to those that are older and hence have been evolving for longer. To be retained during evolution, a protein must not only have a function, but must also avoid toxic dysfunction such as protein aggregation. There is conflict between the two requirements: hydrophobic amino acids form the cores of protein folds, but also promote aggregation. Young genes avoid strongly hydrophobic amino acids, which is presumably the simplest solution to the aggregation problem. Here we show that young genes' few hydrophobic residues are clustered near one another along the primary sequence, presumably to assist folding. The higher aggregation risk created by the higher hydrophobicity of older genes is counteracted by more subtle effects in the ordering of the amino acids, including a reduction in the clustering of hydrophobic residues until they eventually become more interspersed than if distributed randomly. This interspersion has previously been reported to be a general property of proteins, but here we find that it is restricted to old genes. Quantitatively, the index of dispersion delineates a gradual trend, i.e., a decrease in the clustering of hydrophobic amino acids over billions of years.
Collapse
Affiliation(s)
- Scott G Foy
- Department of Ecology and Evolutionary Biology, University of Arizona, Tucson, Arizona 85721
| | - Benjamin A Wilson
- Department of Ecology and Evolutionary Biology, University of Arizona, Tucson, Arizona 85721
| | - Jason Bertram
- Department of Ecology and Evolutionary Biology, University of Arizona, Tucson, Arizona 85721
| | - Matthew H J Cordes
- Department of Chemistry and Biochemistry, University of Arizona, Tucson, Arizona 85721
| | - Joanna Masel
- Department of Ecology and Evolutionary Biology, University of Arizona, Tucson, Arizona 85721
| |
Collapse
|
11
|
Gutierrez B, Escalera-Zamudio M, Pybus OG. Parallel molecular evolution and adaptation in viruses. Curr Opin Virol 2019; 34:90-96. [PMID: 30703578 PMCID: PMC7102768 DOI: 10.1016/j.coviro.2018.12.006] [Citation(s) in RCA: 28] [Impact Index Per Article: 5.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/08/2018] [Accepted: 12/11/2018] [Indexed: 01/05/2023]
Abstract
Parallel molecular evolution is the independent evolution of the same genotype or phenotype from distinct ancestors. The simple genomes and rapid evolution of many viruses mean they are useful model systems for studying parallel evolution by natural selection. Parallel adaptation occurs in the context of several viral behaviours, including cross-species transmission, drug resistance, and host immune escape, and its existence suggests that at least some aspects of virus evolution and emergence are repeatable and predictable. We introduce examples of virus parallel evolution and summarise key concepts. We outline the difficulties in detecting parallel adaptation using virus genomes, with a particular focus on phylogenetic and structural approaches, and we discuss future approaches that may improve our understanding of the phenomenon.
Collapse
Affiliation(s)
| | | | - Oliver G Pybus
- Department of Zoology, University of Oxford, Oxford, United Kingdom.
| |
Collapse
|
12
|
Echave J. Beyond Stability Constraints: A Biophysical Model of Enzyme Evolution with Selection on Stability and Activity. Mol Biol Evol 2018; 36:613-620. [DOI: 10.1093/molbev/msy244] [Citation(s) in RCA: 26] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/14/2022] Open
Affiliation(s)
- Julian Echave
- Escuela de Ciencia y Tecnología, Universidad Nacional de San Martín (UNSAM), Buenos Aires, Argentina
| |
Collapse
|
13
|
Phillips AM, Doud MB, Gonzalez LO, Butty VL, Lin YS, Bloom JD, Shoulders MD. Enhanced ER proteostasis and temperature differentially impact the mutational tolerance of influenza hemagglutinin. eLife 2018; 7:38795. [PMID: 30188321 PMCID: PMC6172027 DOI: 10.7554/elife.38795] [Citation(s) in RCA: 21] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/06/2018] [Accepted: 08/29/2018] [Indexed: 12/23/2022] Open
Abstract
We systematically and quantitatively evaluate whether endoplasmic reticulum (ER) proteostasis factors impact the mutational tolerance of secretory pathway proteins. We focus on influenza hemaggluttinin (HA), a viral membrane protein that folds in the host’s ER via a complex pathway. By integrating chemical methods to modulate ER proteostasis with deep mutational scanning to assess mutational tolerance, we discover that upregulation of ER proteostasis factors broadly enhances HA mutational tolerance across diverse structural elements. Remarkably, this proteostasis network-enhanced mutational tolerance occurs at the same sites where mutational tolerance is most reduced by propagation at fever-like temperature. These findings have important implications for influenza evolution, because influenza immune escape is contingent on HA possessing sufficient mutational tolerance to evade antibodies while maintaining the capacity to fold and function. More broadly, this work provides the first experimental evidence that ER proteostasis mechanisms define the mutational tolerance and, therefore, the evolution of secretory pathway proteins.
Collapse
Affiliation(s)
- Angela M Phillips
- Department of Chemistry, Massachusetts Institute of Technology, Cambridge, United States
| | - Michael B Doud
- Fred Hutchinson Cancer Research Center, Seattle, United States.,Department of Genome Sciences, University of Washington, Seattle, United States
| | - Luna O Gonzalez
- Department of Mathematics, Massachusetts Institute of Technology, Cambridge, United States
| | - Vincent L Butty
- BioMicro Center, Massachusetts Institute of Technology, Cambridge, United States
| | - Yu-Shan Lin
- Department of Chemistry, Tufts University, Medford, United States
| | - Jesse D Bloom
- Fred Hutchinson Cancer Research Center, Seattle, United States.,Department of Genome Sciences, University of Washington, Seattle, United States
| | - Matthew D Shoulders
- Department of Chemistry, Massachusetts Institute of Technology, Cambridge, United States
| |
Collapse
|
14
|
Stability of the Influenza Virus Hemagglutinin Protein Correlates with Evolutionary Dynamics. mSphere 2018; 3:mSphere00554-17. [PMID: 29299534 PMCID: PMC5750392 DOI: 10.1128/mspheredirect.00554-17] [Citation(s) in RCA: 27] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/21/2017] [Accepted: 12/04/2017] [Indexed: 12/25/2022] Open
Abstract
One of the constraints on fast-evolving viruses, such as influenza virus, is protein stability, or how strongly the folded protein holds together. Despite the importance of this protein property, there has been limited investigation of the impact of the stability of the influenza virus hemagglutinin protein—the primary antibody target of the immune system—on its evolution. Using a combination of computational estimates of stability and experiments, our analysis found that viruses with more-stable hemagglutinin proteins were associated with long-term persistence in the population. There are two potential reasons for the observed persistence. One is that more-stable proteins tolerate destabilizing mutations that less-stable proteins could not, thus increasing opportunities for immune escape. The second is that greater stability increases the fitness of the virus through increased production of infectious particles. Further research on the relative importance of these mechanisms could help inform the annual influenza vaccine composition decision process. Protein thermodynamics are an integral determinant of viral fitness and one of the major drivers of protein evolution. Mutations in the influenza A virus (IAV) hemagglutinin (HA) protein can eliminate neutralizing antibody binding to mediate escape from preexisting antiviral immunity. Prior research on the IAV nucleoprotein suggests that protein stability may constrain seasonal IAV evolution; however, the role of stability in shaping the evolutionary dynamics of the HA protein has not been explored. We used the full coding sequence of 9,797 H1N1pdm09 HA sequences and 16,716 human seasonal H3N2 HA sequences to computationally estimate relative changes in the thermal stability of the HA protein between 2009 and 2016. Phylogenetic methods were used to characterize how stability differences impacted the evolutionary dynamics of the virus. We found that pandemic H1N1 IAV strains split into two lineages that had different relative HA protein stabilities and that later variants were descended from the higher-stability lineage. Analysis of the mutations associated with the selective sweep of the higher-stability lineage found that they were characterized by the early appearance of highly stabilizing mutations, the earliest of which was not located in a known antigenic site. Experimental evidence further suggested that H1N1 HA stability may be correlated with in vitro virus production and infection. A similar analysis of H3N2 strains found that surviving lineages were also largely descended from viruses predicted to encode more-stable HA proteins. Our results suggest that HA protein stability likely plays a significant role in the persistence of different IAV lineages. IMPORTANCE One of the constraints on fast-evolving viruses, such as influenza virus, is protein stability, or how strongly the folded protein holds together. Despite the importance of this protein property, there has been limited investigation of the impact of the stability of the influenza virus hemagglutinin protein—the primary antibody target of the immune system—on its evolution. Using a combination of computational estimates of stability and experiments, our analysis found that viruses with more-stable hemagglutinin proteins were associated with long-term persistence in the population. There are two potential reasons for the observed persistence. One is that more-stable proteins tolerate destabilizing mutations that less-stable proteins could not, thus increasing opportunities for immune escape. The second is that greater stability increases the fitness of the virus through increased production of infectious particles. Further research on the relative importance of these mechanisms could help inform the annual influenza vaccine composition decision process.
Collapse
|
15
|
Vernon RM, Chong PA, Lin H, Yang Z, Zhou Q, Aleksandrov AA, Dawson JE, Riordan JR, Brouillette CG, Thibodeau PH, Forman-Kay JD. Stabilization of a nucleotide-binding domain of the cystic fibrosis transmembrane conductance regulator yields insight into disease-causing mutations. J Biol Chem 2017; 292:14147-14164. [PMID: 28655774 PMCID: PMC5572908 DOI: 10.1074/jbc.m116.772335] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/15/2016] [Revised: 06/16/2017] [Indexed: 11/06/2022] Open
Abstract
Characterization of the second nucleotide-binding domain (NBD2) of the cystic fibrosis transmembrane conductance regulator (CFTR) has lagged behind research into the NBD1 domain, in part because NBD1 contains the F508del mutation, which is the dominant cause of cystic fibrosis. Research on NBD2 has also been hampered by the overall instability of the domain and the difficulty of producing reagents. Nonetheless, multiple disease-causing mutations reside in NBD2, and the domain is critical for CFTR function, because channel gating involves NBD1/NBD2 dimerization, and NBD2 contains the catalytically active ATPase site in CFTR. Recognizing the paucity of structural and biophysical data on NBD2, here we have defined a bioinformatics-based method for manually identifying stabilizing substitutions in NBD2, and we used an iterative process of screening single substitutions against thermal melting points to both produce minimally mutated stable constructs and individually characterize mutations. We present a range of stable constructs with minimal mutations to help inform further research on NBD2. We have used this stabilized background to study the effects of NBD2 mutations identified in cystic fibrosis (CF) patients, demonstrating that mutants such as N1303K and G1349D are characterized by lower stability, as shown previously for some NBD1 mutations, suggesting a potential role for NBD2 instability in the pathology of CF.
Collapse
Affiliation(s)
- Robert M Vernon
- From the Program in Molecular Medicine, Hospital for Sick Children, Toronto, Ontario M5G 0A4, Canada
| | - P Andrew Chong
- From the Program in Molecular Medicine, Hospital for Sick Children, Toronto, Ontario M5G 0A4, Canada
| | - Hong Lin
- From the Program in Molecular Medicine, Hospital for Sick Children, Toronto, Ontario M5G 0A4, Canada
| | - Zhengrong Yang
- Center for Structural Biology and Department of Chemistry, University of Alabama at Birmingham, Birmingham, Alabama 35294
| | - Qingxian Zhou
- Center for Structural Biology and Department of Chemistry, University of Alabama at Birmingham, Birmingham, Alabama 35294
| | - Andrei A Aleksandrov
- Department of Biochemistry and Biophysics, Cystic Fibrosis Treatment and Research Center, University of North Carolina, Chapel Hill, North Carolina 27599, and
| | - Jennifer E Dawson
- From the Program in Molecular Medicine, Hospital for Sick Children, Toronto, Ontario M5G 0A4, Canada
| | - John R Riordan
- Department of Biochemistry and Biophysics, Cystic Fibrosis Treatment and Research Center, University of North Carolina, Chapel Hill, North Carolina 27599, and
| | - Christie G Brouillette
- Center for Structural Biology and Department of Chemistry, University of Alabama at Birmingham, Birmingham, Alabama 35294
| | - Patrick H Thibodeau
- Department of Microbiology and Molecular Genetics, University of Pittsburgh School of Medicine, Pittsburgh, Pennsylvania 15219
| | - Julie D Forman-Kay
- From the Program in Molecular Medicine, Hospital for Sick Children, Toronto, Ontario M5G 0A4, Canada,; Department of Biochemistry, University of Toronto, Toronto, Ontario M5S 1A8, Canada.
| |
Collapse
|
16
|
Echave J, Wilke CO. Biophysical Models of Protein Evolution: Understanding the Patterns of Evolutionary Sequence Divergence. Annu Rev Biophys 2017; 46:85-103. [PMID: 28301766 DOI: 10.1146/annurev-biophys-070816-033819] [Citation(s) in RCA: 75] [Impact Index Per Article: 10.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/30/2022]
Abstract
For decades, rates of protein evolution have been interpreted in terms of the vague concept of functional importance. Slowly evolving proteins or sites within proteins were assumed to be more functionally important and thus subject to stronger selection pressure. More recently, biophysical models of protein evolution, which combine evolutionary theory with protein biophysics, have completely revolutionized our view of the forces that shape sequence divergence. Slowly evolving proteins have been found to evolve slowly because of selection against toxic misfolding and misinteractions, linking their rate of evolution primarily to their abundance. Similarly, most slowly evolving sites in proteins are not directly involved in function, but mutating these sites has a large impact on protein structure and stability. In this article, we review the studies in the emerging field of biophysical protein evolution that have shaped our current understanding of sequence divergence patterns. We also propose future research directions to develop this nascent field.
Collapse
Affiliation(s)
- Julian Echave
- Escuela de Ciencia y Tecnología, Universidad Nacional de San Martín, 1650 San Martín, Buenos Aires, Argentina; .,Consejo Nacional de Investigaciones Científicas y Técnicas (CONICET), Argentina
| | - Claus O Wilke
- Department of Integrative Biology, The University of Texas at Austin, Texas 78712;
| |
Collapse
|
17
|
Redondo RAF, de Vladar HP, Włodarski T, Bollback JP. Evolutionary interplay between structure, energy and epistasis in the coat protein of the ϕX174 phage family. J R Soc Interface 2017; 14:20160139. [PMID: 28053111 PMCID: PMC5310724 DOI: 10.1098/rsif.2016.0139] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/06/2016] [Accepted: 11/29/2016] [Indexed: 01/01/2023] Open
Abstract
Viral capsids are structurally constrained by interactions among the amino acids (AAs) of their constituent proteins. Therefore, epistasis is expected to evolve among physically interacting sites and to influence the rates of substitution. To study the evolution of epistasis, we focused on the major structural protein of the ϕX174 phage family by first reconstructing the ancestral protein sequences of 18 species using a Bayesian statistical framework. The inferred ancestral reconstruction differed at eight AAs, for a total of 256 possible ancestral haplotypes. For each ancestral haplotype and the extant species, we estimated, in silico, the distribution of free energies and epistasis of the capsid structure. We found that free energy has not significantly increased but epistasis has. We decomposed epistasis up to fifth order and found that higher-order epistasis sometimes compensates pairwise interactions making the free energy seem additive. The dN/dS ratio is low, suggesting strong purifying selection, and that structure is under stabilizing selection. We synthesized phages carrying ancestral haplotypes of the coat protein gene and measured their fitness experimentally. Our findings indicate that stabilizing mutations can have higher fitness, and that fitness optima do not necessarily coincide with energy minima.
Collapse
Affiliation(s)
| | - Harold P de Vladar
- IST Austria, Am Campus 1, 3400 Klosterneuburg, Austria
- Center for the Conceptual Foundations of Science, Parmenides Foundation, 82049 Pullach, Germany
| | - Tomasz Włodarski
- Department of Structural and Molecular Biology, University College London, London WC1E 6BT, UK
| | | |
Collapse
|
18
|
Effects of point mutations on the thermostability of B. subtilis lipase: investigating nonadditivity. J Comput Aided Mol Des 2016; 30:899-916. [DOI: 10.1007/s10822-016-9978-0] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/13/2016] [Accepted: 09/22/2016] [Indexed: 11/26/2022]
|
19
|
Trudeau DL, Kaltenbach M, Tawfik DS. On the Potential Origins of the High Stability of Reconstructed Ancestral Proteins. Mol Biol Evol 2016; 33:2633-41. [PMID: 27413048 DOI: 10.1093/molbev/msw138] [Citation(s) in RCA: 89] [Impact Index Per Article: 11.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/13/2022] Open
Abstract
Ancestral reconstruction provides instrumental insights regarding the biochemical and biophysical characteristics of past proteins. A striking observation relates to the remarkably high thermostability of reconstructed ancestors. The latter has been linked to high environmental temperatures in the Precambrian era, the era relating to most reconstructed proteins. We found that inferred ancestors of the serum paraoxonase (PON) enzyme family, including the mammalian ancestor, exhibit dramatically increased thermostabilities compared with the extant, human enzyme (up to 30 °C higher melting temperature). However, the environmental temperature at the time of emergence of mammals is presumed to be similar to the present one. Additionally, the mammalian PON ancestor has superior folding properties (kinetic stability)-unlike the extant mammalian PONs, it expresses in E. coli in a soluble and functional form, and at a high yield. We discuss two potential origins of this unexpectedly high stability. First, ancestral stability may be overestimated by a "consensus effect," whereby replacing amino acids that are rare in contemporary sequences with the amino acid most common in the family increases protein stability. Comparison to other reconstructed ancestors indicates that the consensus effect may bias some but not all reconstructions. Second, we note that high stability may relate to factors other than high environmental temperature such as oxidative stress or high radiation levels. Foremost, intrinsic factors such as high rates of genetic mutations and/or of transcriptional and translational errors, and less efficient protein quality control systems, may underlie the high kinetic and thermodynamic stability of past proteins.
Collapse
Affiliation(s)
- Devin L Trudeau
- Department of Biomolecular Sciences, Weizmann Institute of Science, Rehovot, Israel
| | - Miriam Kaltenbach
- Department of Biomolecular Sciences, Weizmann Institute of Science, Rehovot, Israel
| | - Dan S Tawfik
- Department of Biomolecular Sciences, Weizmann Institute of Science, Rehovot, Israel
| |
Collapse
|
20
|
Abstract
A popular and successful strategy in semi-rational design of protein stability is the use of evolutionary information encapsulated in homologous protein sequences. Consensus design is based on the hypothesis that at a given position, the respective consensus amino acid contributes more than average to the stability of the protein than non-conserved amino acids. Here, we review the consensus design approach, its theoretical underpinnings, successes, limitations and challenges, as well as providing a detailed guide to its application in protein engineering.
Collapse
Affiliation(s)
- Benjamin T Porebski
- Biomedicine Discovery Institute and Department of Biochemistry and Molecular Biology, Faculty of Medicine, Monash University, Clayton, Victoria 3800, Australia Medical Research Council Laboratory of Molecular Biology, Francis Crick Avenue, Cambridge CB2 0QH, UK
| | - Ashley M Buckle
- Biomedicine Discovery Institute and Department of Biochemistry and Molecular Biology, Faculty of Medicine, Monash University, Clayton, Victoria 3800, Australia
| |
Collapse
|
21
|
Mirzaei N, Poursina F, Moghim S, Ghaempanah AM, Safaei HG. The Bioinformatics Report of Mutation Outcome on NADPH Flavin Oxidoreductase Protein Sequence in Clinical Isolates of H. pylori. Curr Microbiol 2016; 72:596-605. [PMID: 26821239 DOI: 10.1007/s00284-016-0992-1] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/08/2015] [Accepted: 12/14/2015] [Indexed: 12/23/2022]
Abstract
frxA gene has been implicated in the metronidazole nitro reduction by H. pylori. Alternatively, frxA is expected to contribute to the protection of urease and to the in vivo survival of H. pylori. The aim of present study is to report the mutation effects on the frxA protein sequence in clinical isolates of H. pylori in our community. Metronidazole resistance was proven in 27 of 48 isolates. glmM and frxA genes were used for molecular confirmation of H. pylori isolates. The primer set for detection of whole sequence of frxA gene for the effect of mutation on protein sequence was used. DNA and protein sequence evaluation and analysis were done by blast, Clustal Omega, and T COFFEE programs. Then, FrxA protein sequences from six metronidazole-resistant clinical isolates were analyzed by web-based bioinformatics tools. The result of six metronidazole-resistant clinical isolates in comparison with strain 26695 showed ten missense mutations. The result with the STRING program revealed that no change was seen after alterations in these sequences. According to consensus data involving four methods, residue substitutions at 40, 13, and 141 increase the stability of protein sequence after mutation, while other alterations decrease. Residue substitutions at 40, 43, 141, 138, 169, and 179 are deleterious, while, V7I, Q10R, V34I, and V96I alterations are neutral. As FrxA contribute to survival of bacterium and in regard to the effect of mutations on protein function, it might affect the survival and bacterium phenotype and it need to be studied more. Also, none of the stability prediction tool is perfect; iStable is the best predictor method among all methods.
Collapse
Affiliation(s)
- Nasrin Mirzaei
- Department of Microbiology, Tonekabon Branch, Islamic Azad University, Tonekabon, Iran
| | - Farkhondeh Poursina
- Department of Microbiology, Isfahan University of Medical Sciences, Isfahan, Iran
| | - Sharareh Moghim
- Department of Microbiology, Isfahan University of Medical Sciences, Isfahan, Iran
| | | | | |
Collapse
|
22
|
Echave J, Spielman SJ, Wilke CO. Causes of evolutionary rate variation among protein sites. Nat Rev Genet 2016; 17:109-21. [PMID: 26781812 DOI: 10.1038/nrg.2015.18] [Citation(s) in RCA: 176] [Impact Index Per Article: 22.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/13/2022]
Abstract
It has long been recognized that certain sites within a protein, such as sites in the protein core or catalytic residues in enzymes, are evolutionarily more conserved than other sites. However, our understanding of rate variation among sites remains surprisingly limited. Recent progress to address this includes the development of a wide array of reliable methods to estimate site-specific substitution rates from sequence alignments. In addition, several molecular traits have been identified that correlate with site-specific mutation rates, and novel mechanistic biophysical models have been proposed to explain the observed correlations. Nonetheless, current models explain, at best, approximately 60% of the observed variance, highlighting the limitations of current methods and models and the need for new research directions.
Collapse
Affiliation(s)
- Julian Echave
- Escuela de Ciencia y Tecnología, Universidad Nacional de San Martín, 1650 San Martín, Buenos Aires, Argentina
| | - Stephanie J Spielman
- Department of Integrative Biology, Center for Computational Biology and Bioinformatics, and Institute for Cellular and Molecular Biology, The University of Texas at Austin, Austin, Texas 78712, USA
| | - Claus O Wilke
- Department of Integrative Biology, Center for Computational Biology and Bioinformatics, and Institute for Cellular and Molecular Biology, The University of Texas at Austin, Austin, Texas 78712, USA
| |
Collapse
|
23
|
|
24
|
Wilson BA, Garud NR, Feder AF, Assaf ZJ, Pennings PS. The population genetics of drug resistance evolution in natural populations of viral, bacterial and eukaryotic pathogens. Mol Ecol 2016; 25:42-66. [PMID: 26578204 PMCID: PMC4943078 DOI: 10.1111/mec.13474] [Citation(s) in RCA: 35] [Impact Index Per Article: 4.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/10/2015] [Revised: 09/28/2015] [Accepted: 10/08/2015] [Indexed: 01/09/2023]
Abstract
Drug resistance is a costly consequence of pathogen evolution and a major concern in public health. In this review, we show how population genetics can be used to study the evolution of drug resistance and also how drug resistance evolution is informative as an evolutionary model system. We highlight five examples from diverse organisms with particular focus on: (i) identifying drug resistance loci in the malaria parasite Plasmodium falciparum using the genomic signatures of selective sweeps, (ii) determining the role of epistasis in drug resistance evolution in influenza, (iii) quantifying the role of standing genetic variation in the evolution of drug resistance in HIV, (iv) using drug resistance mutations to study clonal interference dynamics in tuberculosis and (v) analysing the population structure of the core and accessory genome of Staphylococcus aureus to understand the spread of methicillin resistance. Throughout this review, we discuss the uses of sequence data and population genetic theory in studying the evolution of drug resistance.
Collapse
Affiliation(s)
| | | | | | - Zoe J. Assaf
- Department of GeneticsStanford UniversityStanfordCA94305USA
| | - Pleuni S. Pennings
- Department of BiologySan Francisco State UniversityRoom 520Hensill Hall1600 Holloway AveSan FranciscoCA94132USA
| |
Collapse
|
25
|
Figliuzzi M, Jacquier H, Schug A, Tenaillon O, Weigt M. Coevolutionary Landscape Inference and the Context-Dependence of Mutations in Beta-Lactamase TEM-1. Mol Biol Evol 2015; 33:268-80. [PMID: 26446903 PMCID: PMC4693977 DOI: 10.1093/molbev/msv211] [Citation(s) in RCA: 167] [Impact Index Per Article: 18.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/21/2022] Open
Abstract
The quantitative characterization of mutational landscapes is a task of outstanding importance in evolutionary and medical biology: It is, for example, of central importance for our understanding of the phenotypic effect of mutations related to disease and antibiotic drug resistance. Here we develop a novel inference scheme for mutational landscapes, which is based on the statistical analysis of large alignments of homologs of the protein of interest. Our method is able to capture epistatic couplings between residues, and therefore to assess the dependence of mutational effects on the sequence context where they appear. Compared with recent large-scale mutagenesis data of the beta-lactamase TEM-1, a protein providing resistance against beta-lactam antibiotics, our method leads to an increase of about 40% in explicative power as compared with approaches neglecting epistasis. We find that the informative sequence context extends to residues at native distances of about 20 Å from the mutated site, reaching thus far beyond residues in direct physical contact.
Collapse
Affiliation(s)
- Matteo Figliuzzi
- UPMC, Institut de Calcul et de la Simulation, Sorbonne Universités, Paris, France Computational and Quantitative Biology, UPMC, UMR 7238, Sorbonne Universités, Paris, France Computational and Quantitative Biology, CNRS, UMR 7238, Paris, France
| | - Hervé Jacquier
- Infection, Antimicrobials, Modelling, Evolution, INSERM, Université Denis Diderot Paris 7, UMR 1137, Sorbonne Paris Cité, Paris, France Service de Bactériologie-Virologie, Groupe Hospitalier Lariboisiére-Fernand Widal, Assistance Publique-Hôpitaux de Paris (AP-HP), Paris, France
| | - Alexander Schug
- Steinbuch Centre for Computing, Karlsruhe Institute for Technology, Eggenstein-Leopoldshafen, Germany
| | - Oliver Tenaillon
- Infection, Antimicrobials, Modelling, Evolution, INSERM, Université Denis Diderot Paris 7, UMR 1137, Sorbonne Paris Cité, Paris, France
| | - Martin Weigt
- Computational and Quantitative Biology, UPMC, UMR 7238, Sorbonne Universités, Paris, France Computational and Quantitative Biology, CNRS, UMR 7238, Paris, France
| |
Collapse
|
26
|
Koelle K, Rasmussen DA. The effects of a deleterious mutation load on patterns of influenza A/H3N2's antigenic evolution in humans. eLife 2015; 4:e07361. [PMID: 26371556 PMCID: PMC4611170 DOI: 10.7554/elife.07361] [Citation(s) in RCA: 48] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/10/2015] [Accepted: 09/14/2015] [Indexed: 11/19/2022] Open
Abstract
Recent phylogenetic analyses indicate that RNA virus populations carry a significant deleterious mutation load. This mutation load has the potential to shape patterns of adaptive evolution via genetic linkage to beneficial mutations. Here, we examine the effect of deleterious mutations on patterns of influenza A subtype H3N2's antigenic evolution in humans. By first analyzing simple models of influenza that incorporate a mutation load, we show that deleterious mutations, as expected, act to slow the virus's rate of antigenic evolution, while making it more punctuated in nature. These models further predict three distinct molecular pathways by which antigenic cluster transitions occur, and we find phylogenetic patterns consistent with each of these pathways in influenza virus sequences. Simulations of a more complex phylodynamic model further indicate that antigenic mutations act in concert with deleterious mutations to reproduce influenza's spindly hemagglutinin phylogeny, co-circulation of antigenic variants, and high annual attack rates. DOI:http://dx.doi.org/10.7554/eLife.07361.001 Each year, up to 15% of the world's population experience symptoms of an influenza infection, also commonly known as flu. The most common culprit is a strain of the virus called influenza type A subtype H3N2. One reason that so many people become infected each year is that this virus evolves rapidly. Within a few years, proteins on the surface of the virus known as antigens become less recognizable to the immune system of a person who has been previously infected. This means that the person can become ill with the virus again because their immune system cannot mount an effective response to the evolved virus strain. Influenza virus strains evolve rapidly because their genetic material accumulates mutations quickly. Although some of these mutations are beneficial to the virus, other mutations are harmful and reduce the ability of the virus to spread. Sometimes beneficial mutations may occur alongside harmful ones, but it is not known how the harmful mutations affect the evolution of the virus. Here, Koelle and Rasmussen used computer models of H3N2 influenza to examine the effect of harmful mutations on the evolution of this virus population. The models show that harmful mutations limit how quickly the antigens can evolve. Also, the presence of these harmful mutations effectively acts as a sieve: they allow only large changes in the antigens to establish in the virus population. The models suggest that there are three routes by which large changes in the antigens on H3N2 viruses may occur. The first is by a single mutation that has a big effect on the antigens in viruses that only carry a few harmful mutations, but these large mutations would not happen very often. Another route may be through more common mutations that have only a small or moderate benefit, which would allow the virus to become more common in the population before it acquires a beneficial mutation with a much greater effect. The third possibility is that a large beneficial mutation may arise in viruses that have many harmful mutations. These harmful mutations may initially limit the ability of the virus to spread, but over time, some of these harmful mutations may then be lost. Koelle and Rasmussen found that the computer models could recreate the patterns of virus evolution that have been observed in real strains of H3N2. Researchers use predictions of influenza evolution to help them decide which virus strains should be included in flu vaccines each year. Koelle and Rasmussen findings indicate that harmful mutations should be considered when making these predictions. DOI:http://dx.doi.org/10.7554/eLife.07361.002
Collapse
Affiliation(s)
- Katia Koelle
- Department of Biology, Duke University, Durham, United States.,Fogarty International Center, National Institutes of Health, Bethesda, United States
| | - David A Rasmussen
- Department of Biology, Duke University, Durham, United States.,Department of Biosystems Science and Engineering, Eidgenössische Technische Hochschule Zürich, Basel, Switzerland
| |
Collapse
|
27
|
Suzuki Y. Selecting vaccine strains for H3N2 human influenza A virus. Meta Gene 2015; 4:64-72. [PMID: 25893173 PMCID: PMC4392175 DOI: 10.1016/j.mgene.2015.03.003] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/24/2014] [Revised: 02/17/2015] [Accepted: 03/20/2015] [Indexed: 12/23/2022] Open
Abstract
H3N2 human influenza A virus causes epidemics of influenza mainly in the winter season in temperate regions. Since the antigenicity of this virus evolves rapidly, several attempts have been made to predict the major amino acid sequence of hemagglutinin 1 (HA1) in the target season of vaccination. However, the usefulness of predicted sequence was unclear because its relationship to the antigenicity was unknown. Here the antigenic model for estimating the degree of antigenic difference (antigenic distance) between amino acid sequences of HA1 was integrated into the process of selecting vaccine strains for H3N2 human influenza A virus. When the effectiveness of a potential vaccine strain for a target season was evaluated retrospectively using the average antigenic distance between the strain and the epidemic viruses sampled in the target season, the most effective vaccine strain was identified mostly in the season one year before the target season (pre-target season). Effectiveness of actual vaccines appeared to be lower than that of the strains randomly chosen in the pre-target season on average. It was recommended to replace the vaccine strain for every target season with the strain having the smallest average antigenic distance to the others in the pre-target season. The procedure of selecting vaccine strains for future epidemic seasons described in the present study was implemented in the influenza virus forecasting system (INFLUCAST) (http://www.nsc.nagoya-cu.ac.jp/~yossuzuk/influcast.html).
Collapse
Affiliation(s)
- Yoshiyuki Suzuki
- Graduate School of Natural Sciences, Nagoya City University, Nagoya, Japan
| |
Collapse
|
28
|
Sikosek T, Chan HS. Biophysics of protein evolution and evolutionary protein biophysics. J R Soc Interface 2015; 11:20140419. [PMID: 25165599 DOI: 10.1098/rsif.2014.0419] [Citation(s) in RCA: 150] [Impact Index Per Article: 16.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/05/2023] Open
Abstract
The study of molecular evolution at the level of protein-coding genes often entails comparing large datasets of sequences to infer their evolutionary relationships. Despite the importance of a protein's structure and conformational dynamics to its function and thus its fitness, common phylogenetic methods embody minimal biophysical knowledge of proteins. To underscore the biophysical constraints on natural selection, we survey effects of protein mutations, highlighting the physical basis for marginal stability of natural globular proteins and how requirement for kinetic stability and avoidance of misfolding and misinteractions might have affected protein evolution. The biophysical underpinnings of these effects have been addressed by models with an explicit coarse-grained spatial representation of the polypeptide chain. Sequence-structure mappings based on such models are powerful conceptual tools that rationalize mutational robustness, evolvability, epistasis, promiscuous function performed by 'hidden' conformational states, resolution of adaptive conflicts and conformational switches in the evolution from one protein fold to another. Recently, protein biophysics has been applied to derive more accurate evolutionary accounts of sequence data. Methods have also been developed to exploit sequence-based evolutionary information to predict biophysical behaviours of proteins. The success of these approaches demonstrates a deep synergy between the fields of protein biophysics and protein evolution.
Collapse
Affiliation(s)
- Tobias Sikosek
- Department of Biochemistry, University of Toronto, Toronto, Ontario, Canada M5S 1A8 Department of Molecular Genetics, University of Toronto, Toronto, Ontario, Canada M5S 1A8 Department of Physics, University of Toronto, Toronto, Ontario, Canada M5S 1A8
| | - Hue Sun Chan
- Department of Biochemistry, University of Toronto, Toronto, Ontario, Canada M5S 1A8 Department of Molecular Genetics, University of Toronto, Toronto, Ontario, Canada M5S 1A8 Department of Physics, University of Toronto, Toronto, Ontario, Canada M5S 1A8
| |
Collapse
|
29
|
Faure G, Koonin EV. Universal distribution of mutational effects on protein stability, uncoupling of protein robustness from sequence evolution and distinct evolutionary modes of prokaryotic and eukaryotic proteins. Phys Biol 2015; 12:035001. [PMID: 25927823 DOI: 10.1088/1478-3975/12/3/035001] [Citation(s) in RCA: 22] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/04/2023]
Abstract
Robustness to destabilizing effects of mutations is thought of as a key factor of protein evolution. The connections between two measures of robustness, the relative core size and the computationally estimated effect of mutations on protein stability (ΔΔG), protein abundance and the selection pressure on protein-coding genes (dN/dS) were analyzed for the organisms with a large number of available protein structures including four eukaryotes, two bacteria and one archaeon. The distribution of the effects of mutations in the core on protein stability is universal and indistinguishable in eukaryotes and bacteria, centered at slightly destabilizing amino acid replacements, and with a heavy tail of more strongly destabilizing replacements. The distribution of mutational effects in the hyperthermophilic archaeon Thermococcus gammatolerans is significantly shifted toward strongly destabilizing replacements which is indicative of stronger constraints that are imposed on proteins in hyperthermophiles. The median effect of mutations is strongly, positively correlated with the relative core size, in evidence of the congruence between the two measures of protein robustness. However, both measures show only limited correlations to the expression level and selection pressure on protein-coding genes. Thus, the degree of robustness reflected in the universal distribution of mutational effects appears to be a fundamental, ancient feature of globular protein folds whereas the observed variations are largely neutral and uncoupled from short term protein evolution. A weak anticorrelation between protein core size and selection pressure is observed only for surface residues in prokaryotes but a stronger anticorrelation is observed for all residues in eukaryotic proteins. This substantial difference between proteins of prokaryotes and eukaryotes is likely to stem from the demonstrable higher compactness of prokaryotic proteins.
Collapse
Affiliation(s)
- Guilhem Faure
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD 20894, USA
| | | |
Collapse
|
30
|
Echave J, Jackson EL, Wilke CO. Relationship between protein thermodynamic constraints and variation of evolutionary rates among sites. Phys Biol 2015; 12:025002. [PMID: 25787027 PMCID: PMC4391963 DOI: 10.1088/1478-3975/12/2/025002] [Citation(s) in RCA: 34] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/17/2023]
Abstract
Evolutionary-rate variation among sites within proteins depends on functional and biophysical properties that constrain protein evolution. It is generally accepted that proteins must be able to fold stably in order to function. However, the relationship between stability constraints and among-sites rate variation is not well understood. Here, we present a biophysical model that links the thermodynamic stability changes due to mutations at sites in proteins ([Formula: see text]) to the rate at which mutations accumulate at those sites over evolutionary time. We find that such a 'stability model' generally performs well, displaying correlations between predicted and empirically observed rates of up to 0.75 for some proteins. We further find that our model has comparable predictive power as does an alternative, recently proposed 'stress model' that explains evolutionary-rate variation among sites in terms of the excess energy needed for mutants to adopt the correct active structure ([Formula: see text]). The two models make distinct predictions, though, and for some proteins the stability model outperforms the stress model and vice versa. We conclude that both stability and stress constrain site-specific sequence evolution in proteins.
Collapse
|
31
|
Haldane A, Manhart M, Morozov AV. Biophysical fitness landscapes for transcription factor binding sites. PLoS Comput Biol 2014; 10:e1003683. [PMID: 25010228 PMCID: PMC4091707 DOI: 10.1371/journal.pcbi.1003683] [Citation(s) in RCA: 22] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/03/2013] [Accepted: 05/11/2014] [Indexed: 11/18/2022] Open
Abstract
Phenotypic states and evolutionary trajectories available to cell populations are ultimately dictated by complex interactions among DNA, RNA, proteins, and other molecular species. Here we study how evolution of gene regulation in a single-cell eukaryote S. cerevisiae is affected by interactions between transcription factors (TFs) and their cognate DNA sites. Our study is informed by a comprehensive collection of genomic binding sites and high-throughput in vitro measurements of TF-DNA binding interactions. Using an evolutionary model for monomorphic populations evolving on a fitness landscape, we infer fitness as a function of TF-DNA binding to show that the shape of the inferred fitness functions is in broad agreement with a simple functional form inspired by a thermodynamic model of two-state TF-DNA binding. However, the effective parameters of the model are not always consistent with physical values, indicating selection pressures beyond the biophysical constraints imposed by TF-DNA interactions. We find little statistical support for the fitness landscape in which each position in the binding site evolves independently, indicating that epistasis is common in the evolution of gene regulation. Finally, by correlating TF-DNA binding energies with biological properties of the sites or the genes they regulate, we are able to rule out several scenarios of site-specific selection, under which binding sites of the same TF would experience different selection pressures depending on their position in the genome. These findings support the existence of universal fitness landscapes which shape evolution of all sites for a given TF, and whose properties are determined in part by the physics of protein-DNA interactions. Specialized proteins called transcription factors turn genes on and off by binding to short stretches of DNA in their regulatory regions. Precise gene regulation is essential for cellular survival and proliferation, and its evolution and maintenance under mutational pressure are central issues in biology. Here we discuss how evolution of gene regulation is shaped by the need to maintain favorable binding energies between transcription factors and their genomic binding sites. We show that, surprisingly, transcription factor binding is not affected by many biological properties, such as the essentiality of the gene it regulates. Rather, all sites for a given factor appear to evolve under a universal set of constraints, which can be rationalized in terms of a simple model inspired by transcription factor – DNA binding thermodynamics.
Collapse
Affiliation(s)
- Allan Haldane
- Department of Physics and Astronomy, Rutgers University, Piscataway, New Jersey, United States of America
| | - Michael Manhart
- Department of Physics and Astronomy, Rutgers University, Piscataway, New Jersey, United States of America
| | - Alexandre V. Morozov
- Department of Physics and Astronomy, Rutgers University, Piscataway, New Jersey, United States of America
- BioMaPS Institute for Quantitative Biology, Rutgers University, Piscataway, New Jersey, United States of America
- * E-mail:
| |
Collapse
|
32
|
Wu NC, Young AP, Al-Mawsawi LQ, Olson CA, Feng J, Qi H, Chen SH, Lu IH, Lin CY, Chin RG, Luan HH, Nguyen N, Nelson SF, Li X, Wu TT, Sun R. High-throughput profiling of influenza A virus hemagglutinin gene at single-nucleotide resolution. Sci Rep 2014; 4:4942. [PMID: 24820965 PMCID: PMC4018626 DOI: 10.1038/srep04942] [Citation(s) in RCA: 100] [Impact Index Per Article: 10.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/10/2014] [Accepted: 04/16/2014] [Indexed: 01/12/2023] Open
Abstract
Genetic research on influenza virus biology has been informed in large part by nucleotide variants present in seasonal or pandemic samples, or individual mutants generated in the laboratory, leaving a substantial part of the genome uncharacterized. Here, we have developed a single-nucleotide resolution genetic approach to interrogate the fitness effect of point mutations in 98% of the amino acid positions in the influenza A virus hemagglutinin (HA) gene. Our HA fitness map provides a reference to identify indispensable regions to aid in drug and vaccine design as targeting these regions will increase the genetic barrier for the emergence of escape mutations. This study offers a new platform for studying genome dynamics, structure-function relationships, virus-host interactions, and can further rational drug and vaccine design. Our approach can also be applied to any virus that can be genetically manipulated.
Collapse
Affiliation(s)
- Nicholas C Wu
- 1] Department of Molecular and Medical Pharmacology, David Geffen School of Medicine, University of California, Los Angeles, CA 90095, USA [2] Molecular Biology Institute, University of California, Los Angeles, CA 90095, USA [3]
| | - Arthur P Young
- 1] Department of Molecular and Medical Pharmacology, David Geffen School of Medicine, University of California, Los Angeles, CA 90095, USA [2]
| | - Laith Q Al-Mawsawi
- Department of Molecular and Medical Pharmacology, David Geffen School of Medicine, University of California, Los Angeles, CA 90095, USA
| | - C Anders Olson
- Department of Molecular and Medical Pharmacology, David Geffen School of Medicine, University of California, Los Angeles, CA 90095, USA
| | - Jun Feng
- Department of Molecular and Medical Pharmacology, David Geffen School of Medicine, University of California, Los Angeles, CA 90095, USA
| | - Hangfei Qi
- Department of Molecular and Medical Pharmacology, David Geffen School of Medicine, University of California, Los Angeles, CA 90095, USA
| | - Shu-Hwa Chen
- Institute of Information Science, Academia Sinica, Taipei, Taiwan
| | - I-Hsuan Lu
- Institute of Information Science, Academia Sinica, Taipei, Taiwan
| | - Chung-Yen Lin
- Institute of Information Science, Academia Sinica, Taipei, Taiwan
| | - Robert G Chin
- Department of Human Genetics, David Geffen School of Medicine, University of California, Los Angeles, CA 90095, USA
| | - Harding H Luan
- Department of Molecular and Medical Pharmacology, David Geffen School of Medicine, University of California, Los Angeles, CA 90095, USA
| | - Nguyen Nguyen
- Department of Molecular and Medical Pharmacology, David Geffen School of Medicine, University of California, Los Angeles, CA 90095, USA
| | - Stanley F Nelson
- 1] Molecular Biology Institute, University of California, Los Angeles, CA 90095, USA [2] Department of Human Genetics, David Geffen School of Medicine, University of California, Los Angeles, CA 90095, USA
| | - Xinmin Li
- Department of Pathology and Laboratory Medicine, David Geffen School of Medicine, University of California, Los Angeles, CA 90095, USA
| | - Ting-Ting Wu
- Department of Molecular and Medical Pharmacology, David Geffen School of Medicine, University of California, Los Angeles, CA 90095, USA
| | - Ren Sun
- 1] Department of Molecular and Medical Pharmacology, David Geffen School of Medicine, University of California, Los Angeles, CA 90095, USA [2] Molecular Biology Institute, University of California, Los Angeles, CA 90095, USA [3] AIDS Institute, University of California, Los Angeles, CA 90095, USA
| |
Collapse
|
33
|
Raghunathan G, Sokalingam S, Soundrarajan N, Madan B, Munussami G, Lee SG. Modulation of protein stability and aggregation properties by surface charge engineering. MOLECULAR BIOSYSTEMS 2014; 9:2379-89. [PMID: 23861008 DOI: 10.1039/c3mb70068b] [Citation(s) in RCA: 28] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/21/2022]
Abstract
An attempt to alter protein surface charges through traditional protein engineering approaches often affects the native protein structure significantly and induces misfolding. This limitation is a major hindrance in modulating protein properties through surface charge variations. In this study, as a strategy to overcome such a limitation, we attempted to co-introduce stabilizing mutations that can neutralize the destabilizing effect of protein surface charge variation. Two sets of rational mutations were designed; one to increase the number of surface charged amino acids and the other to decrease the number of surface charged amino acids by mutating surface polar uncharged amino acids and charged amino acids, respectively. These two sets of mutations were introduced into Green Fluorescent Protein (GFP) together with or without stabilizing mutations. The co-introduction of stabilizing mutations along with mutations for surface charge modification allowed us to obtain functionally active protein variants (s-GFP(+15-17) and s-GFP(+5-6)). When the protein properties such as fluorescent activity, folding rate and kinetic stability were assessed, we found the possibility that the protein stability can be modulated independently of activity and folding by engineering protein surface charges. The aggregation properties of GFP could also be altered through the surface charge engineering.
Collapse
Affiliation(s)
- Govindan Raghunathan
- Department of Chemical Engineering, Pusan National University, Busan 609-735, South Korea
| | | | | | | | | | | |
Collapse
|
34
|
Abstract
The seasonal human influenza A/H3N2 virus undergoes rapid evolution, which produces significant year-to-year sequence turnover in the population of circulating strains. Adaptive mutations respond to human immune challenge and occur primarily in antigenic epitopes, the antibody-binding domains of the viral surface protein haemagglutinin. Here we develop a fitness model for haemagglutinin that predicts the evolution of the viral population from one year to the next. Two factors are shown to determine the fitness of a strain: adaptive epitope changes and deleterious mutations outside the epitopes. We infer both fitness components for the strains circulating in a given year, using population-genetic data of all previous strains. From fitness and frequency of each strain, we predict the frequency of its descendent strains in the following year. This fitness model maps the adaptive history of influenza A and suggests a principled method for vaccine selection. Our results call for a more comprehensive epidemiology of influenza and other fast-evolving pathogens that integrates antigenic phenotypes with other viral functions coupled by genetic linkage.
Collapse
|
35
|
Kaltenbach M, Tokuriki N. Dynamics and constraints of enzyme evolution. JOURNAL OF EXPERIMENTAL ZOOLOGY PART B-MOLECULAR AND DEVELOPMENTAL EVOLUTION 2014; 322:468-87. [DOI: 10.1002/jez.b.22562] [Citation(s) in RCA: 68] [Impact Index Per Article: 6.8] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/20/2013] [Accepted: 01/06/2014] [Indexed: 12/23/2022]
Affiliation(s)
- Miriam Kaltenbach
- Michael Smith Laboratories; University of British Columbia; Vancouver British Columbia Canada
| | - Nobuhiko Tokuriki
- Michael Smith Laboratories; University of British Columbia; Vancouver British Columbia Canada
| |
Collapse
|
36
|
Mutational effects on stability are largely conserved during protein evolution. Proc Natl Acad Sci U S A 2013; 110:21071-6. [PMID: 24324165 DOI: 10.1073/pnas.1314781111] [Citation(s) in RCA: 105] [Impact Index Per Article: 9.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
Protein stability and folding are the result of cooperative interactions among many residues, yet phylogenetic approaches assume that sites are independent. This discrepancy has engendered concerns about large evolutionary shifts in mutational effects that might confound phylogenetic approaches. Here we experimentally investigate this issue by introducing the same mutations into a set of diverged homologs of the influenza nucleoprotein and measuring the effects on stability. We find that mutational effects on stability are largely conserved across the homologs. We reach qualitatively similar conclusions when we simulate protein evolution with molecular-mechanics force fields. Our results do not mean that proteins evolve without epistasis, which can still arise even when mutational stability effects are conserved. However, our findings indicate that large evolutionary shifts in mutational effects on stability are rare, at least among homologs with similar structures and functions. We suggest that properly describing the clearly observable and highly conserved amino acid preferences at individual sites is likely to be far more important for phylogenetic analyses than accounting for rare shifts in amino acid propensities due to site covariation.
Collapse
|
37
|
Udatha DBRKG, Rasmussen S, Sicheritz-Pontén T, Panagiotou G. Targeted metabolic engineering guided by computational analysis of single-nucleotide polymorphisms (SNPs). Methods Mol Biol 2013; 985:409-428. [PMID: 23417815 DOI: 10.1007/978-1-62703-299-5_20] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/01/2023]
Abstract
The non-synonymous SNPs, the so-called non-silent SNPs, which are single-nucleotide variations in the coding regions that give "birth" to amino acid mutations, are often involved in the modulation of protein function. Understanding the effect of individual amino acid mutations on a protein/enzyme function or stability is useful for altering its properties for a wide variety of engineering studies. Since measuring the effects of amino acid mutations experimentally is a laborious process, a variety of computational methods have been discussed here that aid to extract direct genotype to phenotype information.
Collapse
Affiliation(s)
- D B R K Gupta Udatha
- Department of Chemical and Biological Engineering, Industrial Biotechnology, Chalmers University of Technology, Gothenburg, Sweden
| | | | | | | |
Collapse
|
38
|
Araya CL, Fowler DM, Chen W, Muniez I, Kelly JW, Fields S. A fundamental protein property, thermodynamic stability, revealed solely from large-scale measurements of protein function. Proc Natl Acad Sci U S A 2012; 109:16858-63. [PMID: 23035249 PMCID: PMC3479514 DOI: 10.1073/pnas.1209751109] [Citation(s) in RCA: 166] [Impact Index Per Article: 13.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/27/2022] Open
Abstract
The ability of a protein to carry out a given function results from fundamental physicochemical properties that include the protein's structure, mechanism of action, and thermodynamic stability. Traditional approaches to study these properties have typically required the direct measurement of the property of interest, oftentimes a laborious undertaking. Although protein properties can be probed by mutagenesis, this approach has been limited by its low throughput. Recent technological developments have enabled the rapid quantification of a protein's function, such as binding to a ligand, for numerous variants of that protein. Here, we measure the ability of 47,000 variants of a WW domain to bind to a peptide ligand and use these functional measurements to identify stabilizing mutations without directly assaying stability. Our approach is rooted in the well-established concept that protein function is closely related to stability. Protein function is generally reduced by destabilizing mutations, but this decrease can be rescued by stabilizing mutations. Based on this observation, we introduce partner potentiation, a metric that uses this rescue ability to identify stabilizing mutations, and identify 15 candidate stabilizing mutations in the WW domain. We tested six candidates by thermal denaturation and found two highly stabilizing mutations, one more stabilizing than any previously known mutation. Thus, physicochemical properties such as stability are latent within these large-scale protein functional data and can be revealed by systematic analysis. This approach should allow other protein properties to be discovered.
Collapse
Affiliation(s)
| | - Douglas M. Fowler
- Departments of Genome Sciences and
- The Howard Hughes Medical Institute, University of Washington, Seattle, WA 98195; and
| | | | | | - Jeffery W. Kelly
- Departments of Chemistry and
- Molecular and Experimental Medicine and
- Skaggs Institute for Chemical Biology, The Scripps Research Institute, La Jolla, CA 92037
| | - Stanley Fields
- Departments of Genome Sciences and
- Medicine and
- The Howard Hughes Medical Institute, University of Washington, Seattle, WA 98195; and
| |
Collapse
|
39
|
Manhart M, Haldane A, Morozov AV. A universal scaling law determines time reversibility and steady state of substitutions under selection. Theor Popul Biol 2012; 82:66-76. [PMID: 22838027 PMCID: PMC3613437 DOI: 10.1016/j.tpb.2012.03.007] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/24/2022]
Abstract
Monomorphic loci evolve through a series of substitutions on a fitness landscape. Understanding how mutation, selection, and genetic drift drive this process, and uncovering the structure of the fitness landscape from genomic data are two major goals of evolutionary theory. Population genetics models of the substitution process have traditionally focused on the weak-selection regime, which is accurately described by diffusion theory. Predictions in this regime can be considered universal in the sense that many population models exhibit equivalent behavior in the diffusion limit. However, a growing number of experimental studies suggest that strong selection plays a key role in some systems, and thus there is a need to understand universal properties of models without a priori assumptions about selection strength. Here we study time reversibility in a general substitution model of a monomorphic haploid population. We show that for any time-reversible population model, such as the Moran process, substitution rates obey an exact scaling law. For several other irreversible models, such as the simple Wright–Fisher process and its extensions, the scaling law is accurate up to selection strengths that are well outside the diffusion regime. Time reversibility gives rise to a power-law expression for the steady-state distribution of populations on an arbitrary fitness landscape. The steady-state behavior is dominated by weak selection and is thus adequately described by the diffusion approximation, which guarantees universality of the steady-state formula and its applicability to the problem of reconstructing fitness landscapes from DNA or protein sequence data.
Collapse
Affiliation(s)
- Michael Manhart
- Department of Physics and Astronomy, Rutgers University, 136 Frelinghuysen Road, Piscataway, NJ USA 08854
| | - Allan Haldane
- Department of Physics and Astronomy, Rutgers University, 136 Frelinghuysen Road, Piscataway, NJ USA 08854
| | - Alexandre V. Morozov
- Department of Physics and Astronomy, Rutgers University, 136 Frelinghuysen Road, Piscataway, NJ USA 08854
- BioMaPS Institute for Quantitative Biology, Rutgers University, 110 Frelinghuysen Road, Piscataway, NJ USA 08854
| |
Collapse
|
40
|
Abstract
The seasonal influenza A virus undergoes rapid evolution to escape human immune response. Adaptive changes occur primarily in antigenic epitopes, the antibody-binding domains of the viral hemagglutinin. This process involves recurrent selective sweeps, in which clusters of simultaneous nucleotide fixations in the hemagglutinin coding sequence are observed about every 4 years. Here, we show that influenza A (H3N2) evolves by strong clonal interference. This mode of evolution is a red queen race between viral strains with different beneficial mutations. Clonal interference explains and quantifies the observed sweep pattern: we find an average of at least one strongly beneficial amino acid substitution per year, and a given selective sweep has three to four driving mutations on average. The inference of selection and clonal interference is based on frequency time series of single-nucleotide polymorphisms, which are obtained from a sample of influenza genome sequences over 39 years. Our results imply that mode and speed of influenza evolution are governed not only by positive selection within, but also by background selection outside antigenic epitopes: immune adaptation and conservation of other viral functions interfere with each other. Hence, adapting viral proteins are predicted to be particularly brittle. We conclude that a quantitative understanding of influenza's evolutionary and epidemiological dynamics must be based on all genomic domains and functions coupled by clonal interference.
Collapse
|
41
|
Abstract
Much molecular-evolution research is concerned with sequence analysis. Yet these sequences represent real, three-dimensional molecules with complex structure and function. Here I highlight a growing trend in the field to incorporate molecular structure and function into computational molecular-evolution work. I consider three focus areas: reconstruction and analysis of past evolutionary events, such as phylogenetic inference or methods to infer selection pressures; development of toy models and simulations to identify fundamental principles of molecular evolution; and atom-level, highly realistic computational modeling of molecular structure and function aimed at making predictions about possible future evolutionary events.
Collapse
Affiliation(s)
- Claus O Wilke
- Institute of Cell and Molecular Biology, The University of Texas at Austin, Austin, Texas, United States of America.
| |
Collapse
|
42
|
Wainreb G, Wolf L, Ashkenazy H, Dehouck Y, Ben-Tal N. Protein stability: a single recorded mutation aids in predicting the effects of other mutations in the same amino acid site. ACTA ACUST UNITED AC 2011; 27:3286-92. [PMID: 21998155 PMCID: PMC3223369 DOI: 10.1093/bioinformatics/btr576] [Citation(s) in RCA: 42] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/22/2022]
Abstract
Motivation: Accurate prediction of protein stability is important for understanding the molecular underpinnings of diseases and for the design of new proteins. We introduce a novel approach for the prediction of changes in protein stability that arise from a single-site amino acid substitution; the approach uses available data on mutations occurring in the same position and in other positions. Our algorithm, named Pro-Maya (Protein Mutant stAbilitY Analyzer), combines a collaborative filtering baseline model, Random Forests regression and a diverse set of features. Pro-Maya predicts the stability free energy difference of mutant versus wild type, denoted as ΔΔG. Results: We evaluated our algorithm extensively using cross-validation on two previously utilized datasets of single amino acid mutations and a (third) validation set. The results indicate that using known ΔΔG values of mutations at the query position improves the accuracy of ΔΔG predictions for other mutations in that position. The accuracy of our predictions in such cases significantly surpasses that of similar methods, achieving, e.g. a Pearson's correlation coefficient of 0.79 and a root mean square error of 0.96 on the validation set. Because Pro-Maya uses a diverse set of features, including predictions using two other methods, it also performs slightly better than other methods in the absence of additional experimental data on the query positions. Availability: Pro-Maya is freely available via web server at http://bental.tau.ac.il/ProMaya. Contact:nirb@tauex.tau.ac.il; wolf@cs.tau.ac.il Supplementary Information:Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Gilad Wainreb
- Department of Biochemistry and Molecular Biology, Tel-Aviv University, Ramat Aviv 69978, Israel
| | | | | | | | | |
Collapse
|
43
|
Akanuma S, Iwami S, Yokoi T, Nakamura N, Watanabe H, Yokobori SI, Yamagishi A. Phylogeny-Based Design of a B-Subunit of DNA Gyrase and Its ATPase Domain Using a Small Set of Homologous Amino Acid Sequences. J Mol Biol 2011; 412:212-25. [DOI: 10.1016/j.jmb.2011.07.042] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/28/2010] [Revised: 07/19/2011] [Accepted: 07/20/2011] [Indexed: 10/17/2022]
|
44
|
Bloom JD, Nayak JS, Baltimore D. A computational-experimental approach identifies mutations that enhance surface expression of an oseltamivir-resistant influenza neuraminidase. PLoS One 2011; 6:e22201. [PMID: 21799795 PMCID: PMC3140507 DOI: 10.1371/journal.pone.0022201] [Citation(s) in RCA: 44] [Impact Index Per Article: 3.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/05/2011] [Accepted: 06/16/2011] [Indexed: 12/31/2022] Open
Abstract
The His274→Tyr (H274Y) oseltamivir (Tamiflu) resistance mutation causes a substantial decrease in the total levels of surface-expressed neuraminidase protein and activity in early isolates of human seasonal H1N1 influenza, and in the swine-origin pandemic H1N1. In seasonal H1N1, H274Y only became widespread after the occurrence of secondary mutations that counteracted this decrease. H274Y is currently rare in pandemic H1N1, and it remains unclear whether secondary mutations exist that might similarly counteract the decreased neuraminidase surface expression associated with this resistance mutation in pandemic H1N1. Here we investigate the possibility of predicting such secondary mutations. We first test the ability of several computational approaches to retrospectively identify the secondary mutations that enhanced levels of surface-expressed neuraminidase protein and activity in seasonal H1N1 shortly before the emergence of oseltamivir resistance. We then use the most successful computational approach to predict a set of candidate secondary mutations to the pandemic H1N1 neuraminidase. We experimentally screen these mutations, and find that several of them do indeed partially counteract the decrease in neuraminidase surface expression caused by H274Y. Two of the secondary mutations together restore surface-expressed neuraminidase activity to wildtype levels, and also eliminate the very slight decrease in viral growth in tissue-culture caused by H274Y. Our work therefore demonstrates a combined computational-experimental approach for identifying mutations that enhance neuraminidase surface expression, and describes several specific mutations with the potential to be of relevance to the spread of oseltamivir resistance in pandemic H1N1.
Collapse
MESH Headings
- Computational Biology
- Drug Resistance, Viral/genetics
- Gene Expression Regulation, Viral/drug effects
- Gene Expression Regulation, Viral/genetics
- HEK293 Cells
- Humans
- Influenza A Virus, H1N1 Subtype/drug effects
- Influenza A Virus, H1N1 Subtype/enzymology
- Influenza A Virus, H1N1 Subtype/genetics
- Influenza A Virus, H1N1 Subtype/growth & development
- Influenza, Human/epidemiology
- Influenza, Human/virology
- Models, Molecular
- Mutation
- Neuraminidase/chemistry
- Neuraminidase/genetics
- Oseltamivir/pharmacology
- Pandemics
- Protein Conformation
- Tissue Culture Techniques
Collapse
Affiliation(s)
- Jesse D. Bloom
- Division of Biology, California Institute of Technology, Pasadena, California, United States of America
| | - Jagannath S. Nayak
- Division of Biology, California Institute of Technology, Pasadena, California, United States of America
| | - David Baltimore
- Division of Biology, California Institute of Technology, Pasadena, California, United States of America
| |
Collapse
|
45
|
The relationship between relative solvent accessibility and evolutionary rate in protein evolution. Genetics 2011; 188:479-88. [PMID: 21467571 DOI: 10.1534/genetics.111.128025] [Citation(s) in RCA: 87] [Impact Index Per Article: 6.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
Recent work with Saccharomyces cerevisiae shows a linear relationship between the evolutionary rate of sites and the relative solvent accessibility (RSA) of the corresponding residues in the folded protein. Here, we aim to develop a mathematical model that can reproduce this linear relationship. We first demonstrate that two models that both seem reasonable choices (a simple model in which selection strength correlates with RSA and a more complex model based on RSA-dependent amino acid distributions) fail to reproduce the observed relationship. We then develop a model on the basis of observed site-specific amino acid distributions and show that this model behaves appropriately. We conclude that evolutionary rates are directly linked to the distribution of amino acids at individual sites. Because of this link, any future insight into the biophysical mechanisms that determine amino acid distributions will improve our understanding of evolutionary rates.
Collapse
|
46
|
Madsen KM, Udatha GDBRK, Semba S, Otero JM, Koetter P, Nielsen J, Ebizuka Y, Kushiro T, Panagiotou G. Linking genotype and phenotype of Saccharomyces cerevisiae strains reveals metabolic engineering targets and leads to triterpene hyper-producers. PLoS One 2011; 6:e14763. [PMID: 21445244 PMCID: PMC3060802 DOI: 10.1371/journal.pone.0014763] [Citation(s) in RCA: 40] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/29/2010] [Accepted: 02/16/2011] [Indexed: 01/06/2023] Open
Abstract
BACKGROUND Metabolic engineering is an attractive approach in order to improve the microbial production of drugs. Triterpenes is a chemically diverse class of compounds and many among them are of interest from a human health perspective. A systematic experimental or computational survey of all feasible gene modifications to determine the genotype yielding the optimal triterpene production phenotype is a laborious and time-consuming process. METHODOLOGY/PRINCIPAL FINDINGS Based on the recent genome-wide sequencing of Saccharomyces cerevisiae CEN.PK 113-7D and its phenotypic differences with the S288C strain, we implemented a strategy for the construction of a β-amyrin production platform. The genes Erg8, Erg9 and HFA1 contained non-silent SNPs that were computationally analyzed to evaluate the changes that cause in the respective protein structures. Subsequently, Erg8, Erg9 and HFA1 were correlated with the increased levels of ergosterol and fatty acids in CEN.PK 113-7D and single, double, and triple gene over-expression strains were constructed. CONCLUSIONS The six out of seven gene over-expression constructs had a considerable impact on both ergosterol and β-amyrin production. In the case of β-amyrin formation the triple over-expression construct exhibited a nearly 500% increase over the control strain making our metabolic engineering strategy the most successful design of triterpene microbial producers.
Collapse
Affiliation(s)
- Karina M. Madsen
- Center for Microbial Biotechnology, Department of Systems Biology, Technical University of Denmark, Kgs. Lyngby, Denmark
| | - Gupta D. B. R. K. Udatha
- Department of Chemical and Biological Engineering, Industrial Biotechnology, Chalmers University of Technology, Gothenburg, Sweden
| | - Saori Semba
- Graduate School of Pharmaceutical Sciences, The University of Tokyo, Hongo, Bunkyo-ku, Tokyo, Japan
| | - Jose M. Otero
- Center for Microbial Biotechnology, Department of Systems Biology, Technical University of Denmark, Kgs. Lyngby, Denmark
- Department of Chemical and Biological Engineering, Systems Biology, Chalmers University of Technology, Gothenburg, Sweden
| | - Peter Koetter
- Institute for Microbiology, Johann Wolfgang Goethe-University of Frankfurt, Frankfurt, Germany
| | - Jens Nielsen
- Department of Chemical and Biological Engineering, Systems Biology, Chalmers University of Technology, Gothenburg, Sweden
| | - Yutaka Ebizuka
- Graduate School of Pharmaceutical Sciences, The University of Tokyo, Hongo, Bunkyo-ku, Tokyo, Japan
| | - Tetsuo Kushiro
- Graduate School of Pharmaceutical Sciences, The University of Tokyo, Hongo, Bunkyo-ku, Tokyo, Japan
| | - Gianni Panagiotou
- Graduate School of Pharmaceutical Sciences, The University of Tokyo, Hongo, Bunkyo-ku, Tokyo, Japan
- Center for Microbial Biotechnology, Department of Systems Biology, Technical University of Denmark, Kgs. Lyngby, Denmark
- Center for Biological Sequence Analysis, Department of Systems Biology, Technical University of Denmark, Lyngby, Denmark
| |
Collapse
|
47
|
Jäckel C, Hilvert D. Biocatalysts by evolution. Curr Opin Biotechnol 2010; 21:753-9. [DOI: 10.1016/j.copbio.2010.08.008] [Citation(s) in RCA: 111] [Impact Index Per Article: 7.9] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/08/2010] [Revised: 08/15/2010] [Accepted: 08/19/2010] [Indexed: 11/28/2022]
|
48
|
Bloom JD, Gong LI, Baltimore D. Permissive secondary mutations enable the evolution of influenza oseltamivir resistance. Science 2010; 328:1272-5. [PMID: 20522774 PMCID: PMC2913718 DOI: 10.1126/science.1187816] [Citation(s) in RCA: 509] [Impact Index Per Article: 36.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/02/2022]
Abstract
The His274-->Tyr274 (H274Y) mutation confers oseltamivir resistance on N1 influenza neuraminidase but had long been thought to compromise viral fitness. However, beginning in 2007-2008, viruses containing H274Y rapidly became predominant among human seasonal H1N1 isolates. We show that H274Y decreases the amount of neuraminidase that reaches the cell surface and that this defect can be counteracted by secondary mutations that also restore viral fitness. Two such mutations occurred in seasonal H1N1 shortly before the widespread appearance of H274Y. The evolution of oseltamivir resistance was therefore enabled by "permissive" mutations that allowed the virus to tolerate subsequent occurrences of H274Y. An understanding of this process may provide a basis for predicting the evolution of oseltamivir resistance in other influenza strains.
Collapse
MESH Headings
- Amino Acid Substitution
- Animals
- Antiviral Agents/pharmacology
- Cell Line
- Cell Line, Tumor
- Cell Membrane/metabolism
- Drug Resistance, Viral/genetics
- Evolution, Molecular
- Genes, Viral
- Genetic Fitness
- Humans
- Influenza A Virus, H1N1 Subtype/drug effects
- Influenza A Virus, H1N1 Subtype/genetics
- Influenza A Virus, H1N1 Subtype/growth & development
- Influenza, Human/drug therapy
- Influenza, Human/virology
- Mutation
- Neuraminidase/antagonists & inhibitors
- Neuraminidase/chemistry
- Neuraminidase/genetics
- Neuraminidase/metabolism
- Oseltamivir/pharmacology
- Phylogeny
- Selection, Genetic
Collapse
Affiliation(s)
- Jesse D. Bloom
- Division of Biology, California Institute of Technology, Pasadena, CA 91125, USA
| | - Lizhi Ian Gong
- Division of Biology, California Institute of Technology, Pasadena, CA 91125, USA
| | - David Baltimore
- Division of Biology, California Institute of Technology, Pasadena, CA 91125, USA
| |
Collapse
|
49
|
Jäckel C, Bloom JD, Kast P, Arnold FH, Hilvert D. Consensus protein design without phylogenetic bias. J Mol Biol 2010; 399:541-6. [PMID: 20433850 DOI: 10.1016/j.jmb.2010.04.039] [Citation(s) in RCA: 64] [Impact Index Per Article: 4.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/31/2010] [Revised: 04/20/2010] [Accepted: 04/22/2010] [Indexed: 11/17/2022]
Abstract
Consensus design is an appealing strategy for the stabilization of proteins. It exploits amino acid conservation in sets of homologous proteins to identify likely beneficial mutations. Nevertheless, its success depends on the phylogenetic diversity of the sequence set available. Here, we show that randomization of a single protein represents a reliable alternative source of sequence diversity that is essentially free of phylogenetic bias. A small number of functional protein sequences selected from binary-patterned libraries suffice as input for the consensus design of active enzymes that are easier to produce and substantially more stable than individual members of the starting data set. Although catalytic activity correlates less consistently with sequence conservation in these extensively randomized proteins, less extreme mutagenesis strategies might be adopted in practice to augment stability while maintaining function.
Collapse
Affiliation(s)
- Christian Jäckel
- Laboratory of Organic Chemistry, ETH Zurich, Wolfgang-Pauli-Strasse 10, 8093 Zurich, Switzerland
| | | | | | | | | |
Collapse
|
50
|
Sadowski MI, Taylor WR. Protein structures, folds and fold spaces. JOURNAL OF PHYSICS. CONDENSED MATTER : AN INSTITUTE OF PHYSICS JOURNAL 2010; 22:033103. [PMID: 21386276 DOI: 10.1088/0953-8984/22/3/033103] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/30/2023]
Abstract
There has been considerable progress towards the goal of understanding the space of possible tertiary structures adopted by proteins. Despite a greatly increased rate of structure determination and a deliberate strategy of sequencing proteins expected to be very different from those already known, it is now rare to see a genuinely new fold, leading to the conclusion that we have seen the majority of natural structural types. The increase in knowledge has also led to a critical examination of traditional fold-based classifications and their meaning for evolution and protein structures. We review these issues and discuss possible solutions.
Collapse
Affiliation(s)
- Michael I Sadowski
- Division of Mathematical Biology, MRC National Institute for Medical Research, The Ridgeway, Mill Hill, London NW7 1AA, UK
| | | |
Collapse
|