1
|
Jemth P. Protein binding and folding through an evolutionary lens. Curr Opin Struct Biol 2025; 90:102980. [PMID: 39817990 DOI: 10.1016/j.sbi.2024.102980] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/17/2024] [Revised: 12/18/2024] [Accepted: 12/19/2024] [Indexed: 01/18/2025]
Abstract
Protein-protein associations are often mediated by an intrinsically disordered protein region interacting with a folded domain in a coupled binding and folding reaction. Classic physical organic chemistry approaches together with structural biology have shed light on mechanistic aspects of such reactions. Further insight into general principles may be obtained by interpreting the results through an evolutionary lens. This review attempts to provide an overview on how the analysis of binding and folding reactions can benefit from an evolutionary approach, and is aimed at protein scientists without a background in evolution. Evolution constantly reshapes existing proteins by sampling more or less fit variants. Most new variants are weeded out as generations and new species come and go over hundreds to hundreds of millions of years. The huge ongoing genome sequencing efforts have provided us with a snapshot of existing adapted fit-for-purpose protein homologs in thousands of different organisms. Comparison of present-day orthologs and paralogs highlights general principles of the evolution of coupled binding and folding reactions and demonstrate a great potential for evolution to operate on disordered regions and modulate affinity and specificity of the interactions.
Collapse
Affiliation(s)
- Per Jemth
- Department of Medical Biochemistry and Microbiology, Uppsala University, BMC, Box 582, SE-75123 Uppsala, Sweden.
| |
Collapse
|
2
|
Wäneskog M, Hoch-Schneider EE, Garg S, Kronborg Cantalapiedra C, Schäfer E, Krogh Jensen M, Damgaard Jensen E. Accurate phenotype-to-genotype mapping of high-diversity yeast libraries by heat-shock-electroporation (HEEL). mBio 2024:e0319724. [PMID: 39704499 DOI: 10.1128/mbio.03197-24] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/22/2024] [Accepted: 11/19/2024] [Indexed: 12/21/2024] Open
Abstract
High-throughput DNA transformation techniques are invaluable when generating high-diversity mutant libraries, a cornerstone of successful protein engineering. However, transformation efficiencies have a direct correlation with the probability of introducing multiple DNA molecules into each cell, although reliable library screenings require cells that contain a single unique genotype. Thus, transformation methods that yield a high multiplicity of transformations are unsuitable for high-diversity library screenings. Here, we describe an innovative yeast library transformation method that is both simple and highly efficient. Our dual heat-shock and electroporation approach (HEEL) creates high-quality DNA libraries by increasing the fraction of mono-transformed yeast cells from 20% to over 70% of all transformed cells, thus allowing for near-perfect phenotype-to-genotype associations. HEEL also allows more than 107 yeast cells per reaction to be transformed with a circular plasmid molecule, which corresponds to an almost 100-fold improvement compared with current yeast transformation methods. To further refine our library screening approach, we integrated an automated yeast genotyping workflow with a dual-barcode design that employs both a single nucleotide polymorphism and a high-diversity region. This design allows for robust identification and quantification of unique genotypes within a heterogeneous population using standard Sanger sequencing. Our findings demonstrate that the longstanding trade-off between the size and quality of transformed yeast libraries can be overcome. By employing the HEEL method, large DNA libraries can be transformed into yeast with high-efficiency, while maintaining high library quality, essential for successful mutant screenings. This advancement holds significant promise for the fields of molecular biology and protein engineering.IMPORTANCEWith the recent expansion of artificial intelligence in the field of synthetic biology, there has never been a greater need for high-quality data and reliable measurements of phenotype-to-genotype relationships. However, one major obstacle to creating accurate computer-based models is the current abundance of low-quality phenotypic measurements originating from numerous high-throughput but low-resolution assays. Rather than increasing the quantity of measurements, new studies should aim to generate as accurate measurements as possible. The HEEL methodology presented here aims to address this issue by minimizing the problem of multi-plasmid uptake during high-throughput yeast DNA transformations, which leads to the creation of heterogeneous cellular genotypes. HEEL should enable highly accurate phenotype-to-genotype measurements going forward, which could be used to construct better computer-based models.
Collapse
Affiliation(s)
- Marcus Wäneskog
- The Novo Nordisk Foundation Center for Biosustainability, Technical University of Denmark, Lyngby, Denmark
| | - Emma Elise Hoch-Schneider
- The Novo Nordisk Foundation Center for Biosustainability, Technical University of Denmark, Lyngby, Denmark
| | - Shilpa Garg
- The Novo Nordisk Foundation Center for Biosustainability, Technical University of Denmark, Lyngby, Denmark
| | | | - Elena Schäfer
- The Novo Nordisk Foundation Center for Biosustainability, Technical University of Denmark, Lyngby, Denmark
| | - Michael Krogh Jensen
- The Novo Nordisk Foundation Center for Biosustainability, Technical University of Denmark, Lyngby, Denmark
| | - Emil Damgaard Jensen
- The Novo Nordisk Foundation Center for Biosustainability, Technical University of Denmark, Lyngby, Denmark
| |
Collapse
|
3
|
Zhu XX, Zheng WQ, Xia ZW, Chen XR, Jin T, Ding XW, Chen FF, Chen Q, Xu JH, Kong XD, Zheng GW. Evolutionary insights into the stereoselectivity of imine reductases based on ancestral sequence reconstruction. Nat Commun 2024; 15:10330. [PMID: 39609402 PMCID: PMC11605051 DOI: 10.1038/s41467-024-54613-3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/30/2024] [Accepted: 11/14/2024] [Indexed: 11/30/2024] Open
Abstract
The stereoselectivity of enzymes plays a central role in asymmetric biocatalytic reactions, but there remains a dearth of evolution-driven biochemistry studies investigating the evolutionary trajectory of this vital property. Imine reductases (IREDs) are one such enzyme that possesses excellent stereoselectivity, and stereocomplementary members are pervasive in the family. However, the regulatory mechanism behind stereocomplementarity remains cryptic. Herein, we reconstruct a panel of active ancestral IREDs and trace the evolution of stereoselectivity from ancestors to extant IREDs. Combined with coevolution analysis, we reveal six historical mutations capable of recapitulating stereoselectivity evolution. An investigation of the mechanism with X-ray crystallography shows that they collectively reshape the substrate-binding pocket to regulate stereoselectivity inversion. In addition, we construct an empirical fitness landscape and discover that epistasis is prevalent in stereoselectivity evolution. Our findings emphasize the power of ASR in circumventing the time-consuming large-scale mutagenesis library screening for identifying mutations that change functions and support a Darwinian premise from a molecular perspective that the evolution of biological functions is a stepwise process.
Collapse
Affiliation(s)
- Xin-Xin Zhu
- State Key Laboratory of Bioreactor Engineering, Shanghai Collaborative Innovation Center for Biomanufacturing, East China University of Science and Technology, Shanghai, China
| | - Wen-Qing Zheng
- State Key Laboratory of Bioreactor Engineering, Shanghai Collaborative Innovation Center for Biomanufacturing, East China University of Science and Technology, Shanghai, China
| | - Zi-Wei Xia
- State Key Laboratory of Bioreactor Engineering, Shanghai Collaborative Innovation Center for Biomanufacturing, East China University of Science and Technology, Shanghai, China
| | - Xin-Ru Chen
- State Key Laboratory of Bioreactor Engineering, Shanghai Collaborative Innovation Center for Biomanufacturing, East China University of Science and Technology, Shanghai, China
| | - Tian Jin
- State Key Laboratory of Bioreactor Engineering, Shanghai Collaborative Innovation Center for Biomanufacturing, East China University of Science and Technology, Shanghai, China
| | - Xu-Wei Ding
- State Key Laboratory of Bioreactor Engineering, Shanghai Collaborative Innovation Center for Biomanufacturing, East China University of Science and Technology, Shanghai, China
| | - Fei-Fei Chen
- State Key Laboratory of Bioreactor Engineering, Shanghai Collaborative Innovation Center for Biomanufacturing, East China University of Science and Technology, Shanghai, China
| | - Qi Chen
- State Key Laboratory of Bioreactor Engineering, Shanghai Collaborative Innovation Center for Biomanufacturing, East China University of Science and Technology, Shanghai, China
| | - Jian-He Xu
- State Key Laboratory of Bioreactor Engineering, Shanghai Collaborative Innovation Center for Biomanufacturing, East China University of Science and Technology, Shanghai, China
| | - Xu-Dong Kong
- State Key Laboratory of Microbial Metabolism, School of Life Sciences and Biotechnology, and Zhangjiang Institute for Advanced Study, Shanghai Jiao Tong University, Shanghai, China.
| | - Gao-Wei Zheng
- State Key Laboratory of Bioreactor Engineering, Shanghai Collaborative Innovation Center for Biomanufacturing, East China University of Science and Technology, Shanghai, China.
| |
Collapse
|
4
|
Crandall JG, Zhou X, Rokas A, Hittinger CT. Specialization Restricts the Evolutionary Paths Available to Yeast Sugar Transporters. Mol Biol Evol 2024; 41:msae228. [PMID: 39492761 PMCID: PMC11571961 DOI: 10.1093/molbev/msae228] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/03/2024] [Revised: 10/22/2024] [Accepted: 10/25/2024] [Indexed: 11/05/2024] Open
Abstract
Functional innovation at the protein level is a key source of evolutionary novelties. The constraints on functional innovations are likely to be highly specific in different proteins, which are shaped by their unique histories and the extent of global epistasis that arises from their structures and biochemistries. These contextual nuances in the sequence-function relationship have implications both for a basic understanding of the evolutionary process and for engineering proteins with desirable properties. Here, we have investigated the molecular basis of novel function in a model member of an ancient, conserved, and biotechnologically relevant protein family. These Major Facilitator Superfamily sugar porters are a functionally diverse group of proteins that are thought to be highly plastic and evolvable. By dissecting a recent evolutionary innovation in an α-glucoside transporter from the yeast Saccharomyces eubayanus, we show that the ability to transport a novel substrate requires high-order interactions between many protein regions and numerous specific residues proximal to the transport channel. To reconcile the functional diversity of this family with the constrained evolution of this model protein, we generated new, state-of-the-art genome annotations for 332 Saccharomycotina yeast species spanning ∼400 My of evolution. By integrating phylogenetic and phenotypic analyses across these species, we show that the model yeast α-glucoside transporters likely evolved from a multifunctional ancestor and became subfunctionalized. The accumulation of additive and epistatic substitutions likely entrenched this subfunction, which made the simultaneous acquisition of multiple interacting substitutions the only reasonably accessible path to novelty.
Collapse
Affiliation(s)
- Johnathan G Crandall
- Laboratory of Genetics, J. F. Crow Institute for the Study of Evolution, Center for Genomic Science Innovation, DOE Great Lakes Bioenergy Research Center, Wisconsin Energy Institute, University of Wisconsin-Madison, Madison, WI 53726, USA
| | - Xiaofan Zhou
- Guangdong Province Key Laboratory of Microbial Signals and Disease Control, Integrative Microbiology Research Center, South China Agricultural University, Guangzhou 510642, China
- Department of Biological Sciences and Evolutionary Studies Initiative, Vanderbilt University, Nashville, TN 37235, USA
| | - Antonis Rokas
- Department of Biological Sciences and Evolutionary Studies Initiative, Vanderbilt University, Nashville, TN 37235, USA
| | - Chris Todd Hittinger
- Laboratory of Genetics, J. F. Crow Institute for the Study of Evolution, Center for Genomic Science Innovation, DOE Great Lakes Bioenergy Research Center, Wisconsin Energy Institute, University of Wisconsin-Madison, Madison, WI 53726, USA
| |
Collapse
|
5
|
Alpay BA, Desai MM. Effects of selection stringency on the outcomes of directed evolution. PLoS One 2024; 19:e0311438. [PMID: 39401192 PMCID: PMC11472920 DOI: 10.1371/journal.pone.0311438] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/09/2024] [Accepted: 09/18/2024] [Indexed: 10/17/2024] Open
Abstract
Directed evolution makes mutant lineages compete in climbing complicated sequence-function landscapes. Given this underlying complexity it is unclear how selection stringency, a ubiquitous parameter of directed evolution, impacts the outcome. Here we approach this question in terms of the fitnesses of the candidate variants at each round and the heterogeneity of their distributions of fitness effects. We show that even if the fittest mutant is most likely to yield the fittest mutants in the next round of selection, diversification can improve outcomes by sampling a larger variety of fitness effects. We find that heterogeneity in fitness effects between variants, larger population sizes, and evolution over a greater number of rounds all encourage diversification.
Collapse
Affiliation(s)
- Berk A. Alpay
- Systems, Synthetic, and Quantitative Biology Program, Harvard University, Cambridge, MA, United States of America
- Department of Organismic and Evolutionary Biology, Harvard University, Cambridge, MA, United States of America
| | - Michael M. Desai
- Systems, Synthetic, and Quantitative Biology Program, Harvard University, Cambridge, MA, United States of America
- Department of Organismic and Evolutionary Biology, Harvard University, Cambridge, MA, United States of America
- Department of Physics, Harvard University, Cambridge, MA, United States of America
| |
Collapse
|
6
|
Chen JZ, Bisardi M, Lee D, Cotogno S, Zamponi F, Weigt M, Tokuriki N. Understanding epistatic networks in the B1 β-lactamases through coevolutionary statistical modeling and deep mutational scanning. Nat Commun 2024; 15:8441. [PMID: 39349467 PMCID: PMC11442494 DOI: 10.1038/s41467-024-52614-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/06/2024] [Accepted: 09/16/2024] [Indexed: 10/02/2024] Open
Abstract
Throughout evolution, protein families undergo substantial sequence divergence while preserving structure and function. Although most mutations are deleterious, evolution can explore sequence space via epistatic networks of intramolecular interactions that alleviate the harmful mutations. However, comprehensive analysis of such epistatic networks across protein families remains limited. Thus, we conduct a family wide analysis of the B1 metallo-β-lactamases, combining experiments (deep mutational scanning, DMS) on two distant homologs (NDM-1 and VIM-2) and computational analyses (in silico DMS based on Direct Coupling Analysis, DCA) of 100 homologs. The methods jointly reveal and quantify prevalent epistasis, as ~1/3rd of equivalent mutations are epistatic in DMS. From DCA, half of the positions have a >6.5 fold difference in effective number of tolerated mutations across the entire family. Notably, both methods locate residues with the strongest epistasis in regions of intermediate residue burial, suggesting a balance of residue packing and mutational freedom in forming epistatic networks. We identify entrenched WT residues between NDM-1 and VIM-2 in DMS, which display statistically distinct behaviors in DCA from non-entrenched residues. Entrenched residues are not easily compensated by changes in single nearby interactions, reinforcing existing findings where a complex epistatic network compounds smaller effects from many interacting residues.
Collapse
Affiliation(s)
- J Z Chen
- Michael Smith Laboratories, University of British Columbia, Vancouver, BC, Canada
| | - M Bisardi
- Laboratoire de Physique de l'Ecole Normale Supérieure, ENS, Université PSL, CNRS, Sorbonne Université, Université de Paris, F-75005, Paris, France
- Sorbonne Université, CNRS, Institut de Biologie Paris Seine, Biologie Computationnelle et Quantitative LCQB, F-75005, Paris, France
| | - D Lee
- Michael Smith Laboratories, University of British Columbia, Vancouver, BC, Canada
| | - S Cotogno
- Laboratoire de Physique de l'Ecole Normale Supérieure, ENS, Université PSL, CNRS, Sorbonne Université, Université de Paris, F-75005, Paris, France
- Sorbonne Université, CNRS, Institut de Biologie Paris Seine, Biologie Computationnelle et Quantitative LCQB, F-75005, Paris, France
| | - F Zamponi
- Laboratoire de Physique de l'Ecole Normale Supérieure, ENS, Université PSL, CNRS, Sorbonne Université, Université de Paris, F-75005, Paris, France
- Dipartimento di Fisica, Sapienza Università di Roma, I-00185, Rome, Italy
| | - M Weigt
- Sorbonne Université, CNRS, Institut de Biologie Paris Seine, Biologie Computationnelle et Quantitative LCQB, F-75005, Paris, France
| | - N Tokuriki
- Michael Smith Laboratories, University of British Columbia, Vancouver, BC, Canada.
| |
Collapse
|
7
|
Schmidlin K, Apodaca S, Newell D, Sastokas A, Kinsler G, Geiler-Samerotte K. Distinguishing mutants that resist drugs via different mechanisms by examining fitness tradeoffs. eLife 2024; 13:RP94144. [PMID: 39255191 PMCID: PMC11386965 DOI: 10.7554/elife.94144] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 09/12/2024] Open
Abstract
There is growing interest in designing multidrug therapies that leverage tradeoffs to combat resistance. Tradeoffs are common in evolution and occur when, for example, resistance to one drug results in sensitivity to another. Major questions remain about the extent to which tradeoffs are reliable, specifically, whether the mutants that provide resistance to a given drug all suffer similar tradeoffs. This question is difficult because the drug-resistant mutants observed in the clinic, and even those evolved in controlled laboratory settings, are often biased towards those that provide large fitness benefits. Thus, the mutations (and mechanisms) that provide drug resistance may be more diverse than current data suggests. Here, we perform evolution experiments utilizing lineage-tracking to capture a fuller spectrum of mutations that give yeast cells a fitness advantage in fluconazole, a common antifungal drug. We then quantify fitness tradeoffs for each of 774 evolved mutants across 12 environments, finding these mutants group into classes with characteristically different tradeoffs. Their unique tradeoffs may imply that each group of mutants affects fitness through different underlying mechanisms. Some of the groupings we find are surprising. For example, we find some mutants that resist single drugs do not resist their combination, while others do. And some mutants to the same gene have different tradeoffs than others. These findings, on one hand, demonstrate the difficulty in relying on consistent or intuitive tradeoffs when designing multidrug treatments. On the other hand, by demonstrating that hundreds of adaptive mutations can be reduced to a few groups with characteristic tradeoffs, our findings may yet empower multidrug strategies that leverage tradeoffs to combat resistance. More generally speaking, by grouping mutants that likely affect fitness through similar underlying mechanisms, our work guides efforts to map the phenotypic effects of mutation.
Collapse
Affiliation(s)
- Kara Schmidlin
- Biodesign Center for Mechanisms of Evolution, Arizona State University, Tempe, United States
- School of Life Sciences, Arizona State University, Tempe, United States
| | - Sam Apodaca
- Biodesign Center for Mechanisms of Evolution, Arizona State University, Tempe, United States
- School of Life Sciences, Arizona State University, Tempe, United States
| | - Daphne Newell
- Biodesign Center for Mechanisms of Evolution, Arizona State University, Tempe, United States
- School of Life Sciences, Arizona State University, Tempe, United States
| | - Alexander Sastokas
- Biodesign Center for Mechanisms of Evolution, Arizona State University, Tempe, United States
- School of Life Sciences, Arizona State University, Tempe, United States
| | - Grant Kinsler
- Department of Bioengineering, University of Pennsylvania, Philadelphia, United States
| | - Kerry Geiler-Samerotte
- Biodesign Center for Mechanisms of Evolution, Arizona State University, Tempe, United States
- School of Life Sciences, Arizona State University, Tempe, United States
| |
Collapse
|
8
|
Šakanović A, Kranjc N, Omersa N, Aden S, Kežar A, Kisovec M, Zavec AB, Caserman S, Gilbert RJC, Podobnik M, Crnković A, Anderluh G. In vitro evolution driven by epistasis reveals alternative cholesterol-specific binding motifs of perfringolysin O. J Biol Chem 2024; 300:107664. [PMID: 39128714 PMCID: PMC11416283 DOI: 10.1016/j.jbc.2024.107664] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/06/2024] [Revised: 07/22/2024] [Accepted: 08/05/2024] [Indexed: 08/13/2024] Open
Abstract
The crucial molecular factors that shape the interfaces of lipid-binding proteins with their target ligands and surfaces remain unknown due to the complex makeup of biological membranes. Cholesterol, the major modulator of bilayer structure in mammalian cell membranes, is recognized by various proteins, including the well-studied cholesterol-dependent cytolysins. Here, we use in vitro evolution to investigate the molecular adaptations that preserve the cholesterol specificity of perfringolysin O, the prototypical cholesterol-dependent cytolysin from Clostridium perfringens. We identify variants with altered membrane-binding interfaces whose cholesterol-specific activity exceeds that of the wild-type perfringolysin O. These novel variants represent alternative evolutionary outcomes and have mutations at conserved positions that can only accumulate when epistatic constraints are alleviated. Our results improve the current understanding of the biochemical malleability of the surface of a lipid-binding protein.
Collapse
Affiliation(s)
- Aleksandra Šakanović
- Department of Molecular Biology and Nanobiotechnology, National Institute of Chemistry, Ljubljana, Slovenia
| | - Nace Kranjc
- Department of Molecular Biology and Nanobiotechnology, National Institute of Chemistry, Ljubljana, Slovenia
| | - Neža Omersa
- Department of Molecular Biology and Nanobiotechnology, National Institute of Chemistry, Ljubljana, Slovenia
| | - Saša Aden
- Department of Molecular Biology and Nanobiotechnology, National Institute of Chemistry, Ljubljana, Slovenia
| | - Andreja Kežar
- Department of Molecular Biology and Nanobiotechnology, National Institute of Chemistry, Ljubljana, Slovenia
| | - Matic Kisovec
- Department of Molecular Biology and Nanobiotechnology, National Institute of Chemistry, Ljubljana, Slovenia
| | - Apolonija Bedina Zavec
- Department of Molecular Biology and Nanobiotechnology, National Institute of Chemistry, Ljubljana, Slovenia
| | - Simon Caserman
- Department of Molecular Biology and Nanobiotechnology, National Institute of Chemistry, Ljubljana, Slovenia
| | - Robert J C Gilbert
- Division of Structural Biology, Wellcome Centre for Human Genetics, University of Oxford, Oxford, United Kingdom
| | - Marjetka Podobnik
- Department of Molecular Biology and Nanobiotechnology, National Institute of Chemistry, Ljubljana, Slovenia
| | - Ana Crnković
- Department of Molecular Biology and Nanobiotechnology, National Institute of Chemistry, Ljubljana, Slovenia.
| | - Gregor Anderluh
- Department of Molecular Biology and Nanobiotechnology, National Institute of Chemistry, Ljubljana, Slovenia.
| |
Collapse
|
9
|
Lipsh-Sokolik R, Fleishman SJ. Addressing epistasis in the design of protein function. Proc Natl Acad Sci U S A 2024; 121:e2314999121. [PMID: 39133844 PMCID: PMC11348311 DOI: 10.1073/pnas.2314999121] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 08/29/2024] Open
Abstract
Mutations in protein active sites can dramatically improve function. The active site, however, is densely packed and extremely sensitive to mutations. Therefore, some mutations may only be tolerated in combination with others in a phenomenon known as epistasis. Epistasis reduces the likelihood of obtaining improved functional variants and dramatically slows natural and lab evolutionary processes. Research has shed light on the molecular origins of epistasis and its role in shaping evolutionary trajectories and outcomes. In addition, sequence- and AI-based strategies that infer epistatic relationships from mutational patterns in natural or experimental evolution data have been used to design functional protein variants. In recent years, combinations of such approaches and atomistic design calculations have successfully predicted highly functional combinatorial mutations in active sites. These were used to design thousands of functional active-site variants, demonstrating that, while our understanding of epistasis remains incomplete, some of the determinants that are critical for accurate design are now sufficiently understood. We conclude that the space of active-site variants that has been explored by evolution may be expanded dramatically to enhance natural activities or discover new ones. Furthermore, design opens the way to systematically exploring sequence and structure space and mutational impacts on function, deepening our understanding and control over protein activity.
Collapse
Affiliation(s)
- Rosalie Lipsh-Sokolik
- Department of Biomolecular Sciences, Weizmann Institute of Science, Rehovot 7610001, Israel
| | - Sarel J Fleishman
- Department of Biomolecular Sciences, Weizmann Institute of Science, Rehovot 7610001, Israel
| |
Collapse
|
10
|
Hayes RL, Cervantes LF, Abad Santos JC, Samadi A, Vilseck JZ, Brooks CL. How to Sample Dozens of Substitutions per Site with λ Dynamics. J Chem Theory Comput 2024; 20:6098-6110. [PMID: 38976796 PMCID: PMC11270746 DOI: 10.1021/acs.jctc.4c00514] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/17/2024] [Revised: 06/18/2024] [Accepted: 06/18/2024] [Indexed: 07/10/2024]
Abstract
Alchemical free energy methods are useful in computer-aided drug design and computational protein design because they provide rigorous statistical mechanics-based estimates of free energy differences from molecular dynamics simulations. λ dynamics is a free energy method with the ability to characterize combinatorial chemical spaces spanning thousands of related systems within a single simulation, which gives it a distinct advantage over other alchemical free energy methods that are mostly limited to pairwise comparisons. Recently developed methods have improved the scalability of λ dynamics to perturbations at many sites; however, the size of chemical space that can be explored at each individual site has previously been limited to fewer than ten substituents. As the number of substituents increases, the volume of alchemical space corresponding to nonphysical alchemical intermediates grows exponentially relative to the size corresponding to the physical states of interest. Beyond nine substituents, λ dynamics simulations become lost in an alchemical morass of intermediate states. In this work, we introduce new biasing potentials that circumvent excessive sampling of intermediate states by favoring sampling of physical end points relative to alchemical intermediates. Additionally, we present a more scalable adaptive landscape flattening algorithm for these larger alchemical spaces. Finally, we show that this potential enables more efficient sampling in both protein and drug design test systems with up to 24 substituents per site, enabling, for the first time, simultaneous simulation of all 20 amino acids.
Collapse
Affiliation(s)
- Ryan L. Hayes
- Department
of Chemical and Biomolecular Engineering, University of California Irvine, Irvine, California 92697, United States
- Department
of Pharmaceutical Sciences, University of
California Irvine, Irvine, California 92697, United States
| | - Luis F. Cervantes
- Department
of Medicinal Chemistry, College of Pharmacy, University of Michigan, Ann Arbor, Michigan 48109, United States
| | - Justin Cruz Abad Santos
- Department
of Chemical and Biomolecular Engineering, University of California Irvine, Irvine, California 92697, United States
| | - Amirmasoud Samadi
- Department
of Chemical and Biomolecular Engineering, University of California Irvine, Irvine, California 92697, United States
| | - Jonah Z. Vilseck
- Department
of Biochemistry and Molecular Biology, Indiana
University School of Medicine, Indianapolis, Indiana 46202, United States
- Center
for Computational Biology and Bioinformatics, Indiana University School of Medicine, Indianapolis, Indiana 46202, United States
| | - Charles L. Brooks
- Department
of Chemistry, University of Michigan, Ann Arbor, Michigan 48109, United States
- Biophysics
Program, University of Michigan, Ann Arbor, Michigan 48109, United States
| |
Collapse
|
11
|
Crandall JG, Zhou X, Rokas A, Hittinger CT. Specialization restricts the evolutionary paths available to yeast sugar transporters. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.07.22.604696. [PMID: 39091816 PMCID: PMC11291069 DOI: 10.1101/2024.07.22.604696] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 08/04/2024]
Abstract
Functional innovation at the protein level is a key source of evolutionary novelties. The constraints on functional innovations are likely to be highly specific in different proteins, which are shaped by their unique histories and the extent of global epistasis that arises from their structures and biochemistries. These contextual nuances in the sequence-function relationship have implications both for a basic understanding of the evolutionary process and for engineering proteins with desirable properties. Here, we have investigated the molecular basis of novel function in a model member of an ancient, conserved, and biotechnologically relevant protein family. These Major Facilitator Superfamily sugar porters are a functionally diverse group of proteins that are thought to be highly plastic and evolvable. By dissecting a recent evolutionary innovation in an α-glucoside transporter from the yeast Saccharomyces eubayanus, we show that the ability to transport a novel substrate requires high-order interactions between many protein regions and numerous specific residues proximal to the transport channel. To reconcile the functional diversity of this family with the constrained evolution of this model protein, we generated new, state-of-the-art genome annotations for 332 Saccharomycotina yeast species spanning approximately 400 million years of evolution. By integrating phylogenetic and phenotypic analyses across these species, we show that the model yeast α-glucoside transporters likely evolved from a multifunctional ancestor and became subfunctionalized. The accumulation of additive and epistatic substitutions likely entrenched this subfunction, which made the simultaneous acquisition of multiple interacting substitutions the only reasonably accessible path to novelty.
Collapse
Affiliation(s)
- Johnathan G. Crandall
- Laboratory of Genetics, J. F. Crow Institute for the Study of Evolution, Center for Genomic Science Innovation, DOE Great Lakes Bioenergy Research Center, Wisconsin Energy Institute, University of Wisconsin-Madison, Madison, WI 53726, USA
| | - Xiaofan Zhou
- Guangdong Province Key Laboratory of Microbial Signals and Disease Control, Integrative Microbiology Research Center, South China Agricultural University, Guangzhou 510642, China
- Department of Biological Sciences and Evolutionary Studies Initiative, Vanderbilt University, Nashville, TN 37235, USA
| | - Antonis Rokas
- Department of Biological Sciences and Evolutionary Studies Initiative, Vanderbilt University, Nashville, TN 37235, USA
| | - Chris Todd Hittinger
- Laboratory of Genetics, J. F. Crow Institute for the Study of Evolution, Center for Genomic Science Innovation, DOE Great Lakes Bioenergy Research Center, Wisconsin Energy Institute, University of Wisconsin-Madison, Madison, WI 53726, USA
| |
Collapse
|
12
|
Ardell S, Martsul A, Johnson MS, Kryazhimskiy S. Environment-independent distribution of mutational effects emerges from microscopic epistasis. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2023.11.18.567655. [PMID: 38014325 PMCID: PMC10680819 DOI: 10.1101/2023.11.18.567655] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/29/2023]
Abstract
Predicting how new mutations alter phenotypes is difficult because mutational effects vary across genotypes and environments. Recently discovered global epistasis, where the fitness effects of mutations scale with the fitness of the background genotype, can improve predictions, but how the environment modulates this scaling is unknown. We measured the fitness effects of ~100 insertion mutations in 42 strains of Saccharomyces cerevisiae in six laboratory environments and found that the global-epistasis scaling is nearly invariant across environments. Instead, the environment tunes one global parameter, the background fitness at which most mutations switch sign. As a consequence, the distribution of mutational effects is predictable across genotypes and environments. Our results suggest that the effective dimensionality of genotype-to-phenotype maps across environments is surprisingly low.
Collapse
Affiliation(s)
- Sarah Ardell
- Department of Ecology, Behavior and Evolution, University of California San Diego, La Jolla, CA 92093
| | - Alena Martsul
- Department of Ecology, Behavior and Evolution, University of California San Diego, La Jolla, CA 92093
| | - Milo S. Johnson
- Department of Integrative Biology, University of California Berkeley, Berkeley, CA 94720
- Biological Systems and Engineering Division, Lawrence Berkeley National Laboratory, Berkeley, CA, USA
| | - Sergey Kryazhimskiy
- Department of Ecology, Behavior and Evolution, University of California San Diego, La Jolla, CA 92093
| |
Collapse
|
13
|
Chisholm LO, Orlandi KN, Phillips SR, Shavlik MJ, Harms MJ. Ancestral Reconstruction and the Evolution of Protein Energy Landscapes. Annu Rev Biophys 2024; 53:127-146. [PMID: 38134334 PMCID: PMC11192866 DOI: 10.1146/annurev-biophys-030722-125440] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/24/2023]
Abstract
A protein's sequence determines its conformational energy landscape. This, in turn, determines the protein's function. Understanding the evolution of new protein functions therefore requires understanding how mutations alter the protein energy landscape. Ancestral sequence reconstruction (ASR) has proven a valuable tool for tackling this problem. In ASR, one phylogenetically infers the sequences of ancient proteins, allowing characterization of their properties. When coupled to biophysical, biochemical, and functional characterization, ASR can reveal how historical mutations altered the energy landscape of ancient proteins, allowing the evolution of enzyme activity, altered conformations, binding specificity, oligomerization, and many other protein features. In this article, we review how ASR studies have been used to dissect the evolution of energy landscapes. We also discuss ASR studies that reveal how energy landscapes have shaped protein evolution. Finally, we propose that thinking about evolution from the perspective of an energy landscape can improve how we approach and interpret ASR studies.
Collapse
Affiliation(s)
- Lauren O Chisholm
- Department of Chemistry and Biochemistry, University of Oregon, Eugene, Oregon, USA;
- Institute of Molecular Biology, University of Oregon, Eugene, Oregon, USA
| | - Kona N Orlandi
- Institute of Molecular Biology, University of Oregon, Eugene, Oregon, USA
- Department of Biology, University of Oregon, Eugene, Oregon, USA
| | - Sophia R Phillips
- Department of Chemistry and Biochemistry, University of Oregon, Eugene, Oregon, USA;
- Institute of Molecular Biology, University of Oregon, Eugene, Oregon, USA
| | - Michael J Shavlik
- Institute of Molecular Biology, University of Oregon, Eugene, Oregon, USA
- Department of Biology, University of Oregon, Eugene, Oregon, USA
| | - Michael J Harms
- Department of Chemistry and Biochemistry, University of Oregon, Eugene, Oregon, USA;
- Institute of Molecular Biology, University of Oregon, Eugene, Oregon, USA
| |
Collapse
|
14
|
Chen SK, Liu J, Van Nynatten A, Tudor-Price BM, Chang BSW. Sampling Strategies for Experimentally Mapping Molecular Fitness Landscapes Using High-Throughput Methods. J Mol Evol 2024:10.1007/s00239-024-10179-8. [PMID: 38886207 DOI: 10.1007/s00239-024-10179-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/28/2024] [Accepted: 05/20/2024] [Indexed: 06/20/2024]
Abstract
Empirical studies of genotype-phenotype-fitness maps of proteins are fundamental to understanding the evolutionary process, in elucidating the space of possible genotypes accessible through mutations in a landscape of phenotypes and fitness effects. Yet, comprehensively mapping molecular fitness landscapes remains challenging since all possible combinations of amino acid substitutions for even a few protein sites are encoded by an enormous genotype space. High-throughput mapping of genotype space can be achieved using large-scale screening experiments known as multiplexed assays of variant effect (MAVEs). However, to accommodate such multi-mutational studies, the size of MAVEs has grown to the point where a priori determination of sampling requirements is needed. To address this problem, we propose calculations and simulation methods to approximate minimum sampling requirements for multi-mutational MAVEs, which we combine with a new library construction protocol to experimentally validate our approximation approaches. Analysis of our simulated data reveals how sampling trajectories differ between simulations of nucleotide versus amino acid variants and among mutagenesis schemes. For this, we show quantitatively that marginal gains in sampling efficiency demand increasingly greater sampling effort when sampling for nucleotide sequences over their encoded amino acid equivalents. We present a new library construction protocol that efficiently maximizes sequence variation, and demonstrate using ultradeep sequencing that the library encodes virtually all possible combinations of mutations within the experimental design. Insights learned from our analyses together with the methodological advances reported herein are immediately applicable toward pooled experimental screens of arbitrary design, enabling further assay upscaling and expanded testing of genotype space.
Collapse
Affiliation(s)
- Steven K Chen
- Department of Cell & Systems Biology, University of Toronto, Toronto, ON, Canada
| | - Jing Liu
- Department of Cell & Systems Biology, University of Toronto, Toronto, ON, Canada
| | - Alexander Van Nynatten
- Department of Biological Science, University of Toronto Scarborough, Toronto, ON, Canada
| | | | - Belinda S W Chang
- Department of Cell & Systems Biology, University of Toronto, Toronto, ON, Canada.
- Department of Ecology & Evolutionary Biology, University of Toronto, Toronto, ON, Canada.
- Centre for the Analysis of Genome Evolution & Function, University of Toronto, Toronto, ON, Canada.
| |
Collapse
|
15
|
Alpay BA, Desai MM. Effects of selection stringency on the outcomes of directed evolution. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.06.09.598029. [PMID: 38895455 PMCID: PMC11185767 DOI: 10.1101/2024.06.09.598029] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/21/2024]
Abstract
Directed evolution makes mutant lineages compete in climbing complicated sequence-function landscapes. Given this underlying complexity it is unclear how selection stringency, a ubiquitous parameter of directed evolution, impacts the outcome. Here we approach this question in terms of the fitnesses of the candidate variants at each round and the heterogeneity of their distributions of fitness effects. We show that even if the fittest mutant is most likely to yield the fittest mutants in the next round of selection, diversification can improve outcomes by sampling a larger variety of fitness effects. We find that heterogeneity in fitness effects between variants, larger population sizes, and evolution over a greater number of rounds all encourage diversification.
Collapse
Affiliation(s)
- Berk A. Alpay
- Systems, Synthetic, and Quantitative Biology Program, Harvard University, Cambridge, MA, USA
- Department of Organismic and Evolutionary Biology, Harvard University, Cambridge, MA, USA
| | - Michael M. Desai
- Systems, Synthetic, and Quantitative Biology Program, Harvard University, Cambridge, MA, USA
- Department of Organismic and Evolutionary Biology, Harvard University, Cambridge, MA, USA
- Department of Physics, Harvard University, Cambridge, MA, USA
| |
Collapse
|
16
|
Metzger BPH, Park Y, Starr TN, Thornton JW. Epistasis facilitates functional evolution in an ancient transcription factor. eLife 2024; 12:RP88737. [PMID: 38767330 PMCID: PMC11105156 DOI: 10.7554/elife.88737] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 05/22/2024] Open
Abstract
A protein's genetic architecture - the set of causal rules by which its sequence produces its functions - also determines its possible evolutionary trajectories. Prior research has proposed that the genetic architecture of proteins is very complex, with pervasive epistatic interactions that constrain evolution and make function difficult to predict from sequence. Most of this work has analyzed only the direct paths between two proteins of interest - excluding the vast majority of possible genotypes and evolutionary trajectories - and has considered only a single protein function, leaving unaddressed the genetic architecture of functional specificity and its impact on the evolution of new functions. Here, we develop a new method based on ordinal logistic regression to directly characterize the global genetic determinants of multiple protein functions from 20-state combinatorial deep mutational scanning (DMS) experiments. We use it to dissect the genetic architecture and evolution of a transcription factor's specificity for DNA, using data from a combinatorial DMS of an ancient steroid hormone receptor's capacity to activate transcription from two biologically relevant DNA elements. We show that the genetic architecture of DNA recognition consists of a dense set of main and pairwise effects that involve virtually every possible amino acid state in the protein-DNA interface, but higher-order epistasis plays only a tiny role. Pairwise interactions enlarge the set of functional sequences and are the primary determinants of specificity for different DNA elements. They also massively expand the number of opportunities for single-residue mutations to switch specificity from one DNA target to another. By bringing variants with different functions close together in sequence space, pairwise epistasis therefore facilitates rather than constrains the evolution of new functions.
Collapse
Affiliation(s)
- Brian PH Metzger
- Department of Ecology and Evolution, University of ChicagoChicagoUnited States
| | - Yeonwoo Park
- Program in Genetics, Genomics, and Systems Biology, University of ChicagoChicagoUnited States
| | - Tyler N Starr
- Department of Biochemistry and Molecular Biophysics, University of ChicagoChicagoUnited States
| | - Joseph W Thornton
- Department of Ecology and Evolution, University of ChicagoChicagoUnited States
- Department of Human Genetics, University of ChicagoChicagoUnited States
| |
Collapse
|
17
|
Syrén PO. Ancestral terpene cyclases: From fundamental science to applications in biosynthesis. Methods Enzymol 2024; 699:311-341. [PMID: 38942509 DOI: 10.1016/bs.mie.2024.04.025] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/30/2024]
Abstract
Terpenes constitute one of the largest family of natural products with potent applications as renewable platform chemicals and medicines. The low activity, selectivity and stability displayed by terpene biosynthetic machineries can constitute an obstacle towards achieving expedient biosynthesis of terpenoids in processes that adhere to the 12 principles of green chemistry. Accordingly, engineering of terpene synthase enzymes is a prerequisite for industrial biotechnology applications, but obstructed by their complex catalysis that depend on reactive carbocationic intermediates that are prone to undergo bifurcation mechanisms. Rational redesign of terpene synthases can be tedious and requires high-resolution structural information, which is not always available. Furthermore, it has proven difficult to link sequence space of terpene synthase enzymes to specific product profiles. Herein, the author shows how ancestral sequence reconstruction (ASR) can favorably be used as a protein engineering tool in the redesign of terpene synthases without the need of a structure, and without excessive screening. A detailed workflow of ASR is presented along with associated limitations, with a focus on applying this methodology on terpene synthases. From selected examples of both class I and II enzymes, the author advocates that ancestral terpene cyclases constitute valuable assets to shed light on terpene-synthase catalysis and in enabling accelerated biosynthesis.
Collapse
Affiliation(s)
- Per-Olof Syrén
- School of Chemistry, Biotechnology and Health, Science for Life Laboratory, KTH Royal Institute of Technology, Solna, Sweden; School of Engineering Sciences in Chemistry, Biotechnology and Health, Department of Fibre and Polymer Technology, KTH Royal Institute of Technology, Stockholm, Sweden.
| |
Collapse
|
18
|
Claussnitzer M, Parikh VN, Wagner AH, Arbesfeld JA, Bult CJ, Firth HV, Muffley LA, Nguyen Ba AN, Riehle K, Roth FP, Tabet D, Bolognesi B, Glazer AM, Rubin AF. Minimum information and guidelines for reporting a multiplexed assay of variant effect. Genome Biol 2024; 25:100. [PMID: 38641812 PMCID: PMC11027375 DOI: 10.1186/s13059-024-03223-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/18/2023] [Accepted: 03/25/2024] [Indexed: 04/21/2024] Open
Abstract
Multiplexed assays of variant effect (MAVEs) have emerged as a powerful approach for interrogating thousands of genetic variants in a single experiment. The flexibility and widespread adoption of these techniques across diverse disciplines have led to a heterogeneous mix of data formats and descriptions, which complicates the downstream use of the resulting datasets. To address these issues and promote reproducibility and reuse of MAVE data, we define a set of minimum information standards for MAVE data and metadata and outline a controlled vocabulary aligned with established biomedical ontologies for describing these experimental designs.
Collapse
Affiliation(s)
- Melina Claussnitzer
- The Novo Nordisk Foundation Center for Genomic Mechanisms of Disease, Broad Institute of MIT and Harvard, Cambridge, MA, 02142, USA
- Center for Genomic Medicine, Massachusetts General Hospital, Harvard Medical School, Cambridge, MA, 02142, USA
| | - Victoria N Parikh
- Stanford Center for Inherited Cardiovascular Disease, Stanford University School of Medicine, Stanford, CA, 94305, USA
| | - Alex H Wagner
- The Steve and Cindy Rasmussen Institute for Genomic Medicine, Nationwide Children's Hospital, Columbus, OH, 43215, USA
- Department of Pediatrics, The Ohio State University College of Medicine, Columbus, OH, 43210, USA
| | - Jeremy A Arbesfeld
- The Steve and Cindy Rasmussen Institute for Genomic Medicine, Nationwide Children's Hospital, Columbus, OH, 43215, USA
- Department of Biomedical Informatics, The Ohio State University, Columbus, OH, 43210, USA
| | - Carol J Bult
- The Jackson Laboratory, Bar Harbor, ME, 04609, USA
| | - Helen V Firth
- Wellcome Sanger Institute, Hinxton, Cambridge, UK
- Dept of Medical Genetics, Cambridge University Hospitals NHS Trust, Cambridge, UK
| | - Lara A Muffley
- Department of Genome Sciences, University of Washington, Seattle, WA, 98105, USA
| | - Alex N Nguyen Ba
- Department of Biology, University of Toronto at Mississauga, Mississauga, ON, Canada
| | - Kevin Riehle
- Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX, 77030, USA
| | - Frederick P Roth
- Donnelly Centre, University of Toronto, Toronto, ON, Canada
- Department of Molecular Genetics, University of Toronto, Toronto, ON, Canada
- Department of Computer Science, University of Toronto, Toronto, ON, Canada
- Lunenfeld-Tanenbaum Research Institute, Sinai Health, Toronto, ON, Canada
| | - Daniel Tabet
- Donnelly Centre, University of Toronto, Toronto, ON, Canada
- Department of Molecular Genetics, University of Toronto, Toronto, ON, Canada
- Department of Computer Science, University of Toronto, Toronto, ON, Canada
- Lunenfeld-Tanenbaum Research Institute, Sinai Health, Toronto, ON, Canada
| | - Benedetta Bolognesi
- Institute for Bioengineering of Catalunya (IBEC), The Barcelona Institute of Science and Technology, Barcelona, Spain.
| | - Andrew M Glazer
- Vanderbilt University Medical Center, Nashville, TN, 37232, USA.
| | - Alan F Rubin
- Bioinformatics Division, The Walter and Eliza Hall Institute of Medical Research, Parkville, VIC, Australia.
- Department of Medical Biology, University of Melbourne, Parkville, VIC, Australia.
| |
Collapse
|
19
|
Meger AT, Spence MA, Sandhu M, Matthews D, Chen J, Jackson CJ, Raman S. Rugged fitness landscapes minimize promiscuity in the evolution of transcriptional repressors. Cell Syst 2024; 15:374-387.e6. [PMID: 38537640 PMCID: PMC11299162 DOI: 10.1016/j.cels.2024.03.002] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/26/2023] [Revised: 09/08/2023] [Accepted: 03/05/2024] [Indexed: 04/20/2024]
Abstract
How a protein's function influences the shape of its fitness landscape, smooth or rugged, is a fundamental question in evolutionary biochemistry. Smooth landscapes arise when incremental mutational steps lead to a progressive change in function, as commonly seen in enzymes and binding proteins. On the other hand, rugged landscapes are poorly understood because of the inherent unpredictability of how sequence changes affect function. Here, we experimentally characterize the entire sequence phylogeny, comprising 1,158 extant and ancestral sequences, of the DNA-binding domain (DBD) of the LacI/GalR transcriptional repressor family. Our analysis revealed an extremely rugged landscape with rapid switching of specificity, even between adjacent nodes. Further, the ruggedness arises due to the necessity of the repressor to simultaneously evolve specificity for asymmetric operators and disfavors potentially adverse regulatory crosstalk. Our study provides fundamental insight into evolutionary, molecular, and biophysical rules of genetic regulation through the lens of fitness landscapes.
Collapse
Affiliation(s)
- Anthony T Meger
- Department of Biochemistry, University of Wisconsin-Madison, Madison, WI 53706, USA
| | - Matthew A Spence
- Research School of Chemistry, Australian National University, Canberra, ACT 2601, Australia
| | - Mahakaran Sandhu
- Research School of Chemistry, Australian National University, Canberra, ACT 2601, Australia
| | - Dana Matthews
- Research School of Biology, Australian National University, Canberra, ACT 2601, Australia
| | - Jackie Chen
- Department of Biochemistry, University of Wisconsin-Madison, Madison, WI 53706, USA
| | - Colin J Jackson
- Research School of Chemistry, Australian National University, Canberra, ACT 2601, Australia; ARC Centre of Excellence for Innovations in Peptide & Protein Science, Research School of Chemistry, Australian National University, Canberra, ACT 2601, Australia; ARC Centre of Excellence for Innovations in Synthetic Biology, Research School of Chemistry, Australian National University, Canberra, ACT 2601, Australia.
| | - Srivatsan Raman
- Department of Biochemistry, University of Wisconsin-Madison, Madison, WI 53706, USA; Department of Bacteriology, University of Wisconsin-Madison, Madison, WI 53706, USA; Department of Chemical and Biological Engineering, University of Wisconsin-Madison, Madison, WI 53706, USA.
| |
Collapse
|
20
|
Lee SH, Wang CY, Li IJ, Abe G, Ota KG. Exploring the origin of a unique mutant allele in twin-tail goldfish using CRISPR/Cas9 mutants. Sci Rep 2024; 14:8716. [PMID: 38622170 PMCID: PMC11018756 DOI: 10.1038/s41598-024-58448-2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/14/2023] [Accepted: 03/29/2024] [Indexed: 04/17/2024] Open
Abstract
Artificial selection has been widely applied to genetically fix rare phenotypic features in ornamental domesticated animals. For many of these animals, the mutated loci and alleles underlying rare phenotypes are known. However, few studies have explored whether these rare genetic mutations might have been fixed due to competition among related mutated alleles or if the fixation occurred due to contingent stochastic events. Here, we performed genetic crossing with twin-tail ornamental goldfish and CRISPR/Cas9-mutated goldfish to investigate why only a single mutated allele-chdS with a E127X stop codon (also called chdAE127X)-gives rise to the twin-tail phenotype in the modern domesticated goldfish population. Two closely related chdS mutants were generated with CRISPR/Cas9 and compared with the E127X allele in F2 and F3 generations. Both of the CRISPR/Cas9-generated alleles were equivalent to the E127X allele in terms of penetrance/expressivity of the twin-tail phenotype and viability of carriers. These findings indicate that multiple truncating mutations could have produced viable twin-tail goldfish. Therefore, the absence of polymorphic alleles for the twin-tail phenotype in modern goldfish likely stems from stochastic elimination or a lack of competing alleles in the common ancestor. Our study is the first experimental comparison of a singular domestication-derived allele with CRISPR/Cas9-generated alleles to understand how genetic fixation of a unique genotype and phenotype may have occurred. Thus, our work may provide a conceptual framework for future investigations of rare evolutionary events in domesticated animals.
Collapse
Affiliation(s)
- Shu-Hua Lee
- Laboratory of Aquatic Zoology, Marine Research Station, Institute of Cellular and Organismic Biology, Academia Sinica, Yilan, 26242, Taiwan
| | - Chen-Yi Wang
- Laboratory of Aquatic Zoology, Marine Research Station, Institute of Cellular and Organismic Biology, Academia Sinica, Yilan, 26242, Taiwan
| | - Ing-Jia Li
- Laboratory of Aquatic Zoology, Marine Research Station, Institute of Cellular and Organismic Biology, Academia Sinica, Yilan, 26242, Taiwan
| | - Gembu Abe
- Laboratory of Aquatic Zoology, Marine Research Station, Institute of Cellular and Organismic Biology, Academia Sinica, Yilan, 26242, Taiwan
- Division of Developmental Biology, Department of Functional Morphology, Faculty of Medicine, School of Life Science, Tottori University, Nishi-cho 86, Yonago, 683-8503, Japan
| | - Kinya G Ota
- Laboratory of Aquatic Zoology, Marine Research Station, Institute of Cellular and Organismic Biology, Academia Sinica, Yilan, 26242, Taiwan.
| |
Collapse
|
21
|
Dibyachintan S, Dube AK, Bradley D, Lemieux P, Dionne U, Landry CR. Cryptic genetic variation shapes the fate of gene duplicates in a protein interaction network. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.02.23.581840. [PMID: 38464075 PMCID: PMC10925128 DOI: 10.1101/2024.02.23.581840] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 03/12/2024]
Abstract
Paralogous genes are often redundant for long periods of time before they diverge in function. While their functions are preserved, paralogous proteins can accumulate mutations that, through epistasis, could impact their fate in the future. By quantifying the impact of all single-amino acid substitutions on the binding of two myosin proteins to their interaction partners, we find that the future evolution of these proteins is highly contingent on their regulatory divergence and the mutations that have silently accumulated in their protein binding domains. Differences in the promoter strength of the two paralogs amplify the impact of mutations on binding in the lowly expressed one. While some mutations would be sufficient to non-functionalize one paralog, they would have minimal impact on the other. Our results reveal how functionally equivalent protein domains could be destined to specific fates by regulatory and cryptic coding sequence changes that currently have little to no functional impact.
Collapse
Affiliation(s)
- Soham Dibyachintan
- PROTEO-Regroupement Québécois de Recherche sur la Fonction, l'Ingénierie et les Applications des Protéines, Québec, QC, Canada
- Centre de Recherche en Données Massives de l'Université Laval, Université Laval, Québec, QC, Canada
- Institut de Biologie Intégrative et des Systèmes (IBIS), Université Laval, Québec, QC, Canada
- Département de Biochimie, de Microbiologie et de Bio-Informatique, Université Laval, Québec, QC, Canada
| | - Alexandre K Dube
- PROTEO-Regroupement Québécois de Recherche sur la Fonction, l'Ingénierie et les Applications des Protéines, Québec, QC, Canada
- Centre de Recherche en Données Massives de l'Université Laval, Université Laval, Québec, QC, Canada
- Institut de Biologie Intégrative et des Systèmes (IBIS), Université Laval, Québec, QC, Canada
- Département de Biochimie, de Microbiologie et de Bio-Informatique, Université Laval, Québec, QC, Canada
- Département de Biologie, Université Laval, Québec, QC, Canada
| | - David Bradley
- PROTEO-Regroupement Québécois de Recherche sur la Fonction, l'Ingénierie et les Applications des Protéines, Québec, QC, Canada
- Centre de Recherche en Données Massives de l'Université Laval, Université Laval, Québec, QC, Canada
- Institut de Biologie Intégrative et des Systèmes (IBIS), Université Laval, Québec, QC, Canada
- Département de Biochimie, de Microbiologie et de Bio-Informatique, Université Laval, Québec, QC, Canada
- Département de Biologie, Université Laval, Québec, QC, Canada
| | - Pascale Lemieux
- PROTEO-Regroupement Québécois de Recherche sur la Fonction, l'Ingénierie et les Applications des Protéines, Québec, QC, Canada
- Centre de Recherche en Données Massives de l'Université Laval, Université Laval, Québec, QC, Canada
- Institut de Biologie Intégrative et des Systèmes (IBIS), Université Laval, Québec, QC, Canada
- Département de Biochimie, de Microbiologie et de Bio-Informatique, Université Laval, Québec, QC, Canada
| | - Ugo Dionne
- PROTEO-Regroupement Québécois de Recherche sur la Fonction, l'Ingénierie et les Applications des Protéines, Québec, QC, Canada
- Centre de Recherche en Données Massives de l'Université Laval, Université Laval, Québec, QC, Canada
- Institut de Biologie Intégrative et des Systèmes (IBIS), Université Laval, Québec, QC, Canada
- Current affiliation: Lunenfeld-Tanenbaum Research Institute, Sinai Health, Toronto, ON, Canada
| | - Christian R Landry
- PROTEO-Regroupement Québécois de Recherche sur la Fonction, l'Ingénierie et les Applications des Protéines, Québec, QC, Canada
- Centre de Recherche en Données Massives de l'Université Laval, Université Laval, Québec, QC, Canada
- Institut de Biologie Intégrative et des Systèmes (IBIS), Université Laval, Québec, QC, Canada
- Département de Biochimie, de Microbiologie et de Bio-Informatique, Université Laval, Québec, QC, Canada
- Département de Biologie, Université Laval, Québec, QC, Canada
| |
Collapse
|
22
|
Li ZL, Sun CQ, Qing ZL, Li ZM, Liu HL. Engineering the thermal stability of a polyphosphate kinase by ancestral sequence reconstruction to expand the temperature boundary for an industrially applicable ATP regeneration system. Appl Environ Microbiol 2024; 90:e0157423. [PMID: 38236018 PMCID: PMC10880597 DOI: 10.1128/aem.01574-23] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/09/2023] [Accepted: 12/06/2023] [Indexed: 01/19/2024] Open
Abstract
ATP-dependent energy-consuming enzymatic reactions are widely used in cell-free biocatalysis. However, the direct addition of large amounts of expensive ATP can greatly increase cost, and enzymatic production is often difficult to achieve as a result. Although a polyphosphate kinase (PPK)-polyphosphate-based ATP regeneration system has the potential to solve this challenge, the generally poor thermal stability of PPKs limits the widespread use of this method. In this paper, we evaluated the thermal stability of a PPK from Sulfurovum lithotrophicum (SlPPK2). After directed evolution and computation-supported design, we found that SlPPK2 is very recalcitrant and cannot acquire beneficial mutations. Inspired by the usually outstanding stability of ancestral enzymes, we reconstructed the ancestral sequence of the PPK family and used it as a guide to construct three heat-stable variants of SlPPK2, of which the L35F/T144S variant has a half-life of more than 14 h at 60°C. Molecular dynamics simulations were performed on all enzymes to analyze the reasons for the increased thermal stability. The results showed that mutations at these two positions act synergistically from the interior and surface of the protein, leading to a more compact structure. Finally, the robustness of the L35F/T144S variant was verified in the synthesis of nucleotides at high temperature. In practice, the use of this high-temperature ATP regeneration system can effectively avoid byproduct accumulation. Our work extends the temperature boundary of ATP regeneration and has great potential for industrial applications.IMPORTANCEATP regeneration is an important basic applied study in the field of cell-free biocatalysis. Polyphosphate kinase (PPK) is an enzyme tool widely used for energy regeneration during enzymatic reactions. However, the thermal stability of the PPKs reported to date that can efficiently regenerate ATP is usually poor, which greatly limits their application. In this study, the thermal stability of a difficult-to-engineer PPK from Sulfurovum lithotrophicum was improved, guided by an ancestral sequence reconstruction strategy. The optimal variant has a 4.5-fold longer half-life at 60°C than the wild-type enzyme, thus enabling the extension of the temperature boundary for ATP regeneration. The ability of this variant to regenerate ATP was well demonstrated during high-temperature enzymatic production of nucleotides.
Collapse
Affiliation(s)
- Zong-Lin Li
- State Key Laboratory of Bioreactor Engineering, East China University of Science and Technology, Shanghai, China
| | - Chuan-Qi Sun
- State Key Laboratory of Bioreactor Engineering, East China University of Science and Technology, Shanghai, China
| | - Zhou-Lei Qing
- State Key Laboratory of Bioreactor Engineering, East China University of Science and Technology, Shanghai, China
| | - Zhi-Min Li
- State Key Laboratory of Bioreactor Engineering, East China University of Science and Technology, Shanghai, China
- Shanghai Collaborative Innovation Center for Biomanufacturing Technology, Shanghai, China
| | - Hong-Lai Liu
- School of Chemistry and Molecular Engineering, East China University of Science and Technology, Shanghai, China
| |
Collapse
|
23
|
Catania EM, Dubs NM, Soumen S, Barkman TJ. The Mutational Road not Taken: Using Ancestral Sequence Resurrection to Evaluate the Evolution of Plant Enzyme Substrate Preferences. Genome Biol Evol 2024; 16:evae016. [PMID: 38290535 PMCID: PMC10853004 DOI: 10.1093/gbe/evae016] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/04/2024] [Accepted: 01/19/2024] [Indexed: 02/01/2024] Open
Abstract
We investigated the flowering plant salicylic acid methyl transferase (SAMT) enzyme lineage to understand the evolution of substrate preference change. Previous studies indicated that a single amino acid replacement to the SAMT active site (H150M) was sufficient to change ancestral enzyme substrate preference from benzoic acid to the structurally similar substrate, salicylic acid (SA). Yet, subsequent studies have shown that the H150M function-changing replacement did not likely occur during the historical episode of enzymatic divergence studied. Therefore, we reinvestigated the origin of SA methylation preference here and additionally assessed the extent to which epistasis may act to limit mutational paths. We found that the SAMT lineage of enzymes acquired preference to methylate SA from an ancestor that preferred to methylate benzoic acid as previously reported. In contrast, we found that a different amino acid replacement, Y267Q, was sufficient to change substrate preference with others providing small positive-magnitude epistatic improvements. We show that the kinetic basis for the ancestral enzymatic change in substate preference by Y267Q appears to be due to both a reduced specificity constant, kcat/KM, for benzoic acid and an improvement in KM for SA. Therefore, this lineage of enzymes appears to have had multiple mutational paths available to achieve the same evolutionary divergence. While the reasons remain unclear for why one path was taken, and the other was not, the mutational distance between ancestral and descendant codons may be a factor.
Collapse
Affiliation(s)
- Emily M Catania
- Department of Biological Sciences, Western Michigan University, Kalamazoo, MI 49008, USA
| | - Nicole M Dubs
- Department of Biological Sciences, Western Michigan University, Kalamazoo, MI 49008, USA
| | - Shejal Soumen
- Department of Biological Sciences, Western Michigan University, Kalamazoo, MI 49008, USA
| | - Todd J Barkman
- Department of Biological Sciences, Western Michigan University, Kalamazoo, MI 49008, USA
| |
Collapse
|
24
|
Czerczak-Kwiatkowska K, Kaminska M, Fraczyk J, Majsterek I, Kolesinska B. Searching for EGF Fragments Recreating the Outer Sphere of the Growth Factor Involved in Receptor Interactions. Int J Mol Sci 2024; 25:1470. [PMID: 38338748 PMCID: PMC10855902 DOI: 10.3390/ijms25031470] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/12/2023] [Revised: 01/21/2024] [Accepted: 01/22/2024] [Indexed: 02/12/2024] Open
Abstract
The aims of this study were to determine whether it is possible to use peptide microarrays obtained using the SPOT technique (immobilized on cellulose) and specific polyclonal antibodies to select fragments that reconstruct the outer sphere of proteins and to ascertain whether the selected peptide fragments can be useful in the study of their protein-protein and/or peptide-protein interactions. Using this approach, epidermal growth factor (EGF) fragments responsible for the interaction with the EGF receptor were searched. A library of EGF fragments immobilized on cellulose was obtained using triazine condensing reagents. Experiments on the interactions with EGFR confirmed the high affinity of the selected peptide fragments. Biological tests on cells showed the lack of cytotoxicity of the EGF fragments. Selected EGF fragments can be used in various areas of medicine.
Collapse
Affiliation(s)
- Katarzyna Czerczak-Kwiatkowska
- Faculty of Chemistry, Institute of Organic Chemistry, Lodz University of Technology, Zeromskiego 116, 90-924 Lodz, Poland; (K.C.-K.); (J.F.)
| | - Marta Kaminska
- Division of Biophysics, Institute of Materials Science and Engineering, Lodz University of Technology, Stefanowskiego 1/15, 90-924 Lodz, Poland;
| | - Justyna Fraczyk
- Faculty of Chemistry, Institute of Organic Chemistry, Lodz University of Technology, Zeromskiego 116, 90-924 Lodz, Poland; (K.C.-K.); (J.F.)
| | - Ireneusz Majsterek
- Department of Clinical Chemistry and Biochemistry, Medical University of Lodz, Narutowicza 60, 90-136 Lodz, Poland;
| | - Beata Kolesinska
- Faculty of Chemistry, Institute of Organic Chemistry, Lodz University of Technology, Zeromskiego 116, 90-924 Lodz, Poland; (K.C.-K.); (J.F.)
| |
Collapse
|
25
|
Hayes RL, Nixon CF, Marqusee S, Brooks CL. Selection pressures on evolution of ribonuclease H explored with rigorous free-energy-based design. Proc Natl Acad Sci U S A 2024; 121:e2312029121. [PMID: 38194446 PMCID: PMC10801872 DOI: 10.1073/pnas.2312029121] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/14/2023] [Accepted: 11/22/2023] [Indexed: 01/11/2024] Open
Abstract
Understanding natural protein evolution and designing novel proteins are motivating interest in development of high-throughput methods to explore large sequence spaces. In this work, we demonstrate the application of multisite λ dynamics (MSλD), a rigorous free energy simulation method, and chemical denaturation experiments to quantify evolutionary selection pressure from sequence-stability relationships and to address questions of design. This study examines a mesophilic phylogenetic clade of ribonuclease H (RNase H), furthering its extensive characterization in earlier studies, focusing on E. coli RNase H (ecRNH) and a more stable consensus sequence (AncCcons) differing at 15 positions. The stabilities of 32,768 chimeras between these two sequences were computed using the MSλD framework. The most stable and least stable chimeras were predicted and tested along with several other sequences, revealing a designed chimera with approximately the same stability increase as AncCcons, but requiring only half the mutations. Comparing the computed stabilities with experiment for 12 sequences reveals a Pearson correlation of 0.86 and root mean squared error of 1.18 kcal/mol, an unprecedented level of accuracy well beyond less rigorous computational design methods. We then quantified selection pressure using a simple evolutionary model in which sequences are selected according to the Boltzmann factor of their stability. Selection temperatures from 110 to 168 K are estimated in three ways by comparing experimental and computational results to evolutionary models. These estimates indicate selection pressure is high, which has implications for evolutionary dynamics and for the accuracy required for design, and suggests accurate high-throughput computational methods like MSλD may enable more effective protein design.
Collapse
Affiliation(s)
- Ryan L. Hayes
- Department of Chemical and Biomolecular Engineering, University of California, Irvine, CA92697
- Department of Chemistry, University of Michigan, Ann Arbor, MI48109
| | - Charlotte F. Nixon
- Department of Molecular and Cell Biology, University of California, Berkeley, CA94720
| | - Susan Marqusee
- Department of Molecular and Cell Biology, University of California, Berkeley, CA94720
- California Institute for Quantitative Biosciences, University of California, Berkeley, CA94720
- Department of Chemistry, University of California, Berkeley, CA94720
| | - Charles L. Brooks
- Department of Chemistry, University of Michigan, Ann Arbor, MI48109
- Biophysics Program, University of Michigan, Ann Arbor, MI48109
| |
Collapse
|
26
|
Avizemer Z, Martí-Gómez C, Hoch SY, McCandlish DM, Fleishman SJ. Evolutionary paths that link orthogonal pairs of binding proteins. RESEARCH SQUARE 2023:rs.3.rs-2836905. [PMID: 37131620 PMCID: PMC10153392 DOI: 10.21203/rs.3.rs-2836905/v1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/04/2023]
Abstract
Some protein binding pairs exhibit extreme specificities that functionally insulate them from homologs. Such pairs evolve mostly by accumulating single-point mutations, and mutants are selected if their affinity exceeds the threshold required for function1-4. Thus, homologous and high-specificity binding pairs bring to light an evolutionary conundrum: how does a new specificity evolve while maintaining the required affinity in each intermediate5,6? Until now, a fully functional single-mutation path that connects two orthogonal pairs has only been described where the pairs were mutationally close thus enabling experimental enumeration of all intermediates2. We present an atomistic and graph-theoretical framework for discovering low molecular strain single-mutation paths that connect two extant pairs, enabling enumeration beyond experimental capability. We apply it to two orthogonal bacterial colicin endonuclease-immunity pairs separated by 17 interface mutations7. We were not able to find a strain-free and functional path in the sequence space defined by the two extant pairs. But including mutations that bridge amino acids that cannot be exchanged through single-nucleotide mutations led us to a strain-free 19-mutation trajectory that is completely viable in vivo. Our experiments show that the specificity switch is remarkably abrupt, resulting from only one radical mutation on each partner. Furthermore, each of the critical specificity-switch mutations increases fitness, demonstrating that functional divergence could be driven by positive Darwinian selection. These results reveal how even radical functional changes in an epistatic fitness landscape may evolve.
Collapse
Affiliation(s)
- Ziv Avizemer
- Department of Biomolecular Sciences, Weizmann Institute of Science, 7610001, Rehovot, Israel
| | - Carlos Martí-Gómez
- Simons Center for Quantitative Biology, Cold Spring Harbor Laboratory, Cold Spring Harbor, NY 11724
| | - Shlomo Yakir Hoch
- Department of Biomolecular Sciences, Weizmann Institute of Science, 7610001, Rehovot, Israel
| | - David M. McCandlish
- Simons Center for Quantitative Biology, Cold Spring Harbor Laboratory, Cold Spring Harbor, NY 11724
| | - Sarel J. Fleishman
- Department of Biomolecular Sciences, Weizmann Institute of Science, 7610001, Rehovot, Israel
| |
Collapse
|
27
|
Maes S, Deploey N, Peelman F, Eyckerman S. Deep mutational scanning of proteins in mammalian cells. CELL REPORTS METHODS 2023; 3:100641. [PMID: 37963462 PMCID: PMC10694495 DOI: 10.1016/j.crmeth.2023.100641] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/12/2023] [Revised: 07/06/2023] [Accepted: 10/20/2023] [Indexed: 11/16/2023]
Abstract
Protein mutagenesis is essential for unveiling the molecular mechanisms underlying protein function in health, disease, and evolution. In the past decade, deep mutational scanning methods have evolved to support the functional analysis of nearly all possible single-amino acid changes in a protein of interest. While historically these methods were developed in lower organisms such as E. coli and yeast, recent technological advancements have resulted in the increased use of mammalian cells, particularly for studying proteins involved in human disease. These advancements will aid significantly in the classification and interpretation of variants of unknown significance, which are being discovered at large scale due to the current surge in the use of whole-genome sequencing in clinical contexts. Here, we explore the experimental aspects of deep mutational scanning studies in mammalian cells and report the different methods used in each step of the workflow, ultimately providing a useful guide toward the design of such studies.
Collapse
Affiliation(s)
- Stefanie Maes
- VIB Center for Medical Biotechnology (CMB), Technologiepark-Zwijnaarde 75, 9052 Ghent, Belgium; Department of Biochemistry and Microbiology, Ghent University, Technologiepark-Zwijnaarde 75, 9052 Ghent, Belgium; Department of Biomolecular Medicine, Ghent University, Technologiepark-Zwijnaarde 75, 9052 Ghent, Belgium
| | - Nick Deploey
- VIB Center for Medical Biotechnology (CMB), Technologiepark-Zwijnaarde 75, 9052 Ghent, Belgium; Department of Biomolecular Medicine, Ghent University, Technologiepark-Zwijnaarde 75, 9052 Ghent, Belgium
| | - Frank Peelman
- VIB Center for Medical Biotechnology (CMB), Technologiepark-Zwijnaarde 75, 9052 Ghent, Belgium; Department of Biomolecular Medicine, Ghent University, Technologiepark-Zwijnaarde 75, 9052 Ghent, Belgium
| | - Sven Eyckerman
- VIB Center for Medical Biotechnology (CMB), Technologiepark-Zwijnaarde 75, 9052 Ghent, Belgium; Department of Biomolecular Medicine, Ghent University, Technologiepark-Zwijnaarde 75, 9052 Ghent, Belgium.
| |
Collapse
|
28
|
Cellier MFM. Slc11 Synapomorphy: A Conserved 3D Framework Articulating Carrier Conformation Switch. Int J Mol Sci 2023; 24:15076. [PMID: 37894758 PMCID: PMC10606218 DOI: 10.3390/ijms242015076] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/15/2023] [Revised: 10/05/2023] [Accepted: 10/07/2023] [Indexed: 10/29/2023] Open
Abstract
Transmembrane carriers of the Slc11 family catalyze proton (H+)-dependent uptake of divalent metal ions (Me2+) such as manganese and iron-vital elements coveted during infection. The Slc11 mechanism of high-affinity Me2+ cell import is selective and conserved between prokaryotic (MntH) and eukaryotic (Nramp) homologs, though processes coupling the use of the proton motive force to Me2+ uptake evolved repeatedly. Adding bacterial piracy of Nramp genes spread in distinct environmental niches suggests selective gain of function that may benefit opportunistic pathogens. To better understand Slc11 evolution, Alphafold (AF2)/Colabfold (CF) 3D predictions for bacterial sequences from sister clades of eukaryotic descent (MCb and MCg) were compared using both native and mutant templates. AF2/CF model an array of native MCb intermediates spanning the transition from outwardly open (OO) to inwardly open (IO) carriers. In silico mutagenesis targeting (i) a set of (evolutionarily coupled) sites that may define Slc11 function (putative synapomorphy) and (ii) residues from networked communities evolving during MCb transition indicates that Slc11 synapomorphy primarily instructs a Me2+-selective conformation switch which unlocks carrier inner gate and contributes to Me2+ binding site occlusion and outer gate locking. Inner gate opening apparently proceeds from interaction between transmembrane helix (h) h5, h8 and h1a. MCg1 xenologs revealed marked differences in carrier shape and plasticity, owing partly to an altered intramolecular H+ network. Yet, targeting Slc11 synapomorphy also converted MCg1 IO models to an OO state, apparently mobilizing the same residues to control gates. But MCg1 response to mutagenesis differed, with extensive divergence within this clade correlating with MCb-like modeling properties. Notably, MCg1 divergent epistasis marks the emergence of the genus Bordetella-Achromobacter. Slc11 synapomorphy localizes to the 3D areas that deviate least among MCb and MCg1 models (either IO or OO) implying that it constitutes a 3D network of residues articulating a Me2+-selective carrier conformation switch which is maintained in fast-evolving clades at the cost of divergent epistatic interactions impacting carrier shape and dynamics.
Collapse
Affiliation(s)
- Mathieu F M Cellier
- Centre Armand-Frappier Santé Biotechnologie, Institut National de la Recherche Scientifique (INRS), Laval, QC H7V 1B7, Canada
| |
Collapse
|
29
|
Nicoll CR, Massari M, Fraaije MW, Mascotti ML, Mattevi A. Impact of ancestral sequence reconstruction on mechanistic and structural enzymology. Curr Opin Struct Biol 2023; 82:102669. [PMID: 37544113 DOI: 10.1016/j.sbi.2023.102669] [Citation(s) in RCA: 7] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/29/2023] [Revised: 06/19/2023] [Accepted: 07/10/2023] [Indexed: 08/08/2023]
Abstract
Ancestral sequence reconstruction (ASR) provides insight into the changes within a protein sequence across evolution. More specifically, it can illustrate how specific amino acid changes give rise to different phenotypes within a protein family. Over the last few decades it has established itself as a powerful technique for revealing molecular common denominators that govern enzyme function. Here, we describe the strength of ASR in unveiling catalytic mechanisms and emerging phenotypes for a range of different proteins, also highlighting biotechnological applications the methodology can provide.
Collapse
Affiliation(s)
- Callum R Nicoll
- Department of Biology and Biotechnology Lazzaro Spallanzani, University of Pavia, Via Ferrata 9, 27100, Pavia, Italy
| | - Marta Massari
- Department of Biology and Biotechnology Lazzaro Spallanzani, University of Pavia, Via Ferrata 9, 27100, Pavia, Italy
| | - Marco W Fraaije
- Molecular Enzymology, Groningen Biomolecular Sciences and Biotechnology Institute, University of Groningen, Nijenborgh 4, 9747, AG Groningen, the Netherlands. https://twitter.com/fraaije1
| | - Maria Laura Mascotti
- Molecular Enzymology, Groningen Biomolecular Sciences and Biotechnology Institute, University of Groningen, Nijenborgh 4, 9747, AG Groningen, the Netherlands; IMIBIO-SL CONICET, Facultad de Química Bioquímica y Farmacia, Universidad Nacional de San Luis, Ejército de los Andes 950, D5700HHW, San Luis, Argentina
| | - Andrea Mattevi
- Department of Biology and Biotechnology Lazzaro Spallanzani, University of Pavia, Via Ferrata 9, 27100, Pavia, Italy.
| |
Collapse
|
30
|
Buda K, Miton CM, Fan XC, Tokuriki N. Molecular determinants of protein evolvability. Trends Biochem Sci 2023; 48:751-760. [PMID: 37330341 DOI: 10.1016/j.tibs.2023.05.009] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/27/2023] [Revised: 05/18/2023] [Accepted: 05/23/2023] [Indexed: 06/19/2023]
Abstract
The plethora of biological functions that sustain life is rooted in the remarkable evolvability of proteins. An emerging view highlights the importance of a protein's initial state in dictating evolutionary success. A deeper comprehension of the mechanisms that govern the evolvability of these initial states can provide invaluable insights into protein evolution. In this review, we describe several molecular determinants of protein evolvability, unveiled by experimental evolution and ancestral sequence reconstruction studies. We further discuss how genetic variation and epistasis can promote or constrain functional innovation and suggest putative underlying mechanisms. By establishing a clear framework for these determinants, we provide potential indicators enabling the forecast of suitable evolutionary starting points and delineate molecular mechanisms in need of deeper exploration.
Collapse
Affiliation(s)
- Karol Buda
- Michael Smith Laboratories, University of British Columbia, Vancouver, Canada
| | - Charlotte M Miton
- Michael Smith Laboratories, University of British Columbia, Vancouver, Canada
| | - Xingyu Cara Fan
- Michael Smith Laboratories, University of British Columbia, Vancouver, Canada
| | - Nobuhiko Tokuriki
- Michael Smith Laboratories, University of British Columbia, Vancouver, Canada.
| |
Collapse
|
31
|
Haddox HK, Galloway JG, Dadonaite B, Bloom JD, Matsen IV FA, DeWitt WS. Jointly modeling deep mutational scans identifies shifted mutational effects among SARS-CoV-2 spike homologs. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.07.31.551037. [PMID: 37577604 PMCID: PMC10418112 DOI: 10.1101/2023.07.31.551037] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 08/15/2023]
Abstract
Deep mutational scanning (DMS) is a high-throughput experimental technique that measures the effects of thousands of mutations to a protein. These experiments can be performed on multiple homologs of a protein or on the same protein selected under multiple conditions. It is often of biological interest to identify mutations with shifted effects across homologs or conditions. However, it is challenging to determine if observed shifts arise from biological signal or experimental noise. Here, we describe a method for jointly inferring mutational effects across multiple DMS experiments while also identifying mutations that have shifted in their effects among experiments. A key aspect of our method is to regularize the inferred shifts, so that they are nonzero only when strongly supported by the data. We apply this method to DMS experiments that measure how mutations to spike proteins from SARS-CoV-2 variants (Delta, Omicron BA.1, and Omicron BA.2) affect cell entry. Most mutational effects are conserved between these spike homologs, but a fraction have markedly shifted. We experimentally validate a subset of the mutations inferred to have shifted effects, and confirm differences of > 1,000-fold in the impact of the same mutation on spike-mediated viral infection across spikes from different SARS-CoV-2 variants. Overall, our work establishes a general approach for comparing sets of DMS experiments to identify biologically important shifts in mutational effects.
Collapse
Affiliation(s)
- Hugh K. Haddox
- Computational Biology Program, Fred Hutchinson Cancer Research Center, Seattle, WA 98102, USA
| | - Jared G. Galloway
- Computational Biology Program, Fred Hutchinson Cancer Research Center, Seattle, WA 98102, USA
| | - Bernadeta Dadonaite
- Basic Sciences Division, Fred Hutchinson Cancer Research Center, Seattle, WA 98109, USA
| | - Jesse D. Bloom
- Computational Biology Program, Fred Hutchinson Cancer Research Center, Seattle, WA 98102, USA
- Basic Sciences Division, Fred Hutchinson Cancer Research Center, Seattle, WA 98109, USA
- Department of Genome Sciences, University of Washington, Seattle, WA 98195, USA
- Howard Hughes Medical Institute, Seattle, WA 98109, USA
| | - Frederick A. Matsen IV
- Computational Biology Program, Fred Hutchinson Cancer Research Center, Seattle, WA 98102, USA
- Basic Sciences Division, Fred Hutchinson Cancer Research Center, Seattle, WA 98109, USA
- Howard Hughes Medical Institute, Seattle, WA 98109, USA
- Department of Statistics, University of Washington, Seattle, WA 98195, USA
| | - William S. DeWitt
- Department of Electrical Engineering and Computer Sciences, University of California, Berkeley, CA 94720, USA
| |
Collapse
|
32
|
Fowler DM, Adams DJ, Gloyn AL, Hahn WC, Marks DS, Muffley LA, Neal JT, Roth FP, Rubin AF, Starita LM, Hurles ME. An Atlas of Variant Effects to understand the genome at nucleotide resolution. Genome Biol 2023; 24:147. [PMID: 37394429 PMCID: PMC10316620 DOI: 10.1186/s13059-023-02986-x] [Citation(s) in RCA: 44] [Impact Index Per Article: 22.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/02/2023] [Accepted: 06/13/2023] [Indexed: 07/04/2023] Open
Abstract
Sequencing has revealed hundreds of millions of human genetic variants, and continued efforts will only add to this variant avalanche. Insufficient information exists to interpret the effects of most variants, limiting opportunities for precision medicine and comprehension of genome function. A solution lies in experimental assessment of the functional effect of variants, which can reveal their biological and clinical impact. However, variant effect assays have generally been undertaken reactively for individual variants only after and, in most cases long after, their first observation. Now, multiplexed assays of variant effect can characterise massive numbers of variants simultaneously, yielding variant effect maps that reveal the function of every possible single nucleotide change in a gene or regulatory element. Generating maps for every protein encoding gene and regulatory element in the human genome would create an 'Atlas' of variant effect maps and transform our understanding of genetics and usher in a new era of nucleotide-resolution functional knowledge of the genome. An Atlas would reveal the fundamental biology of the human genome, inform human evolution, empower the development and use of therapeutics and maximize the utility of genomics for diagnosing and treating disease. The Atlas of Variant Effects Alliance is an international collaborative group comprising hundreds of researchers, technologists and clinicians dedicated to realising an Atlas of Variant Effects to help deliver on the promise of genomics.
Collapse
Affiliation(s)
- Douglas M. Fowler
- Department of Genome Sciences, University of Washington, Seattle, WA USA
- Department of Bioengineering, University of Washington, Seattle, WA USA
- Brotman Baty Institute for Precision Medicine, Seattle, WA USA
| | | | - Anna L. Gloyn
- Department of Pediatrics & Department of Genetics, Division of Endocrinology, Stanford School of Medicine, Stanford University, Stanford, CA USA
| | - William C. Hahn
- Department of Medical Oncology, Dana-Farber Cancer Institute, Boston, MA USA
- Broad Institute of MIT and Harvard, Cambridge, MA USA
| | - Debora S. Marks
- Broad Institute of MIT and Harvard, Cambridge, MA USA
- Department of Systems Biology, Harvard Medical School, Cambridge, USA
| | - Lara A. Muffley
- Department of Genome Sciences, University of Washington, Seattle, WA USA
| | - James T. Neal
- Broad Institute of MIT and Harvard, Cambridge, MA USA
- Novo Nordisk Foundation Center for Genomic Mechanisms of Disease at Broad Institute, Cambridge, MA USA
| | - Frederick P. Roth
- Donnelly Centre and Departments of Molecular Genetics and Computer Science, University of Toronto, Toronto, ON Canada
- Lunenfeld-Tanenbaum Research Institute, Sinai Health System, Toronto, ON Canada
| | - Alan F. Rubin
- Bioinformatics Division, The Walter and Eliza Hall Institute of Medical Research, Parkville, VIC Australia
- Department of Medical Biology, University of Melbourne, Melbourne, VIC Australia
| | - Lea M. Starita
- Department of Genome Sciences, University of Washington, Seattle, WA USA
- Department of Bioengineering, University of Washington, Seattle, WA USA
- Brotman Baty Institute for Precision Medicine, Seattle, WA USA
| | | |
Collapse
|
33
|
Moulana A, Dupic T, Phillips AM, Desai MM. Genotype-phenotype landscapes for immune-pathogen coevolution. Trends Immunol 2023; 44:384-396. [PMID: 37024340 PMCID: PMC10147585 DOI: 10.1016/j.it.2023.03.006] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/03/2023] [Revised: 03/08/2023] [Accepted: 03/09/2023] [Indexed: 04/07/2023]
Abstract
Our immune systems constantly coevolve with the pathogens that challenge them, as pathogens adapt to evade our defense responses, with our immune repertoires shifting in turn. These coevolutionary dynamics take place across a vast and high-dimensional landscape of potential pathogen and immune receptor sequence variants. Mapping the relationship between these genotypes and the phenotypes that determine immune-pathogen interactions is crucial for understanding, predicting, and controlling disease. Here, we review recent developments applying high-throughput methods to create large libraries of immune receptor and pathogen protein sequence variants and measure relevant phenotypes. We describe several approaches that probe different regions of the high-dimensional sequence space and comment on how combinations of these methods may offer novel insight into immune-pathogen coevolution.
Collapse
Affiliation(s)
- Alief Moulana
- Department of Organismic and Evolutionary Biology, Harvard University, Cambridge, MA 02138, USA
| | - Thomas Dupic
- Department of Organismic and Evolutionary Biology, Harvard University, Cambridge, MA 02138, USA
| | - Angela M Phillips
- Department of Microbiology and Immunology, University of California at San Francisco, San Francisco, CA 94143, USA
| | - Michael M Desai
- Department of Organismic and Evolutionary Biology, Harvard University, Cambridge, MA 02138, USA; Department of Physics, Harvard University, Cambridge, MA 02138, USA; NSF-Simons Center for Mathematical and Statistical Analysis of Biology, Harvard University, Cambridge, MA 02138, USA; Quantitative Biology Initiative, Harvard University, Cambridge, MA 02138, USA.
| |
Collapse
|
34
|
Wei H, Li X. Deep mutational scanning: A versatile tool in systematically mapping genotypes to phenotypes. Front Genet 2023; 14:1087267. [PMID: 36713072 PMCID: PMC9878224 DOI: 10.3389/fgene.2023.1087267] [Citation(s) in RCA: 8] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/02/2022] [Accepted: 01/02/2023] [Indexed: 01/13/2023] Open
Abstract
Unveiling how genetic variations lead to phenotypic variations is one of the key questions in evolutionary biology, genetics, and biomedical research. Deep mutational scanning (DMS) technology has allowed the mapping of tens of thousands of genetic variations to phenotypic variations efficiently and economically. Since its first systematic introduction about a decade ago, we have witnessed the use of deep mutational scanning in many research areas leading to scientific breakthroughs. Also, the methods in each step of deep mutational scanning have become much more versatile thanks to the oligo-synthesizing technology, high-throughput phenotyping methods and deep sequencing technology. However, each specific possible step of deep mutational scanning has its pros and cons, and some limitations still await further technological development. Here, we discuss recent scientific accomplishments achieved through the deep mutational scanning and describe widely used methods in each step of deep mutational scanning. We also compare these different methods and analyze their advantages and disadvantages, providing insight into how to design a deep mutational scanning study that best suits the aims of the readers' projects.
Collapse
Affiliation(s)
- Huijin Wei
- Zhejiang University—University of Edinburgh Institute, Zhejiang University, Haining, Zhejiang, China
| | - Xianghua Li
- Zhejiang University—University of Edinburgh Institute, Zhejiang University, Haining, Zhejiang, China
- Deanery of Biomedical Sciences, University of Edinburgh, Edinburgh, United Kingdom
- The Second Affiliated Hospital of Zhejiang University, Hangzhou, Zhejiang, China
- Biomedical and Health Translational Centre of Zhejiang Province, Haining, Zhejiang, China
| |
Collapse
|
35
|
Phillips AM, Maurer DP, Brooks C, Dupic T, Schmidt AG, Desai MM. Hierarchical sequence-affinity landscapes shape the evolution of breadth in an anti-influenza receptor binding site antibody. eLife 2023; 12:83628. [PMID: 36625542 PMCID: PMC9995116 DOI: 10.7554/elife.83628] [Citation(s) in RCA: 10] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/21/2022] [Accepted: 01/09/2023] [Indexed: 01/11/2023] Open
Abstract
Broadly neutralizing antibodies (bnAbs) that neutralize diverse variants of a particular virus are of considerable therapeutic interest. Recent advances have enabled us to isolate and engineer these antibodies as therapeutics, but eliciting them through vaccination remains challenging, in part due to our limited understanding of how antibodies evolve breadth. Here, we analyze the landscape by which an anti-influenza receptor binding site (RBS) bnAb, CH65, evolved broad affinity to diverse H1 influenza strains. We do this by generating an antibody library of all possible evolutionary intermediates between the unmutated common ancestor (UCA) and the affinity-matured CH65 antibody and measure the affinity of each intermediate to three distinct H1 antigens. We find that affinity to each antigen requires a specific set of mutations - distributed across the variable light and heavy chains - that interact non-additively (i.e., epistatically). These sets of mutations form a hierarchical pattern across the antigens, with increasingly divergent antigens requiring additional epistatic mutations beyond those required to bind less divergent antigens. We investigate the underlying biochemical and structural basis for these hierarchical sets of epistatic mutations and find that epistasis between heavy chain mutations and a mutation in the light chain at the VH-VL interface is essential for binding a divergent H1. Collectively, this is the first work to comprehensively characterize epistasis between heavy and light chain mutations and shows that such interactions are both strong and widespread. Together with our previous study analyzing a different class of anti-influenza antibodies, our results implicate epistasis as a general feature of antibody sequence-affinity landscapes that can potentiate and constrain the evolution of breadth.
Collapse
Affiliation(s)
- Angela M Phillips
- Department of Organismic and Evolutionary Biology, Harvard UniversityCambridgeUnited States
- Department of Microbiology and Immunology, University of California, San FranciscoSan FranciscoUnited States
| | - Daniel P Maurer
- Ragon Institute of MGH, MIT, and HarvardCambridgeUnited States
- Department of Microbiology, Harvard Medical SchoolBostonUnited States
| | - Caelan Brooks
- Department of Physics, Harvard UniversityCambridgeUnited States
| | - Thomas Dupic
- Department of Organismic and Evolutionary Biology, Harvard UniversityCambridgeUnited States
| | - Aaron G Schmidt
- Ragon Institute of MGH, MIT, and HarvardCambridgeUnited States
- Department of Microbiology, Harvard Medical SchoolBostonUnited States
| | - Michael M Desai
- Department of Organismic and Evolutionary Biology, Harvard UniversityCambridgeUnited States
- Department of Physics, Harvard UniversityCambridgeUnited States
- NSF-Simons Center for Mathematical and Statistical Analysis of Biology, Harvard UniversityCambridgeUnited States
- Quantitative Biology Initiative, Harvard UniversityCambridgeUnited States
| |
Collapse
|
36
|
Kennedy EN, Foster CA, Barr SA, Bourret RB. General strategies for using amino acid sequence data to guide biochemical investigation of protein function. Biochem Soc Trans 2022; 50:1847-1858. [PMID: 36416676 PMCID: PMC10257402 DOI: 10.1042/bst20220849] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/18/2022] [Revised: 11/04/2022] [Accepted: 11/09/2022] [Indexed: 11/24/2022]
Abstract
The rapid increase of '-omics' data warrants the reconsideration of experimental strategies to investigate general protein function. Studying individual members of a protein family is likely insufficient to provide a complete mechanistic understanding of family functions, especially for diverse families with thousands of known members. Strategies that exploit large amounts of available amino acid sequence data can inspire and guide biochemical experiments, generating broadly applicable insights into a given family. Here we review several methods that utilize abundant sequence data to focus experimental efforts and identify features truly representative of a protein family or domain. First, coevolutionary relationships between residues within primary sequences can be successfully exploited to identify structurally and/or functionally important positions for experimental investigation. Second, functionally important variable residue positions typically occupy a limited sequence space, a property useful for guiding biochemical characterization of the effects of the most physiologically and evolutionarily relevant amino acids. Third, amino acid sequence variation within domains shared between different protein families can be used to sort a particular domain into multiple subtypes, inspiring further experimental designs. Although generally applicable to any kind of protein domain because they depend solely on amino acid sequences, the second and third approaches are reviewed in detail because they appear to have been used infrequently and offer immediate opportunities for new advances. Finally, we speculate that future technologies capable of analyzing and manipulating conserved and variable aspects of the three-dimensional structures of a protein family could lead to broad insights not attainable by current methods.
Collapse
Affiliation(s)
- Emily N. Kennedy
- Department of Microbiology & Immunology, University of North Carolina, Chapel Hill, NC, United States of America
| | - Clay A. Foster
- Department of Pediatrics, Section Hematology/Oncology, University of Oklahoma Health Sciences Center, Oklahoma City, Oklahoma, United States of America
| | - Sarah A. Barr
- Department of Microbiology & Immunology, University of North Carolina, Chapel Hill, NC, United States of America
| | - Robert B. Bourret
- Department of Microbiology & Immunology, University of North Carolina, Chapel Hill, NC, United States of America
| |
Collapse
|
37
|
Wagner A. Adaptive evolvability through direct selection instead of indirect, second-order selection. JOURNAL OF EXPERIMENTAL ZOOLOGY. PART B, MOLECULAR AND DEVELOPMENTAL EVOLUTION 2022; 338:395-404. [PMID: 34254439 PMCID: PMC9786751 DOI: 10.1002/jez.b.23071] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 11/30/2020] [Revised: 05/11/2021] [Accepted: 06/04/2021] [Indexed: 12/30/2022]
Abstract
Can evolvability itself be the product of adaptive evolution? To answer this question is challenging, because any DNA mutation that alters only evolvability is subject to indirect, "second order" selection on the future effects of this mutation. Such indirect selection is weaker than "first-order" selection on mutations that alter fitness, in the sense that it can operate only under restrictive conditions. Here I discuss a route to adaptive evolvability that overcomes this challenge. Specifically, a recent evolution experiment showed that some mutations can enhance both fitness and evolvability through a combination of direct and indirect selection. Unrelated evidence from gene duplication and the evolution of gene regulation suggests that mutations with such dual effects may not be rare. Through such mutations, evolvability may increase at least in part because it provides an adaptive advantage. These observations suggest a research program on the adaptive evolution of evolvability, which aims to identify such mutations and to disentangle their direct fitness effects from their indirect effects on evolvability. If evolvability is itself adaptive, Darwinian evolution may have created more than life's diversity. It may also have helped create the very conditions that made the success of Darwinian evolution possible.
Collapse
Affiliation(s)
- Andreas Wagner
- Department of Evolutionary Biology and Environmental StudiesUniversity of ZurichZurichSwitzerland,Swiss Institute of BioinformaticsQuartier Sorge‐Batiment GenopodeLausanneSwitzerland,The Santa Fe InstituteSanta FeNew MexicoUSA,Stellenbosch Institute for Advanced Study, Wallenberg Research Centre at Stellenbosch UniversityStellenboschSouth Africa
| |
Collapse
|
38
|
Pillai AS, Hochberg GK, Thornton JW. Simple mechanisms for the evolution of protein complexity. Protein Sci 2022; 31:e4449. [PMID: 36107026 PMCID: PMC9601886 DOI: 10.1002/pro.4449] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/23/2022] [Revised: 09/01/2022] [Accepted: 09/10/2022] [Indexed: 01/26/2023]
Abstract
Proteins are tiny models of biological complexity: specific interactions among their many amino acids cause proteins to fold into elaborate structures, assemble with other proteins into higher-order complexes, and change their functions and structures upon binding other molecules. These complex features are classically thought to evolve via long and gradual trajectories driven by persistent natural selection. But a growing body of evidence from biochemistry, protein engineering, and molecular evolution shows that naturally occurring proteins often exist at or near the genetic edge of multimerization, allostery, and even new folds, so just one or a few mutations can trigger acquisition of these properties. These sudden transitions can occur because many of the physical properties that underlie these features are present in simpler proteins as fortuitous by-products of their architecture. Moreover, complex features of proteins can be encoded by huge arrays of sequences, so they are accessible from many different starting points via many possible paths. Because the bridges to these features are both short and numerous, random chance can join selection as a key factor in explaining the evolution of molecular complexity.
Collapse
Affiliation(s)
- Arvind S. Pillai
- Department of Ecology and EvolutionUniversity of ChicagoChicagoIllinoisUSA
- Institute for Protein DesignUniversity of WashingtonSeattleWAUSA
| | - Georg K.A. Hochberg
- Max Planck Institute for Terrestrial MicrobiologyMarburgGermany
- Department of Chemistry, Center for Synthetic MicrobiologyPhilipps University MarburgMarburgGermany
| | - Joseph W. Thornton
- Department of Ecology and EvolutionUniversity of ChicagoChicagoIllinoisUSA
- Departments of Human Genetics and Ecology and EvolutionUniversity of ChicagoChicagoIllinoisUSA
| |
Collapse
|
39
|
Azbukina N, Zharikova A, Ramensky V. Intragenic compensation through the lens of deep mutational scanning. Biophys Rev 2022; 14:1161-1182. [PMID: 36345285 PMCID: PMC9636336 DOI: 10.1007/s12551-022-01005-w] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/14/2022] [Accepted: 09/26/2022] [Indexed: 12/20/2022] Open
Abstract
A significant fraction of mutations in proteins are deleterious and result in adverse consequences for protein function, stability, or interaction with other molecules. Intragenic compensation is a specific case of positive epistasis when a neutral missense mutation cancels effect of a deleterious mutation in the same protein. Permissive compensatory mutations facilitate protein evolution, since without them all sequences would be extremely conserved. Understanding compensatory mechanisms is an important scientific challenge at the intersection of protein biophysics and evolution. In human genetics, intragenic compensatory interactions are important since they may result in variable penetrance of pathogenic mutations or fixation of pathogenic human alleles in orthologous proteins from related species. The latter phenomenon complicates computational and clinical inference of an allele's pathogenicity. Deep mutational scanning is a relatively new technique that enables experimental studies of functional effects of thousands of mutations in proteins. We review the important aspects of the field and discuss existing limitations of current datasets. We reviewed ten published DMS datasets with quantified functional effects of single and double mutations and described rates and patterns of intragenic compensation in eight of them. Supplementary Information The online version contains supplementary material available at 10.1007/s12551-022-01005-w.
Collapse
Affiliation(s)
- Nadezhda Azbukina
- Faculty of Bioengineering and Bioinformatics, Lomonosov Moscow State University, 1-73, Leninskie Gory, 119991 Moscow, Russia
| | - Anastasia Zharikova
- Faculty of Bioengineering and Bioinformatics, Lomonosov Moscow State University, 1-73, Leninskie Gory, 119991 Moscow, Russia
- National Medical Research Center for Therapy and Preventive Medicine, Petroverigsky per., 10, Bld.3, 101000 Moscow, Russia
| | - Vasily Ramensky
- Faculty of Bioengineering and Bioinformatics, Lomonosov Moscow State University, 1-73, Leninskie Gory, 119991 Moscow, Russia
- National Medical Research Center for Therapy and Preventive Medicine, Petroverigsky per., 10, Bld.3, 101000 Moscow, Russia
| |
Collapse
|
40
|
Abstract
One core goal of genetics is to systematically understand the mapping between the DNA sequence of an organism (genotype) and its measurable characteristics (phenotype). Understanding this mapping is often challenging because of interactions between mutations, where the result of combining several different mutations can be very different than the sum of their individual effects. Here we provide a statistical framework for modeling complex genetic interactions of this type. The key idea is to ask how fast the effects of mutations change when introducing the same mutation in increasingly distant genetic backgrounds. We then propose a model for phenotypic prediction that takes into account this tendency for the effects of mutations to be more similar in nearby genetic backgrounds. Contemporary high-throughput mutagenesis experiments are providing an increasingly detailed view of the complex patterns of genetic interaction that occur between multiple mutations within a single protein or regulatory element. By simultaneously measuring the effects of thousands of combinations of mutations, these experiments have revealed that the genotype–phenotype relationship typically reflects not only genetic interactions between pairs of sites but also higher-order interactions among larger numbers of sites. However, modeling and understanding these higher-order interactions remains challenging. Here we present a method for reconstructing sequence-to-function mappings from partially observed data that can accommodate all orders of genetic interaction. The main idea is to make predictions for unobserved genotypes that match the type and extent of epistasis found in the observed data. This information on the type and extent of epistasis can be extracted by considering how phenotypic correlations change as a function of mutational distance, which is equivalent to estimating the fraction of phenotypic variance due to each order of genetic interaction (additive, pairwise, three-way, etc.). Using these estimated variance components, we then define an empirical Bayes prior that in expectation matches the observed pattern of epistasis and reconstruct the genotype–phenotype mapping by conducting Gaussian process regression under this prior. To demonstrate the power of this approach, we present an application to the antibody-binding domain GB1 and also provide a detailed exploration of a dataset consisting of high-throughput measurements for the splicing efficiency of human pre-mRNA 5′ splice sites, for which we also validate our model predictions via additional low-throughput experiments.
Collapse
|
41
|
Cox RM, Hale MD, Wittman TN, Robinson CD, Cox CL. Evolution of hormone-phenotype couplings and hormone-genome interactions. Horm Behav 2022; 144:105216. [PMID: 35777215 DOI: 10.1016/j.yhbeh.2022.105216] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 12/31/2021] [Revised: 06/05/2022] [Accepted: 06/20/2022] [Indexed: 12/22/2022]
Abstract
When selection favors a new relationship between a cue and a hormonally mediated response, adaptation can proceed by altering the hormonal signal that is produced or by altering the phenotypic response to the hormonal signal. The field of evolutionary endocrinology has made considerable progress toward understanding the evolution of hormonal signals, but we know much less about the evolution of hormone-phenotype couplings, particularly at the hormone-genome interface. We briefly review and classify the mechanisms through which these hormone-phenotype couplings likely evolve, using androgens and their receptors and genomic response elements to illustrate our view. We then present two empirical studies of hormone-phenotype couplings, one rooted in evolutionary quantitative genetics and another in comparative transcriptomics, each focused on the regulation of sexually dimorphic phenotypes by testosterone (T) in the brown anole lizard (Anolis sagrei). First, we illustrate the potential for hormone-phenotype couplings to evolve by showing that coloration of the dewlap (an ornament used in behavioral displays) exhibits significant heritability in its responsiveness to T, implying that anoles harbor genetic variance in the architecture of hormonal pleiotropy. Second, we combine T manipulations with analyses of the liver transcriptome to ask whether and how statistical methods for characterizing modules of co-expressed genes and in silico techniques for identifying androgen response elements (AREs) can improve our understanding of hormone-genome interactions. We conclude by emphasizing important avenues for future work at the hormone-genome interface, particularly those conducted in a comparative evolutionary framework.
Collapse
Affiliation(s)
- Robert M Cox
- Department of Biology, University of Virginia, Charlottesville, VA, USA.
| | - Matthew D Hale
- Department of Biology, University of Virginia, Charlottesville, VA, USA
| | - Tyler N Wittman
- Department of Biology, University of Virginia, Charlottesville, VA, USA
| | | | - Christian L Cox
- Department of Biology, University of Virginia, Charlottesville, VA, USA; Biological Sciences, Florida International University, Miami, FL, USA
| |
Collapse
|
42
|
Nocedal I, Laub MT. Ancestral reconstruction of duplicated signaling proteins reveals the evolution of signaling specificity. eLife 2022; 11:e77346. [PMID: 35686729 PMCID: PMC9208753 DOI: 10.7554/elife.77346] [Citation(s) in RCA: 14] [Impact Index Per Article: 4.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/25/2022] [Accepted: 05/27/2022] [Indexed: 01/30/2023] Open
Abstract
Gene duplication is crucial to generating novel signaling pathways during evolution. However, it remains unclear how the redundant proteins produced by gene duplication ultimately acquire new interaction specificities to establish insulated paralogous signaling pathways. Here, we used ancestral sequence reconstruction to resurrect and characterize a bacterial two-component signaling system that duplicated in α-proteobacteria. We determined the interaction specificities of the signaling proteins that existed before and immediately after this duplication event and then identified key mutations responsible for establishing specificity in the two systems. Just three mutations, in only two of the four interacting proteins, were sufficient to establish specificity of the extant systems. Some of these mutations weakened interactions between paralogous systems to limit crosstalk. However, others strengthened interactions within a system, indicating that the ancestral interaction, although functional, had the potential to be strengthened. Our work suggests that protein-protein interactions with such latent potential may be highly amenable to duplication and divergence.
Collapse
Affiliation(s)
- Isabel Nocedal
- Department of Biology, Massachusetts Institute of TechnologyCambridgeUnited States
| | - Michael T Laub
- Department of Biology, Massachusetts Institute of TechnologyCambridgeUnited States
- Howard Hughes Medical Institute, Massachusetts Institute of TechnologyCambridgeUnited States
| |
Collapse
|
43
|
Park Y, Metzger BPH, Thornton JW. Epistatic drift causes gradual decay of predictability in protein evolution. Science 2022; 376:823-830. [PMID: 35587978 DOI: 10.1126/science.abn6895] [Citation(s) in RCA: 42] [Impact Index Per Article: 14.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/11/2022]
Abstract
Epistatic interactions can make the outcomes of evolution unpredictable, but no comprehensive data are available on the extent and temporal dynamics of changes in the effects of mutations as protein sequences evolve. Here, we use phylogenetic deep mutational scanning to measure the functional effect of every possible amino acid mutation in a series of ancestral and extant steroid receptor DNA binding domains. Across 700 million years of evolution, epistatic interactions caused the effects of most mutations to become decorrelated from their initial effects and their windows of evolutionary accessibility to open and close transiently. Most effects changed gradually and without bias at rates that were largely constant across time, indicating a neutral process caused by many weak epistatic interactions. Our findings show that protein sequences drift inexorably into contingency and unpredictability, but that the process is statistically predictable, given sufficient phylogenetic and experimental data.
Collapse
Affiliation(s)
- Yeonwoo Park
- Committee on Genetics, Genomics, and Systems Biology, University of Chicago, Chicago, IL, USA
| | - Brian P H Metzger
- Department of Ecology and Evolution, University of Chicago, Chicago, IL, USA
| | - Joseph W Thornton
- Committee on Genetics, Genomics, and Systems Biology, University of Chicago, Chicago, IL, USA.,Department of Ecology and Evolution, University of Chicago, Chicago, IL, USA.,Department of Human Genetics, University of Chicago, Chicago, IL, USA
| |
Collapse
|
44
|
Ding D, Green AG, Wang B, Lite TLV, Weinstein EN, Marks DS, Laub MT. Co-evolution of interacting proteins through non-contacting and non-specific mutations. Nat Ecol Evol 2022; 6:590-603. [PMID: 35361892 PMCID: PMC9090974 DOI: 10.1038/s41559-022-01688-0] [Citation(s) in RCA: 25] [Impact Index Per Article: 8.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/25/2021] [Accepted: 01/31/2022] [Indexed: 01/08/2023]
Abstract
Proteins often accumulate neutral mutations that do not affect current functions but can profoundly influence future mutational possibilities and functions. Understanding such hidden potential has major implications for protein design and evolutionary forecasting but has been limited by a lack of systematic efforts to identify potentiating mutations. Here, through the comprehensive analysis of a bacterial toxin-antitoxin system, we identified all possible single substitutions in the toxin that enable it to tolerate otherwise interface-disrupting mutations in its antitoxin. Strikingly, the majority of enabling mutations in the toxin do not contact and promote tolerance non-specifically to many different antitoxin mutations, despite covariation in homologues occurring primarily between specific pairs of contacting residues across the interface. In addition, the enabling mutations we identified expand future mutational paths that both maintain old toxin-antitoxin interactions and form new ones. These non-specific mutations are missed by widely used covariation and machine learning methods. Identifying such enabling mutations will be critical for ensuring continued binding of therapeutically relevant proteins, such as antibodies, aimed at evolving targets.
Collapse
Affiliation(s)
- David Ding
- Department of Biology, Massachusetts Institute of Technology, Cambridge, MA, USA
- Department of Systems Biology, Harvard Medical School, Boston, MA, USA
| | - Anna G Green
- Department of Systems Biology, Harvard Medical School, Boston, MA, USA
- Department of Biomedical Informatics, Harvard Medical School, Boston, MA, USA
| | - Boyuan Wang
- Department of Pharmacology, UT Southwestern Medical Center, Dallas, TX, USA
| | - Thuy-Lan Vo Lite
- Harvard-MIT Division of Health Sciences and Technology, Harvard Medical School, Boston, MA, USA
| | | | - Debora S Marks
- Department of Systems Biology, Harvard Medical School, Boston, MA, USA
| | - Michael T Laub
- Department of Biology, Massachusetts Institute of Technology, Cambridge, MA, USA.
- Howard Hughes Medical Institute, Massachusetts Institute of Technology, Cambridge, MA, USA.
| |
Collapse
|
45
|
Xie VC, Styles MJ, Dickinson BC. Methods for the directed evolution of biomolecular interactions. Trends Biochem Sci 2022; 47:403-416. [PMID: 35427479 PMCID: PMC9022280 DOI: 10.1016/j.tibs.2022.01.001] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/16/2021] [Revised: 12/27/2021] [Accepted: 01/13/2022] [Indexed: 02/06/2023]
Abstract
Noncovalent interactions between biomolecules such as proteins and nucleic acids coordinate all cellular processes through changes in proximity. Tools that perturb these interactions are and will continue to be highly valuable for basic and translational scientific endeavors. By taking cues from natural systems, such as the adaptive immune system, we can design directed evolution platforms that can generate proteins that bind to biomolecules of interest. In recent years, the platforms used to direct the evolution of biomolecular binders have greatly expanded the range of types of interactions one can evolve. Herein, we review recent advances in methods to evolve protein-protein, protein-RNA, and protein-DNA interactions.
Collapse
Affiliation(s)
| | - Matthew J Styles
- Department of Chemistry, The University of Chicago, Chicago, IL 60637, USA
| | - Bryan C Dickinson
- Department of Chemistry, The University of Chicago, Chicago, IL 60637, USA.
| |
Collapse
|
46
|
Hayes RL, Vilseck JZ, Brooks CL. Addressing Intersite Coupling Unlocks Large Combinatorial Chemical Spaces for Alchemical Free Energy Methods. J Chem Theory Comput 2022; 18:2114-2123. [PMID: 35255214 PMCID: PMC9700482 DOI: 10.1021/acs.jctc.1c00948] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
Abstract
Alchemical free energy methods are playing a growing role in molecular design, both for computer-aided drug design of small molecules and for computational protein design. Multisite λ dynamics (MSλD) is a uniquely scalable alchemical free energy method that enables more efficient exploration of combinatorial alchemical spaces encountered in molecular design, but simulations have typically been limited to a few hundred ligands or sequences. Here, we focus on coupling between sites to enable scaling to larger alchemical spaces. We first discuss updates to the biasing potentials that facilitate MSλD sampling to include coupling terms and show that this can provide more thorough sampling of alchemical states. We then harness coupling between sites by developing a new free energy estimator based on the Potts models underlying direct coupling analysis, a method for predicting contacts from sequence coevolution, and find it yields more accurate free energies than previous estimators. The sampling requirements of the Potts model estimator scale with the square of the number of sites, a substantial improvement over the exponential scaling of the standard estimator. This opens up exploration of much larger alchemical spaces with MSλD for molecular design.
Collapse
Affiliation(s)
- Ryan L Hayes
- Department of Chemistry, University of Michigan, Ann Arbor, Michigan 48109, United States
- Biophysics Program, University of Michigan, Ann Arbor, Michigan 48109, United States
| | - Jonah Z Vilseck
- Department of Biochemistry and Molecular Biology, Indiana University School of Medicine, Indianapolis, Indiana 46202, United States
- Center for Computational Biology and Bioinformatics, Indiana University School of Medicine, Indianapolis, Indiana 46202, United States
| | - Charles L Brooks
- Department of Chemistry, University of Michigan, Ann Arbor, Michigan 48109, United States
- Biophysics Program, University of Michigan, Ann Arbor, Michigan 48109, United States
| |
Collapse
|
47
|
Environmental selection and epistasis in an empirical phenotype-environment-fitness landscape. Nat Ecol Evol 2022; 6:427-438. [PMID: 35210579 DOI: 10.1038/s41559-022-01675-5] [Citation(s) in RCA: 7] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/19/2021] [Accepted: 12/14/2021] [Indexed: 11/08/2022]
Abstract
Fitness landscapes, mappings of genotype/phenotype to their effects on fitness, are invaluable concepts in evolutionary biochemistry. Although widely discussed, measurements of phenotype-fitness landscapes in proteins remain scarce. Here, we quantify all single mutational effects on fitness and phenotype (EC50) of VIM-2 β-lactamase across a 64-fold range of ampicillin concentrations. We then construct a phenotype-fitness landscape that takes variations in environmental selection pressure into account. We found that a simple, empirical landscape accurately models the ~39,000 mutational data points, suggesting that the evolution of VIM-2 can be predicted on the basis of the selection environment. Our landscape provides new quantitative knowledge on the evolution of the β-lactamases and proteins in general, particularly their evolutionary dynamics under subinhibitory antibiotic concentrations, as well as the mechanisms and environmental dependence of non-specific epistasis.
Collapse
|
48
|
Mascotti ML. Resurrecting Enzymes by Ancestral Sequence Reconstruction. METHODS IN MOLECULAR BIOLOGY (CLIFTON, N.J.) 2022; 2397:111-136. [PMID: 34813062 DOI: 10.1007/978-1-0716-1826-4_7] [Citation(s) in RCA: 12] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [Subscribe] [Scholar Register] [Indexed: 12/15/2022]
Abstract
Ancestral Sequence Reconstruction (ASR) allows one to infer the sequences of extinct proteins using the phylogeny of extant proteins. It consists of disclosing the evolutionary history-i.e., the phylogeny-of a protein family of interest and then inferring the sequences of its ancestors-i.e., the nodes in the phylogeny. Assisted by gene synthesis, the selected ancestors can be resurrected in the lab and experimentally characterized. The crucial step to succeed with ASR is starting from a reliable phylogeny. At the same time, it is of the utmost importance to have a clear idea on the evolutionary history of the family under study and the events that influenced it. This allows us to implement ASR with well-defined hypotheses and to apply the appropriate experimental methods. In the last years, ASR has become popular to test hypotheses about the origin of functionalities, changes in activities, understanding physicochemical properties of proteins, among others. In this context, the aim of this chapter is to present the ASR approach applied to the reconstruction of enzymes-i.e., proteins with catalytic roles. The spirit of this contribution is to provide a basic, hands-to-work guide for biochemists and biologists who are unfamiliar with molecular phylogenetics.
Collapse
Affiliation(s)
- Maria Laura Mascotti
- Molecular Enzymology group, University of Groningen, Groningen, The Netherlands. .,IMIBIO-SL CONICET, Facultad de Química Bioquímica y Farmacia, Universidad Nacional de San Luis, San Luis, Argentina.
| |
Collapse
|
49
|
Truong DP, Rousseau S, Machala BW, Huddleston JP, Zhu M, Hull KG, Romo D, Raushel FM, Sacchettini JC, Glasner ME. Second-Shell Amino Acid R266 Helps Determine N-Succinylamino Acid Racemase Reaction Specificity in Promiscuous N-Succinylamino Acid Racemase/ o-Succinylbenzoate Synthase Enzymes. Biochemistry 2021; 60:3829-3840. [PMID: 34845903 DOI: 10.1021/acs.biochem.1c00627] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
Abstract
Catalytic promiscuity is the coincidental ability to catalyze nonbiological reactions in the same active site as the native biological reaction. Several lines of evidence show that catalytic promiscuity plays a role in the evolution of new enzyme functions. Thus, studying catalytic promiscuity can help identify structural features that predispose an enzyme to evolve new functions. This study identifies a potentially preadaptive residue in a promiscuous N-succinylamino acid racemase/o-succinylbenzoate synthase (NSAR/OSBS) enzyme from Amycolatopsis sp. T-1-60. This enzyme belongs to a branch of the OSBS family which includes many catalytically promiscuous NSAR/OSBS enzymes. R266 is conserved in all members of the NSAR/OSBS subfamily. However, the homologous position is usually hydrophobic in other OSBS subfamilies, whose enzymes lack NSAR activity. The second-shell amino acid R266 is close to the catalytic acid/base K263, but it does not contact the substrate, suggesting that R266 could affect the catalytic mechanism. Mutating R266 to glutamine in Amycolatopsis NSAR/OSBS profoundly reduces NSAR activity but moderately reduces OSBS activity. This is due to a 1000-fold decrease in the rate of proton exchange between the substrate and the general acid/base catalyst K263. This mutation is less deleterious for the OSBS reaction because K263 forms a cation-π interaction with the OSBS substrate and/or the intermediate, rather than acting as a general acid/base catalyst. Together, the data explain how R266 contributes to NSAR reaction specificity and was likely an essential preadaptation for the evolution of NSAR activity.
Collapse
Affiliation(s)
- Dat P Truong
- Department of Biochemistry and Biophysics, Texas A&M University, 2128 TAMU, College Station, Texas 77843-2128, United States
| | - Simon Rousseau
- Department of Biochemistry and Biophysics, Texas A&M University, 2128 TAMU, College Station, Texas 77843-2128, United States
| | - Benjamin W Machala
- Department of Biochemistry and Biophysics, Texas A&M University, 2128 TAMU, College Station, Texas 77843-2128, United States
| | - Jamison P Huddleston
- Department of Chemistry, Texas A&M University, 3255 TAMU, College Station, Texas 77843-3255, United States
| | - Mingzhao Zhu
- Baylor Synthesis and Drug-Lead Discovery Laboratory, Department of Chemistry and Biochemistry, Baylor University, One Bear Place, Waco, Texas 76798-7348, United States
| | - Kenneth G Hull
- Baylor Synthesis and Drug-Lead Discovery Laboratory, Department of Chemistry and Biochemistry, Baylor University, One Bear Place, Waco, Texas 76798-7348, United States
| | - Daniel Romo
- Baylor Synthesis and Drug-Lead Discovery Laboratory, Department of Chemistry and Biochemistry, Baylor University, One Bear Place, Waco, Texas 76798-7348, United States
| | - Frank M Raushel
- Department of Biochemistry and Biophysics, Texas A&M University, 2128 TAMU, College Station, Texas 77843-2128, United States.,Department of Chemistry, Texas A&M University, 3255 TAMU, College Station, Texas 77843-3255, United States
| | - James C Sacchettini
- Department of Chemistry, Texas A&M University, 3255 TAMU, College Station, Texas 77843-3255, United States
| | - Margaret E Glasner
- Department of Biochemistry and Biophysics, Texas A&M University, 2128 TAMU, College Station, Texas 77843-2128, United States
| |
Collapse
|
50
|
Castiglione GM, Zhou L, Xu Z, Neiman Z, Hung CF, Duh EJ. Evolutionary pathways to SARS-CoV-2 resistance are opened and closed by epistasis acting on ACE2. PLoS Biol 2021; 19:e3001510. [PMID: 34932561 PMCID: PMC8730403 DOI: 10.1371/journal.pbio.3001510] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/25/2021] [Revised: 01/05/2022] [Accepted: 12/08/2021] [Indexed: 02/06/2023] Open
Abstract
Severe Acute Respiratory Syndrome Coronavirus 2 (SARS-CoV-2) infects a broader range of mammalian species than previously predicted, binding a diversity of angiotensin converting enzyme 2 (ACE2) orthologs despite extensive sequence divergence. Within this sequence degeneracy, we identify a rare sequence combination capable of conferring SARS-CoV-2 resistance. We demonstrate that this sequence was likely unattainable during human evolution due to deleterious effects on ACE2 carboxypeptidase activity, which has vasodilatory and cardioprotective functions in vivo. Across the 25 ACE2 sites implicated in viral binding, we identify 6 amino acid substitutions unique to mouse-one of the only known mammalian species resistant to SARS-CoV-2. Substituting human variants at these positions is sufficient to confer binding of the SARS-CoV-2 S protein to mouse ACE2, facilitating cellular infection. Conversely, substituting mouse variants into either human or dog ACE2 abolishes viral binding, diminishing cellular infection. However, these same substitutions decrease human ACE2 activity by 50% and are predicted as pathogenic, consistent with the extreme rarity of human polymorphisms at these sites. This trade-off can be avoided, however, depending on genetic background; if substituted simultaneously, these same mutations have no deleterious effect on dog ACE2 nor that of the rodent ancestor estimated to exist 70 million years ago. This genetic contingency (epistasis) may have therefore opened the road to resistance for some species, while making humans susceptible to viruses that use these ACE2 surfaces for binding, as does SARS-CoV-2.
Collapse
Affiliation(s)
- Gianni M. Castiglione
- Department of Ophthalmology, Johns Hopkins University School of Medicine, Baltimore, Maryland, United States of America
| | - Lingli Zhou
- Department of Ophthalmology, Johns Hopkins University School of Medicine, Baltimore, Maryland, United States of America
| | - Zhenhua Xu
- Department of Ophthalmology, Johns Hopkins University School of Medicine, Baltimore, Maryland, United States of America
| | - Zachary Neiman
- Department of Ophthalmology, Johns Hopkins University School of Medicine, Baltimore, Maryland, United States of America
| | - Chien-Fu Hung
- Department of Pathology, Johns Hopkins University School of Medicine, Baltimore, Maryland, United States of America
| | - Elia J. Duh
- Department of Ophthalmology, Johns Hopkins University School of Medicine, Baltimore, Maryland, United States of America
| |
Collapse
|