1
|
Tripp A, Braun M, Wieser F, Oberdorfer G, Lechner H. Click, Compute, Create: A Review of Web-based Tools for Enzyme Engineering. Chembiochem 2024; 25:e202400092. [PMID: 38634409 DOI: 10.1002/cbic.202400092] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/31/2024] [Revised: 04/14/2024] [Accepted: 04/15/2024] [Indexed: 04/19/2024]
Abstract
Enzyme engineering, though pivotal across various biotechnological domains, is often plagued by its time-consuming and labor-intensive nature. This review aims to offer an overview of supportive in silico methodologies for this demanding endeavor. Starting from methods to predict protein structures, to classification of their activity and even the discovery of new enzymes we continue with describing tools used to increase thermostability and production yields of selected targets. Subsequently, we discuss computational methods to modulate both, the activity as well as selectivity of enzymes. Last, we present recent approaches based on cutting-edge machine learning methods to redesign enzymes. With exception of the last chapter, there is a strong focus on methods easily accessible via web-interfaces or simple Python-scripts, therefore readily useable for a diverse and broad community.
Collapse
Affiliation(s)
- Adrian Tripp
- Institute of Biochemistry, Graz University of Technology, Petersgasse 12/2, 8010, Graz, Austria
| | - Markus Braun
- Institute of Biochemistry, Graz University of Technology, Petersgasse 12/2, 8010, Graz, Austria
| | - Florian Wieser
- Institute of Biochemistry, Graz University of Technology, Petersgasse 12/2, 8010, Graz, Austria
| | - Gustav Oberdorfer
- Institute of Biochemistry, Graz University of Technology, Petersgasse 12/2, 8010, Graz, Austria
- BioTechMed, Graz, Austria
| | - Horst Lechner
- Institute of Biochemistry, Graz University of Technology, Petersgasse 12/2, 8010, Graz, Austria
- BioTechMed, Graz, Austria
| |
Collapse
|
2
|
Hollmann F, Sanchis J, Reetz MT. Learning from Protein Engineering by Deconvolution of Multi-Mutational Variants. Angew Chem Int Ed Engl 2024; 63:e202404880. [PMID: 38884594 DOI: 10.1002/anie.202404880] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/11/2024] [Revised: 06/05/2024] [Accepted: 06/06/2024] [Indexed: 06/18/2024]
Abstract
This review analyzes a development in biochemistry, enzymology and biotechnology that originally came as a surprise. Following the establishment of directed evolution of stereoselective enzymes in organic chemistry, the concept of partial or complete deconvolution of selective multi-mutational variants was introduced. Early deconvolution experiments of stereoselective variants led to the finding that mutations can interact cooperatively or antagonistically with one another, not just additively. During the past decade, this phenomenon was shown to be general. In some studies, molecular dynamics (MD) and quantum mechanics/molecular mechanics (QM/MM) computations were performed in order to shed light on the origin of non-additivity at all stages of an evolutionary upward climb. Data of complete deconvolution can be used to construct unique multi-dimensional rugged fitness pathway landscapes, which provide mechanistic insights different from traditional fitness landscapes. Along a related line, biochemists have long tested the result of introducing two point mutations in an enzyme for mechanistic reasons, followed by a comparison of the respective double mutant in so-called double mutant cycles, which originally showed only additive effects, but more recently also uncovered cooperative and antagonistic non-additive effects. We conclude with suggestions for future work, and call for a unified overall picture of non-additivity and epistasis.
Collapse
Affiliation(s)
- Frank Hollmann
- Department of Biotechnology, Delft University of Technology, Van der Maasweg 9, 2629HZ, Delft, Netherlands
| | - Joaquin Sanchis
- Monash Institute of Pharmaceutical Sciences, Monash University, Parkville, Victoria, 3052, Australia
| | - Manfred T Reetz
- Max-Plank-Institut für Kohlenforschung, Kaiser-Wilhelm-Platz 1, 45481, Mülheim, Germany
- Tianjin Institute of Industrial Biotechnology, Chinese Academy of Sciences, Tianjin, 300308, China
| |
Collapse
|
3
|
Lipsh-Sokolik R, Fleishman SJ. Addressing epistasis in the design of protein function. Proc Natl Acad Sci U S A 2024; 121:e2314999121. [PMID: 39133844 PMCID: PMC11348311 DOI: 10.1073/pnas.2314999121] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 08/29/2024] Open
Abstract
Mutations in protein active sites can dramatically improve function. The active site, however, is densely packed and extremely sensitive to mutations. Therefore, some mutations may only be tolerated in combination with others in a phenomenon known as epistasis. Epistasis reduces the likelihood of obtaining improved functional variants and dramatically slows natural and lab evolutionary processes. Research has shed light on the molecular origins of epistasis and its role in shaping evolutionary trajectories and outcomes. In addition, sequence- and AI-based strategies that infer epistatic relationships from mutational patterns in natural or experimental evolution data have been used to design functional protein variants. In recent years, combinations of such approaches and atomistic design calculations have successfully predicted highly functional combinatorial mutations in active sites. These were used to design thousands of functional active-site variants, demonstrating that, while our understanding of epistasis remains incomplete, some of the determinants that are critical for accurate design are now sufficiently understood. We conclude that the space of active-site variants that has been explored by evolution may be expanded dramatically to enhance natural activities or discover new ones. Furthermore, design opens the way to systematically exploring sequence and structure space and mutational impacts on function, deepening our understanding and control over protein activity.
Collapse
Affiliation(s)
- Rosalie Lipsh-Sokolik
- Department of Biomolecular Sciences, Weizmann Institute of Science, Rehovot 7610001, Israel
| | - Sarel J Fleishman
- Department of Biomolecular Sciences, Weizmann Institute of Science, Rehovot 7610001, Israel
| |
Collapse
|
4
|
Johnston KE, Almhjell PJ, Watkins-Dulaney EJ, Liu G, Porter NJ, Yang J, Arnold FH. A combinatorially complete epistatic fitness landscape in an enzyme active site. Proc Natl Acad Sci U S A 2024; 121:e2400439121. [PMID: 39074291 PMCID: PMC11317637 DOI: 10.1073/pnas.2400439121] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/20/2024] [Accepted: 06/17/2024] [Indexed: 07/31/2024] Open
Abstract
Protein engineering often targets binding pockets or active sites which are enriched in epistasis-nonadditive interactions between amino acid substitutions-and where the combined effects of multiple single substitutions are difficult to predict. Few existing sequence-fitness datasets capture epistasis at large scale, especially for enzyme catalysis, limiting the development and assessment of model-guided enzyme engineering approaches. We present here a combinatorially complete, 160,000-variant fitness landscape across four residues in the active site of an enzyme. Assaying the native reaction of a thermostable β-subunit of tryptophan synthase (TrpB) in a nonnative environment yielded a landscape characterized by significant epistasis and many local optima. These effects prevent simulated directed evolution approaches from efficiently reaching the global optimum. There is nonetheless wide variability in the effectiveness of different directed evolution approaches, which together provide experimental benchmarks for computational and machine learning workflows. The most-fit TrpB variants contain a substitution that is nearly absent in natural TrpB sequences-a result that conservation-based predictions would not capture. Thus, although fitness prediction using evolutionary data can enrich in more-active variants, these approaches struggle to identify and differentiate among the most-active variants, even for this near-native function. Overall, this work presents a large-scale testing ground for model-guided enzyme engineering and suggests that efficient navigation of epistatic fitness landscapes can be improved by advances in both machine learning and physical modeling.
Collapse
Affiliation(s)
- Kadina E. Johnston
- Division of Biology and Bioengineering, California Institute of Technology, Pasadena, CA91125
| | - Patrick J. Almhjell
- Division of Chemistry and Chemical Engineering, California Institute of Technology, Pasadena, CA91125
| | - Ella J. Watkins-Dulaney
- Division of Biology and Bioengineering, California Institute of Technology, Pasadena, CA91125
| | - Grace Liu
- Division of Biology and Bioengineering, California Institute of Technology, Pasadena, CA91125
| | - Nicholas J. Porter
- Division of Chemistry and Chemical Engineering, California Institute of Technology, Pasadena, CA91125
| | - Jason Yang
- Division of Chemistry and Chemical Engineering, California Institute of Technology, Pasadena, CA91125
| | - Frances H. Arnold
- Division of Biology and Bioengineering, California Institute of Technology, Pasadena, CA91125
- Division of Chemistry and Chemical Engineering, California Institute of Technology, Pasadena, CA91125
| |
Collapse
|
5
|
Listov D, Goverde CA, Correia BE, Fleishman SJ. Opportunities and challenges in design and optimization of protein function. Nat Rev Mol Cell Biol 2024; 25:639-653. [PMID: 38565617 PMCID: PMC7616297 DOI: 10.1038/s41580-024-00718-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 02/27/2024] [Indexed: 04/04/2024]
Abstract
The field of protein design has made remarkable progress over the past decade. Historically, the low reliability of purely structure-based design methods limited their application, but recent strategies that combine structure-based and sequence-based calculations, as well as machine learning tools, have dramatically improved protein engineering and design. In this Review, we discuss how these methods have enabled the design of increasingly complex structures and therapeutically relevant activities. Additionally, protein optimization methods have improved the stability and activity of complex eukaryotic proteins. Thanks to their increased reliability, computational design methods have been applied to improve therapeutics and enzymes for green chemistry and have generated vaccine antigens, antivirals and drug-delivery nano-vehicles. Moreover, the high success of design methods reflects an increased understanding of basic rules that govern the relationships among protein sequence, structure and function. However, de novo design is still limited mostly to α-helix bundles, restricting its potential to generate sophisticated enzymes and diverse protein and small-molecule binders. Designing complex protein structures is a challenging but necessary next step if we are to realize our objective of generating new-to-nature activities.
Collapse
Affiliation(s)
- Dina Listov
- Department of Biomolecular Sciences, Weizmann Institute of Science, Rehovot, Israel
| | - Casper A Goverde
- Institute of Bioengineering, École Polytechnique Fédérale de Lausanne (EPFL), Lausanne, Switzerland
| | - Bruno E Correia
- Institute of Bioengineering, École Polytechnique Fédérale de Lausanne (EPFL), Lausanne, Switzerland.
| | - Sarel Jacob Fleishman
- Department of Biomolecular Sciences, Weizmann Institute of Science, Rehovot, Israel.
| |
Collapse
|
6
|
Chitra U, Arnold BJ, Raphael BJ. Quantifying higher-order epistasis: beware the chimera. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.07.17.603976. [PMID: 39071303 PMCID: PMC11275791 DOI: 10.1101/2024.07.17.603976] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 07/30/2024]
Abstract
Epistasis, or interactions in which alleles at one locus modify the fitness effects of alleles at other loci, plays a fundamental role in genetics, protein evolution, and many other areas of biology. Epistasis is typically quantified by computing the deviation from the expected fitness under an additive or multiplicative model using one of several formulae. However, these formulae are not all equivalent. Importantly, one widely used formula - which we call the chimeric formula - measures deviations from a multiplicative fitness model on an additive scale, thus mixing two measurement scales. We show that for pairwise interactions, the chimeric formula yields a different magnitude, but the same sign (synergistic vs. antagonistic) of epistasis compared to the multiplicative formula that measures both fitness and deviations on a multiplicative scale. However, for higher-order interactions, we show that the chimeric formula can have both different magnitude and sign compared to the multiplicative formula - thus confusing negative epistatic interactions with positive interactions, and vice versa. We resolve these inconsistencies by deriving fundamental connections between the different epistasis formulae and the parameters of the multivariate Bernoulli distribution . Our results demonstrate that the additive and multiplicative epistasis formulae are more mathematically sound than the chimeric formula. Moreover, we demonstrate that the mathematical issues with the chimeric epistasis formula lead to markedly different biological interpretations of real data. Analyzing multi-gene knockout data in yeast, multi-way drug interactions in E. coli , and deep mutational scanning (DMS) of several proteins, we find that 10 - 60% of higher-order interactions have a change in sign with the multiplicative or additive epistasis formula. These sign changes result in qualitatively different findings on functional divergence in the yeast genome, synergistic vs. antagonistic drug interactions, and and epistasis between protein mutations. In particular, in the yeast data, the more appropriate multiplicative formula identifies nearly 500 additional negative three-way interactions, thus extending the trigenic interaction network by 25%.
Collapse
|
7
|
Schnettler JD, Wang MS, Gantz M, Bunzel HA, Karas C, Hollfelder F, Hecht MH. Selection of a promiscuous minimalist cAMP phosphodiesterase from a library of de novo designed proteins. Nat Chem 2024; 16:1200-1208. [PMID: 38702405 PMCID: PMC11230910 DOI: 10.1038/s41557-024-01490-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/13/2023] [Accepted: 02/27/2024] [Indexed: 05/06/2024]
Abstract
The ability of unevolved amino acid sequences to become biological catalysts was key to the emergence of life on Earth. However, billions of years of evolution separate complex modern enzymes from their simpler early ancestors. To probe how unevolved sequences can develop new functions, we use ultrahigh-throughput droplet microfluidics to screen for phosphoesterase activity amidst a library of more than one million sequences based on a de novo designed 4-helix bundle. Characterization of hits revealed that acquisition of function involved a large jump in sequence space enriching for truncations that removed >40% of the protein chain. Biophysical characterization of a catalytically active truncated protein revealed that it dimerizes into an α-helical structure, with the gain of function accompanied by increased structural dynamics. The identified phosphodiesterase is a manganese-dependent metalloenzyme that hydrolyses a range of phosphodiesters. It is most active towards cyclic AMP, with a rate acceleration of ~109 and a catalytic proficiency of >1014 M-1, comparable to larger enzymes shaped by billions of years of evolution.
Collapse
Affiliation(s)
| | - Michael S Wang
- Department of Chemistry, Princeton University, Princeton, USA
| | - Maximilian Gantz
- Department of Biochemistry, University of Cambridge, Cambridge, UK
| | - H Adrian Bunzel
- Department of Biosystems Science and Engineering, ETH Zürich, Basel, Switzerland
| | - Christina Karas
- Department of Molecular Biology, Princeton University, Princeton, USA
| | | | - Michael H Hecht
- Department of Chemistry, Princeton University, Princeton, USA.
| |
Collapse
|
8
|
Fram B, Su Y, Truebridge I, Riesselman AJ, Ingraham JB, Passera A, Napier E, Thadani NN, Lim S, Roberts K, Kaur G, Stiffler MA, Marks DS, Bahl CD, Khan AR, Sander C, Gauthier NP. Simultaneous enhancement of multiple functional properties using evolution-informed protein design. Nat Commun 2024; 15:5141. [PMID: 38902262 PMCID: PMC11190266 DOI: 10.1038/s41467-024-49119-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/28/2023] [Accepted: 05/24/2024] [Indexed: 06/22/2024] Open
Abstract
A major challenge in protein design is to augment existing functional proteins with multiple property enhancements. Altering several properties likely necessitates numerous primary sequence changes, and novel methods are needed to accurately predict combinations of mutations that maintain or enhance function. Models of sequence co-variation (e.g., EVcouplings), which leverage extensive information about various protein properties and activities from homologous protein sequences, have proven effective for many applications including structure determination and mutation effect prediction. We apply EVcouplings to computationally design variants of the model protein TEM-1 β-lactamase. Nearly all the 14 experimentally characterized designs were functional, including one with 84 mutations from the nearest natural homolog. The designs also had large increases in thermostability, increased activity on multiple substrates, and nearly identical structure to the wild type enzyme. This study highlights the efficacy of evolutionary models in guiding large sequence alterations to generate functional diversity for protein design applications.
Collapse
Affiliation(s)
- Benjamin Fram
- Department of Systems Biology, Harvard Medical School, Boston, MA, USA.
- Department of Data Sciences, Dana-Farber Cancer Institute, Boston, MA, USA.
| | - Yang Su
- Department of Systems Biology, Harvard Medical School, Boston, MA, USA
| | - Ian Truebridge
- Institute for Protein Innovation, Boston, MA, USA
- Division of Hematology/Oncology, Boston Children's Hospital, Harvard Medical School, Boston, MA, USA
- AI Proteins, Boston, MA, USA
| | - Adam J Riesselman
- Department of Systems Biology, Harvard Medical School, Boston, MA, USA
- Program in Biomedical Informatics, Harvard Medical School, Boston, MA, USA
| | - John B Ingraham
- Department of Systems Biology, Harvard Medical School, Boston, MA, USA
| | - Alessandro Passera
- Department of Data Sciences, Dana-Farber Cancer Institute, Boston, MA, USA
- Research Institute of Molecular Pathology (IMP), Vienna BioCenter (VBC), Campus-Vienna-Biocenter 1, 1030, Vienna, Austria
| | - Eve Napier
- School of Biochemistry and Immunology, Trinity College Dublin, Dublin 2, Ireland
| | - Nicole N Thadani
- Department of Systems Biology, Harvard Medical School, Boston, MA, USA
- Apriori Bio, Cambridge, MA, USA
| | - Samuel Lim
- Department of Systems Biology, Harvard Medical School, Boston, MA, USA
| | - Kristen Roberts
- Selux Diagnostics Inc., 56 Roland Street, Charlestown, MA, USA
| | - Gurleen Kaur
- Selux Diagnostics Inc., 56 Roland Street, Charlestown, MA, USA
| | - Michael A Stiffler
- Department of Data Sciences, Dana-Farber Cancer Institute, Boston, MA, USA
- Dyno Therapeutics, 343 Arsenal Street, Watertown, MA, USA
| | - Debora S Marks
- Department of Systems Biology, Harvard Medical School, Boston, MA, USA
- Broad Institute of MIT and Harvard, Cambridge, MA, USA
| | - Christopher D Bahl
- Institute for Protein Innovation, Boston, MA, USA
- Division of Hematology/Oncology, Boston Children's Hospital, Harvard Medical School, Boston, MA, USA
- AI Proteins, Boston, MA, USA
| | - Amir R Khan
- School of Biochemistry and Immunology, Trinity College Dublin, Dublin 2, Ireland
- Division of Newborn Medicine, Boston Children's Hospital, Boston, MA, USA
| | - Chris Sander
- Department of Systems Biology, Harvard Medical School, Boston, MA, USA
- Department of Data Sciences, Dana-Farber Cancer Institute, Boston, MA, USA
- Broad Institute of MIT and Harvard, Cambridge, MA, USA
| | - Nicholas P Gauthier
- Department of Systems Biology, Harvard Medical School, Boston, MA, USA.
- Department of Data Sciences, Dana-Farber Cancer Institute, Boston, MA, USA.
- Broad Institute of MIT and Harvard, Cambridge, MA, USA.
| |
Collapse
|
9
|
Ose NJ, Campitelli P, Modi T, Kazan IC, Kumar S, Ozkan SB. Some mechanistic underpinnings of molecular adaptations of SARS-COV-2 spike protein by integrating candidate adaptive polymorphisms with protein dynamics. eLife 2024; 12:RP92063. [PMID: 38713502 PMCID: PMC11076047 DOI: 10.7554/elife.92063] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 05/08/2024] Open
Abstract
We integrate evolutionary predictions based on the neutral theory of molecular evolution with protein dynamics to generate mechanistic insight into the molecular adaptations of the SARS-COV-2 spike (S) protein. With this approach, we first identified candidate adaptive polymorphisms (CAPs) of the SARS-CoV-2 S protein and assessed the impact of these CAPs through dynamics analysis. Not only have we found that CAPs frequently overlap with well-known functional sites, but also, using several different dynamics-based metrics, we reveal the critical allosteric interplay between SARS-CoV-2 CAPs and the S protein binding sites with the human ACE2 (hACE2) protein. CAPs interact far differently with the hACE2 binding site residues in the open conformation of the S protein compared to the closed form. In particular, the CAP sites control the dynamics of binding residues in the open state, suggesting an allosteric control of hACE2 binding. We also explored the characteristic mutations of different SARS-CoV-2 strains to find dynamic hallmarks and potential effects of future mutations. Our analyses reveal that Delta strain-specific variants have non-additive (i.e., epistatic) interactions with CAP sites, whereas the less pathogenic Omicron strains have mostly additive mutations. Finally, our dynamics-based analysis suggests that the novel mutations observed in the Omicron strain epistatically interact with the CAP sites to help escape antibody binding.
Collapse
Affiliation(s)
- Nicholas James Ose
- Department of Physics and Center for Biological Physics, Arizona State UniversityTempeUnited States
| | - Paul Campitelli
- Department of Physics and Center for Biological Physics, Arizona State UniversityTempeUnited States
| | - Tushar Modi
- Department of Physics and Center for Biological Physics, Arizona State UniversityTempeUnited States
| | - I Can Kazan
- Department of Physics and Center for Biological Physics, Arizona State UniversityTempeUnited States
| | - Sudhir Kumar
- Institute for Genomics and Evolutionary Medicine, Temple UniversityPhiladelphiaUnited States
- Department of Biology, Temple UniversityPhiladelphiaUnited States
- Center for Genomic Medicine Research, King Abdulaziz UniversityJeddahSaudi Arabia
| | - Sefika Banu Ozkan
- Department of Physics and Center for Biological Physics, Arizona State UniversityTempeUnited States
| |
Collapse
|
10
|
Lou J, Liang W, Cao L, Hu I, Zhao S, Chen Z, Chan RWY, Cheung PPH, Zheng H, Liu C, Li Q, Chong MKC, Zhang Y, Yeoh EK, Chan PKS, Zee BCY, Mok CKP, Wang MH. Predictive evolutionary modelling for influenza virus by site-based dynamics of mutations. Nat Commun 2024; 15:2546. [PMID: 38514647 PMCID: PMC10958014 DOI: 10.1038/s41467-024-46918-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/06/2023] [Accepted: 03/12/2024] [Indexed: 03/23/2024] Open
Abstract
Influenza virus continuously evolves to escape human adaptive immunity and generates seasonal epidemics. Therefore, influenza vaccine strains need to be updated annually for the upcoming flu season to ensure vaccine effectiveness. We develop a computational approach, beth-1, to forecast virus evolution and select representative virus for influenza vaccine. The method involves modelling site-wise mutation fitness. Informed by virus genome and population sero-positivity, we calibrate transition time of mutations and project the fitness landscape to future time, based on which beth-1 selects the optimal vaccine strain. In season-to-season prediction in historical data for the influenza A pH1N1 and H3N2 viruses, beth-1 demonstrates superior genetic matching compared to existing approaches. In prospective validations, the model shows superior or non-inferior genetic matching and neutralization against circulating virus in mice immunization experiments compared to the current vaccine. The method offers a promising and ready-to-use tool to facilitate vaccine strain selection for the influenza virus through capturing heterogeneous evolutionary dynamics over genome space-time and linking molecular variants to population immune response.
Collapse
Affiliation(s)
- Jingzhi Lou
- JC School of Public Health and Primary Care (JCSPHPC), The Chinese University of Hong Kong (CUHK), Hong Kong SAR, China
- Beth Bioinformatics Co. Ltd, Hong Kong SAR, China
| | - Weiwen Liang
- HKU-Pasteur Research Pole, School of Public Health, Li Ka Shing Faculty of Medicine, The University of Hong Kong, Hong Kong SAR, China
| | - Lirong Cao
- JC School of Public Health and Primary Care (JCSPHPC), The Chinese University of Hong Kong (CUHK), Hong Kong SAR, China
- CUHK Shenzhen Research Institute, Shenzhen, China
| | - Inchi Hu
- Department of Statistics, George Mason University, Fairfax, VA, USA
| | - Shi Zhao
- JC School of Public Health and Primary Care (JCSPHPC), The Chinese University of Hong Kong (CUHK), Hong Kong SAR, China
- School of Public Health, Tianjin Medical University, Tianjin, China
| | - Zigui Chen
- Department of Microbiology, CUHK, Hong Kong SAR, China
| | - Renee Wan Yi Chan
- Department of Paediatrics, CUHK, Hong Kong SAR, China
- Hong Kong Hub of Paediatric Excellence, CUHK, Hong Kong SAR, China
| | | | - Hong Zheng
- JC School of Public Health and Primary Care (JCSPHPC), The Chinese University of Hong Kong (CUHK), Hong Kong SAR, China
| | - Caiqi Liu
- JC School of Public Health and Primary Care (JCSPHPC), The Chinese University of Hong Kong (CUHK), Hong Kong SAR, China
| | - Qi Li
- JC School of Public Health and Primary Care (JCSPHPC), The Chinese University of Hong Kong (CUHK), Hong Kong SAR, China
| | - Marc Ka Chun Chong
- JC School of Public Health and Primary Care (JCSPHPC), The Chinese University of Hong Kong (CUHK), Hong Kong SAR, China
- CUHK Shenzhen Research Institute, Shenzhen, China
| | - Yexian Zhang
- Beth Bioinformatics Co. Ltd, Hong Kong SAR, China
- CUHK Shenzhen Research Institute, Shenzhen, China
| | - Eng-Kiong Yeoh
- JC School of Public Health and Primary Care (JCSPHPC), The Chinese University of Hong Kong (CUHK), Hong Kong SAR, China
- Centre for Health Systems and Policy Research, CUHK, Hong Kong SAR, China
| | - Paul Kay-Sheung Chan
- Department of Microbiology, CUHK, Hong Kong SAR, China
- Stanley Ho Centre for Emerging Infectious Diseases, CUHK, Hong Kong SAR, China
| | - Benny Chung Ying Zee
- JC School of Public Health and Primary Care (JCSPHPC), The Chinese University of Hong Kong (CUHK), Hong Kong SAR, China
- CUHK Shenzhen Research Institute, Shenzhen, China
| | - Chris Ka Pun Mok
- JC School of Public Health and Primary Care (JCSPHPC), The Chinese University of Hong Kong (CUHK), Hong Kong SAR, China.
- Li Ka Shing Institute of Health Sciences, Faculty of Medicine, CUHK, Hong Kong SAR, China.
| | - Maggie Haitian Wang
- JC School of Public Health and Primary Care (JCSPHPC), The Chinese University of Hong Kong (CUHK), Hong Kong SAR, China.
- CUHK Shenzhen Research Institute, Shenzhen, China.
| |
Collapse
|
11
|
Coleman T, Shin J, Silberg JJ, Shamoo Y, Atkinson JT. The Biochemical Impact of Extracting an Embedded Adenylate Kinase Domain Using Circular Permutation. Biochemistry 2024; 63:599-609. [PMID: 38357768 DOI: 10.1021/acs.biochem.3c00605] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/16/2024]
Abstract
Adenylate kinases (AKs) have evolved AMP-binding and lid domains that are encoded as continuous polypeptides embedded at different locations within the discontinuous polypeptide encoding the core domain. A prior study showed that AK homologues of different stabilities consistently retain cellular activity following circular permutation that splits a region with high energetic frustration within the AMP-binding domain into discontinuous fragments. Herein, we show that mesophilic and thermophilic AKs having this topological restructuring retain activity and substrate-binding characteristics of the parental AK. While permutation decreased the activity of both AK homologues at physiological temperatures, the catalytic activity of the thermophilic AK increased upon permutation when assayed >30 °C below the melting temperature of the native AK. The thermostabilities of the permuted AKs were uniformly lower than those of native AKs, and they exhibited multiphasic unfolding transitions, unlike the native AKs, which presented cooperative thermal unfolding. In addition, proteolytic digestion revealed that permutation destabilized each AK in differing manners, and mass spectrometry suggested that the new termini within the AMP-binding domain were responsible for the increased proteolysis sensitivity. These findings illustrate how changes in contact order can be used to tune enzyme activity and alter folding dynamics in multidomain enzymes.
Collapse
Affiliation(s)
- Tom Coleman
- Department of BioSciences, Rice University, MS-140, 6100 Main Street, Houston, Texas 77005, United States
| | - John Shin
- Department of BioSciences, Rice University, MS-140, 6100 Main Street, Houston, Texas 77005, United States
| | - Jonathan J Silberg
- Department of BioSciences, Rice University, MS-140, 6100 Main Street, Houston, Texas 77005, United States
- Department of Chemical and Biomolecular Engineering, Rice University, MS-362, 6100 Main Street, Houston, Texas 77005, United States
- Department of Bioengineering, Rice University, MS-142, 6100 Main Street, Houston, Texas 77005, United States
| | - Yousif Shamoo
- Department of BioSciences, Rice University, MS-140, 6100 Main Street, Houston, Texas 77005, United States
| | - Joshua T Atkinson
- Department of BioSciences, Rice University, MS-140, 6100 Main Street, Houston, Texas 77005, United States
- Department of Physics and Astronomy, University of Southern California, Los Angeles, California 90007, United States
- Department of Civil and Environmental Engineering, Princeton University, Princeton, New Jersey 08544, United States
- Omenn-Darling Bioengineering Institute, Princeton University, Princeton, New Jersey 08544, United States
| |
Collapse
|
12
|
Radojković M, Ubbink M. Positive epistasis drives clavulanic acid resistance in double mutant libraries of BlaC β-lactamase. Commun Biol 2024; 7:197. [PMID: 38368480 PMCID: PMC10874438 DOI: 10.1038/s42003-024-05868-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/06/2023] [Accepted: 01/26/2024] [Indexed: 02/19/2024] Open
Abstract
Phenotypic effects of mutations are highly dependent on the genetic backgrounds in which they occur, due to epistatic effects. To test how easily the loss of enzyme activity can be compensated for, we screen mutant libraries of BlaC, a β-lactamase from Mycobacterium tuberculosis, for fitness in the presence of carbenicillin and the inhibitor clavulanic acid. Using a semi-rational approach and deep sequencing, we prepare four double-site saturation libraries and determine the relative fitness effect for 1534/1540 (99.6%) of the unique library members at two temperatures. Each library comprises variants of a residue known to be relevant for clavulanic acid resistance as well as residue 105, which regulates access to the active site. Variants with greatly improved fitness were identified within each library, demonstrating that compensatory mutations for loss of activity can be readily found. In most cases, the fittest variants are a result of positive epistasis, indicating strong synergistic effects between the chosen residue pairs. Our study sheds light on a role of epistasis in the evolution of functional residues and underlines the highly adaptive potential of BlaC.
Collapse
Affiliation(s)
- Marko Radojković
- Leiden Institute of Chemistry, Leiden University, Einsteinweg 55, 2333 CC, Leiden, The Netherlands
| | - Marcellus Ubbink
- Leiden Institute of Chemistry, Leiden University, Einsteinweg 55, 2333 CC, Leiden, The Netherlands.
| |
Collapse
|
13
|
Alvarez S, Nartey CM, Mercado N, de la Paz JA, Huseinbegovic T, Morcos F. In vivo functional phenotypes from a computational epistatic model of evolution. Proc Natl Acad Sci U S A 2024; 121:e2308895121. [PMID: 38285950 PMCID: PMC10861889 DOI: 10.1073/pnas.2308895121] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/26/2023] [Accepted: 12/19/2023] [Indexed: 01/31/2024] Open
Abstract
Computational models of evolution are valuable for understanding the dynamics of sequence variation, to infer phylogenetic relationships or potential evolutionary pathways and for biomedical and industrial applications. Despite these benefits, few have validated their propensities to generate outputs with in vivo functionality, which would enhance their value as accurate and interpretable evolutionary algorithms. We demonstrate the power of epistasis inferred from natural protein families to evolve sequence variants in an algorithm we developed called sequence evolution with epistatic contributions (SEEC). Utilizing the Hamiltonian of the joint probability of sequences in the family as fitness metric, we sampled and experimentally tested for in vivo [Formula: see text]-lactamase activity in Escherichia coli TEM-1 variants. These evolved proteins can have dozens of mutations dispersed across the structure while preserving sites essential for both catalysis and interactions. Remarkably, these variants retain family-like functionality while being more active than their wild-type predecessor. We found that depending on the inference method used to generate the epistatic constraints, different parameters simulate diverse selection strengths. Under weaker selection, local Hamiltonian fluctuations reliably predict relative changes to variant fitness, recapitulating neutral evolution. SEEC has the potential to explore the dynamics of neofunctionalization, characterize viral fitness landscapes, and facilitate vaccine development.
Collapse
Affiliation(s)
- Sophia Alvarez
- Department of Biological Sciences, University of Texas at Dallas, Richardson, TX75080
| | - Charisse M. Nartey
- Department of Biological Sciences, University of Texas at Dallas, Richardson, TX75080
| | - Nicholas Mercado
- Department of Biological Sciences, University of Texas at Dallas, Richardson, TX75080
| | | | - Tea Huseinbegovic
- Department of Biological Sciences, University of Texas at Dallas, Richardson, TX75080
| | - Faruck Morcos
- Department of Biological Sciences, University of Texas at Dallas, Richardson, TX75080
- Department of Bioengineering, University of Texas at Dallas, Richardson, TX75080
- Center for Systems Biology, University of Texas at Dallas, Richardson, TX75080
| |
Collapse
|
14
|
Ose NJ, Campitelli P, Modi T, Can Kazan I, Kumar S, Banu Ozkan S. Some mechanistic underpinnings of molecular adaptations of SARS-COV-2 spike protein by integrating candidate adaptive polymorphisms with protein dynamics. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2023.09.14.557827. [PMID: 37745560 PMCID: PMC10515954 DOI: 10.1101/2023.09.14.557827] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 09/26/2023]
Abstract
We integrate evolutionary predictions based on the neutral theory of molecular evolution with protein dynamics to generate mechanistic insight into the molecular adaptations of the SARS-COV-2 Spike (S) protein. With this approach, we first identified Candidate Adaptive Polymorphisms (CAPs) of the SARS-CoV-2 Spike protein and assessed the impact of these CAPs through dynamics analysis. Not only have we found that CAPs frequently overlap with well-known functional sites, but also, using several different dynamics-based metrics, we reveal the critical allosteric interplay between SARS-CoV-2 CAPs and the S protein binding sites with the human ACE2 (hACE2) protein. CAPs interact far differently with the hACE2 binding site residues in the open conformation of the S protein compared to the closed form. In particular, the CAP sites control the dynamics of binding residues in the open state, suggesting an allosteric control of hACE2 binding. We also explored the characteristic mutations of different SARS-CoV-2 strains to find dynamic hallmarks and potential effects of future mutations. Our analyses reveal that Delta strain-specific variants have non-additive (i.e., epistatic) interactions with CAP sites, whereas the less pathogenic Omicron strains have mostly additive mutations. Finally, our dynamics-based analysis suggests that the novel mutations observed in the Omicron strain epistatically interact with the CAP sites to help escape antibody binding.
Collapse
Affiliation(s)
- Nicholas J. Ose
- Department of Physics and Center for Biological Physics, Arizona State University, Tempe, Arizona, United States of America
| | - Paul Campitelli
- Department of Physics and Center for Biological Physics, Arizona State University, Tempe, Arizona, United States of America
| | - Tushar Modi
- Department of Physics and Center for Biological Physics, Arizona State University, Tempe, Arizona, United States of America
| | - I. Can Kazan
- Department of Physics and Center for Biological Physics, Arizona State University, Tempe, Arizona, United States of America
| | - Sudhir Kumar
- Institute for Genomics and Evolutionary Medicine, Temple University, Philadelphia, Pennsylvania, United States of America
- Department of Biology, Temple University, Philadelphia, Pennsylvania, United States of America
- Center for Genomic Medicine Research, King Abdulaziz University, Jeddah, Saudi Arabia
| | - S. Banu Ozkan
- Department of Physics and Center for Biological Physics, Arizona State University, Tempe, Arizona, United States of America
| |
Collapse
|
15
|
Hoffmann MD, Zdechlik AC, He Y, Nedrud D, Aslanidi G, Gordon W, Schmidt D. Multiparametric domain insertional profiling of adeno-associated virus VP1. Mol Ther Methods Clin Dev 2023; 31:101143. [PMID: 38027057 PMCID: PMC10661864 DOI: 10.1016/j.omtm.2023.101143] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/17/2023] [Accepted: 10/21/2023] [Indexed: 12/01/2023]
Abstract
Several evolved properties of adeno-associated virus (AAV), such as broad tropism and immunogenicity in humans, are barriers to AAV-based gene therapy. Most efforts to re-engineer these properties have focused on variable regions near AAV's 3-fold protrusions and capsid protein termini. To comprehensively survey AAV capsids for engineerable hotspots, we determined multiple AAV fitness phenotypes upon insertion of six structured protein domains into the entire AAV-DJ capsid protein VP1. This is the largest and most comprehensive AAV domain insertion dataset to date. Our data revealed a surprising robustness of AAV capsids to accommodate large domain insertions. Insertion permissibility depended strongly on insertion position, domain type, and measured fitness phenotype, which clustered into contiguous structural units that we could link to distinct roles in AAV assembly, stability, and infectivity. We also identified engineerable hotspots of AAV that facilitate the covalent attachment of binding scaffolds, which may represent an alternative approach to re-direct AAV tropism.
Collapse
Affiliation(s)
- Mareike D. Hoffmann
- Department of Genetics, Cell Biology & Development, University of Minnesota, Minneapolis, MN 55455, USA
| | - Alina C. Zdechlik
- Department of Biochemistry, Molecular Biology & Biophysics, University of Minnesota, Minneapolis, MN 55455, USA
| | - Yungui He
- Department of Genetics, Cell Biology & Development, University of Minnesota, Minneapolis, MN 55455, USA
| | - David Nedrud
- Department of Biochemistry, Molecular Biology & Biophysics, University of Minnesota, Minneapolis, MN 55455, USA
| | | | - Wendy Gordon
- Department of Biochemistry, Molecular Biology & Biophysics, University of Minnesota, Minneapolis, MN 55455, USA
| | - Daniel Schmidt
- Department of Genetics, Cell Biology & Development, University of Minnesota, Minneapolis, MN 55455, USA
| |
Collapse
|
16
|
González LJ, Bahr G, González MM, Bonomo RA, Vila AJ. In-cell kinetic stability is an essential trait in metallo-β-lactamase evolution. Nat Chem Biol 2023; 19:1116-1126. [PMID: 37188957 DOI: 10.1038/s41589-023-01319-0] [Citation(s) in RCA: 6] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/31/2022] [Accepted: 03/21/2023] [Indexed: 05/17/2023]
Abstract
Protein stability is an essential property for biological function. In contrast to the vast knowledge on protein stability in vitro, little is known about the factors governing in-cell stability. Here we show that the metallo-β-lactamase (MBL) New Delhi MBL-1 (NDM-1) is a kinetically unstable protein on metal restriction that has evolved by acquiring different biochemical traits that optimize its in-cell stability. The nonmetalated (apo) NDM-1 is degraded by the periplasmic protease Prc that recognizes its partially unstructured C-terminal domain. Zn(II) binding renders the protein refractory to degradation by quenching the flexibility of this region. Membrane anchoring makes apo-NDM-1 less accessible to Prc and protects it from DegP, a cellular protease degrading misfolded, nonmetalated NDM-1 precursors. NDM variants accumulate substitutions at the C terminus that quench its flexibility, enhancing their kinetic stability and bypassing proteolysis. These observations link MBL-mediated resistance with the essential periplasmic metabolism, highlighting the importance of the cellular protein homeostasis.
Collapse
Affiliation(s)
- Lisandro J González
- Instituto de Biología Molecular y Celular de Rosario (IBR, CONICET-UNR), Rosario, Argentina
- Área Biofísica, Facultad de Ciencias Bioquímicas y Farmacéuticas, Universidad Nacional de Rosario, Rosario, Argentina
| | - Guillermo Bahr
- Instituto de Biología Molecular y Celular de Rosario (IBR, CONICET-UNR), Rosario, Argentina
- Área Biofísica, Facultad de Ciencias Bioquímicas y Farmacéuticas, Universidad Nacional de Rosario, Rosario, Argentina
| | - Mariano M González
- Instituto de Biología Molecular y Celular de Rosario (IBR, CONICET-UNR), Rosario, Argentina
- Área Biofísica, Facultad de Ciencias Bioquímicas y Farmacéuticas, Universidad Nacional de Rosario, Rosario, Argentina
| | - Robert A Bonomo
- Research Service, Veterans Affairs Northeast Ohio Healthcare System, Cleveland, OH, USA
- Department of Medicine, Case Western Reserve University School of Medicine, Cleveland, OH, USA
- Medical Service and GRECC, Veterans Affairs Northeast Ohio Healthcare System, Cleveland, OH, USA
- CWRU-Cleveland VAMC Center for Antimicrobial Resistance and Epidemiology (Case VA CARES), Cleveland, OH, USA
- Department of Pharmacology, Case Western Reserve University School of Medicine, Cleveland, OH, USA
- Department of Molecular Biology and Microbiology, Case Western Reserve University School of Medicine, Cleveland, OH, USA
- Department of Biochemistry, Case Western Reserve University School of Medicine, Cleveland, OH, USA
- Department of Proteomics and Bioinformatics, Case Western Reserve University School of Medicine, Cleveland, OH, USA
| | - Alejandro J Vila
- Instituto de Biología Molecular y Celular de Rosario (IBR, CONICET-UNR), Rosario, Argentina.
- Área Biofísica, Facultad de Ciencias Bioquímicas y Farmacéuticas, Universidad Nacional de Rosario, Rosario, Argentina.
- CWRU-Cleveland VAMC Center for Antimicrobial Resistance and Epidemiology (Case VA CARES), Cleveland, OH, USA.
| |
Collapse
|
17
|
Chen L, Zhang Z, Li Z, Li R, Huo R, Chen L, Wang D, Luo X, Chen K, Liao C, Zheng M. Learning protein fitness landscapes with deep mutational scanning data from multiple sources. Cell Syst 2023; 14:706-721.e5. [PMID: 37591206 DOI: 10.1016/j.cels.2023.07.003] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/15/2023] [Revised: 05/30/2023] [Accepted: 07/18/2023] [Indexed: 08/19/2023]
Abstract
One of the key points of machine learning-assisted directed evolution (MLDE) is the accurate learning of the fitness landscape, a conceptual mapping from sequence variants to the desired function. Here, we describe a multi-protein training scheme that leverages the existing deep mutational scanning data from diverse proteins to aid in understanding the fitness landscape of a new protein. Proof-of-concept trials are designed to validate this training scheme in three aspects: random and positional extrapolation for single-variant effects, zero-shot fitness predictions for new proteins, and extrapolation for higher-order variant effects from single-variant effects. Moreover, our study identified previously overlooked strong baselines, and their unexpectedly good performance brings our attention to the pitfalls of MLDE. Overall, these results may improve our understanding of the association between different protein fitness profiles and shed light on developing better machine learning-assisted approaches to the directed evolution of proteins. A record of this paper's transparent peer review process is included in the supplemental information.
Collapse
Affiliation(s)
- Lin Chen
- Drug Discovery and Design Center, State Key Laboratory of Drug Research, Shanghai Institute of Materia Medica, Chinese Academy of Sciences, Shanghai 201203, China; University of Chinese Academy of Sciences, Beijing 100049, China
| | - Zehong Zhang
- Drug Discovery and Design Center, State Key Laboratory of Drug Research, Shanghai Institute of Materia Medica, Chinese Academy of Sciences, Shanghai 201203, China; University of Chinese Academy of Sciences, Beijing 100049, China
| | - Zhenghao Li
- Drug Discovery and Design Center, State Key Laboratory of Drug Research, Shanghai Institute of Materia Medica, Chinese Academy of Sciences, Shanghai 201203, China; Shanghai Institute for Advanced Immunochemical Studies, School of Life Science and Technology, ShanghaiTech University, Shanghai 201210, China
| | - Rui Li
- Drug Discovery and Design Center, State Key Laboratory of Drug Research, Shanghai Institute of Materia Medica, Chinese Academy of Sciences, Shanghai 201203, China; School of Pharmacy, China Pharmaceutical University, Nanjing 211198, China
| | - Ruifeng Huo
- School of Chinese Materia Medica, Nanjing University of Chinese Medicine, Nanjing 210023, China
| | - Lifan Chen
- Drug Discovery and Design Center, State Key Laboratory of Drug Research, Shanghai Institute of Materia Medica, Chinese Academy of Sciences, Shanghai 201203, China; University of Chinese Academy of Sciences, Beijing 100049, China
| | | | - Xiaomin Luo
- Drug Discovery and Design Center, State Key Laboratory of Drug Research, Shanghai Institute of Materia Medica, Chinese Academy of Sciences, Shanghai 201203, China; University of Chinese Academy of Sciences, Beijing 100049, China
| | - Kaixian Chen
- Drug Discovery and Design Center, State Key Laboratory of Drug Research, Shanghai Institute of Materia Medica, Chinese Academy of Sciences, Shanghai 201203, China; University of Chinese Academy of Sciences, Beijing 100049, China; School of Pharmacy, China Pharmaceutical University, Nanjing 211198, China
| | - Cangsong Liao
- University of Chinese Academy of Sciences, Beijing 100049, China; Chemical Biology Research Center, Shanghai Institute of Materia Medica, Chinese Academy of Science, Shanghai 201203, China.
| | - Mingyue Zheng
- Drug Discovery and Design Center, State Key Laboratory of Drug Research, Shanghai Institute of Materia Medica, Chinese Academy of Sciences, Shanghai 201203, China; University of Chinese Academy of Sciences, Beijing 100049, China; School of Pharmacy, China Pharmaceutical University, Nanjing 211198, China; School of Chinese Materia Medica, Nanjing University of Chinese Medicine, Nanjing 210023, China.
| |
Collapse
|
18
|
Haddox HK, Galloway JG, Dadonaite B, Bloom JD, Matsen IV FA, DeWitt WS. Jointly modeling deep mutational scans identifies shifted mutational effects among SARS-CoV-2 spike homologs. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.07.31.551037. [PMID: 37577604 PMCID: PMC10418112 DOI: 10.1101/2023.07.31.551037] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 08/15/2023]
Abstract
Deep mutational scanning (DMS) is a high-throughput experimental technique that measures the effects of thousands of mutations to a protein. These experiments can be performed on multiple homologs of a protein or on the same protein selected under multiple conditions. It is often of biological interest to identify mutations with shifted effects across homologs or conditions. However, it is challenging to determine if observed shifts arise from biological signal or experimental noise. Here, we describe a method for jointly inferring mutational effects across multiple DMS experiments while also identifying mutations that have shifted in their effects among experiments. A key aspect of our method is to regularize the inferred shifts, so that they are nonzero only when strongly supported by the data. We apply this method to DMS experiments that measure how mutations to spike proteins from SARS-CoV-2 variants (Delta, Omicron BA.1, and Omicron BA.2) affect cell entry. Most mutational effects are conserved between these spike homologs, but a fraction have markedly shifted. We experimentally validate a subset of the mutations inferred to have shifted effects, and confirm differences of > 1,000-fold in the impact of the same mutation on spike-mediated viral infection across spikes from different SARS-CoV-2 variants. Overall, our work establishes a general approach for comparing sets of DMS experiments to identify biologically important shifts in mutational effects.
Collapse
Affiliation(s)
- Hugh K. Haddox
- Computational Biology Program, Fred Hutchinson Cancer Research Center, Seattle, WA 98102, USA
| | - Jared G. Galloway
- Computational Biology Program, Fred Hutchinson Cancer Research Center, Seattle, WA 98102, USA
| | - Bernadeta Dadonaite
- Basic Sciences Division, Fred Hutchinson Cancer Research Center, Seattle, WA 98109, USA
| | - Jesse D. Bloom
- Computational Biology Program, Fred Hutchinson Cancer Research Center, Seattle, WA 98102, USA
- Basic Sciences Division, Fred Hutchinson Cancer Research Center, Seattle, WA 98109, USA
- Department of Genome Sciences, University of Washington, Seattle, WA 98195, USA
- Howard Hughes Medical Institute, Seattle, WA 98109, USA
| | - Frederick A. Matsen IV
- Computational Biology Program, Fred Hutchinson Cancer Research Center, Seattle, WA 98102, USA
- Basic Sciences Division, Fred Hutchinson Cancer Research Center, Seattle, WA 98109, USA
- Howard Hughes Medical Institute, Seattle, WA 98109, USA
- Department of Statistics, University of Washington, Seattle, WA 98195, USA
| | - William S. DeWitt
- Department of Electrical Engineering and Computer Sciences, University of California, Berkeley, CA 94720, USA
| |
Collapse
|
19
|
Ose NJ, Campitelli P, Patel R, Kumar S, Ozkan SB. Protein dynamics provide mechanistic insights about epistasis among common missense polymorphisms. Biophys J 2023; 122:2938-2947. [PMID: 36726312 PMCID: PMC10398253 DOI: 10.1016/j.bpj.2023.01.037] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/12/2022] [Revised: 12/20/2022] [Accepted: 01/26/2023] [Indexed: 02/03/2023] Open
Abstract
Sequencing of the protein coding genome has revealed many different missense mutations of human proteins and different population frequencies of corresponding haplotypes, which consist of different sets of those mutations. Here, we present evidence for pairwise intramolecular epistasis (i.e., nonadditive interactions) between many such mutations through an analysis of protein dynamics. We suggest that functional compensation for conserving protein dynamics is a likely evolutionary mechanism that maintains high-frequency mutations that are individually nonneutral but epistatically compensating within proteins. This analysis is the first of its type to look at human proteins with specific high population frequency mutations and examine the relationship between mutations that make up that observed high-frequency protein haplotype. Importantly, protein dynamics revealed a separation between high and low frequency haplotypes within a target protein cytochrome P450 2A7, with the high-frequency haplotypes showing behavior closer to the wild-type protein. Common protein haplotypes containing two mutations display dynamic compensation in which one mutation can correct for the dynamic effects of the other. We also utilize a dynamics-based metric, EpiScore, that evaluates the epistatic interactions and allows us to see dynamic compensation within many other proteins.
Collapse
Affiliation(s)
- Nicholas J Ose
- Department of Physics and Center for Biological Physics, Arizona State University, Tempe, Arizona
| | - Paul Campitelli
- Department of Physics and Center for Biological Physics, Arizona State University, Tempe, Arizona
| | - Ravi Patel
- Institute for Genomics and Evolutionary Medicine, Temple University, Philadelphia, Pennsylvania; Department of Biology, Temple University, Philadelphia, Pennsylvania
| | - Sudhir Kumar
- Institute for Genomics and Evolutionary Medicine, Temple University, Philadelphia, Pennsylvania; Department of Biology, Temple University, Philadelphia, Pennsylvania; Center for Genomic Medicine Research, King Abdulaziz University, Jeddah, Saudi Arabia.
| | - S Banu Ozkan
- Department of Physics and Center for Biological Physics, Arizona State University, Tempe, Arizona.
| |
Collapse
|
20
|
Alvarez S, Nartey CM, Mercado N, de la Paz A, Huseinbegovic T, Morcos F. In vivo functional phenotypes from a computational epistatic model of evolution. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.05.24.542176. [PMID: 37292895 PMCID: PMC10245989 DOI: 10.1101/2023.05.24.542176] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/10/2023]
Abstract
Computational models of evolution are valuable for understanding the dynamics of sequence variation, to infer phylogenetic relationships or potential evolutionary pathways and for biomedical and industrial applications. Despite these benefits, few have validated their propensities to generate outputs with in vivo functionality, which would enhance their value as accurate and interpretable evolutionary algorithms. We demonstrate the power of epistasis inferred from natural protein families to evolve sequence variants in an algorithm we developed called Sequence Evolution with Epistatic Contributions. Utilizing the Hamiltonian of the joint probability of sequences in the family as fitness metric, we sampled and experimentally tested for in vivo β -lactamase activity in E. coli TEM-1 variants. These evolved proteins can have dozens of mutations dispersed across the structure while preserving sites essential for both catalysis and interactions. Remarkably, these variants retain family-like functionality while being more active than their WT predecessor. We found that depending on the inference method used to generate the epistatic constraints, different parameters simulate diverse selection strengths. Under weaker selection, local Hamiltonian fluctuations reliably predict relative changes to variant fitness, recapitulating neutral evolution. SEEC has the potential to explore the dynamics of neofunctionalization, characterize viral fitness landscapes and facilitate vaccine development.
Collapse
Affiliation(s)
- Sophia Alvarez
- Department of Biological Sciences, University of Texas at Dallas, Richardson, TX 75080, USA
| | - Charisse M. Nartey
- Department of Biological Sciences, University of Texas at Dallas, Richardson, TX 75080, USA
| | - Nicholas Mercado
- Department of Biological Sciences, University of Texas at Dallas, Richardson, TX 75080, USA
| | - Alberto de la Paz
- Department of Biological Sciences, University of Texas at Dallas, Richardson, TX 75080, USA
| | - Tea Huseinbegovic
- School of Natural Sciences and Mathematics, University of Texas at Dallas, Richardson, TX 75080, USA
| | - Faruck Morcos
- Department of Biological Sciences, University of Texas at Dallas, Richardson, TX 75080, USA
- Department of Bioengineering, University of Texas at Dallas, Richardson, TX 75080, USA
- Center for Systems Biology, University of Texas at Dallas, Richardson, TX 75080, USA
| |
Collapse
|
21
|
Weinstein JY, Martí-Gómez C, Lipsh-Sokolik R, Hoch SY, Liebermann D, Nevo R, Weissman H, Petrovich-Kopitman E, Margulies D, Ivankov D, McCandlish DM, Fleishman SJ. Designed active-site library reveals thousands of functional GFP variants. Nat Commun 2023; 14:2890. [PMID: 37210560 PMCID: PMC10199939 DOI: 10.1038/s41467-023-38099-z] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/02/2022] [Accepted: 04/13/2023] [Indexed: 05/22/2023] Open
Abstract
Mutations in a protein active site can lead to dramatic and useful changes in protein activity. The active site, however, is sensitive to mutations due to a high density of molecular interactions, substantially reducing the likelihood of obtaining functional multipoint mutants. We introduce an atomistic and machine-learning-based approach, called high-throughput Functional Libraries (htFuncLib), that designs a sequence space in which mutations form low-energy combinations that mitigate the risk of incompatible interactions. We apply htFuncLib to the GFP chromophore-binding pocket, and, using fluorescence readout, recover >16,000 unique designs encoding as many as eight active-site mutations. Many designs exhibit substantial and useful diversity in functional thermostability (up to 96 °C), fluorescence lifetime, and quantum yield. By eliminating incompatible active-site mutations, htFuncLib generates a large diversity of functional sequences. We envision that htFuncLib will be used in one-shot optimization of activity in enzymes, binders, and other proteins.
Collapse
Affiliation(s)
| | - Carlos Martí-Gómez
- Simons Center for Quantitative Biology, Cold Spring Harbor Laboratory, Cold Spring Harbor, NY, 11724, USA
| | - Rosalie Lipsh-Sokolik
- Department of Biomolecular Sciences, Weizmann Institute of Science, Rehovot, 7610001, Israel
| | - Shlomo Yakir Hoch
- Department of Biomolecular Sciences, Weizmann Institute of Science, Rehovot, 7610001, Israel
| | - Demian Liebermann
- Department of Chemical and Biological Physics, Weizmann Institute of Science, Rehovot, 7610001, Israel
| | - Reinat Nevo
- Department of Biomolecular Sciences, Weizmann Institute of Science, Rehovot, 7610001, Israel
| | - Haim Weissman
- Department of Molecular Chemistry and Materials Science, Weizmann Institute of Science, Rehovot, 7610001, Israel
| | | | - David Margulies
- Department of Chemical and Structural Biology, Weizmann Institute of Science, Rehovot, 7610001, Israel
| | - Dmitry Ivankov
- Center of Life Sciences, Skolkovo Institute of Science and Technology, Moscow, Russia
| | - David M McCandlish
- Simons Center for Quantitative Biology, Cold Spring Harbor Laboratory, Cold Spring Harbor, NY, 11724, USA
| | - Sarel J Fleishman
- Department of Biomolecular Sciences, Weizmann Institute of Science, Rehovot, 7610001, Israel.
| |
Collapse
|
22
|
Fram B, Truebridge I, Su Y, Riesselman AJ, Ingraham JB, Passera A, Napier E, Thadani NN, Lim S, Roberts K, Kaur G, Stiffler M, Marks DS, Bahl CD, Khan AR, Sander C, Gauthier NP. Simultaneous enhancement of multiple functional properties using evolution-informed protein design. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.05.09.539914. [PMID: 37214973 PMCID: PMC10197589 DOI: 10.1101/2023.05.09.539914] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/24/2023]
Abstract
Designing optimized proteins is important for a range of practical applications. Protein design is a rapidly developing field that would benefit from approaches that enable many changes in the amino acid primary sequence, rather than a small number of mutations, while maintaining structure and enhancing function. Homologous protein sequences contain extensive information about various protein properties and activities that have emerged over billions of years of evolution. Evolutionary models of sequence co-variation, derived from a set of homologous sequences, have proven effective in a range of applications including structure determination and mutation effect prediction. In this work we apply one of these models (EVcouplings) to computationally design highly divergent variants of the model protein TEM-1 β-lactamase, and characterize these designs experimentally using multiple biochemical and biophysical assays. Nearly all designed variants were functional, including one with 84 mutations from the nearest natural homolog. Surprisingly, all functional designs had large increases in thermostability and most had a broadening of available substrates. These property enhancements occurred while maintaining a nearly identical structure to the wild type enzyme. Collectively, this work demonstrates that evolutionary models of sequence co-variation (1) are able to capture complex epistatic interactions that successfully guide large sequence departures from natural contexts, and (2) can be applied to generate functional diversity useful for many applications in protein design.
Collapse
Affiliation(s)
- Benjamin Fram
- Department of Systems Biology, Harvard Medical School, Boston, MA, USA
| | - Ian Truebridge
- Institute for Protein Innovation, Boston, Massachusetts, Boston, MA, USA
- Division of Hematology/Oncology, Boston Children’s Hospital, Harvard Medical School; Boston, MA, USA
- current address: AI Proteins; Boston, MA, USA
| | - Yang Su
- Department of Systems Biology, Harvard Medical School, Boston, MA, USA
| | - Adam J. Riesselman
- Department of Systems Biology, Harvard Medical School, Boston, MA, USA
- Program in Biomedical Informatics, Harvard Medical School, Boston, MA, USA
| | - John B. Ingraham
- Department of Systems Biology, Harvard Medical School, Boston, MA, USA
| | - Alessandro Passera
- Department of Data Sciences, Dana-Farber Cancer Institute, Boston, MA, USA
- current address: Research Institute of Molecular Pathology (IMP), Vienna BioCenter (VBC), Campus-Vienna-Biocenter 1, 1030 Vienna, Austria
| | - Eve Napier
- School of Biochemistry and Immunology, Trinity College Dublin, Dublin 2, Ireland
| | - Nicole N. Thadani
- Department of Systems Biology, Harvard Medical School, Boston, MA, USA
| | - Samuel Lim
- Department of Systems Biology, Harvard Medical School, Boston, MA, USA
| | - Kristen Roberts
- Selux Diagnostics, Inc., 56 Roland Street, Charlestown, MA, USA
| | - Gurleen Kaur
- Selux Diagnostics, Inc., 56 Roland Street, Charlestown, MA, USA
| | - Michael Stiffler
- Department of Data Sciences, Dana-Farber Cancer Institute, Boston, MA, USA
| | - Debora S. Marks
- Department of Systems Biology, Harvard Medical School, Boston, MA, USA
| | - Christopher D. Bahl
- Institute for Protein Innovation, Boston, Massachusetts, Boston, MA, USA
- Division of Hematology/Oncology, Boston Children’s Hospital, Harvard Medical School; Boston, MA, USA
- current address: AI Proteins; Boston, MA, USA
| | - Amir R. Khan
- School of Biochemistry and Immunology, Trinity College Dublin, Dublin 2, Ireland
- Division of Newborn Medicine, Boston Children’s Hospital, Boston, MA, USA
| | - Chris Sander
- Department of Systems Biology, Harvard Medical School, Boston, MA, USA
| | - Nicholas P. Gauthier
- Department of Systems Biology, Harvard Medical School, Boston, MA, USA
- Department of Data Sciences, Dana-Farber Cancer Institute, Boston, MA, USA
| |
Collapse
|
23
|
Hoffmann MD, Zdechlik AC, He Y, Nedrud D, Aslanidi G, Gordon W, Schmidt D. Multiparametric domain insertional profiling of Adeno-Associated Virus VP1. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.04.19.537549. [PMID: 37131661 PMCID: PMC10153220 DOI: 10.1101/2023.04.19.537549] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/04/2023]
Abstract
Evolved properties of Adeno-Associated Virus (AAV), such as broad tropism and immunogenicity in humans, are barriers to AAV-based gene therapy. Previous efforts to re-engineer these properties have focused on variable regions near AAV’s 3-fold protrusions and capsid protein termini. To comprehensively survey AAV capsids for engineerable hotspots, we determined multiple AAV fitness phenotypes upon insertion of large, structured protein domains into the entire AAV-DJ capsid protein VP1. This is the largest and most comprehensive AAV domain insertion dataset to date. Our data revealed a surprising robustness of AAV capsids to accommodate large domain insertions. There was strong positional, domain-type, and fitness phenotype dependence of insertion permissibility, which clustered into correlated structural units that we could link to distinct roles in AAV assembly, stability, and infectivity. We also identified new engineerable hotspots of AAV that facilitate the covalent attachment of binding scaffolds, which may represent an alternative approach to re-direct AAV tropism.
Collapse
|
24
|
Qiao J, Sheng Y, Wang M, Li A, Li X, Huang H. Evolving Robust and Interpretable Enzymes for the Bioethanol Industry. Angew Chem Int Ed Engl 2023; 62:e202300320. [PMID: 36701239 DOI: 10.1002/anie.202300320] [Citation(s) in RCA: 4] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/07/2023] [Revised: 01/19/2023] [Accepted: 01/26/2023] [Indexed: 01/27/2023]
Abstract
Obtaining a robust and applicable enzyme for bioethanol production is a dream for biorefinery engineers. Herein, we describe a general method to evolve an all-round and interpretable enzyme that can be directly employed in the bioethanol industry. By integrating the transferable protein evolution strategy InSiReP 2.0 (In Silico guided Recombination Process), enzymatic characterization for actual production, and computational molecular understanding, the model cellulase PvCel5A (endoglucanase II Cel5A from Penicillium verruculosum) was successfully evolved to overcome the remaining challenges of low ethanol and temperature tolerance, which primarily limited biomass transformation and bioethanol yield. Remarkably, application of the PvCel5A variants in both first- and second-generation bioethanol production processes (i. Conventional corn ethanol fermentation combined with the in situ pretreatment process; ii. cellulosic ethanol fermentation process) resulted in a 5.7-10.1 % increase in the ethanol yield, which was unlikely to be achieved by other optimization techniques.
Collapse
Affiliation(s)
- Jie Qiao
- School of Food Science and Pharmaceutical Engineering, Nanjing Normal University, No. 2 Xuelin Road, Nanjing, 210097, China
| | - Yijie Sheng
- School of Food Science and Pharmaceutical Engineering, Nanjing Normal University, No. 2 Xuelin Road, Nanjing, 210097, China
| | - Minghui Wang
- School of Food Science and Pharmaceutical Engineering, Nanjing Normal University, No. 2 Xuelin Road, Nanjing, 210097, China
| | - Anni Li
- School of Food Science and Pharmaceutical Engineering, Nanjing Normal University, No. 2 Xuelin Road, Nanjing, 210097, China
| | - Xiujuan Li
- School of Food Science and Pharmaceutical Engineering, Nanjing Normal University, No. 2 Xuelin Road, Nanjing, 210097, China
| | - He Huang
- School of Food Science and Pharmaceutical Engineering, Nanjing Normal University, No. 2 Xuelin Road, Nanjing, 210097, China.,School of Pharmaceutical Science, Nanjing Tech University, Nanjing, 211816, China
| |
Collapse
|
25
|
do Nascimento MA, Leão RA, Froidevaux R, Wojcieszak R, de Souza ROA, Itabaiana I. A new approach for the direct acylation of bio-oil enriched with levoglucosan: kinetic study and lipase thermostability. Biochem Eng J 2023. [DOI: 10.1016/j.bej.2023.108915] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 03/31/2023]
|
26
|
Inferring protein fitness landscapes from laboratory evolution experiments. PLoS Comput Biol 2023; 19:e1010956. [PMID: 36857380 PMCID: PMC10010530 DOI: 10.1371/journal.pcbi.1010956] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/05/2022] [Revised: 03/13/2023] [Accepted: 02/16/2023] [Indexed: 03/02/2023] Open
Abstract
Directed laboratory evolution applies iterative rounds of mutation and selection to explore the protein fitness landscape and provides rich information regarding the underlying relationships between protein sequence, structure, and function. Laboratory evolution data consist of protein sequences sampled from evolving populations over multiple generations and this data type does not fit into established supervised and unsupervised machine learning approaches. We develop a statistical learning framework that models the evolutionary process and can infer the protein fitness landscape from multiple snapshots along an evolutionary trajectory. We apply our modeling approach to dihydrofolate reductase (DHFR) laboratory evolution data and the resulting landscape parameters capture important aspects of DHFR structure and function. We use the resulting model to understand the structure of the fitness landscape and find numerous examples of epistasis but an overall global peak that is evolutionarily accessible from most starting sequences. Finally, we use the model to perform an in silico extrapolation of the DHFR laboratory evolution trajectory and computationally design proteins from future evolutionary rounds.
Collapse
|
27
|
Densi A, Iyer RS, Bhat PJ. Synonymous and Nonsynonymous Substitutions in Dictyostelium discoideum Ammonium Transporter amtA Are Necessary for Functional Complementation in Saccharomyces cerevisiae. Microbiol Spectr 2023; 11:e0384722. [PMID: 36840598 PMCID: PMC10100761 DOI: 10.1128/spectrum.03847-22] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/21/2022] [Accepted: 01/24/2023] [Indexed: 02/24/2023] Open
Abstract
Ammonium transporters are present in all three domains of life. They have undergone extensive horizontal gene transfer (HGT), gene duplication, and functional diversification and therefore offer an excellent paradigm to study protein evolution. We attempted to complement a mep1Δmep2Δmep3Δ strain of Saccharomyces cerevisiae (triple-deletion strain), which otherwise cannot grow on ammonium as a sole nitrogen source at concentrations of <3 mM, with amtA of Dictyostelium discoideum, an orthologue of S. cerevisiae MEP2. We observed that amtA did not complement the triple-deletion strain of S. cerevisiae for growth on low-ammonium medium. We isolated two mutant derivatives of amtA (amtA M1 and amtA M2) from a PCR-generated mutant plasmid library that complemented the triple-deletion strain of S. cerevisiae. amtA M1 bears three nonsynonymous and two synonymous substitutions, which are necessary for its functionality. amtA M2 bears two nonsynonymous substitutions and one synonymous substitution, all of which are necessary for functionality. Interestingly, AmtA M1 transports ammonium but does not confer methylamine toxicity, while AmtA M2 transports ammonium and confers methylamine toxicity, demonstrating functional diversification. Preliminary biochemical analyses indicated that the mutants differ in their conformations as well as their mechanisms of ammonium transport. These intriguing results clearly point out that protein evolution cannot be fathomed by studying nonsynonymous and synonymous substitutions in isolation. The above-described observations have significant implications for various facets of biological processes and are discussed in detail. IMPORTANCE Functional diversification following gene duplication is one of the major driving forces of protein evolution. While the role of nonsynonymous substitutions in the functional diversification of proteins is well recognized, knowledge of the role of synonymous substitutions in protein evolution is in its infancy. Using functional complementation, we isolated two functional alleles of the D. discoideum ammonium transporter gene (amtA), which otherwise does not function in S. cerevisiae as an ammonium transporters. One of them is an ammonium transporter, while the other is an ammonium transporter that also confers methylammonium (ammonium analogue) toxicity, suggesting functional diversification. Surprisingly, both alleles require a combination of synonymous and nonsynonymous substitutions for their functionality. These results bring out a hitherto-unknown pathway of protein evolution and pave the way for not only understanding protein evolution but also interpreting single nucleotide polymorphisms (SNPs).
Collapse
Affiliation(s)
- Asha Densi
- Department of Biosciences and Bioengineering, Indian Institute of Technology Bombay, Mumbai, India
| | - Revathi S. Iyer
- Department of Biosciences and Bioengineering, Indian Institute of Technology Bombay, Mumbai, India
| | - Paike Jayadeva Bhat
- Department of Biosciences and Bioengineering, Indian Institute of Technology Bombay, Mumbai, India
| |
Collapse
|
28
|
Keskin Karakoyun H, Yüksel ŞK, Amanoglu I, Naserikhojasteh L, Yeşilyurt A, Yakıcıer C, Timuçin E, Akyerli CB. Evaluation of AlphaFold structure-based protein stability prediction on missense variations in cancer. Front Genet 2023; 14:1052383. [PMID: 36896237 PMCID: PMC9988940 DOI: 10.3389/fgene.2023.1052383] [Citation(s) in RCA: 7] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/26/2022] [Accepted: 02/08/2023] [Indexed: 02/23/2023] Open
Abstract
Identifying pathogenic missense variants in hereditary cancer is critical to the efforts of patient surveillance and risk-reduction strategies. For this purpose, many different gene panels consisting of different number and/or set of genes are available and we are particularly interested in a panel of 26 genes with a varying degree of hereditary cancer risk consisting of ABRAXAS1, ATM, BARD1, BLM, BRCA1, BRCA2, BRIP1, CDH1, CHEK2, EPCAM, MEN1, MLH1, MRE11, MSH2, MSH6, MUTYH, NBN, PALB2, PMS2, PTEN, RAD50, RAD51C, RAD51D, STK11, TP53, and XRCC2. In this study, we have compiled a collection of the missense variations reported in any of these 26 genes. More than a thousand missense variants were collected from ClinVar and the targeted screen of a breast cancer cohort of 355 patients which contributed to this set with 160 novel missense variations. We analyzed the impact of the missense variations on protein stability by five different predictors including both sequence- (SAAF2EC and MUpro) and structure-based (Maestro, mCSM, CUPSAT) predictors. For the structure-based tools, we have utilized the AlphaFold (AF2) protein structures which comprise the first structural analysis of this hereditary cancer proteins. Our results agreed with the recent benchmarks that computed the power of stability predictors in discriminating the pathogenic variants. Overall, we reported a low-to-medium-level performance for the stability predictors in discriminating pathogenic variants, except MUpro which had an AUROC of 0.534 (95% CI [0.499-0.570]). The AUROC values ranged between 0.614-0.719 for the total set and 0.596-0.682 for the set with high AF2 confidence regions. Furthermore, our findings revealed that the confidence score for a given variant in the AF2 structure could alone predict pathogenicity more robustly than any of the tested stability predictors with an AUROC of 0.852. Altogether, this study represents the first structural analysis of the 26 hereditary cancer genes underscoring 1) the thermodynamic stability predicted from AF2 structures as a moderate and 2) the confidence score of AF2 as a strong descriptor for variant pathogenicity.
Collapse
Affiliation(s)
- Hilal Keskin Karakoyun
- Department of Biochemistry and Molecular Biology, Institute of Health Sciences, Acibadem Mehmet Ali Aydinlar University, Istanbul, Türkiye
| | - Şirin K. Yüksel
- Department of Biochemistry and Molecular Biology, Institute of Health Sciences, Acibadem Mehmet Ali Aydinlar University, Istanbul, Türkiye
| | - Ilayda Amanoglu
- Department of Biostatistics and Bioinformatics, Institute of Health Sciences, Acibadem Mehmet Ali Aydinlar University, Istanbul, Türkiye
| | - Lara Naserikhojasteh
- Department of Biostatistics and Bioinformatics, Institute of Health Sciences, Acibadem Mehmet Ali Aydinlar University, Istanbul, Türkiye
| | - Ahmet Yeşilyurt
- Acibadem Labgen Genetic Diagnosis Centre, Acibadem Health Group, Istanbul, Türkiye
| | - Cengiz Yakıcıer
- Acibadem Pathology Laboratories, Acibadem Health Group, Istanbul, Türkiye
| | - Emel Timuçin
- Department of Biostatistics and Medical Informatics, School of Medicine, Acibadem Mehmet Ali Aydinlar University, Istanbul, Türkiye
| | - Cemaliye B. Akyerli
- Department of Medical Biology, School of Medicine, Acibadem Mehmet Ali Aydinlar University, Istanbul, Türkiye
| |
Collapse
|
29
|
Alejaldre L, Lemay-St-Denis C, Pelletier JN, Quaglia D. Tuning Selectivity in CalA Lipase: Beyond Tunnel Engineering. Biochemistry 2023; 62:396-409. [PMID: 36580299 PMCID: PMC9851156 DOI: 10.1021/acs.biochem.2c00513] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/01/2022] [Revised: 12/15/2022] [Indexed: 12/30/2022]
Abstract
Engineering studies of Candida (Pseudozyma) antarctica lipase A (CalA) have demonstrated the potential of this enzyme in the selective hydrolysis of fatty acid esters of different chain lengths. CalA has been shown to bind substrates preferentially through an acyl-chain binding tunnel accessed via the hydrolytic active site; it has also been shown that selectivity for substrates of longer or shorter chain length can be tuned, for instance by modulating steric hindrance within the tunnel. Here we demonstrate that, whereas the tunnel region is certainly of paramount importance for substrate recognition, residues in distal regions of the enzyme can also modulate substrate selectivity. To this end, we investigate variants that carry one or more substitutions within the substrate tunnel as well as in distal regions. Combining experimental determination of the substrate selectivity using natural and synthetic substrates with computational characterization of protein dynamics and of tunnels, we deconvolute the effect of key substitutions and demonstrate that epistatic interactions contribute to procuring selectivity toward either long-chain or short/medium-chain fatty acid esters. We demonstrate that various mechanisms contribute to the diverse selectivity profiles, ranging from reshaping tunnel morphology and tunnel stabilization to obstructing the main substrate-binding tunnel, highlighting the dynamic nature of the substrate-binding region. This work provides important insights into the versatility of this robust lipase toward diverse applications.
Collapse
Affiliation(s)
- Lorea Alejaldre
- PROTEO,
The Québec Network for Research on Protein, Function, Engineering
and Applications, https://proteo.ca/en/
- CGCC, Center
in Green Chemistry and Catalysis, Montréal, QC, CanadaG1V 0A6
- Department
of Biochemistry and Molecular Medicine, Université de Montréal, Montréal, QC, CanadaH3T 1J4
| | - Claudèle Lemay-St-Denis
- PROTEO,
The Québec Network for Research on Protein, Function, Engineering
and Applications, https://proteo.ca/en/
- CGCC, Center
in Green Chemistry and Catalysis, Montréal, QC, CanadaG1V 0A6
- Department
of Biochemistry and Molecular Medicine, Université de Montréal, Montréal, QC, CanadaH3T 1J4
| | - Joelle N. Pelletier
- PROTEO,
The Québec Network for Research on Protein, Function, Engineering
and Applications, https://proteo.ca/en/
- CGCC, Center
in Green Chemistry and Catalysis, Montréal, QC, CanadaG1V 0A6
- Department
of Biochemistry and Molecular Medicine, Université de Montréal, Montréal, QC, CanadaH3T 1J4
- Department
of Chemistry, Université de Montréal, Montréal, QC, CanadaH2V 0B3
| | - Daniela Quaglia
- PROTEO,
The Québec Network for Research on Protein, Function, Engineering
and Applications, https://proteo.ca/en/
- CGCC, Center
in Green Chemistry and Catalysis, Montréal, QC, CanadaG1V 0A6
- Department
of Chemistry, Université de Montréal, Montréal, QC, CanadaH2V 0B3
- Department
of Chemistry, Carleton University, Ottawa, ON, CanadaK1S 5B6
| |
Collapse
|
30
|
Clifton BE, Kozome D, Laurino P. Efficient Exploration of Sequence Space by Sequence-Guided Protein Engineering and Design. Biochemistry 2023; 62:210-220. [PMID: 35245020 DOI: 10.1021/acs.biochem.1c00757] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/02/2023]
Abstract
The rapid growth of sequence databases over the past two decades means that protein engineers faced with optimizing a protein for any given task will often have immediate access to a vast number of related protein sequences. These sequences encode information about the evolutionary history of the protein and the underlying sequence requirements to produce folded, stable, and functional protein variants. Methods that can take advantage of this information are an increasingly important part of the protein engineering tool kit. In this Perspective, we discuss the utility of sequence data in protein engineering and design, focusing on recent advances in three main areas: the use of ancestral sequence reconstruction as an engineering tool to generate thermostable and multifunctional proteins, the use of sequence data to guide engineering of multipoint mutants by structure-based computational protein design, and the use of unlabeled sequence data for unsupervised and semisupervised machine learning, allowing the generation of diverse and functional protein sequences in unexplored regions of sequence space. Altogether, these methods enable the rapid exploration of sequence space within regions enriched with functional proteins and therefore have great potential for accelerating the engineering of stable, functional, and diverse proteins for industrial and biomedical applications.
Collapse
Affiliation(s)
- Ben E Clifton
- Protein Engineering and Evolution Unit, Okinawa Institute of Science and Technology, 1919-1 Tancha, Onna, Okinawa 904-0495, Japan
| | - Dan Kozome
- Protein Engineering and Evolution Unit, Okinawa Institute of Science and Technology, 1919-1 Tancha, Onna, Okinawa 904-0495, Japan
| | - Paola Laurino
- Protein Engineering and Evolution Unit, Okinawa Institute of Science and Technology, 1919-1 Tancha, Onna, Okinawa 904-0495, Japan
| |
Collapse
|
31
|
Pak MA, Markhieva KA, Novikova MS, Petrov DS, Vorobyev IS, Maksimova ES, Kondrashov FA, Ivankov DN. Using AlphaFold to predict the impact of single mutations on protein stability and function. PLoS One 2023; 18:e0282689. [PMID: 36928239 PMCID: PMC10019719 DOI: 10.1371/journal.pone.0282689] [Citation(s) in RCA: 72] [Impact Index Per Article: 72.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/20/2021] [Accepted: 02/21/2023] [Indexed: 03/17/2023] Open
Abstract
AlphaFold changed the field of structural biology by achieving three-dimensional (3D) structure prediction from protein sequence at experimental quality. The astounding success even led to claims that the protein folding problem is "solved". However, protein folding problem is more than just structure prediction from sequence. Presently, it is unknown if the AlphaFold-triggered revolution could help to solve other problems related to protein folding. Here we assay the ability of AlphaFold to predict the impact of single mutations on protein stability (ΔΔG) and function. To study the question we extracted the pLDDT and <pLDDT> metrics from AlphaFold predictions before and after single mutation in a protein and correlated the predicted change with the experimentally known ΔΔG values. Additionally, we correlated the same AlphaFold pLDDT metrics with the impact of a single mutation on structure using a large scale dataset of single mutations in GFP with the experimentally assayed levels of fluorescence. We found a very weak or no correlation between AlphaFold output metrics and change of protein stability or fluorescence. Our results imply that AlphaFold may not be immediately applied to other problems or applications in protein folding.
Collapse
Affiliation(s)
- Marina A. Pak
- Center of Life Sciences, Skolkovo Institute of Science and Technology, Moscow, Russia
| | | | - Mariia S. Novikova
- Armand Hammer United World College of the American West, Montezuma, New Mexico, United Stated of America
| | - Dmitry S. Petrov
- Specialized Educational and Scientific Center of UrFU (SUNC UrFU), Ekaterinburg, Russia
| | - Ilya S. Vorobyev
- Center of Life Sciences, Skolkovo Institute of Science and Technology, Moscow, Russia
| | | | - Fyodor A. Kondrashov
- Institute of Science and Technology Austria, Maria Gugging, Austria
- Evolutionary and Synthetic Biology Unit, Okinawa Institute of Science and Technology Graduate University, Onna, Okinawa, Japan
| | - Dmitry N. Ivankov
- Center of Life Sciences, Skolkovo Institute of Science and Technology, Moscow, Russia
- * E-mail:
| |
Collapse
|
32
|
Carpentier M, Chomilier J. Analyses of Mutation Displacements from Homology Models. Methods Mol Biol 2023; 2627:195-210. [PMID: 36959449 DOI: 10.1007/978-1-0716-2974-1_11] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 03/25/2023]
Abstract
Evaluation of the structural perturbations introduced by a single amino acid mutation is the main issue for protein structural biology. We propose here to present some recent advances in methods, allowing the splitting of distortion between the actual substitution effect and the contribution of the local flexibility of the position where the mutation occurs. Its main drawback is the need of many structures with a single mutation in each of them. To bypass this difficulty, we propose to use molecular modeling tools, with several software enabling us to build a model from a template, given the sequence. As a proof of concept, we rely on a gold standard, the human lysozyme. Both wild-type and three mutant structures are available in the PDB. Two of these mutations result in amyloid fibril formation, and the last one is neutral. As a conclusion, irrespective of the algorithm used for modeling, side chain conformations at the site of mutation are reliable, although long-range effects are out of reach of these tools.
Collapse
Affiliation(s)
- Mathilde Carpentier
- Institut Systématique Evolution Biodiversité (ISYEB), Sorbonne Université, MNHN, CNRS, EPHE, Paris, France.
| | - Jacques Chomilier
- Sorbonne Université, BiBiP, IMPMC, UMR 7590, CNRS, MNHN, Paris, France
| |
Collapse
|
33
|
Standley M, Blay V, Beleva Guthrie V, Kim J, Lyman A, Moya A, Karchin R, Camps M. Experimental and In Silico Analysis of TEM β-Lactamase Adaptive Evolution. ACS Infect Dis 2022; 8:2451-2463. [PMID: 36377311 PMCID: PMC9745794 DOI: 10.1021/acsinfecdis.2c00216] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]
Abstract
Multiple mutations often have non-additive (epistatic) phenotypic effects. Epistasis is of fundamental biological relevance but is not well understood mechanistically. Adaptive evolution, i.e., the evolution of new biochemical activities, is rich in epistatic interactions. To better understand the principles underlying epistasis during genetic adaptation, we studied the evolution of TEM-1 β-lactamase variants exhibiting cefotaxime resistance. We report the collection of a library of 487 observed evolutionary trajectories for TEM-1 and determine the epistasis status based on cefotaxime resistance phenotype for 206 combinations of 2-3 TEM-1 mutations involving 17 positions under adaptive selective pressure. Gain-of-function (GOF) mutations are gatekeepers for adaptation. To see if GOF phenotypes can be inferred based solely on sequence data, we calculated the enrichment of GOF mutations in the different categories of epistatic pairs. Our results suggest that this is possible because GOF mutations are particularly enriched in sign and reciprocal sign epistasis, which leave a major imprint on the sequence space accessible to evolution. We also used FoldX to explore the relationship between thermodynamic stability and epistasis. We found that mutations in observed evolutionary trajectories tend to destabilize the folded structure of the protein, albeit their cumulative effects are consistently below the protein's free energy of folding. The destabilizing effect is stronger for epistatic pairs, suggesting that modest or local alterations in folding stability can modulate catalysis. Finally, we report a significant relationship between epistasis and the degree to which two protein positions are structurally and dynamically coupled, even in the absence of ligand.
Collapse
Affiliation(s)
- Melissa Standley
- Department
of Microbiology and Environmental Toxicology, University of California, Santa
Cruz, California95064, United States
| | - Vincent Blay
- Department
of Microbiology and Environmental Toxicology, University of California, Santa
Cruz, California95064, United States,Institute
for Integrative Systems Biology (I2Sysbio), Universitat de València and Spanish Research Council (CSIC), 46980Valencia, Spain,
| | - Violeta Beleva Guthrie
- Department
of Biomedical Engineering and Institute for Computational Medicine, The Johns Hopkins University, Baltimore, Maryland21218, United States
| | - Jay Kim
- Department
of Microbiology and Environmental Toxicology, University of California, Santa
Cruz, California95064, United States
| | - Audrey Lyman
- Department
of Microbiology and Environmental Toxicology, University of California, Santa
Cruz, California95064, United States
| | - Andrés Moya
- Institute
for Integrative Systems Biology (I2Sysbio), Universitat de València and Spanish Research Council (CSIC), 46980Valencia, Spain,Foundation
for the Promotion of Sanitary and Biomedical Research of Valencia
Region (FISABIO), 46021Valencia, Spain,CIBER
in Epidemiology and Public Health (CIBEResp), 28029Madrid, Spain
| | - Rachel Karchin
- Department
of Biomedical Engineering and Institute for Computational Medicine, The Johns Hopkins University, Baltimore, Maryland21218, United States
| | - Manel Camps
- Department
of Microbiology and Environmental Toxicology, University of California, Santa
Cruz, California95064, United States,
| |
Collapse
|
34
|
Iyengar BR, Wagner A. Bacterial Hsp90 predominantly buffers but does not potentiate the phenotypic effects of deleterious mutations during fluorescent protein evolution. Genetics 2022; 222:iyac154. [PMID: 36227141 PMCID: PMC9713429 DOI: 10.1093/genetics/iyac154] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/29/2022] [Accepted: 09/26/2022] [Indexed: 12/13/2022] Open
Abstract
Chaperones facilitate the folding of other ("client") proteins and can thus affect the adaptive evolution of these clients. Specifically, chaperones affect the phenotype of proteins via two opposing mechanisms. On the one hand, they can buffer the effects of mutations in proteins and thus help preserve an ancestral, premutation phenotype. On the other hand, they can potentiate the effects of mutations and thus enhance the phenotypic changes caused by a mutation. We study that how the bacterial Hsp90 chaperone (HtpG) affects the evolution of green fluorescent protein. To this end, we performed directed evolution of green fluorescent protein under low and high cellular concentrations of Hsp90. Specifically, we evolved green fluorescent protein under both stabilizing selection for its ancestral (green) phenotype and directional selection toward a new (cyan) phenotype. While Hsp90 did only affect the rate of adaptive evolution transiently, it did affect the phenotypic effects of mutations that occurred during adaptive evolution. Specifically, Hsp90 allowed strongly deleterious mutations to accumulate in evolving populations by buffering their effects. Our observations show that the role of a chaperone for adaptive evolution depends on the organism and the trait being studied.
Collapse
Affiliation(s)
- Bharat Ravi Iyengar
- Department of Evolutionary Biology and Environmental Studies, University of Zurich, 8057 Zurich, Switzerland
- Swiss Institute of Bioinformatics, Quartier Sorge-Batiment Genopode, 1015 Lausanne, Switzerland
- Institute for Evolution and Biodiversity, Westfalian Wilhelms—University of Münster, 48149 Münster, Germany
| | - Andreas Wagner
- Department of Evolutionary Biology and Environmental Studies, University of Zurich, 8057 Zurich, Switzerland
- Swiss Institute of Bioinformatics, Quartier Sorge-Batiment Genopode, 1015 Lausanne, Switzerland
- The Santa Fe Institute, Santa Fe, NM 87501, USA
- Stellenbosch Institute for Advanced Study (STIAS), Wallenberg Research Centre at Stellenbosch University, 7600 Stellenbosch, South Africa
| |
Collapse
|
35
|
Colque CA, albarracín Orio AG, Tomatis PE, Dotta G, Moreno DM, Hedemann LG, Hickman RA, Sommer LM, Feliziani S, Moyano AJ, Bonomo RA, K. Johansen H, Molin S, Vila AJ, Smania AM. Longitudinal Evolution of the Pseudomonas-Derived Cephalosporinase (PDC) Structure and Activity in a Cystic Fibrosis Patient Treated with β-Lactams. mBio 2022; 13:e0166322. [PMID: 36073814 PMCID: PMC9600753 DOI: 10.1128/mbio.01663-22] [Citation(s) in RCA: 8] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/28/2022] [Accepted: 07/11/2022] [Indexed: 11/24/2022] Open
Abstract
Traditional studies on the evolution of antibiotic resistance development use approaches that can range from laboratory-based experimental studies, to epidemiological surveillance, to sequencing of clinical isolates. However, evolutionary trajectories also depend on the environment in which selection takes place, compelling the need to more deeply investigate the impact of environmental complexities and their dynamics over time. Herein, we explored the within-patient adaptive long-term evolution of a Pseudomonas aeruginosa hypermutator lineage in the airways of a cystic fibrosis (CF) patient by performing a chronological tracking of mutations that occurred in different subpopulations; our results demonstrated parallel evolution events in the chromosomally encoded class C β-lactamase (blaPDC). These multiple mutations within blaPDC shaped diverse coexisting alleles, whose frequency dynamics responded to the changing antibiotic selective pressures for more than 26 years of chronic infection. Importantly, the combination of the cumulative mutations in blaPDC provided structural and functional protein changes that resulted in a continuous enhancement of its catalytic efficiency and high level of cephalosporin resistance. This evolution was linked to the persistent treatment with ceftazidime, which we demonstrated selected for variants with robust catalytic activity against this expanded-spectrum cephalosporin. A "gain of function" of collateral resistance toward ceftolozane, a more recently introduced cephalosporin that was not prescribed to this patient, was also observed, and the biochemical basis of this cross-resistance phenomenon was elucidated. This work unveils the evolutionary trajectories paved by bacteria toward a multidrug-resistant phenotype, driven by decades of antibiotic treatment in the natural CF environmental setting. IMPORTANCE Antibiotics are becoming increasingly ineffective to treat bacterial infections. It has been consequently predicted that infectious diseases will become the biggest challenge to human health in the near future. Pseudomonas aeruginosa is considered a paradigm in antimicrobial resistance as it exploits intrinsic and acquired resistance mechanisms to resist virtually all antibiotics known. AmpC β-lactamase is the main mechanism driving resistance in this notorious pathogen to β-lactams, one of the most widely used classes of antibiotics for cystic fibrosis infections. Here, we focus on the β-lactamase gene as a model resistance determinant and unveil the trajectory P. aeruginosa undertakes on the path toward a multidrug-resistant phenotype during the course of two and a half decades of chronic infection in the airways of a cystic fibrosis patient. Integrating genetic and biochemical studies in the natural environment where evolution occurs, we provide a unique perspective on this challenging landscape, addressing fundamental molecular mechanisms of resistance.
Collapse
Affiliation(s)
- Claudia A. Colque
- Universidad Nacional de Córdoba, Facultad de Ciencias Químicas, Departamento de Química Biológica Ranwel Caputto, Córdoba, Argentina
- CONICET, Universidad Nacional de Córdoba, Centro de Investigaciones en Química Biológica de Córdoba (CIQUIBIC), Córdoba, Argentina
| | - Andrea G. albarracín Orio
- Universidad Nacional de Córdoba, Facultad de Ciencias Químicas, Departamento de Química Biológica Ranwel Caputto, Córdoba, Argentina
- CONICET, Universidad Nacional de Córdoba, Centro de Investigaciones en Química Biológica de Córdoba (CIQUIBIC), Córdoba, Argentina
- IRNASUS, Universidad Católica de Córdoba, CONICET, Facultad de Ciencias Agropecuarias, Córdoba, Argentina
| | - Pablo E. Tomatis
- Instituto de Biología Molecular y Celular de Rosario (IBR), CONICET, Universidad Nacional de Rosario, Rosario, Argentina
- Facultad de Ciencias Bioquímicas y Farmacéuticas, Universidad Nacional de Rosario, Rosario, Argentina
| | - Gina Dotta
- Instituto de Biología Molecular y Celular de Rosario (IBR), CONICET, Universidad Nacional de Rosario, Rosario, Argentina
| | - Diego M. Moreno
- Facultad de Ciencias Bioquímicas y Farmacéuticas, Universidad Nacional de Rosario, Rosario, Argentina
- IQUIR, Instituto de Química de Rosario, CONICET, Universidad Nacional de Rosario, Rosario, Argentina
| | - Laura G. Hedemann
- Universidad Nacional de Córdoba, Facultad de Ciencias Químicas, Departamento de Química Biológica Ranwel Caputto, Córdoba, Argentina
- CONICET, Universidad Nacional de Córdoba, Centro de Investigaciones en Química Biológica de Córdoba (CIQUIBIC), Córdoba, Argentina
| | - Rachel A. Hickman
- Department of Clinical Microbiology, Rigshospitalet, Copenhagen, Denmark
- Novo Nordisk Foundation Centre for Biosustainability, Technical University of Denmark, Lyngby, Denmark
| | - Lea M. Sommer
- Department of Clinical Microbiology, Rigshospitalet, Copenhagen, Denmark
- Novo Nordisk Foundation Centre for Biosustainability, Technical University of Denmark, Lyngby, Denmark
| | - Sofía Feliziani
- Universidad Nacional de Córdoba, Facultad de Ciencias Químicas, Departamento de Química Biológica Ranwel Caputto, Córdoba, Argentina
- CONICET, Universidad Nacional de Córdoba, Centro de Investigaciones en Química Biológica de Córdoba (CIQUIBIC), Córdoba, Argentina
| | - Alejandro J. Moyano
- Universidad Nacional de Córdoba, Facultad de Ciencias Químicas, Departamento de Química Biológica Ranwel Caputto, Córdoba, Argentina
- CONICET, Universidad Nacional de Córdoba, Centro de Investigaciones en Química Biológica de Córdoba (CIQUIBIC), Córdoba, Argentina
| | - Robert A. Bonomo
- Departments of Molecular Biology and Microbiology, Medicine, Biochemistry, Pharmacology, and Proteomics and Bioinformatics, Case Western Reserve University, Cleveland, Ohio, USA
- Senior Clinical Scientist Investigator, Louis Stokes Cleveland Department of Veterans Affairs, Cleveland, Ohio, USA
| | - Helle K. Johansen
- Department of Clinical Microbiology, Rigshospitalet, Copenhagen, Denmark
- Novo Nordisk Foundation Centre for Biosustainability, Technical University of Denmark, Lyngby, Denmark
- Department of Clinical Medicine, University of Copenhagen, Copenhagen, Denmark
| | - Søren Molin
- Novo Nordisk Foundation Centre for Biosustainability, Technical University of Denmark, Lyngby, Denmark
| | - Alejandro J. Vila
- Instituto de Biología Molecular y Celular de Rosario (IBR), CONICET, Universidad Nacional de Rosario, Rosario, Argentina
- Facultad de Ciencias Bioquímicas y Farmacéuticas, Universidad Nacional de Rosario, Rosario, Argentina
| | - Andrea M. Smania
- Universidad Nacional de Córdoba, Facultad de Ciencias Químicas, Departamento de Química Biológica Ranwel Caputto, Córdoba, Argentina
- CONICET, Universidad Nacional de Córdoba, Centro de Investigaciones en Química Biológica de Córdoba (CIQUIBIC), Córdoba, Argentina
| |
Collapse
|
36
|
Azbukina N, Zharikova A, Ramensky V. Intragenic compensation through the lens of deep mutational scanning. Biophys Rev 2022; 14:1161-1182. [PMID: 36345285 PMCID: PMC9636336 DOI: 10.1007/s12551-022-01005-w] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/14/2022] [Accepted: 09/26/2022] [Indexed: 12/20/2022] Open
Abstract
A significant fraction of mutations in proteins are deleterious and result in adverse consequences for protein function, stability, or interaction with other molecules. Intragenic compensation is a specific case of positive epistasis when a neutral missense mutation cancels effect of a deleterious mutation in the same protein. Permissive compensatory mutations facilitate protein evolution, since without them all sequences would be extremely conserved. Understanding compensatory mechanisms is an important scientific challenge at the intersection of protein biophysics and evolution. In human genetics, intragenic compensatory interactions are important since they may result in variable penetrance of pathogenic mutations or fixation of pathogenic human alleles in orthologous proteins from related species. The latter phenomenon complicates computational and clinical inference of an allele's pathogenicity. Deep mutational scanning is a relatively new technique that enables experimental studies of functional effects of thousands of mutations in proteins. We review the important aspects of the field and discuss existing limitations of current datasets. We reviewed ten published DMS datasets with quantified functional effects of single and double mutations and described rates and patterns of intragenic compensation in eight of them. Supplementary Information The online version contains supplementary material available at 10.1007/s12551-022-01005-w.
Collapse
Affiliation(s)
- Nadezhda Azbukina
- Faculty of Bioengineering and Bioinformatics, Lomonosov Moscow State University, 1-73, Leninskie Gory, 119991 Moscow, Russia
| | - Anastasia Zharikova
- Faculty of Bioengineering and Bioinformatics, Lomonosov Moscow State University, 1-73, Leninskie Gory, 119991 Moscow, Russia
- National Medical Research Center for Therapy and Preventive Medicine, Petroverigsky per., 10, Bld.3, 101000 Moscow, Russia
| | - Vasily Ramensky
- Faculty of Bioengineering and Bioinformatics, Lomonosov Moscow State University, 1-73, Leninskie Gory, 119991 Moscow, Russia
- National Medical Research Center for Therapy and Preventive Medicine, Petroverigsky per., 10, Bld.3, 101000 Moscow, Russia
| |
Collapse
|
37
|
Abstract
One core goal of genetics is to systematically understand the mapping between the DNA sequence of an organism (genotype) and its measurable characteristics (phenotype). Understanding this mapping is often challenging because of interactions between mutations, where the result of combining several different mutations can be very different than the sum of their individual effects. Here we provide a statistical framework for modeling complex genetic interactions of this type. The key idea is to ask how fast the effects of mutations change when introducing the same mutation in increasingly distant genetic backgrounds. We then propose a model for phenotypic prediction that takes into account this tendency for the effects of mutations to be more similar in nearby genetic backgrounds. Contemporary high-throughput mutagenesis experiments are providing an increasingly detailed view of the complex patterns of genetic interaction that occur between multiple mutations within a single protein or regulatory element. By simultaneously measuring the effects of thousands of combinations of mutations, these experiments have revealed that the genotype–phenotype relationship typically reflects not only genetic interactions between pairs of sites but also higher-order interactions among larger numbers of sites. However, modeling and understanding these higher-order interactions remains challenging. Here we present a method for reconstructing sequence-to-function mappings from partially observed data that can accommodate all orders of genetic interaction. The main idea is to make predictions for unobserved genotypes that match the type and extent of epistasis found in the observed data. This information on the type and extent of epistasis can be extracted by considering how phenotypic correlations change as a function of mutational distance, which is equivalent to estimating the fraction of phenotypic variance due to each order of genetic interaction (additive, pairwise, three-way, etc.). Using these estimated variance components, we then define an empirical Bayes prior that in expectation matches the observed pattern of epistasis and reconstruct the genotype–phenotype mapping by conducting Gaussian process regression under this prior. To demonstrate the power of this approach, we present an application to the antibody-binding domain GB1 and also provide a detailed exploration of a dataset consisting of high-throughput measurements for the splicing efficiency of human pre-mRNA 5′ splice sites, for which we also validate our model predictions via additional low-throughput experiments.
Collapse
|
38
|
Sohail MS, Louie RHY, Hong Z, Barton JP, McKay MR. Inferring Epistasis from Genetic Time-series Data. Mol Biol Evol 2022; 39:6710201. [PMID: 36130322 PMCID: PMC9558069 DOI: 10.1093/molbev/msac199] [Citation(s) in RCA: 10] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/01/2023] Open
Abstract
Epistasis refers to fitness or functional effects of mutations that depend on the sequence background in which these mutations arise. Epistasis is prevalent in nature, including populations of viruses, bacteria, and cancers, and can contribute to the evolution of drug resistance and immune escape. However, it is difficult to directly estimate epistatic effects from sampled observations of a population. At present, there are very few methods that can disentangle the effects of selection (including epistasis), mutation, recombination, genetic drift, and genetic linkage in evolving populations. Here we develop a method to infer epistasis, along with the fitness effects of individual mutations, from observed evolutionary histories. Simulations show that we can accurately infer pairwise epistatic interactions provided that there is sufficient genetic diversity in the data. Our method also allows us to identify which fitness parameters can be reliably inferred from a particular data set and which ones are unidentifiable. Our approach therefore allows for the inference of more complex models of selection from time-series genetic data, while also quantifying uncertainty in the inferred parameters.
Collapse
Affiliation(s)
- Muhammad Saqib Sohail
- Department of Electronic and Computer Engineering, Hong Kong University of Science and Technology, Hong Kong SAR, People’s Republic of China
| | - Raymond H Y Louie
- The Kirby Institute, University of New South Wales, Sydney, New South Wales, Australia
| | - Zhenchen Hong
- Department of Physics and Astronomy, University of California, Riverside, CA, USA
| | | | | |
Collapse
|
39
|
Analysis of the Contribution of Intrinsic Disorder in Shaping Potyvirus Genetic Diversity. Viruses 2022; 14:v14091959. [PMID: 36146764 PMCID: PMC9504506 DOI: 10.3390/v14091959] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/11/2022] [Revised: 08/30/2022] [Accepted: 08/31/2022] [Indexed: 12/30/2022] Open
Abstract
Intrinsically disordered regions (IDRs) are abundant in the proteome of RNA viruses. The multifunctional properties of these regions are widely documented and their structural flexibility is associated with the low constraint in their amino acid positions. Therefore, from an evolutionary stand point, these regions could have a greater propensity to accumulate non-synonymous mutations (NS) than highly structured regions (ORs, or 'ordered regions'). To address this hypothesis, we compared the distribution of non-synonymous mutations (NS), which we relate here to mutational robustness, in IDRs and ORs in the genome of potyviruses, a major genus of plant viruses. For this purpose, a simulation model was built and used to distinguish a possible selection phenomenon in the biological datasets from randomly generated mutations. We analyzed several short-term experimental evolution datasets. An analysis was also performed on the natural diversity of three different species of potyviruses reflecting their long-term evolution. We observed that the mutational robustness of IDRs is significantly higher than that of ORs. Moreover, the substitutions in the ORs are very constrained by the conservation of the physico-chemical properties of the amino acids. This feature is not found in the IDRs where the substitutions tend to be more random. This reflects the weak structural constraints in these regions, wherein an amino acid polymorphism is naturally conserved. In the course of evolution, potyvirus IDRs and ORs follow different evolutive paths with respect to their mutational robustness. These results have forced the authors to consider the hypothesis that IDRs and their associated amino acid polymorphism could constitute a potential adaptive reservoir.
Collapse
|
40
|
Romero Romero ML, Landerer C, Poehls J, Toth‐Petroczy A. Phenotypic mutations contribute to protein diversity and shape protein evolution. Protein Sci 2022; 31:e4397. [PMID: 36040266 PMCID: PMC9375231 DOI: 10.1002/pro.4397] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/14/2022] [Revised: 06/14/2022] [Accepted: 07/04/2022] [Indexed: 11/16/2022]
Abstract
Errors in DNA replication generate genetic mutations, while errors in transcription and translation lead to phenotypic mutations. Phenotypic mutations are orders of magnitude more frequent than genetic ones, yet they are less understood. Here, we review the types of phenotypic mutations, their quantifications, and their role in protein evolution and disease. The diversity generated by phenotypic mutation can facilitate adaptive evolution. Indeed, phenotypic mutations, such as ribosomal frameshift and stop codon readthrough, sometimes serve to regulate protein expression and function. Phenotypic mutations have often been linked to fitness decrease and diseases. Thus, understanding the protein heterogeneity and phenotypic diversity caused by phenotypic mutations will advance our understanding of protein evolution and have implications on human health and diseases.
Collapse
Affiliation(s)
- Maria Luisa Romero Romero
- Max Planck Institute of Molecular Cell Biology and GeneticsDresdenGermany
- Center for Systems Biology DresdenDresdenGermany
| | - Cedric Landerer
- Max Planck Institute of Molecular Cell Biology and GeneticsDresdenGermany
- Center for Systems Biology DresdenDresdenGermany
| | - Jonas Poehls
- Max Planck Institute of Molecular Cell Biology and GeneticsDresdenGermany
- Center for Systems Biology DresdenDresdenGermany
| | - Agnes Toth‐Petroczy
- Max Planck Institute of Molecular Cell Biology and GeneticsDresdenGermany
- Center for Systems Biology DresdenDresdenGermany
- Cluster of Excellence Physics of LifeTU DresdenDresdenGermany
| |
Collapse
|
41
|
Jayaraman V, Toledo‐Patiño S, Noda‐García L, Laurino P. Mechanisms of protein evolution. Protein Sci 2022; 31:e4362. [PMID: 35762715 PMCID: PMC9214755 DOI: 10.1002/pro.4362] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/28/2022] [Revised: 05/11/2022] [Accepted: 05/14/2022] [Indexed: 11/06/2022]
Abstract
How do proteins evolve? How do changes in sequence mediate changes in protein structure, and in turn in function? This question has multiple angles, ranging from biochemistry and biophysics to evolutionary biology. This review provides a brief integrated view of some key mechanistic aspects of protein evolution. First, we explain how protein evolution is primarily driven by randomly acquired genetic mutations and selection for function, and how these mutations can even give rise to completely new folds. Then, we also comment on how phenotypic protein variability, including promiscuity, transcriptional and translational errors, may also accelerate this process, possibly via "plasticity-first" mechanisms. Finally, we highlight open questions in the field of protein evolution, with respect to the emergence of more sophisticated protein systems such as protein complexes, pathways, and the emergence of pre-LUCA enzymes.
Collapse
Affiliation(s)
- Vijay Jayaraman
- Department of Molecular Cell BiologyWeizmann Institute of ScienceRehovotIsrael
| | - Saacnicteh Toledo‐Patiño
- Protein Engineering and Evolution UnitOkinawa Institute of Science and Technology Graduate UniversityOkinawaJapan
| | - Lianet Noda‐García
- Department of Plant Pathology and Microbiology, Institute of Environmental Sciences, Robert H. Smith Faculty of Agriculture, Food and EnvironmentHebrew University of JerusalemRehovotIsrael
| | - Paola Laurino
- Protein Engineering and Evolution UnitOkinawa Institute of Science and Technology Graduate UniversityOkinawaJapan
| |
Collapse
|
42
|
Chen DS, Clark AG, Wolfner MF. Octopaminergic/tyraminergic Tdc2 neurons regulate biased sperm usage in female Drosophila melanogaster. Genetics 2022; 221:6613932. [PMID: 35736370 DOI: 10.1093/genetics/iyac097] [Citation(s) in RCA: 5] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/25/2021] [Accepted: 06/04/2022] [Indexed: 11/14/2022] Open
Abstract
In polyandrous internally fertilizing species, a multiply-mated female can use stored sperm from different males in a biased manner to fertilize her eggs. The female's ability to assess sperm quality and compatibility is essential for her reproductive success, and represents an important aspect of postcopulatory sexual selection. In Drosophila melanogaster, previous studies demonstrated that the female nervous system plays an active role in influencing progeny paternity proportion, and suggested a role for octopaminergic/tyraminergic Tdc2 neurons in this process. Here, we report that inhibiting Tdc2 neuronal activity causes females to produce a higher-than-normal proportion of first-male progeny. This difference is not due to differences in sperm storage or release, but instead is attributable to the suppression of second-male sperm usage bias that normally occurs in control females. We further show that a subset of Tdc2 neurons innervating the female reproductive tract is largely responsible for the progeny proportion phenotype that is observed when Tdc2 neurons are inhibited globally. On the contrary, overactivation of Tdc2 neurons does not further affect sperm storage and release or progeny proportion. These results suggest that octopaminergic/tyraminergic signaling allows a multiply-mated female to bias sperm usage, and identify a new role for the female nervous system in postcopulatory sexual selection.
Collapse
Affiliation(s)
- Dawn S Chen
- Department of Molecular Biology and Genetics, Cornell University, Ithaca NY 14853, USA
| | - Andrew G Clark
- Department of Molecular Biology and Genetics, Cornell University, Ithaca NY 14853, USA
| | - Mariana F Wolfner
- Department of Molecular Biology and Genetics, Cornell University, Ithaca NY 14853, USA
| |
Collapse
|
43
|
Abstract
Using a neural network to predict how green fluorescent proteins respond to genetic mutations illuminates properties that could help design new proteins.
Collapse
Affiliation(s)
- Grzegorz Kudla
- MRC Human Genetics Unit, The University of Edinburgh, Edinburgh, United Kingdom
| | - Marcin Plech
- MRC Human Genetics Unit, The University of Edinburgh, Edinburgh, United Kingdom
| |
Collapse
|
44
|
Bakerlee CW, Nguyen Ba AN, Shulgina Y, Rojas Echenique JI, Desai MM. Idiosyncratic epistasis leads to global fitness-correlated trends. Science 2022; 376:630-635. [PMID: 35511982 PMCID: PMC10124986 DOI: 10.1126/science.abm4774] [Citation(s) in RCA: 21] [Impact Index Per Article: 10.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/15/2022]
Abstract
Epistasis can markedly affect evolutionary trajectories. In recent decades, protein-level fitness landscapes have revealed extensive idiosyncratic epistasis among specific mutations. By contrast, other work has found ubiquitous and apparently nonspecific patterns of global diminishing-returns and increasing-costs epistasis among mutations across the genome. Here, we used a hierarchical CRISPR gene drive system to construct all combinations of 10 missense mutations from across the genome in budding yeast and measured their fitness in six environments. We show that the resulting fitness landscapes exhibit global fitness-correlated trends but that these trends emerge from specific idiosyncratic interactions. We thus provide experimental validation of recent theoretical work arguing that fitness-correlated trends can emerge as the generic consequence of idiosyncratic epistasis.
Collapse
Affiliation(s)
- Christopher W Bakerlee
- Department of Organismic and Evolutionary Biology, Harvard University, Cambridge, MA, USA.,Quantitative Biology Initiative, Harvard University, Cambridge, MA, USA.,Department of Molecular and Cellular Biology, Harvard University, Cambridge, MA, USA
| | - Alex N Nguyen Ba
- Department of Organismic and Evolutionary Biology, Harvard University, Cambridge, MA, USA.,Quantitative Biology Initiative, Harvard University, Cambridge, MA, USA.,Department of Cell and Systems Biology, University of Toronto, Toronto, Ontario, Canada.,Department of Biology, University of Toronto Mississauga, Mississauga, Ontario, Canada
| | - Yekaterina Shulgina
- Department of Molecular and Cellular Biology, Harvard University, Cambridge, MA, USA
| | - Jose I Rojas Echenique
- Department of Organismic and Evolutionary Biology, Harvard University, Cambridge, MA, USA.,Department of Molecular Genetics, University of Toronto, Toronto, Ontario, Canada
| | - Michael M Desai
- Department of Organismic and Evolutionary Biology, Harvard University, Cambridge, MA, USA.,Quantitative Biology Initiative, Harvard University, Cambridge, MA, USA.,NSF-Simons Center for Mathematical and Statistical Analysis of Biology, Harvard University, Cambridge, MA, USA.,Department of Physics, Harvard University, Cambridge, MA, USA
| |
Collapse
|
45
|
Noby N, Johnson RL, Tyzack JD, Embaby AM, Saeed H, Hussein A, Khattab SN, Rizkallah PJ, Jones DD. Structure-Guided Engineering of a Family IV Cold-Adapted Esterase Expands Its Substrate Range. Int J Mol Sci 2022; 23:ijms23094703. [PMID: 35563094 PMCID: PMC9100969 DOI: 10.3390/ijms23094703] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/09/2022] [Revised: 04/15/2022] [Accepted: 04/16/2022] [Indexed: 11/16/2022] Open
Abstract
Cold active esterases have gained great interest in several industries. The recently determined structure of a family IV cold active esterase (EstN7) from Bacillus cohnii strain N1 was used to expand its substrate range and to probe its commercially valuable substrates. Database mining suggested that triacetin was a potential commercially valuable substrate for EstN7, which was subsequently proved experimentally with the final product being a single isomeric product, 1,2-glyceryl diacetate. Enzyme kinetics revealed that EstN7’s activity is restricted to C2 and C4 substrates due to a plug at the end of the acyl binding pocket that blocks access to a buried water-filled cavity. Residues M187, N211 and W206 were identified as key plug forming residues. N211A stabilised EstN7 allowing incorporation of the destabilising M187A mutation. The M187A-N211A double mutant had the broadest substrate range, capable of hydrolysing a C8 substrate. W206A did not appear to have any significant effect on substrate range either alone or when combined with the double mutant. Thus, the enzyme kinetics and engineering together with a recently determined structure of EstN7 provide new insights into substrate specificity and the role of acyl binding pocket plug residues in determining family IV esterase stability and substrate range.
Collapse
Affiliation(s)
- Nehad Noby
- Department of Biotechnology, Institute of Graduate Studies and Research, Alexandria University, Alexandria 21526, Egypt; (A.M.E.); (H.S.); (A.H.)
- Correspondence: (N.N.); (D.D.J.)
| | - Rachel L. Johnson
- Molecular Biosciences Division, School of Biosciences, Cardiff University, Cardiff CF10 3AX, UK;
| | - Jonathan D. Tyzack
- European Molecular Biology Laboratory-European Bioinformatics Institute, Wellcome Genome Campus, Hinxton CB10 1SD, UK;
| | - Amira M. Embaby
- Department of Biotechnology, Institute of Graduate Studies and Research, Alexandria University, Alexandria 21526, Egypt; (A.M.E.); (H.S.); (A.H.)
| | - Hesham Saeed
- Department of Biotechnology, Institute of Graduate Studies and Research, Alexandria University, Alexandria 21526, Egypt; (A.M.E.); (H.S.); (A.H.)
| | - Ahmed Hussein
- Department of Biotechnology, Institute of Graduate Studies and Research, Alexandria University, Alexandria 21526, Egypt; (A.M.E.); (H.S.); (A.H.)
| | - Sherine N. Khattab
- Department of Chemistry, Faculty of Science, Alexandria University, Alexandria 21321, Egypt;
- Cancer Nanotechnology Research Laboratory (CNRL), Faculty of Pharmacy, Alexandria University, Alexandria 21521, Egypt
| | | | - D. Dafydd Jones
- Molecular Biosciences Division, School of Biosciences, Cardiff University, Cardiff CF10 3AX, UK;
- Correspondence: (N.N.); (D.D.J.)
| |
Collapse
|
46
|
Environmental selection and epistasis in an empirical phenotype-environment-fitness landscape. Nat Ecol Evol 2022; 6:427-438. [PMID: 35210579 DOI: 10.1038/s41559-022-01675-5] [Citation(s) in RCA: 7] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/19/2021] [Accepted: 12/14/2021] [Indexed: 11/08/2022]
Abstract
Fitness landscapes, mappings of genotype/phenotype to their effects on fitness, are invaluable concepts in evolutionary biochemistry. Although widely discussed, measurements of phenotype-fitness landscapes in proteins remain scarce. Here, we quantify all single mutational effects on fitness and phenotype (EC50) of VIM-2 β-lactamase across a 64-fold range of ampicillin concentrations. We then construct a phenotype-fitness landscape that takes variations in environmental selection pressure into account. We found that a simple, empirical landscape accurately models the ~39,000 mutational data points, suggesting that the evolution of VIM-2 can be predicted on the basis of the selection environment. Our landscape provides new quantitative knowledge on the evolution of the β-lactamases and proteins in general, particularly their evolutionary dynamics under subinhibitory antibiotic concentrations, as well as the mechanisms and environmental dependence of non-specific epistasis.
Collapse
|
47
|
Evolutionary dynamics, evolutionary forces, and robustness: A nonequilibrium statistical mechanics perspective. Proc Natl Acad Sci U S A 2022; 119:e2112083119. [PMID: 35312370 PMCID: PMC9060472 DOI: 10.1073/pnas.2112083119] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/19/2022] Open
Abstract
Evolution through natural selection is an overwhelmingly complex process, and it is not surprising that theoretical approaches are strongly simplifying it. For instance, population genetics considers mainly dynamics of gene allele frequencies. Here, we develop a complementary approach to evolutionary dynamics based on three elements—organism reproduction, variations, and selection—that are essential for any evolutionary theory. By considering such general dynamics as a stochastic thermodynamic process, we clarify the nature and action of the evolutionary forces. We show that some of the forces cannot be described solely in terms of fitness landscapes. We also find that one force contribution can make organism reproduction insensitive (robust) to variations. Any realistic evolutionary theory has to consider 1) the dynamics of organisms that reproduce and possess heritable traits, 2) the appearance of stochastic variations in these traits, and 3) the selection of those organisms that better survive and reproduce. These elements shape the “evolutionary forces” that characterize the evolutionary dynamics. Here, we introduce a general model of reproduction–variation–selection dynamics. By treating these dynamics as a nonequilibrium thermodynamic process, we make precise the notion of the forces that characterize evolution. One of these forces, in particular, can be associated with the robustness of reproduction to variations. Some of the detailed predictions of our model can be tested by quantitative laboratory experiments, similar to those performed in the past on evolving populations of proteins or viruses.
Collapse
|
48
|
Teufl M, Zajc CU, Traxlmayr MW. Engineering Strategies to Overcome the Stability-Function Trade-Off in Proteins. ACS Synth Biol 2022; 11:1030-1039. [PMID: 35258287 PMCID: PMC8938945 DOI: 10.1021/acssynbio.1c00512] [Citation(s) in RCA: 12] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
Abstract
![]()
In addition to its
biological function, the stability of a protein
is a major determinant for its applicability. Unfortunately, engineering
proteins for improved functionality usually results in destabilization
of the protein. This so-called stability–function trade-off
can be explained by the simple fact that the generation of a novel
protein function—or the improvement of an existing one—necessitates
the insertion of mutations, i.e., deviations from
the evolutionarily optimized wild-type sequence. In fact, it was demonstrated
that gain-of-function mutations are not more destabilizing than other
random mutations. The stability–function trade-off is a universal
phenomenon during protein evolution that has been observed with completely
different types of proteins, including enzymes, antibodies, and engineered
binding scaffolds. In this review, we discuss three types of strategies
that have been successfully deployed to overcome this omnipresent
obstacle in protein engineering approaches: (i) using highly stable
parental proteins, (ii) minimizing the extent of destabilization during
functional engineering (by library optimization and/or coselection
for stability and function), and (iii) repairing damaged mutants through
stability engineering. The implementation of these strategies in protein
engineering campaigns will facilitate the efficient generation of
protein variants that are not only functional but also stable and
therefore better-suited for subsequent applications.
Collapse
Affiliation(s)
- Magdalena Teufl
- Department of Chemistry, Institute of Biochemistry, University of Natural Resources and Life Sciences, 1190 Vienna, Austria
- CD Laboratory for Next Generation CAR T Cells, 1190 Vienna, Austria
| | - Charlotte U. Zajc
- Department of Chemistry, Institute of Biochemistry, University of Natural Resources and Life Sciences, 1190 Vienna, Austria
- CD Laboratory for Next Generation CAR T Cells, 1190 Vienna, Austria
| | - Michael W. Traxlmayr
- Department of Chemistry, Institute of Biochemistry, University of Natural Resources and Life Sciences, 1190 Vienna, Austria
- CD Laboratory for Next Generation CAR T Cells, 1190 Vienna, Austria
| |
Collapse
|
49
|
Hidalgo F, Nocka LM, Shah NH, Gorday K, Latorraca NR, Bandaru P, Templeton S, Lee D, Karandur D, Pelton JG, Marqusee S, Wemmer D, Kuriyan J. A saturation-mutagenesis analysis of the interplay between stability and activation in Ras. eLife 2022; 11:e76595. [PMID: 35272765 PMCID: PMC8916776 DOI: 10.7554/elife.76595] [Citation(s) in RCA: 9] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/05/2022] [Accepted: 01/25/2022] [Indexed: 12/31/2022] Open
Abstract
Cancer mutations in Ras occur predominantly at three hotspots: Gly 12, Gly 13, and Gln 61. Previously, we reported that deep mutagenesis of H-Ras using a bacterial assay identified many other activating mutations (Bandaru et al., 2017). We now show that the results of saturation mutagenesis of H-Ras in mammalian Ba/F3 cells correlate well with the results of bacterial experiments in which H-Ras or K-Ras are co-expressed with a GTPase-activating protein (GAP). The prominent cancer hotspots are not dominant in the Ba/F3 data. We used the bacterial system to mutagenize Ras constructs of different stabilities and discovered a feature that distinguishes the cancer hotspots. While mutations at the cancer hotspots activate Ras regardless of construct stability, mutations at lower-frequency sites (e.g. at Val 14 or Asp 119) can be activating or deleterious, depending on the stability of the Ras construct. We characterized the dynamics of three non-hotspot activating Ras mutants by using NMR to monitor hydrogen-deuterium exchange (HDX). These mutations result in global increases in HDX rates, consistent with destabilization of Ras. An explanation for these observations is that mutations that destabilize Ras increase nucleotide dissociation rates, enabling activation by spontaneous nucleotide exchange. A further stability decrease can lead to insufficient levels of folded Ras - and subsequent loss of function. In contrast, the cancer hotspot mutations are mechanism-based activators of Ras that interfere directly with the action of GAPs. Our results demonstrate the importance of GAP surveillance and protein stability in determining the sensitivity of Ras to mutational activation.
Collapse
Affiliation(s)
- Frank Hidalgo
- California Institute for Quantitative Biosciences (QB3), University of California, BerkeleyBerkeleyUnited States
- Howard Hughes Medical Institute, University of California, BerkeleyBerkeleyUnited States
- Department of Chemistry, University of California, BerkeleyBerkeleyUnited States
| | - Laura M Nocka
- California Institute for Quantitative Biosciences (QB3), University of California, BerkeleyBerkeleyUnited States
- Howard Hughes Medical Institute, University of California, BerkeleyBerkeleyUnited States
- Department of Chemistry, University of California, BerkeleyBerkeleyUnited States
| | - Neel H Shah
- California Institute for Quantitative Biosciences (QB3), University of California, BerkeleyBerkeleyUnited States
- Howard Hughes Medical Institute, University of California, BerkeleyBerkeleyUnited States
- Department of Molecular and Cell Biology, University of California, BerkeleyBerkeleyUnited States
- Department of Chemistry, Columbia UniversityNew YorkUnited States
| | - Kent Gorday
- California Institute for Quantitative Biosciences (QB3), University of California, BerkeleyBerkeleyUnited States
- Howard Hughes Medical Institute, University of California, BerkeleyBerkeleyUnited States
- Biophysics Graduate Group, University of California, BerkeleyBerkeleyUnited States
| | - Naomi R Latorraca
- California Institute for Quantitative Biosciences (QB3), University of California, BerkeleyBerkeleyUnited States
- Department of Molecular and Cell Biology, University of California, BerkeleyBerkeleyUnited States
| | - Pradeep Bandaru
- California Institute for Quantitative Biosciences (QB3), University of California, BerkeleyBerkeleyUnited States
- Howard Hughes Medical Institute, University of California, BerkeleyBerkeleyUnited States
- Department of Molecular and Cell Biology, University of California, BerkeleyBerkeleyUnited States
| | - Sage Templeton
- California Institute for Quantitative Biosciences (QB3), University of California, BerkeleyBerkeleyUnited States
- Howard Hughes Medical Institute, University of California, BerkeleyBerkeleyUnited States
| | - David Lee
- California Institute for Quantitative Biosciences (QB3), University of California, BerkeleyBerkeleyUnited States
- Howard Hughes Medical Institute, University of California, BerkeleyBerkeleyUnited States
| | - Deepti Karandur
- California Institute for Quantitative Biosciences (QB3), University of California, BerkeleyBerkeleyUnited States
- Howard Hughes Medical Institute, University of California, BerkeleyBerkeleyUnited States
- Department of Molecular and Cell Biology, University of California, BerkeleyBerkeleyUnited States
| | - Jeffrey G Pelton
- California Institute for Quantitative Biosciences (QB3), University of California, BerkeleyBerkeleyUnited States
| | - Susan Marqusee
- California Institute for Quantitative Biosciences (QB3), University of California, BerkeleyBerkeleyUnited States
- Department of Chemistry, University of California, BerkeleyBerkeleyUnited States
- Department of Molecular and Cell Biology, University of California, BerkeleyBerkeleyUnited States
| | - David Wemmer
- California Institute for Quantitative Biosciences (QB3), University of California, BerkeleyBerkeleyUnited States
- Department of Chemistry, University of California, BerkeleyBerkeleyUnited States
| | - John Kuriyan
- California Institute for Quantitative Biosciences (QB3), University of California, BerkeleyBerkeleyUnited States
- Howard Hughes Medical Institute, University of California, BerkeleyBerkeleyUnited States
- Department of Chemistry, University of California, BerkeleyBerkeleyUnited States
- Department of Molecular and Cell Biology, University of California, BerkeleyBerkeleyUnited States
| |
Collapse
|
50
|
Iyengar BR, Wagner A. GroEL/S overexpression helps to purge deleterious mutations and reduce genetic diversity during adaptive protein evolution. Mol Biol Evol 2022; 39:6540901. [PMID: 35234895 PMCID: PMC9188349 DOI: 10.1093/molbev/msac047] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022] Open
Abstract
Chaperones are proteins that help other proteins fold. They also affect the adaptive evolution of their client proteins by buffering the effect of deleterious mutations and increasing the genetic diversity of evolving proteins. We study how the bacterial chaperone GroE (GroEL + GroES) affects the evolution of green fluorescent protein (GFP). To this end we subjected GFP to multiple rounds of mutation and selection for its color phenotype in four replicate E. coli populations, and studied its evolutionary dynamics through high-throughput sequencing and mutant engineering. We evolved GFP both under stabilizing selection for its ancestral (green) phenotype, and to directional selection for a new (cyan) phenotype. We did so both under low and high expression of the chaperone GroE. In contrast to previous work, we observe that GroE does not just buffer but also helps purge deleterious (fluorescence reducing) mutations from evolving populations. In doing so, GroE helps reduce the genetic diversity of evolving populations. In addition, it causes phenotypic heterogeneity in mutants with the same genotype, helping to enhance their fluorescence in some cells, and reducing it in others. Our observations show that chaperones can affect adaptive evolution in more than one way.
Collapse
|