1
|
Rao J, Xin R, Macdonald C, Howard MK, Estevam GO, Yee SW, Wang M, Fraser JS, Coyote-Maestas W, Pimentel H. Rosace: a robust deep mutational scanning analysis framework employing position and mean-variance shrinkage. Genome Biol 2024; 25:138. [PMID: 38789982 PMCID: PMC11127319 DOI: 10.1186/s13059-024-03279-7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/31/2023] [Accepted: 05/14/2024] [Indexed: 05/26/2024] Open
Abstract
Deep mutational scanning (DMS) measures the effects of thousands of genetic variants in a protein simultaneously. The small sample size renders classical statistical methods ineffective. For example, p-values cannot be correctly calibrated when treating variants independently. We propose Rosace, a Bayesian framework for analyzing growth-based DMS data. Rosace leverages amino acid position information to increase power and control the false discovery rate by sharing information across parameters via shrinkage. We also developed Rosette for simulating the distributional properties of DMS. We show that Rosace is robust to the violation of model assumptions and is more powerful than existing tools.
Collapse
Affiliation(s)
- Jingyou Rao
- Department of Computer Science, UCLA, Los Angeles, CA, USA
| | - Ruiqi Xin
- Computational and Systems Biology Interdepartmental Program, UCLA, Los Angeles, CA, USA
| | - Christian Macdonald
- Department of Bioengineering and Therapeutic Sciences, UCSF, San Francisco, CA, USA
| | - Matthew K Howard
- Department of Bioengineering and Therapeutic Sciences, UCSF, San Francisco, CA, USA
- Tetrad Graduate Program, UCSF, San Francisco, CA, USA
- Department of Pharmaceutical Chemistry, UCSF, San Francisco, CA, USA
| | - Gabriella O Estevam
- Department of Bioengineering and Therapeutic Sciences, UCSF, San Francisco, CA, USA
- Tetrad Graduate Program, UCSF, San Francisco, CA, USA
| | - Sook Wah Yee
- Department of Bioengineering and Therapeutic Sciences, UCSF, San Francisco, CA, USA
| | - Mingsen Wang
- Department of Mathematics, Baruch College, CUNY, New York, NY, USA
| | - James S Fraser
- Department of Bioengineering and Therapeutic Sciences, UCSF, San Francisco, CA, USA
- Quantitative Biosciences Institute, UCSF, San Francisco, CA, USA
| | - Willow Coyote-Maestas
- Department of Bioengineering and Therapeutic Sciences, UCSF, San Francisco, CA, USA.
- Quantitative Biosciences Institute, UCSF, San Francisco, CA, USA.
| | - Harold Pimentel
- Department of Computer Science, UCLA, Los Angeles, CA, USA.
- Department of Computational Medicine, David Geffen School of Medicine, UCLA, Los Angeles, CA, USA.
- Department of Human Genetics, David Geffen School of Medicine, UCLA, Los Angeles, CA, USA.
| |
Collapse
|
2
|
Judge A, Sankaran B, Hu L, Palaniappan M, Birgy A, Prasad BVV, Palzkill T. Network of epistatic interactions in an enzyme active site revealed by large-scale deep mutational scanning. Proc Natl Acad Sci U S A 2024; 121:e2313513121. [PMID: 38483989 PMCID: PMC10962969 DOI: 10.1073/pnas.2313513121] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/06/2023] [Accepted: 02/14/2024] [Indexed: 03/19/2024] Open
Abstract
Cooperative interactions between amino acids are critical for protein function. A genetic reflection of cooperativity is epistasis, which is when a change in the amino acid at one position changes the sequence requirements at another position. To assess epistasis within an enzyme active site, we utilized CTX-M β-lactamase as a model system. CTX-M hydrolyzes β-lactam antibiotics to provide antibiotic resistance, allowing a simple functional selection for rapid sorting of modified enzymes. We created all pairwise mutations across 17 active site positions in the β-lactamase enzyme and quantitated the function of variants against two β-lactam antibiotics using next-generation sequencing. Context-dependent sequence requirements were determined by comparing the antibiotic resistance function of double mutations across the CTX-M active site to their predicted function based on the constituent single mutations, revealing both positive epistasis (synergistic interactions) and negative epistasis (antagonistic interactions) between amino acid substitutions. The resulting trends demonstrate that positive epistasis is present throughout the active site, that epistasis between residues is mediated through substrate interactions, and that residues more tolerant to substitutions serve as generic compensators which are responsible for many cases of positive epistasis. Additionally, we show that a key catalytic residue (Glu166) is amenable to compensatory mutations, and we characterize one such double mutant (E166Y/N170G) that acts by an altered catalytic mechanism. These findings shed light on the unique biochemical factors that drive epistasis within an enzyme active site and will inform enzyme engineering efforts by bridging the gap between amino acid sequence and catalytic function.
Collapse
Affiliation(s)
- Allison Judge
- Verna and Marrs McLean Department of Biochemistry and Molecular Pharmacology, Baylor College of Medicine, Houston, TX77030
| | - Banumathi Sankaran
- Department of Molecular Biophysics and Integrated Bioimaging, Berkeley Center for Structural Biology Lawrence Berkeley National Laboratory, Berkeley, CA94720
| | - Liya Hu
- Verna and Marrs McLean Department of Biochemistry and Molecular Pharmacology, Baylor College of Medicine, Houston, TX77030
| | - Murugesan Palaniappan
- Department of Pathology and Immunology, Center for Drug Discovery, Baylor College of Medicine, Houston, TX77030
| | - André Birgy
- Verna and Marrs McLean Department of Biochemistry and Molecular Pharmacology, Baylor College of Medicine, Houston, TX77030
- Infections, Antimicrobials, Modelling, Evolution, UMR 1137, French Insitute for Medical Research (INSERM), Faculty of Health, Université Paris Cité, Paris75006, France
| | - B. V. Venkataram Prasad
- Verna and Marrs McLean Department of Biochemistry and Molecular Pharmacology, Baylor College of Medicine, Houston, TX77030
| | - Timothy Palzkill
- Verna and Marrs McLean Department of Biochemistry and Molecular Pharmacology, Baylor College of Medicine, Houston, TX77030
| |
Collapse
|
3
|
Radojković M, Ubbink M. Positive epistasis drives clavulanic acid resistance in double mutant libraries of BlaC β-lactamase. Commun Biol 2024; 7:197. [PMID: 38368480 PMCID: PMC10874438 DOI: 10.1038/s42003-024-05868-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/06/2023] [Accepted: 01/26/2024] [Indexed: 02/19/2024] Open
Abstract
Phenotypic effects of mutations are highly dependent on the genetic backgrounds in which they occur, due to epistatic effects. To test how easily the loss of enzyme activity can be compensated for, we screen mutant libraries of BlaC, a β-lactamase from Mycobacterium tuberculosis, for fitness in the presence of carbenicillin and the inhibitor clavulanic acid. Using a semi-rational approach and deep sequencing, we prepare four double-site saturation libraries and determine the relative fitness effect for 1534/1540 (99.6%) of the unique library members at two temperatures. Each library comprises variants of a residue known to be relevant for clavulanic acid resistance as well as residue 105, which regulates access to the active site. Variants with greatly improved fitness were identified within each library, demonstrating that compensatory mutations for loss of activity can be readily found. In most cases, the fittest variants are a result of positive epistasis, indicating strong synergistic effects between the chosen residue pairs. Our study sheds light on a role of epistasis in the evolution of functional residues and underlines the highly adaptive potential of BlaC.
Collapse
Affiliation(s)
- Marko Radojković
- Leiden Institute of Chemistry, Leiden University, Einsteinweg 55, 2333 CC, Leiden, The Netherlands
| | - Marcellus Ubbink
- Leiden Institute of Chemistry, Leiden University, Einsteinweg 55, 2333 CC, Leiden, The Netherlands.
| |
Collapse
|
4
|
Buda K, Miton CM, Fan XC, Tokuriki N. Molecular determinants of protein evolvability. Trends Biochem Sci 2023; 48:751-760. [PMID: 37330341 DOI: 10.1016/j.tibs.2023.05.009] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/27/2023] [Revised: 05/18/2023] [Accepted: 05/23/2023] [Indexed: 06/19/2023]
Abstract
The plethora of biological functions that sustain life is rooted in the remarkable evolvability of proteins. An emerging view highlights the importance of a protein's initial state in dictating evolutionary success. A deeper comprehension of the mechanisms that govern the evolvability of these initial states can provide invaluable insights into protein evolution. In this review, we describe several molecular determinants of protein evolvability, unveiled by experimental evolution and ancestral sequence reconstruction studies. We further discuss how genetic variation and epistasis can promote or constrain functional innovation and suggest putative underlying mechanisms. By establishing a clear framework for these determinants, we provide potential indicators enabling the forecast of suitable evolutionary starting points and delineate molecular mechanisms in need of deeper exploration.
Collapse
Affiliation(s)
- Karol Buda
- Michael Smith Laboratories, University of British Columbia, Vancouver, Canada
| | - Charlotte M Miton
- Michael Smith Laboratories, University of British Columbia, Vancouver, Canada
| | - Xingyu Cara Fan
- Michael Smith Laboratories, University of British Columbia, Vancouver, Canada
| | - Nobuhiko Tokuriki
- Michael Smith Laboratories, University of British Columbia, Vancouver, Canada.
| |
Collapse
|
5
|
Standley M, Blay V, Beleva Guthrie V, Kim J, Lyman A, Moya A, Karchin R, Camps M. Experimental and In Silico Analysis of TEM β-Lactamase Adaptive Evolution. ACS Infect Dis 2022; 8:2451-2463. [PMID: 36377311 PMCID: PMC9745794 DOI: 10.1021/acsinfecdis.2c00216] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]
Abstract
Multiple mutations often have non-additive (epistatic) phenotypic effects. Epistasis is of fundamental biological relevance but is not well understood mechanistically. Adaptive evolution, i.e., the evolution of new biochemical activities, is rich in epistatic interactions. To better understand the principles underlying epistasis during genetic adaptation, we studied the evolution of TEM-1 β-lactamase variants exhibiting cefotaxime resistance. We report the collection of a library of 487 observed evolutionary trajectories for TEM-1 and determine the epistasis status based on cefotaxime resistance phenotype for 206 combinations of 2-3 TEM-1 mutations involving 17 positions under adaptive selective pressure. Gain-of-function (GOF) mutations are gatekeepers for adaptation. To see if GOF phenotypes can be inferred based solely on sequence data, we calculated the enrichment of GOF mutations in the different categories of epistatic pairs. Our results suggest that this is possible because GOF mutations are particularly enriched in sign and reciprocal sign epistasis, which leave a major imprint on the sequence space accessible to evolution. We also used FoldX to explore the relationship between thermodynamic stability and epistasis. We found that mutations in observed evolutionary trajectories tend to destabilize the folded structure of the protein, albeit their cumulative effects are consistently below the protein's free energy of folding. The destabilizing effect is stronger for epistatic pairs, suggesting that modest or local alterations in folding stability can modulate catalysis. Finally, we report a significant relationship between epistasis and the degree to which two protein positions are structurally and dynamically coupled, even in the absence of ligand.
Collapse
Affiliation(s)
- Melissa Standley
- Department
of Microbiology and Environmental Toxicology, University of California, Santa
Cruz, California95064, United States
| | - Vincent Blay
- Department
of Microbiology and Environmental Toxicology, University of California, Santa
Cruz, California95064, United States,Institute
for Integrative Systems Biology (I2Sysbio), Universitat de València and Spanish Research Council (CSIC), 46980Valencia, Spain,
| | - Violeta Beleva Guthrie
- Department
of Biomedical Engineering and Institute for Computational Medicine, The Johns Hopkins University, Baltimore, Maryland21218, United States
| | - Jay Kim
- Department
of Microbiology and Environmental Toxicology, University of California, Santa
Cruz, California95064, United States
| | - Audrey Lyman
- Department
of Microbiology and Environmental Toxicology, University of California, Santa
Cruz, California95064, United States
| | - Andrés Moya
- Institute
for Integrative Systems Biology (I2Sysbio), Universitat de València and Spanish Research Council (CSIC), 46980Valencia, Spain,Foundation
for the Promotion of Sanitary and Biomedical Research of Valencia
Region (FISABIO), 46021Valencia, Spain,CIBER
in Epidemiology and Public Health (CIBEResp), 28029Madrid, Spain
| | - Rachel Karchin
- Department
of Biomedical Engineering and Institute for Computational Medicine, The Johns Hopkins University, Baltimore, Maryland21218, United States
| | - Manel Camps
- Department
of Microbiology and Environmental Toxicology, University of California, Santa
Cruz, California95064, United States,
| |
Collapse
|
6
|
Abstract
One core goal of genetics is to systematically understand the mapping between the DNA sequence of an organism (genotype) and its measurable characteristics (phenotype). Understanding this mapping is often challenging because of interactions between mutations, where the result of combining several different mutations can be very different than the sum of their individual effects. Here we provide a statistical framework for modeling complex genetic interactions of this type. The key idea is to ask how fast the effects of mutations change when introducing the same mutation in increasingly distant genetic backgrounds. We then propose a model for phenotypic prediction that takes into account this tendency for the effects of mutations to be more similar in nearby genetic backgrounds. Contemporary high-throughput mutagenesis experiments are providing an increasingly detailed view of the complex patterns of genetic interaction that occur between multiple mutations within a single protein or regulatory element. By simultaneously measuring the effects of thousands of combinations of mutations, these experiments have revealed that the genotype–phenotype relationship typically reflects not only genetic interactions between pairs of sites but also higher-order interactions among larger numbers of sites. However, modeling and understanding these higher-order interactions remains challenging. Here we present a method for reconstructing sequence-to-function mappings from partially observed data that can accommodate all orders of genetic interaction. The main idea is to make predictions for unobserved genotypes that match the type and extent of epistasis found in the observed data. This information on the type and extent of epistasis can be extracted by considering how phenotypic correlations change as a function of mutational distance, which is equivalent to estimating the fraction of phenotypic variance due to each order of genetic interaction (additive, pairwise, three-way, etc.). Using these estimated variance components, we then define an empirical Bayes prior that in expectation matches the observed pattern of epistasis and reconstruct the genotype–phenotype mapping by conducting Gaussian process regression under this prior. To demonstrate the power of this approach, we present an application to the antibody-binding domain GB1 and also provide a detailed exploration of a dataset consisting of high-throughput measurements for the splicing efficiency of human pre-mRNA 5′ splice sites, for which we also validate our model predictions via additional low-throughput experiments.
Collapse
|
7
|
Chen DS, Clark AG, Wolfner MF. Octopaminergic/tyraminergic Tdc2 neurons regulate biased sperm usage in female Drosophila melanogaster. Genetics 2022; 221:6613932. [PMID: 35736370 DOI: 10.1093/genetics/iyac097] [Citation(s) in RCA: 10] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/25/2021] [Accepted: 06/04/2022] [Indexed: 11/14/2022] Open
Abstract
In polyandrous internally fertilizing species, a multiply-mated female can use stored sperm from different males in a biased manner to fertilize her eggs. The female's ability to assess sperm quality and compatibility is essential for her reproductive success, and represents an important aspect of postcopulatory sexual selection. In Drosophila melanogaster, previous studies demonstrated that the female nervous system plays an active role in influencing progeny paternity proportion, and suggested a role for octopaminergic/tyraminergic Tdc2 neurons in this process. Here, we report that inhibiting Tdc2 neuronal activity causes females to produce a higher-than-normal proportion of first-male progeny. This difference is not due to differences in sperm storage or release, but instead is attributable to the suppression of second-male sperm usage bias that normally occurs in control females. We further show that a subset of Tdc2 neurons innervating the female reproductive tract is largely responsible for the progeny proportion phenotype that is observed when Tdc2 neurons are inhibited globally. On the contrary, overactivation of Tdc2 neurons does not further affect sperm storage and release or progeny proportion. These results suggest that octopaminergic/tyraminergic signaling allows a multiply-mated female to bias sperm usage, and identify a new role for the female nervous system in postcopulatory sexual selection.
Collapse
Affiliation(s)
- Dawn S Chen
- Department of Molecular Biology and Genetics, Cornell University, Ithaca NY 14853, USA
| | - Andrew G Clark
- Department of Molecular Biology and Genetics, Cornell University, Ithaca NY 14853, USA
| | - Mariana F Wolfner
- Department of Molecular Biology and Genetics, Cornell University, Ithaca NY 14853, USA
| |
Collapse
|
8
|
Schneider S, Kozuch J, Boxer SG. The Interplay of Electrostatics and Chemical Positioning in the Evolution of Antibiotic Resistance in TEM β-Lactamases. ACS CENTRAL SCIENCE 2021; 7:1996-2008. [PMID: 34963893 PMCID: PMC8704030 DOI: 10.1021/acscentsci.1c00880] [Citation(s) in RCA: 17] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/21/2021] [Indexed: 05/25/2023]
Abstract
The interplay of enzyme active site electrostatics and chemical positioning is important for understanding the origin(s) of enzyme catalysis and the design of novel catalysts. We reconstruct the evolutionary trajectory of TEM-1 β-lactamase to TEM-52 toward extended-spectrum activity to better understand the emergence of antibiotic resistance and to provide insights into the structure-function paradigm and noncovalent interactions involved in catalysis. Utilizing a detailed kinetic analysis and the vibrational Stark effect, we quantify the changes in rates and electric fields in the Michaelis and acyl-enzyme complexes for penicillin G and cefotaxime to ascertain the evolutionary role of electric fields to modulate function. These data are combined with MD simulations to interpret and quantify the substrate-dependent structural changes during evolution. We observe that this evolutionary trajectory utilizes a large preorganized electric field and substrate-dependent chemical positioning to facilitate catalysis. This governs the evolvability, substrate promiscuity, and protein fitness landscape in TEM β-lactamase antibiotic resistance.
Collapse
Affiliation(s)
| | | | - Steven G. Boxer
- Chemistry Department, Stanford University, Stanford, California 94305, United States
| |
Collapse
|
9
|
Baquero F, Martínez JL, F. Lanza V, Rodríguez-Beltrán J, Galán JC, San Millán A, Cantón R, Coque TM. Evolutionary Pathways and Trajectories in Antibiotic Resistance. Clin Microbiol Rev 2021; 34:e0005019. [PMID: 34190572 PMCID: PMC8404696 DOI: 10.1128/cmr.00050-19] [Citation(s) in RCA: 95] [Impact Index Per Article: 23.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/15/2022] Open
Abstract
Evolution is the hallmark of life. Descriptions of the evolution of microorganisms have provided a wealth of information, but knowledge regarding "what happened" has precluded a deeper understanding of "how" evolution has proceeded, as in the case of antimicrobial resistance. The difficulty in answering the "how" question lies in the multihierarchical dimensions of evolutionary processes, nested in complex networks, encompassing all units of selection, from genes to communities and ecosystems. At the simplest ontological level (as resistance genes), evolution proceeds by random (mutation and drift) and directional (natural selection) processes; however, sequential pathways of adaptive variation can occasionally be observed, and under fixed circumstances (particular fitness landscapes), evolution is predictable. At the highest level (such as that of plasmids, clones, species, microbiotas), the systems' degrees of freedom increase dramatically, related to the variable dispersal, fragmentation, relatedness, or coalescence of bacterial populations, depending on heterogeneous and changing niches and selective gradients in complex environments. Evolutionary trajectories of antibiotic resistance find their way in these changing landscapes subjected to random variations, becoming highly entropic and therefore unpredictable. However, experimental, phylogenetic, and ecogenetic analyses reveal preferential frequented paths (highways) where antibiotic resistance flows and propagates, allowing some understanding of evolutionary dynamics, modeling and designing interventions. Studies on antibiotic resistance have an applied aspect in improving individual health, One Health, and Global Health, as well as an academic value for understanding evolution. Most importantly, they have a heuristic significance as a model to reduce the negative influence of anthropogenic effects on the environment.
Collapse
Affiliation(s)
- F. Baquero
- Department of Microbiology, Ramón y Cajal University Hospital, Ramón y Cajal Institute for Health Research (IRYCIS), Network Center for Research in Epidemiology and Public Health (CIBERESP), Madrid, Spain
| | - J. L. Martínez
- National Center for Biotechnology (CNB-CSIC), Madrid, Spain
| | - V. F. Lanza
- Department of Microbiology, Ramón y Cajal University Hospital, Ramón y Cajal Institute for Health Research (IRYCIS), Network Center for Research in Epidemiology and Public Health (CIBERESP), Madrid, Spain
- Central Bioinformatics Unit, Ramón y Cajal Institute for Health Research (IRYCIS), Madrid, Spain
| | - J. Rodríguez-Beltrán
- Department of Microbiology, Ramón y Cajal University Hospital, Ramón y Cajal Institute for Health Research (IRYCIS), Network Center for Research in Epidemiology and Public Health (CIBERESP), Madrid, Spain
| | - J. C. Galán
- Department of Microbiology, Ramón y Cajal University Hospital, Ramón y Cajal Institute for Health Research (IRYCIS), Network Center for Research in Epidemiology and Public Health (CIBERESP), Madrid, Spain
| | - A. San Millán
- National Center for Biotechnology (CNB-CSIC), Madrid, Spain
| | - R. Cantón
- Department of Microbiology, Ramón y Cajal University Hospital, Ramón y Cajal Institute for Health Research (IRYCIS), Network Center for Research in Epidemiology and Public Health (CIBERESP), Madrid, Spain
| | - T. M. Coque
- Department of Microbiology, Ramón y Cajal University Hospital, Ramón y Cajal Institute for Health Research (IRYCIS), Network Center for Research in Epidemiology and Public Health (CIBERESP), Madrid, Spain
| |
Collapse
|
10
|
Routh S, Acharyya A, Dhar R. A two-step PCR assembly for construction of gene variants across large mutational distances. Biol Methods Protoc 2021; 6:bpab007. [PMID: 33928191 PMCID: PMC8062255 DOI: 10.1093/biomethods/bpab007] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/13/2021] [Revised: 03/09/2021] [Accepted: 04/01/2021] [Indexed: 11/14/2022] Open
Abstract
Construction of empirical fitness landscapes has transformed our understanding of genotype-phenotype relationships across genes. However, most empirical fitness landscapes have been constrained to the local genotype neighbourhood of a gene primarily due to our limited ability to systematically construct genotypes that differ by a large number of mutations. Although a few methods have been proposed in the literature, these techniques are complex owing to several steps of construction or contain a large number of amplification cycles that increase chances of non-specific mutations. A few other described methods require amplification of the whole vector, thereby increasing the chances of vector backbone mutations that can have unintended consequences for study of fitness landscapes. Thus, this has substantially constrained us from traversing large mutational distances in the genotype network, thereby limiting our understanding of the interactions between multiple mutations and the role these interactions play in evolution of novel phenotypes. In the current work, we present a simple but powerful approach that allows us to systematically and accurately construct gene variants at large mutational distances. Our approach relies on building-up small fragments containing targeted mutations in the first step followed by assembly of these fragments into the complete gene fragment by polymerase chain reaction (PCR). We demonstrate the utility of our approach by constructing variants that differ by up to 11 mutations in a model gene. Our work thus provides an accurate method for construction of multi-mutant variants of genes and therefore will transform the studies of empirical fitness landscapes by enabling exploration of genotypes that are far away from a starting genotype.
Collapse
Affiliation(s)
- Shreya Routh
- Department of Biotechnology, Indian Institute of Technology Kharagpur, Kharagpur 721302, West Bengal, India
| | - Anamika Acharyya
- Department of Biotechnology, Indian Institute of Technology Kharagpur, Kharagpur 721302, West Bengal, India
| | - Riddhiman Dhar
- Department of Biotechnology, Indian Institute of Technology Kharagpur, Kharagpur 721302, West Bengal, India
| |
Collapse
|
11
|
Nedrud D, Coyote-Maestas W, Schmidt D. A large-scale survey of pairwise epistasis reveals a mechanism for evolutionary expansion and specialization of PDZ domains. Proteins 2021; 89:899-914. [PMID: 33620761 DOI: 10.1002/prot.26067] [Citation(s) in RCA: 13] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/15/2020] [Revised: 02/02/2021] [Accepted: 02/18/2021] [Indexed: 12/21/2022]
Abstract
Deep mutational scanning (DMS) facilitates data-driven models of protein structure and function. Here, we adapted Saturated Programmable Insertion Engineering (SPINE) as a programmable DMS technique. We validate SPINE with a reference single mutant dataset in the PSD95 PDZ3 domain and then characterize most pairwise double mutants to study epistasis. We observe wide-spread proximal negative epistasis, which we attribute to mutations affecting thermodynamic stability, and strong long-range positive epistasis, which is enriched in an evolutionarily conserved and function-defining network of "sector" and clade-specifying residues. Conditional neutrality of mutations in clade-specifying residues compensates for deleterious mutations in sector positions. This suggests that epistatic interactions between these position pairs facilitated the evolutionary expansion and specialization of PDZ domains. We propose that SPINE provides easy experimental access to reveal epistasis signatures in proteins that will improve our understanding of the structural basis for protein function and adaptation.
Collapse
Affiliation(s)
- David Nedrud
- Department of Biochemistry, Molecular Biology & Biophysics, University of Minnesota, Minneapolis, Minnesota, USA
| | - Willow Coyote-Maestas
- Department of Biochemistry, Molecular Biology & Biophysics, University of Minnesota, Minneapolis, Minnesota, USA
| | - Daniel Schmidt
- Department of Genetics, Cell Biology & Development, University of Minnesota, Minneapolis, Minnesota, USA
| |
Collapse
|
12
|
Strokach A, Lu TY, Kim PM. ELASPIC2 (EL2): Combining Contextualized Language Models and Graph Neural Networks to Predict Effects of Mutations. J Mol Biol 2021; 433:166810. [PMID: 33450251 DOI: 10.1016/j.jmb.2021.166810] [Citation(s) in RCA: 23] [Impact Index Per Article: 5.8] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/26/2020] [Revised: 12/19/2020] [Accepted: 01/03/2021] [Indexed: 12/21/2022]
Abstract
The ELASPIC web server allows users to evaluate the effect of mutations on protein folding and protein-protein interaction on a proteome-wide scale. It uses homology models of proteins and protein-protein interactions, which have been precalculated for several proteomes, and machine learning models, which integrate structural information with sequence conservation scores, in order to make its predictions. Since the original publication of the ELASPIC web server, several advances have motivated a revisiting of the problem of mutation effect prediction. First, progress in neural network architectures and self-supervised pre-trained has resulted in models which provide more informative embeddings of protein sequence and structure than those used by the original version of ELASPIC. Second, the amount of training data has increased several-fold, largely driven by advances in deep mutation scanning and other multiplexed assays of variant effect. Here, we describe two machine learning models which leverage the recent advances in order to achieve superior accuracy in predicting the effect of mutation on protein folding and protein-protein interaction. The models incorporate features generated using pre-trained transformer- and graph convolution-based neural networks, and are trained to optimize a ranking objective function, which permits the use of heterogeneous training data. The outputs from the new models have been incorporated into the ELASPIC web server, available at http://elaspic.kimlab.org.
Collapse
Affiliation(s)
- Alexey Strokach
- Department of Computer Science, University of Toronto, Toronto, ON M5S 3E1, Canada
| | - Tian Yu Lu
- Department of Computer Science, University of Toronto, Toronto, ON M5S 3E1, Canada
| | - Philip M Kim
- Department of Computer Science, University of Toronto, Toronto, ON M5S 3E1, Canada; Donnelly Centre for Cellular and Biomolecular Research, University of Toronto, Toronto, ON M5S 3E1, Canada; Department of Molecular Genetics, University of Toronto, Toronto, ON M5S 3E1, Canada.
| |
Collapse
|
13
|
Bons E, Leemann C, Metzner KJ, Regoes RR. Long-term experimental evolution of HIV-1 reveals effects of environment and mutational history. PLoS Biol 2020; 18:e3001010. [PMID: 33370289 PMCID: PMC7793244 DOI: 10.1371/journal.pbio.3001010] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/27/2020] [Revised: 01/08/2021] [Accepted: 11/30/2020] [Indexed: 11/21/2022] Open
Abstract
An often-returning question for not only HIV-1, but also other organisms, is how predictable evolutionary paths are. The environment, mutational history, and random processes can all impact the exact evolutionary paths, but to which extent these factors contribute to the evolutionary dynamics of a particular system is an open question. Especially in a virus like HIV-1, with a large mutation rate and large population sizes, evolution is expected to be highly predictable if the impact of environment and history is low, and evolution is not neutral. We investigated the effect of environment and mutational history by analyzing sequences from a long-term evolution experiment, in which HIV-1 was passaged on 2 different cell types in 8 independent evolutionary lines and 8 derived lines, 4 of which involved a switch of the environment. The experiments lasted for 240–300 passages, corresponding to approximately 400–600 generations or almost 3 years. The sequences show signs of extensive parallel evolution—the majority of mutations that are shared between independent lines appear in both cell types, but we also find that both environment and mutational history significantly impact the evolutionary paths. We conclude that HIV-1 evolution is robust to small changes in the environment, similar to a transmission event in the absence of an immune response or drug pressure. We also find that the fitness landscape of HIV-1 is largely smooth, although we find some evidence for both positive and negative epistatic interactions between mutations. Analysis of the longest evolutionary experiment with HIV-1 to-date reveals continuous viral adaptation over several years. The authors quantify the environment-specific mutations that arise and determine the fraction of mutations that co-occur with significantly different frequencies than expected by chance.
Collapse
Affiliation(s)
- Eva Bons
- Department of Environmental Systems Sciences, Institute of Integrative Biology, ETH Zurich, Zurich, Switzerland
| | - Christine Leemann
- Division of Infectious Diseases and Hospital Epidemiology, University Hospital Zurich, Zurich, Switzerland
- Institute of Medical Virology, University of Zurich, Zurich, Switzerland
| | - Karin J. Metzner
- Division of Infectious Diseases and Hospital Epidemiology, University Hospital Zurich, Zurich, Switzerland
- Institute of Medical Virology, University of Zurich, Zurich, Switzerland
- * E-mail: (KJM); (RRR)
| | - Roland R. Regoes
- Department of Environmental Systems Sciences, Institute of Integrative Biology, ETH Zurich, Zurich, Switzerland
- * E-mail: (KJM); (RRR)
| |
Collapse
|
14
|
Kurahashi R, Tanaka SI, Takano K. Highly active enzymes produced by directed evolution with stability-based selection. Enzyme Microb Technol 2020; 140:109626. [DOI: 10.1016/j.enzmictec.2020.109626] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/09/2020] [Revised: 06/12/2020] [Accepted: 06/12/2020] [Indexed: 12/22/2022]
|
15
|
Thompson S, Zhang Y, Ingle C, Reynolds KA, Kortemme T. Altered expression of a quality control protease in E. coli reshapes the in vivo mutational landscape of a model enzyme. eLife 2020; 9:53476. [PMID: 32701056 PMCID: PMC7377907 DOI: 10.7554/elife.53476] [Citation(s) in RCA: 20] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/09/2019] [Accepted: 07/09/2020] [Indexed: 12/03/2022] Open
Abstract
Protein mutational landscapes are shaped by the cellular environment, but key factors and their quantitative effects are often unknown. Here we show that Lon, a quality control protease naturally absent in common E. coli expression strains, drastically reshapes the mutational landscape of the metabolic enzyme dihydrofolate reductase (DHFR). Selection under conditions that resolve highly active mutants reveals that 23.3% of all single point mutations in DHFR are advantageous in the absence of Lon, but advantageous mutations are largely suppressed when Lon is reintroduced. Protein stability measurements demonstrate extensive activity-stability tradeoffs for the advantageous mutants and provide a mechanistic explanation for Lon’s widespread impact. Our findings suggest possibilities for tuning mutational landscapes by modulating the cellular environment, with implications for protein design and combatting antibiotic resistance.
Collapse
Affiliation(s)
- Samuel Thompson
- Graduate Group in Biophysics, University of California San Francisco, San Francisco, United States
| | - Yang Zhang
- Department of Bioengineering and Therapeutic Sciences, University of California San Francisco, San Francisco, United States
| | - Christine Ingle
- The Green Center for Systems Biology, University of Texas Southwestern Medical Center, Dallas, United States
| | - Kimberly A Reynolds
- The Green Center for Systems Biology, University of Texas Southwestern Medical Center, Dallas, United States.,Department of Biophysics, University of Texas Southwestern Medical Center, Dallas, United States
| | - Tanja Kortemme
- Graduate Group in Biophysics, University of California San Francisco, San Francisco, United States.,Department of Bioengineering and Therapeutic Sciences, University of California San Francisco, San Francisco, United States.,Chan Zuckerberg Biohub, San Francisco, United States
| |
Collapse
|
16
|
Chen JZ, Fowler DM, Tokuriki N. Comprehensive exploration of the translocation, stability and substrate recognition requirements in VIM-2 lactamase. eLife 2020; 9:e56707. [PMID: 32510322 PMCID: PMC7308095 DOI: 10.7554/elife.56707] [Citation(s) in RCA: 26] [Impact Index Per Article: 5.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/06/2020] [Accepted: 06/06/2020] [Indexed: 12/12/2022] Open
Abstract
Metallo-β-lactamases (MBLs) degrade a broad spectrum of β-lactam antibiotics, and are a major disseminating source for multidrug resistant bacteria. Despite many biochemical studies in diverse MBLs, molecular understanding of the roles of residues in the enzyme's stability and function, and especially substrate specificity, is lacking. Here, we employ deep mutational scanning (DMS) to generate comprehensive single amino acid variant data on a major clinical MBL, VIM-2, by measuring the effect of thousands of VIM-2 mutants on the degradation of three representative classes of β-lactams (ampicillin, cefotaxime, and meropenem) and at two different temperatures (25°C and 37°C). We revealed residues responsible for expression and translocation, and mutations that increase resistance and/or alter substrate specificity. The distribution of specificity-altering mutations unveiled distinct molecular recognition of the three substrates. Moreover, these function-altering mutations are frequently observed among naturally occurring variants, suggesting that the enzymes have continuously evolved to become more potent resistance genes.
Collapse
Affiliation(s)
- John Z Chen
- Michael Smith Laboratories, University of British ColumbiaVancouverCanada
| | - Douglas M Fowler
- Department of Genome Sciences, University of WashingtonSeattleUnited States
- Department of Bioengineering, University of WashingtonSeattleUnited States
| | - Nobuhiko Tokuriki
- Michael Smith Laboratories, University of British ColumbiaVancouverCanada
| |
Collapse
|
17
|
Zhou J, McCandlish DM. Minimum epistasis interpolation for sequence-function relationships. Nat Commun 2020; 11:1782. [PMID: 32286265 PMCID: PMC7156698 DOI: 10.1038/s41467-020-15512-5] [Citation(s) in RCA: 13] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/02/2019] [Accepted: 03/12/2020] [Indexed: 12/17/2022] Open
Abstract
Massively parallel phenotyping assays have provided unprecedented insight into how multiple mutations combine to determine biological function. While such assays can measure phenotypes for thousands to millions of genotypes in a single experiment, in practice these measurements are not exhaustive, so that there is a need for techniques to impute values for genotypes whose phenotypes have not been directly assayed. Here, we present an imputation method based on inferring the least epistatic possible sequence-function relationship compatible with the data. In particular, we infer the reconstruction where mutational effects change as little as possible across adjacent genetic backgrounds. The resulting models can capture complex higher-order genetic interactions near the data, but approach additivity where data is sparse or absent. We apply the method to high-throughput transcription factor binding assays and use it to explore a fitness landscape for protein G.
Collapse
Affiliation(s)
- Juannan Zhou
- Simons Center for Quantitative Biology, Cold Spring Harbor Laboratory, Cold Spring Harbor, NY, 11724, USA
| | - David M McCandlish
- Simons Center for Quantitative Biology, Cold Spring Harbor Laboratory, Cold Spring Harbor, NY, 11724, USA.
| |
Collapse
|
18
|
Esposito D, Weile J, Shendure J, Starita LM, Papenfuss AT, Roth FP, Fowler DM, Rubin AF. MaveDB: an open-source platform to distribute and interpret data from multiplexed assays of variant effect. Genome Biol 2019; 20:223. [PMID: 31679514 PMCID: PMC6827219 DOI: 10.1186/s13059-019-1845-6] [Citation(s) in RCA: 146] [Impact Index Per Article: 24.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/31/2019] [Accepted: 10/01/2019] [Indexed: 11/10/2022] Open
Abstract
Multiplex assays of variant effect (MAVEs), such as deep mutational scans and massively parallel reporter assays, test thousands of sequence variants in a single experiment. Despite the importance of MAVE data for basic and clinical research, there is no standard resource for their discovery and distribution. Here, we present MaveDB ( https://www.mavedb.org ), a public repository for large-scale measurements of sequence variant impact, designed for interoperability with applications to interpret these datasets. We also describe the first such application, MaveVis, which retrieves, visualizes, and contextualizes variant effect maps. Together, the database and applications will empower the community to mine these powerful datasets.
Collapse
Affiliation(s)
- Daniel Esposito
- Bioinformatics Division, The Walter and Eliza Hall Institute of Medical Research, Parkville, VIC, Australia
| | - Jochen Weile
- The Donnelly Centre, University of Toronto, Toronto, ON, Canada
- Lunenfeld-Tanenbaum Research Institute, Sinai Health System, Toronto, ON, Canada
- Department of Molecular Genetics, University of Toronto, Toronto, ON, Canada
- Department of Computer Science, University of Toronto, Toronto, ON, Canada
| | - Jay Shendure
- Department of Genome Sciences, University of Washington, Seattle, WA, USA
- Brotman Baty Institute for Precision Medicine, Seattle, WA, USA
- Howard Hughes Medical Institute, University of Washington, Seattle, WA, USA
| | - Lea M Starita
- Department of Genome Sciences, University of Washington, Seattle, WA, USA
- Brotman Baty Institute for Precision Medicine, Seattle, WA, USA
| | - Anthony T Papenfuss
- Bioinformatics Division, The Walter and Eliza Hall Institute of Medical Research, Parkville, VIC, Australia
- Department of Medical Biology, University of Melbourne, Melbourne, VIC, Australia
- Bioinformatics and Cancer Genomics Laboratory, Peter MacCallum Cancer Centre, Melbourne, VIC, Australia
- Sir Peter MacCallum Department of Oncology, University of Melbourne, Melbourne, VIC, Australia
- Department of Mathematics and Statistics, University of Melbourne, Melbourne, VIC, Australia
| | - Frederick P Roth
- The Donnelly Centre, University of Toronto, Toronto, ON, Canada.
- Lunenfeld-Tanenbaum Research Institute, Sinai Health System, Toronto, ON, Canada.
- Department of Molecular Genetics, University of Toronto, Toronto, ON, Canada.
- Department of Computer Science, University of Toronto, Toronto, ON, Canada.
- Canadian Institute for Advanced Research, Toronto, ON, Canada.
| | - Douglas M Fowler
- Department of Genome Sciences, University of Washington, Seattle, WA, USA.
- Canadian Institute for Advanced Research, Toronto, ON, Canada.
- Department of Bioengineering, University of Washington, Seattle, WA, USA.
| | - Alan F Rubin
- Bioinformatics Division, The Walter and Eliza Hall Institute of Medical Research, Parkville, VIC, Australia.
- Department of Medical Biology, University of Melbourne, Melbourne, VIC, Australia.
- Bioinformatics and Cancer Genomics Laboratory, Peter MacCallum Cancer Centre, Melbourne, VIC, Australia.
| |
Collapse
|
19
|
Gonzalez CE, Roberts P, Ostermeier M. Fitness Effects of Single Amino Acid Insertions and Deletions in TEM-1 β-Lactamase. J Mol Biol 2019; 431:2320-2330. [PMID: 31034887 DOI: 10.1016/j.jmb.2019.04.030] [Citation(s) in RCA: 26] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/01/2019] [Revised: 04/17/2019] [Accepted: 04/18/2019] [Indexed: 11/16/2022]
Abstract
Short insertions and deletions (InDels) are a common type of mutation found in nature and a useful source of variation in protein engineering. InDel events have important consequences in protein evolution, often opening new pathways for adaptation. However, much less is known about the effects of InDels compared to point mutations and amino acid substitutions. In particular, deep mutagenesis studies on the distribution of fitness effects of mutations have focused almost exclusively on amino acid substitutions. Here, we present a near-comprehensive analysis of the fitness effects of single amino acid InDels in TEM-1 β-lactamase. While we found InDels to be largely deleterious, partially overlapping deletion-tolerant and insertion-tolerant regions were observed throughout the protein, especially in unstructured regions and at the end of helices. The signal sequence of TEM-1 tolerated InDels more than the mature protein. Most regions of the protein tolerated insertions more than deletions, but a few regions tolerated deletions more than insertions. We examined the relationship between InDel tolerance and a variety of measures to help understand its origin. These measures included evolutionary variation in β-lactamases, secondary structure identity, tolerance to amino acid substitutions, solvent accessibility, and side-chain weighted contact number. We found secondary structure, weighted contact number, and evolutionary variation in class A beta-lactamases to be the somewhat predictive of InDel fitness effects.
Collapse
Affiliation(s)
- Courtney E Gonzalez
- Department of Chemical and Biomolecular Engineering, Johns Hopkins University, 3400 N. Charles St., Baltimore, MD 21218, USA
| | - Paul Roberts
- Department of Chemical and Biomolecular Engineering, Johns Hopkins University, 3400 N. Charles St., Baltimore, MD 21218, USA
| | - Marc Ostermeier
- Department of Chemical and Biomolecular Engineering, Johns Hopkins University, 3400 N. Charles St., Baltimore, MD 21218, USA.
| |
Collapse
|
20
|
Noda-García L, Davidi D, Korenblum E, Elazar A, Putintseva E, Aharoni A, Tawfik DS. Chance and pleiotropy dominate genetic diversity in complex bacterial environments. Nat Microbiol 2019; 4:1221-1230. [PMID: 30936490 DOI: 10.1038/s41564-019-0412-y] [Citation(s) in RCA: 14] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/29/2018] [Accepted: 02/14/2019] [Indexed: 12/18/2022]
Abstract
How does environmental complexity affect the evolution of single genes? Here, we measured the effects of a set of Bacillus subtilis glutamate dehydrogenase mutants across 19 different environments-from phenotypically homogeneous single-cell populations in liquid media to heterogeneous biofilms, plant roots and soil populations. The effects of individual gene mutations on organismal fitness were highly reproducible in liquid cultures. However, 84% of the tested alleles showed opposing fitness effects under different growth conditions (sign environmental pleiotropy). In colony biofilms and soil samples, different alleles dominated in parallel replica experiments. Accordingly, we found that in these heterogeneous cell populations the fate of mutations was dictated by a combination of selection and drift. The latter relates to programmed prophage excisions that occurred during biofilm development. Overall, for each condition, a wide range of glutamate dehydrogenase mutations persisted and sometimes fixated as a result of the combined action of selection, pleiotropy and chance. However, over longer periods and in multiple environments, nearly all of this diversity would be lost-across all the environments and conditions that we tested, the wild type was the fittest allele.
Collapse
Affiliation(s)
- Lianet Noda-García
- Department of Biomolecular Sciences, Weizmann Institute of Science, Rehovot, Israel
| | - Dan Davidi
- Department of Plant and Environmental Sciences, Weizmann Institute of Science, Rehovot, Israel
| | - Elisa Korenblum
- Department of Plant and Environmental Sciences, Weizmann Institute of Science, Rehovot, Israel
| | - Assaf Elazar
- Department of Biomolecular Sciences, Weizmann Institute of Science, Rehovot, Israel
| | | | - Asaph Aharoni
- Department of Plant and Environmental Sciences, Weizmann Institute of Science, Rehovot, Israel
| | - Dan S Tawfik
- Department of Biomolecular Sciences, Weizmann Institute of Science, Rehovot, Israel.
| |
Collapse
|
21
|
Gonzalez CE, Ostermeier M. Pervasive Pairwise Intragenic Epistasis among Sequential Mutations in TEM-1 β-Lactamase. J Mol Biol 2019; 431:1981-1992. [PMID: 30922874 DOI: 10.1016/j.jmb.2019.03.020] [Citation(s) in RCA: 22] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/14/2018] [Revised: 02/25/2019] [Accepted: 03/13/2019] [Indexed: 12/25/2022]
Abstract
Interactions between mutations play a central role in shaping the fitness landscape, but a clear picture of intragenic epistasis has yet to emerge. To further reveal the prevalence and patterns of intragenic epistasis, we present a survey of epistatic interactions between sequential mutations in TEM-1 β-lactamase. We measured the fitness effect of ~12,000 pairs of consecutive amino acid substitutions and used our previous study of the fitness effects of single amino acid substitutions to calculate epistasis for over 8000 mutation pairs. Since sequential mutations are prone to physically interact, we postulated that our study would be surveying specific epistasis instead of nonspecific epistasis. We found widespread negative epistasis, especially in beta-strands, and a high frequency of negative sign epistasis among individually beneficial mutations. Negative epistasis (52%) occurred 7.6 times as frequently as positive epistasis (6.8%). Buried residues experienced more negative epistasis that surface-exposed residues. However, TEM-1 exhibited a couple of hotspots for positive epistasis, most notably L221/ R222 at which many combinations of mutations positively interacted. This study is the first to systematically examine pairwise epistasis throughout an entire protein performing its native function in its native host.
Collapse
Affiliation(s)
- Courtney E Gonzalez
- Department of Chemical and Biomolecular Engineering, Johns Hopkins University, 3400 N. Charles St., Baltimore, MD 21218, USA
| | - Marc Ostermeier
- Department of Chemical and Biomolecular Engineering, Johns Hopkins University, 3400 N. Charles St., Baltimore, MD 21218, USA.
| |
Collapse
|
22
|
Hilton SK, Bloom JD. Modeling site-specific amino-acid preferences deepens phylogenetic estimates of viral sequence divergence. Virus Evol 2018; 4:vey033. [PMID: 30425841 PMCID: PMC6220371 DOI: 10.1093/ve/vey033] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/06/2023] Open
Abstract
Molecular phylogenetics is often used to estimate the time since the divergence of modern gene sequences. For highly diverged sequences, such phylogenetic techniques sometimes estimate surprisingly recent divergence times. In the case of viruses, independent evidence indicates that the estimates of deep divergence times from molecular phylogenetics are sometimes too recent. This discrepancy is caused in part by inadequate models of purifying selection leading to branch-length underestimation. Here we examine the effect on branch-length estimation of using models that incorporate experimental measurements of purifying selection. We find that models informed by experimentally measured site-specific amino-acid preferences estimate longer deep branches on phylogenies of influenza virus hemagglutinin. This lengthening of branches is due to more realistic stationary states of the models, and is mostly independent of the branch-length extension from modeling site-to-site variation in amino-acid substitution rate. The branch-length extension from experimentally informed site-specific models is similar to that achieved by other approaches that allow the stationary state to vary across sites. However, the improvements from all of these site-specific but time homogeneous and site independent models are limited by the fact that a protein’s amino-acid preferences gradually shift as it evolves. Overall, our work underscores the importance of modeling site-specific amino-acid preferences when estimating deep divergence times—but also shows the inherent limitations of approaches that fail to account for how these preferences shift over time.
Collapse
Affiliation(s)
- Sarah K Hilton
- Basic Sciences and Computational Biology Program, Fred Hutchinson Cancer Research Center.,Department of Genome Sciences, University of Washington, USA
| | - Jesse D Bloom
- Basic Sciences and Computational Biology Program, Fred Hutchinson Cancer Research Center.,Department of Genome Sciences, University of Washington, USA.,Howard Hughes Medical Institute, Seattle, WA, USA
| |
Collapse
|
23
|
Cortina GA, Kasson PM. Predicting allostery and microbial drug resistance with molecular simulations. Curr Opin Struct Biol 2018; 52:80-86. [PMID: 30243041 PMCID: PMC6296865 DOI: 10.1016/j.sbi.2018.09.001] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/14/2018] [Revised: 08/31/2018] [Accepted: 09/07/2018] [Indexed: 11/30/2022]
Abstract
Beta-lactamase enzymes mediate the most common forms of gram-negative antibiotic resistance affecting clinical treatment. They also constitute an excellent model system for the difficult problem of understanding how allosteric mutations can augment catalytic activity of already-competent enzymes. Multiple allosteric mutations have been identified that alter catalytic activity or drug-resistance spectrum in class A beta lactamases, but predicting these in advance continues to be challenging. Here, we review computational techniques based on structure and/or molecular simulation to predict such mutations. Structure-based techniques have been particularly helpful in developing graph algorithms for analyzing critical residues in beta-lactamase function, while classical molecular simulation has recently shown the ability to prospectively predict allosteric mutations increasing beta-lactamase activity and drug resistance. These will ultimately achieve the greatest power when combined with simulation methods that model reactive chemistry to calculate activation free energies directly.
Collapse
Affiliation(s)
- George A Cortina
- Departments of Molecular Physiology and of Biomedical Engineering, University of Virginia, Charlottesville, VA 22908, United States
| | - Peter M Kasson
- Departments of Molecular Physiology and of Biomedical Engineering, University of Virginia, Charlottesville, VA 22908, United States; Science for Life Laboratory, Department of Cell and Molecular Biology, Uppsala University, Uppsala 75146, Sweden.
| |
Collapse
|
24
|
Multiplexed assays of variant effects contribute to a growing genotype-phenotype atlas. Hum Genet 2018; 137:665-678. [PMID: 30073413 PMCID: PMC6153521 DOI: 10.1007/s00439-018-1916-x] [Citation(s) in RCA: 84] [Impact Index Per Article: 12.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/21/2018] [Accepted: 07/21/2018] [Indexed: 12/12/2022]
Abstract
Given the constantly improving cost and speed of genome sequencing, it is reasonable to expect that personal genomes will soon be known for many millions of humans. This stands in stark contrast with our limited ability to interpret the sequence variants which we find. Although it is, perhaps, easiest to interpret variants in coding regions, knowledge of functional impact is unknown for the vast majority of missense variants. While many computational approaches can predict the impact of coding variants, they are given a little weight in the current guidelines for interpreting clinical variants. Laboratory assays produce comparatively more trustworthy results, but until recently did not scale to the space of all possible mutations. The development of deep mutational scanning and other multiplexed assays of variant effect has now brought feasibility of this endeavour within view. Here, we review progress in this field over the last decade, break down the different approaches into their components, and compare methodological differences.
Collapse
|
25
|
The direction of protein evolution is destined by the stability. Biochimie 2018; 150:100-109. [DOI: 10.1016/j.biochi.2018.05.006] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/22/2018] [Accepted: 05/15/2018] [Indexed: 01/29/2023]
|
26
|
Protein Evolution is Potentially Governed by Protein Stability: Directed Evolution of an Esterase from the Hyperthermophilic Archaeon Sulfolobus tokodaii. J Mol Evol 2018; 86:283-292. [DOI: 10.1007/s00239-018-9843-y] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/11/2018] [Accepted: 04/18/2018] [Indexed: 11/27/2022]
|
27
|
Gupta K, Varadarajan R. Insights into protein structure, stability and function from saturation mutagenesis. Curr Opin Struct Biol 2018; 50:117-125. [PMID: 29505936 DOI: 10.1016/j.sbi.2018.02.006] [Citation(s) in RCA: 20] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/10/2017] [Revised: 02/09/2018] [Accepted: 02/17/2018] [Indexed: 12/20/2022]
Abstract
Where convenient phenotypic readouts are available, saturation mutagenesis coupled to deep sequencing provides a rapid and facile method to infer sequence determinants of protein structure, stability and function. We provide brief descriptions and currently available options for the various steps involved, and mention limitations of current implementations. We also highlight recent applications such as estimating relative stabilities and affinities of protein variants, mapping epitopes, protein model discrimination and prediction of mutant phenotypes. Most mutational scans have so far been applied to single genes and proteins. Additional methodological improvements are required to expand the scope to study intergenic epistasis and intermolecular interactions in macromolecular complexes.
Collapse
Affiliation(s)
- Kritika Gupta
- Molecular Biophysics Unit, Indian Institute of Science, Bangalore 560 012, India
| | - Raghavan Varadarajan
- Molecular Biophysics Unit, Indian Institute of Science, Bangalore 560 012, India; Jawaharlal Nehru Center for Advanced Scientific Research, Jakkur P.O., Bangalore 560 004, India.
| |
Collapse
|
28
|
Evolutionary mechanisms studied through protein fitness landscapes. Curr Opin Struct Biol 2018; 48:141-148. [DOI: 10.1016/j.sbi.2018.01.001] [Citation(s) in RCA: 28] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/16/2017] [Revised: 12/26/2017] [Accepted: 01/01/2018] [Indexed: 12/15/2022]
|
29
|
Tyzack JD, Furnham N, Sillitoe I, Orengo CM, Thornton JM. Understanding enzyme function evolution from a computational perspective. Curr Opin Struct Biol 2017; 47:131-139. [PMID: 28892668 DOI: 10.1016/j.sbi.2017.08.003] [Citation(s) in RCA: 24] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/08/2017] [Revised: 07/08/2017] [Accepted: 08/13/2017] [Indexed: 10/18/2022]
Abstract
In this review, we will explore recent computational approaches to understand enzyme evolution from the perspective of protein structure, dynamics and promiscuity. We will present quantitative methods to measure the size of evolutionary steps within a structural domain, allowing the correlation between change in substrate and domain structure to be assessed, and giving insights into the evolvability of different domains in terms of the number, types and sizes of evolutionary steps observed. These approaches will help to understand the evolution of new catalytic and non-catalytic functionality in response to environmental demands, showing potential to guide de novoenzyme design and directed evolution experiments.
Collapse
Affiliation(s)
| | - Nicholas Furnham
- London School of Hygiene and Tropical Medicine, Keppel Street, London, WC1E 7HT, United Kingdom
| | - Ian Sillitoe
- Institute of Structural and Molecular Biology, University College London, Gower Street, London, WC1E 6BT, United Kingdom
| | - Christine M Orengo
- Institute of Structural and Molecular Biology, University College London, Gower Street, London, WC1E 6BT, United Kingdom
| | | |
Collapse
|
30
|
Zheng X, Xing XH, Zhang C. Targeted mutagenesis: A sniper-like diversity generator in microbial engineering. Synth Syst Biotechnol 2017; 2:75-86. [PMID: 29062964 PMCID: PMC5636951 DOI: 10.1016/j.synbio.2017.07.001] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/09/2017] [Revised: 06/30/2017] [Accepted: 07/03/2017] [Indexed: 12/26/2022] Open
Abstract
Mutations, serving as the raw materials of evolution, have been extensively utilized to increase the chances of engineering molecules or microbes with tailor-made functions. Global and targeted mutagenesis are two main methods of obtaining various mutations, distinguished by the range of action they can cover. While the former one stresses the mining of novel genetic loci within the whole genomic background, targeted mutagenesis performs in a more straightforward manner, bringing evolutionary escape and error catastrophe under control. In this review, we classify the existing techniques of targeted mutagenesis into two categories in terms of whether the diversity is generated in vitro or in vivo, and briefly introduce the mechanisms and applications of them separately. The inherent connections and development trends of the two classes are also discussed to provide an insight into the next generation evolution research.
Collapse
Key Words
- 3′-LTR, 3’-long terminal repeat
- 5-FOA, 5-fluoro-orotic acid
- CRISPR/Cas9, clustered regularly interspaced short palindromic repeats and associated protein 9
- DNA Pol III, DNA polymerase III
- DNA PolI, DNA polymerase I
- DSB, double strand break
- Evolution
- FLASH, fast ligation-based automatable solid-phase high-throughput
- HDR, homology-directed repair
- HIV, human immunodeficiency virus
- ICE, in vivo continuous evolution
- LIC, ligation-independent cloning
- MAGE, multiplex automated genome engineering
- MMEJ, microhomology-mediated end-joining
- Mutations
- NHEJ, error-prone non-homologous end-joining
- ORF, open reading frame
- PAM, protospacer-adjacent motif
- RVD, repeat variable di-residue
- Synthetic biology
- TALE, transcription activator-like effector
- TALEN, transcription activator-like effector nuclease
- TP, terminal protein
- TP-DNAP, TP-DNA polymerase fusion
- TaGTEAM, targeting glycosylase to embedded arrays for mutagenesis
- Targeted mutagenesis
- YOGE, yeast oligo-mediated genome engineering
- ZF, zinc-finger protein
- ZFN, zinc-finger nuclease
- dCas9, catalytically dead Cas9
- dNTP, deoxy-ribonucleoside triphosphate
- dsDNA, double-stranded DNA
- error-prone PCR, error-prone polymerase chain reaction
- non-GMO, non-genetically modified organism
- pre-crRNA, pre-CRISPR RNA
- sctetR, single chain tetR
- sgRNA, single-guide RNA
- ssDNA, single-stranded DNA
- tracrRNA, trans-encoded RNA
Collapse
Affiliation(s)
| | | | - Chong Zhang
- Key Laboratory for Industrial Biocatalysis, Ministry of Education, Institute of Biochemical Engineering, Department of Chemical Engineering, Center for Synthetic & Systems Biology, Tsinghua University, Beijing 100084, China
| |
Collapse
|
31
|
Smith CIE. Enigmas in tumor resistance to kinase inhibitors and calculation of the drug resistance index for cancer (DRIC). Semin Cancer Biol 2016; 45:36-49. [PMID: 27865897 DOI: 10.1016/j.semcancer.2016.11.008] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/26/2016] [Accepted: 11/09/2016] [Indexed: 12/11/2022]
Abstract
Darwinian selection is also applicable when antibiotics, the immune system or other host factors shape the repertoire of microorganisms, and similarly, clonal selection is the hallmark of tumor evolution. The ongoing revolution in new anti-cancer treatment modalities, combined with an unprecedented precision in characterizing malignant clones at the level below one percent, profoundly improves the understanding of repertoire-tuning mechanisms. There is no fundamental difference between selection of the tumor cells in the presence, or absence, of therapy. However, under treatment the influence of a single agent can be measured, simplifying the analysis. Because of their beneficial and selective therapeutic effect, the focus in this review is set on protein kinase inhibitors (PKIs), predominantly tyrosine kinase inhibitors (TKIs). This is one of the most rapidly growing families of novel cancer medicines. In order to limit the number of drugs, the following representative target kinases are included: ALK, BCR-ABL, BRAF, BTK, and EGFR. A key therapeutic challenge is how to reduce tumor growth after treatment, since this is rate-limiting for the generation and expansion of more malignant escape mutants. Thus, upon efficient treatment, tumor cell loss often enables a profoundly increased growth rate among resistant cells. Strategies to reduce this risk, such as concomitant, competitive outgrowth of non-transformed cells, are described. Seven parameters: 1. Drug type, 2. tumor type, 3. presence of metastases or phenotypic change, 4. tumor cell number, 5. net growth rate (proliferation minus cell death), 6. inherited genetic- and 7. epigenetic- variations are crucial for drug responses. It is envisaged that it might become possible to calculate a clinically relevant Drug Resistance Index for Cancer (DRIC) for each patient.
Collapse
Affiliation(s)
- C I Edvard Smith
- Clinical Research Center, Dept. of Laboratory Medicine, Karolinska Institutet, Karolinska University Hospital Huddinge, SE-14186, Huddinge, Sweden.
| |
Collapse
|
32
|
Bershtein S, Serohijos AW, Shakhnovich EI. Bridging the physical scales in evolutionary biology: from protein sequence space to fitness of organisms and populations. Curr Opin Struct Biol 2016; 42:31-40. [PMID: 27810574 DOI: 10.1016/j.sbi.2016.10.013] [Citation(s) in RCA: 46] [Impact Index Per Article: 5.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/25/2016] [Accepted: 10/14/2016] [Indexed: 01/11/2023]
Abstract
Bridging the gap between the molecular properties of proteins and organismal/population fitness is essential for understanding evolutionary processes. This task requires the integration of the several physical scales of biological organization, each defined by a distinct set of mechanisms and constraints, into a single unifying model. The molecular scale is dominated by the constraints imposed by the physico-chemical properties of proteins and their substrates, which give rise to trade-offs and epistatic (non-additive) effects of mutations. At the systems scale, biological networks modulate protein expression and can either buffer or enhance the fitness effects of mutations. The population scale is influenced by the mutational input, selection regimes, and stochastic changes affecting the size and structure of populations, which eventually determine the evolutionary fate of mutations. Here, we summarize the recent advances in theory, computer simulations, and experiments that advance our understanding of the links between various physical scales in biology.
Collapse
Affiliation(s)
- Shimon Bershtein
- Department of Life Sciences, Ben-Gurion University of the Negev, Beer-Sheva 84501, Israel
| | - Adrian Wr Serohijos
- Département de Biochimie, Centre Robert-Cedergren en Bioinformatique & Génomique, Université de Montréal, Montréal, QC H3T 1J4, Canada
| | - Eugene I Shakhnovich
- Department of Chemistry and Chemical Biology, Harvard University, 12 Oxford Street, Cambridge, MA 02138, United States.
| |
Collapse
|