1
|
Pinzón-Reyes EH, Sierra-Bueno DA, Suarez-Barrera MO, Rueda-Forero NJ, Abaunza-Villamizar S, Rondón-Villareal P. Generation of Cry11 Variants of Bacillus thuringiensis by Heuristic Computational Modeling. Evol Bioinform Online 2020; 16:1176934320924681. [PMID: 32782424 PMCID: PMC7385851 DOI: 10.1177/1176934320924681] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/09/2019] [Accepted: 04/07/2020] [Indexed: 12/19/2022] Open
Abstract
Directed evolution methods mimic in vitro Darwinian evolution, inducing random mutations and selective pressure in genes to obtain proteins with enhanced characteristics. These techniques are developed using trial-and-error testing at an experimental level with a high degree of uncertainty. Therefore, in silico modeling of directed evolution is required to support experimental assays. Several in silico approaches have reproduced directed evolution, using statistical, thermodynamic, and kinetic models in an attempt to recreate experimental conditions. Likewise, optimization techniques using heuristic models have been used to understand and find the best scenarios of directed evolution. Our study uses an in silico model named HeurIstics DirecteD EvolutioN, which is based on a genetic algorithm designed to generate chimeric libraries from 2 parental genes, cry11Aa and cry11Ba, of Bacillus thuringiensis. These genes encode crystal-shaped δ-endotoxins with 3 conserved domains. Cry11 toxins are of biotechnological interest because they have shown to be effective as biopesticides for disease-spreading vectors. With our heuristic model, we considered experimental parameters such as DNA fragmentation length, number of generations or simulation cycles, and mutation rate, to get characteristics of Cry11 chimeric libraries such as percentage of population identity, truncation of variants obtained from the presence of internal stop codons, percentage of thermodynamic diversity, and stability of variants. Our study allowed us to focus on experimental conditions that may be useful for the design of in vitro and in silico experiments of directed evolution with Cry toxins of 3 conserved domains. Furthermore, we obtained in silico libraries of Cry11 variants, in which structural characteristics of wild Cry families were observed in a review of a sample of in silico sequences. We consider that future studies could use our in silico libraries and heuristic computational models, as the one suggested here, to support in vitro experiments of directed evolution.
Collapse
Affiliation(s)
- Efraín Hernando Pinzón-Reyes
- Universidad de Santander, Faculty of Health Sciences, Laboratory of Molecular Biology and Biotechnology, Bucaramanga, Colombia.,Centro de Bioinformática Simulación y Modelado (CBSM), School of Bioinformatic, Universidad de Talca, Talca, Chile
| | | | - Miguel Orlando Suarez-Barrera
- Universidad de Santander, Faculty of Health Sciences, Laboratory of Molecular Biology and Biotechnology, Bucaramanga, Colombia
| | - Nohora Juliana Rueda-Forero
- Universidad de Santander, Faculty of Health Sciences, Laboratory of Molecular Biology and Biotechnology, Bucaramanga, Colombia
| | - Sebastián Abaunza-Villamizar
- Universidad de Santander, Faculty of Health Sciences, Laboratory of Molecular Biology and Biotechnology, Bucaramanga, Colombia
| | - Paola Rondón-Villareal
- Universidad de Santander, Faculty of Health Sciences, Laboratory of Molecular Biology and Biotechnology, Bucaramanga, Colombia
| |
Collapse
|
2
|
Cabanes-Creus M, Ginn SL, Amaya AK, Liao SHY, Westhaus A, Hallwirth CV, Wilmott P, Ward J, Dilworth KL, Santilli G, Rybicki A, Nakai H, Thrasher AJ, Filip AC, Alexander IE, Lisowski L. Codon-Optimization of Wild-Type Adeno-Associated Virus Capsid Sequences Enhances DNA Family Shuffling while Conserving Functionality. MOLECULAR THERAPY-METHODS & CLINICAL DEVELOPMENT 2018; 12:71-84. [PMID: 30534580 PMCID: PMC6279885 DOI: 10.1016/j.omtm.2018.10.016] [Citation(s) in RCA: 23] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 08/31/2018] [Accepted: 10/29/2018] [Indexed: 12/22/2022]
Abstract
Adeno-associated virus (AAV) vectors have become one of the most widely used gene transfer tools in human gene therapy. Considerable effort is currently being focused on AAV capsid engineering strategies with the aim of developing novel variants with enhanced tropism for specific human cell types, decreased human seroreactivity, and increased manufacturability. Selection strategies based on directed evolution rely on the generation of highly variable AAV capsid libraries using methods such as DNA-family shuffling, a technique reliant on stretches of high DNA sequence identity between input parental capsid sequences. This identity dependence for reassembly of shuffled capsids is inherently limiting and results in decreased shuffling efficiency as the phylogenetic distance between parental AAV capsids increases. To overcome this limitation, we have developed a novel codon-optimization algorithm that exploits evolutionarily defined codon usage at each amino acid residue in the parental sequences. This method increases average sequence identity between capsids, while enhancing the probability of retaining capsid functionality, and facilitates incorporation of phylogenetically distant serotypes into the DNA-shuffled libraries. This technology will help accelerate the discovery of an increasingly powerful repertoire of AAV capsid variants for cell-type and disease-specific applications.
Collapse
Affiliation(s)
- Marti Cabanes-Creus
- Translational Vectorology Group, Children's Medical Research Institute, Faculty of Medicine and Health, The University of Sydney, Sydney, NSW 2006, Australia.,Great Ormond Street Institute of Child Health, University College London, London, UK
| | - Samantha L Ginn
- Gene Therapy Research Unit, Children's Medical Research Institute, Faculty of Medicine and Health, The University of Sydney and Sydney Children's Hospitals Network, Sydney, NSW 2006, Australia
| | - Anais K Amaya
- Gene Therapy Research Unit, Children's Medical Research Institute, Faculty of Medicine and Health, The University of Sydney and Sydney Children's Hospitals Network, Sydney, NSW 2006, Australia
| | - Sophia H Y Liao
- Translational Vectorology Group, Children's Medical Research Institute, Faculty of Medicine and Health, The University of Sydney, Sydney, NSW 2006, Australia.,Gene Therapy Research Unit, Children's Medical Research Institute, Faculty of Medicine and Health, The University of Sydney and Sydney Children's Hospitals Network, Sydney, NSW 2006, Australia
| | - Adrian Westhaus
- Translational Vectorology Group, Children's Medical Research Institute, Faculty of Medicine and Health, The University of Sydney, Sydney, NSW 2006, Australia
| | - Claus V Hallwirth
- Gene Therapy Research Unit, Children's Medical Research Institute, Faculty of Medicine and Health, The University of Sydney and Sydney Children's Hospitals Network, Sydney, NSW 2006, Australia
| | - Patrick Wilmott
- Translational Vectorology Group, Children's Medical Research Institute, Faculty of Medicine and Health, The University of Sydney, Sydney, NSW 2006, Australia
| | - Jason Ward
- Translational Vectorology Group, Children's Medical Research Institute, Faculty of Medicine and Health, The University of Sydney, Sydney, NSW 2006, Australia
| | - Kimberley L Dilworth
- Vector and Genome Engineering Facility, Children's Medical Research Institute, Faculty of Medicine and Health, The University of Sydney, Sydney, NSW 2006, Australia
| | - Giorgia Santilli
- Great Ormond Street Institute of Child Health, University College London, London, UK
| | - Arkadiusz Rybicki
- Vector and Genome Engineering Facility, Children's Medical Research Institute, Faculty of Medicine and Health, The University of Sydney, Sydney, NSW 2006, Australia
| | - Hiroyuki Nakai
- Oregon Health & Science University, Portland, OR 97239, USA
| | - Adrian J Thrasher
- Great Ormond Street Institute of Child Health, University College London, London, UK
| | - Adrian C Filip
- Translational Vectorology Group, Children's Medical Research Institute, Faculty of Medicine and Health, The University of Sydney, Sydney, NSW 2006, Australia
| | - Ian E Alexander
- Gene Therapy Research Unit, Children's Medical Research Institute, Faculty of Medicine and Health, The University of Sydney and Sydney Children's Hospitals Network, Sydney, NSW 2006, Australia.,Discipline of Child and Adolescent Health, Sydney Medical School, Faculty of Medicine and Health, The University of Sydney, Sydney, NSW 2145, Australia
| | - Leszek Lisowski
- Translational Vectorology Group, Children's Medical Research Institute, Faculty of Medicine and Health, The University of Sydney, Sydney, NSW 2006, Australia.,Vector and Genome Engineering Facility, Children's Medical Research Institute, Faculty of Medicine and Health, The University of Sydney, Sydney, NSW 2006, Australia.,Military Institute of Hygiene and Epidemiology, The Biological Threats Identification and Countermeasure Centre, 24-100 Puławy, Poland
| |
Collapse
|
3
|
Jimenez-Rosales A, Flores-Merino MV. Tailoring Proteins to Re-Evolve Nature: A Short Review. Mol Biotechnol 2018; 60:946-974. [DOI: 10.1007/s12033-018-0122-3] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/02/2023]
|
4
|
Milligan JN, Garry DJ. Shuffle Optimizer: A Program to Optimize DNA Shuffling for Protein Engineering. Methods Mol Biol 2018; 1472:35-45. [PMID: 27671930 DOI: 10.1007/978-1-4939-6343-0_3] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/25/2022]
Abstract
DNA shuffling is a powerful tool to develop libraries of variants for protein engineering. Here, we present a protocol to use our freely available and easy-to-use computer program, Shuffle Optimizer. Shuffle Optimizer is written in the Python computer language and increases the nucleotide homology between two pieces of DNA desired to be shuffled together without changing the amino acid sequence. In addition we also include sections on optimal primer design for DNA shuffling and library construction, a small-volume ultrasonicator method to create sheared DNA, and finally a method to reassemble the sheared fragments and recover and clone the library. The Shuffle Optimizer program and these protocols will be useful to anyone desiring to perform any of the nucleotide homology-dependent shuffling methods.
Collapse
Affiliation(s)
- John N Milligan
- The Department of Molecular Biosciences, The University of Texas at Austin, 2506 Speedway STOP A5000, Austin, TX, 78712, USA.
| | - Daniel J Garry
- The Department of Molecular Biosciences, The University of Texas at Austin, 2506 Speedway STOP A5000, Austin, TX, 78712, USA.
| |
Collapse
|
5
|
Swainston N, Currin A, Day PJ, Kell DB. GeneGenie: optimized oligomer design for directed evolution. Nucleic Acids Res 2014; 42:W395-400. [PMID: 24782527 PMCID: PMC4086129 DOI: 10.1093/nar/gku336] [Citation(s) in RCA: 32] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/04/2023] Open
Abstract
GeneGenie, a new online tool available at http://www.gene-genie.org, is introduced to support the design and self-assembly of synthetic genes and constructs. GeneGenie allows for the design of oligonucleotide cohorts encoding the gene sequence optimized for expression in any suitable host through an intuitive, easy-to-use web interface. The tool ensures consistent oligomer overlapping melting temperatures, minimizes the likelihood of misannealing, optimizes codon usage for expression in a selected host, allows for specification of forward and reverse cloning sequences (for downstream ligation) and also provides support for mutagenesis or directed evolution studies. Directed evolution studies are enabled through the construction of variant libraries via the optional specification of ‘variant codons’, containing mixtures of bases, at any position. For example, specifying the variant codon TNT (where N is any nucleotide) will generate an equimolar mixture of the codons TAT, TCT, TGT and TTT at that position, encoding a mixture of the amino acids Tyr, Ser, Cys and Phe. This facility is demonstrated through the use of GeneGenie to develop and synthesize a library of enhanced green fluorescent protein variants.
Collapse
Affiliation(s)
- Neil Swainston
- Manchester Institute of Biotechnology, The University of Manchester, Manchester M1 7DN, UK School of Computer Science, The University of Manchester, Manchester M13 9PL, UK
| | - Andrew Currin
- Manchester Institute of Biotechnology, The University of Manchester, Manchester M1 7DN, UK School of Chemistry, The University of Manchester, Manchester M13 9PL, UK
| | - Philip J Day
- Manchester Institute of Biotechnology, The University of Manchester, Manchester M1 7DN, UK Faculty of Medical and Human Sciences, The University of Manchester, Manchester M13 9PT, UK
| | - Douglas B Kell
- Manchester Institute of Biotechnology, The University of Manchester, Manchester M1 7DN, UK School of Chemistry, The University of Manchester, Manchester M13 9PL, UK
| |
Collapse
|
6
|
Abstract
BACKGROUND DNA shuffling generates combinatorial libraries of chimeric genes by stochastically recombining parent genes. The resulting libraries are subjected to large-scale genetic selection or screening to identify those chimeras with favorable properties (e.g., enhanced stability or enzymatic activity). While DNA shuffling has been applied quite successfully, it is limited by its homology-dependent, stochastic nature. Consequently, it is used only with parents of sufficient overall sequence identity, and provides no control over the resulting chimeric library. RESULTS This paper presents efficient methods to extend the scope of DNA shuffling to handle significantly more diverse parents and to generate more predictable, optimized libraries. Our CODNS (cross-over optimization for DNA shuffling) approach employs polynomial-time dynamic programming algorithms to select codons for the parental amino acids, allowing for zero or a fixed number of conservative substitutions. We first present efficient algorithms to optimize the local sequence identity or the nearest-neighbor approximation of the change in free energy upon annealing, objectives that were previously optimized by computationally-expensive integer programming methods. We then present efficient algorithms for more powerful objectives that seek to localize and enhance the frequency of recombination by producing "runs" of common nucleotides either overall or according to the sequence diversity of the resulting chimeras. We demonstrate the effectiveness of CODNS in choosing codons and allocating substitutions to promote recombination between parents targeted in earlier studies: two GAR transformylases (41% amino acid sequence identity), two very distantly related DNA polymerases, Pol X and β (15%), and beta-lactamases of varying identity (26-47%). CONCLUSIONS Our methods provide the protein engineer with a new approach to DNA shuffling that supports substantially more diverse parents, is more deterministic, and generates more predictable and more diverse chimeric libraries.
Collapse
Affiliation(s)
- Lu He
- Dept of Computer Science, Dartmouth College, 6211 Sudikoff Laboratory, Hanover, NH 03755, USA
| | - Alan M Friedman
- Dept of Biological Sciences, Markey Center for Structural Biology, Purdue Cancer Center, and Bindley Bioscience Center, Purdue University, West Lafayette, IN 47907, USA
| | - Chris Bailey-Kellogg
- Dept of Computer Science, Dartmouth College, 6211 Sudikoff Laboratory, Hanover, NH 03755, USA
| |
Collapse
|
7
|
Eisenbeis S, Höcker B. Evolutionary mechanism as a template for protein engineering. J Pept Sci 2010; 16:538-44. [DOI: 10.1002/psc.1233] [Citation(s) in RCA: 19] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/10/2022]
|
8
|
Larsen LSZ, Wassman CD, Hatfield GW, Lathrop RH. Computationally Optimised DNA Assembly of synthetic genes. ACTA ACUST UNITED AC 2008; 4:324-36. [PMID: 18640907 DOI: 10.1504/ijbra.2008.019578] [Citation(s) in RCA: 16] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/21/2022]
Abstract
Gene synthesis is hampered by two obstacles: improper assembly of oligonucleotides; oligonucleotide defects incurred during chemical synthesis. To overcome the first problem, we describe the employment of a Computationally Optimised DNA Assembly (CODA) algorithm that uses the degeneracy of the genetic code to design overlapping oligonucleotides with thermodynamic properties for self-assembly into a single, linear, DNA product. To address the second problem, we describe a hierarchical assembly strategy that reduces the incorporation of defective oligonucleotides into full-length gene constructs. The CODA algorithm and these biological methods enable fast, simple and reliable assemblies of sequence-correct full-length genes.
Collapse
|
9
|
Wong TS, Roccatano D, Schwaneberg U. Steering directed protein evolution: strategies to manage combinatorial complexity of mutant libraries. Environ Microbiol 2007; 9:2645-59. [DOI: 10.1111/j.1462-2920.2007.01411.x] [Citation(s) in RCA: 53] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
|
10
|
Tan T, Bogarad LD, Deem MW. Modulation of base-specific mutation and recombination rates enables functional adaptation within the context of the genetic code. J Mol Evol 2005; 59:385-99. [PMID: 15553092 DOI: 10.1007/s00239-004-2633-8] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/01/2022]
Abstract
The persistence of life requires populations to adapt at a rate commensurate with the dynamics of their environment. Successful populations that inhabit highly variable environments have evolved mechanisms to increase the likelihood of successful adaptation. We introduce a 64 x 64 matrix to quantify base-specific mutation potential, analyzing four different replicative systems, error-prone PCR, mouse antibodies, a nematode, and Drosophila. Mutational tendencies are correlated with the structural evolution of proteins. In systems under strong selective pressure, mutational biases are shown to favor the adaptive search of space, either by base mutation or by recombination. Such adaptability is discussed within the context of the genetic code at the levels of replication and codon usage.
Collapse
Affiliation(s)
- Taison Tan
- Department of Bioengineering and Department of Physics & Astronomy, Rice University, Houston, TX 77005-1892, USA
| | | | | |
Collapse
|
11
|
Neylon C. Chemical and biochemical strategies for the randomization of protein encoding DNA sequences: library construction methods for directed evolution. Nucleic Acids Res 2004; 32:1448-59. [PMID: 14990750 PMCID: PMC390300 DOI: 10.1093/nar/gkh315] [Citation(s) in RCA: 171] [Impact Index Per Article: 8.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/06/2004] [Revised: 02/06/2004] [Accepted: 02/06/2004] [Indexed: 11/14/2022] Open
Abstract
Directed molecular evolution and combinatorial methodologies are playing an increasingly important role in the field of protein engineering. The general approach of generating a library of partially randomized genes, expressing the gene library to generate the proteins the library encodes and then screening the proteins for improved or modified characteristics has successfully been applied in the areas of protein-ligand binding, improving protein stability and modifying enzyme selectivity. A wide range of techniques are now available for generating gene libraries with different characteristics. This review will discuss these different methodologies, their accessibility and applicability to non-expert laboratories and the characteristics of the libraries they produce. The aim is to provide an up to date resource to allow groups interested in using directed evolution to identify the most appropriate methods for their purposes and to guide those moving on from initial experiments to more ambitious targets in the selection of library construction techniques. References are provided to original methodology papers and other recent examples from the primary literature that provide details of experimental methods.
Collapse
Affiliation(s)
- Cameron Neylon
- School of Chemistry, University of Southampton, Highfield SO17 1BJ, UK.
| |
Collapse
|
12
|
Moore GL, Maranas CD. Computational challenges in combinatorial library design for protein engineering. AIChE J 2004. [DOI: 10.1002/aic.10025] [Citation(s) in RCA: 23] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022]
|
13
|
Abstract
Thanks to biotechnology, proteins are becoming increasingly important tools to fight disease, both as therapeutics in their own right and as catalysts for the synthesis of small molecule drugs. However, the properties of these proteins are not necessarily optimal for their intended tasks. In vitro evolution is a set of technologies useful to address their shortcomings. Moreover, in vitro evolution can help illuminate natural evolutionary pathways, thus potentially enabling prediction of drug resistance evolution. We consider here recent developments in the area of in vitro evolution, as well as its application to proteins of interest to medical science.
Collapse
Affiliation(s)
- Simon Delagrave
- Center for Molecular Biotechnology, Fraunhofer USA, 9 Innovation Way, Suite 200, Newark, DE 19711, USA.
| | | |
Collapse
|