1
|
Jiang R, Yuan S, Zhou Y, Wei Y, Li F, Wang M, Chen B, Yu H. Strategies to overcome the challenges of low or no expression of heterologous proteins in Escherichia coli. Biotechnol Adv 2024; 75:108417. [PMID: 39038691 DOI: 10.1016/j.biotechadv.2024.108417] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/21/2024] [Revised: 07/18/2024] [Accepted: 07/19/2024] [Indexed: 07/24/2024]
Abstract
Protein expression is a critical process in diverse biological systems. For Escherichia coli, a widely employed microbial host in industrial catalysis and healthcare, researchers often face significant challenges in constructing recombinant expression systems. To maximize the potential of E. coli expression systems, it is essential to address problems regarding the low or absent production of certain target proteins. This article presents viable solutions to the main factors posing challenges to heterologous protein expression in E. coli, which includes protein toxicity, the intrinsic influence of gene sequences, and mRNA structure. These strategies include specialized approaches for managing toxic protein expression, addressing issues related to mRNA structure and codon bias, advanced codon optimization methodologies that consider multiple factors, and emerging optimization techniques facilitated by big data and machine learning.
Collapse
Affiliation(s)
- Ruizhao Jiang
- Department of Chemical Engineering, Tsinghua University, Beijing 100084, China; Key Laboratory of Industrial Biocatalysis (Tsinghua University), the Ministry of Education, Beijing 100084, China
| | - Shuting Yuan
- Department of Chemical Engineering, Tsinghua University, Beijing 100084, China; Key Laboratory of Industrial Biocatalysis (Tsinghua University), the Ministry of Education, Beijing 100084, China
| | - Yilong Zhou
- Tanwei College, Tsinghua University, Beijing 100084, China
| | - Yuwen Wei
- Department of Chemical Engineering, Tsinghua University, Beijing 100084, China; Key Laboratory of Industrial Biocatalysis (Tsinghua University), the Ministry of Education, Beijing 100084, China
| | - Fulong Li
- Beijing Evolyzer Co.,Ltd., 100176, China
| | | | - Bo Chen
- Beijing Evolyzer Co.,Ltd., 100176, China
| | - Huimin Yu
- Department of Chemical Engineering, Tsinghua University, Beijing 100084, China; Key Laboratory of Industrial Biocatalysis (Tsinghua University), the Ministry of Education, Beijing 100084, China; Center for Synthetic and Systems Biology, Tsinghua University, Beijing 100084, China.
| |
Collapse
|
2
|
Luebbert L, Sullivan DK, Carilli M, Hjörleifsson KE, Winnett AV, Chari T, Pachter L. Efficient and accurate detection of viral sequences at single-cell resolution reveals putative novel viruses perturbing host gene expression. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2023.12.11.571168. [PMID: 38168363 PMCID: PMC10760059 DOI: 10.1101/2023.12.11.571168] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/05/2024]
Abstract
There are an estimated 300,000 mammalian viruses from which infectious diseases in humans may arise. They inhabit human tissues such as the lungs, blood, and brain and often remain undetected. Efficient and accurate detection of viral infection is vital to understanding its impact on human health and to make accurate predictions to limit adverse effects, such as future epidemics. The increasing use of high-throughput sequencing methods in research, agriculture, and healthcare provides an opportunity for the cost-effective surveillance of viral diversity and investigation of virus-disease correlation. However, existing methods for identifying viruses in sequencing data rely on and are limited to reference genomes or cannot retain single-cell resolution through cell barcode tracking. We introduce a method that accurately and rapidly detects viral sequences in bulk and single-cell transcriptomics data based on highly conserved amino acid domains, which enables the detection of RNA viruses covering up to 1012 virus species. The analysis of viral presence and host gene expression in parallel at single-cell resolution allows for the characterization of host viromes and the identification of viral tropism and host responses. We applied our method to identify putative novel viruses in rhesus macaque PBMC data that display cell type specificity and whose presence correlates with altered host gene expression.
Collapse
Affiliation(s)
- Laura Luebbert
- Division of Biology and Biological Engineering, California Institute of Technology, Pasadena, California
| | - Delaney K. Sullivan
- Division of Biology and Biological Engineering, California Institute of Technology, Pasadena, California
- UCLA-Caltech Medical Scientist Training Program, David Geffen School of Medicine, University of California, Los Angeles, California
| | - Maria Carilli
- Division of Biology and Biological Engineering, California Institute of Technology, Pasadena, California
| | | | - Alexander Viloria Winnett
- Division of Biology and Biological Engineering, California Institute of Technology, Pasadena, California
- UCLA-Caltech Medical Scientist Training Program, David Geffen School of Medicine, University of California, Los Angeles, California
| | - Tara Chari
- Division of Biology and Biological Engineering, California Institute of Technology, Pasadena, California
| | - Lior Pachter
- Division of Biology and Biological Engineering, California Institute of Technology, Pasadena, California
- Department of Computing and Mathematical Sciences, California Institute of Technology, Pasadena, California
| |
Collapse
|
3
|
Aubel M, Buchel F, Heames B, Jones A, Honc O, Bornberg-Bauer E, Hlouchova K. High-throughput Selection of Human de novo-emerged sORFs with High Folding Potential. Genome Biol Evol 2024; 16:evae069. [PMID: 38597156 PMCID: PMC11024478 DOI: 10.1093/gbe/evae069] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/19/2024] [Revised: 03/11/2024] [Accepted: 03/23/2024] [Indexed: 04/11/2024] Open
Abstract
De novo genes emerge from previously noncoding stretches of the genome. Their encoded de novo proteins are generally expected to be similar to random sequences and, accordingly, with no stable tertiary fold and high predicted disorder. However, structural properties of de novo proteins and whether they differ during the stages of emergence and fixation have not been studied in depth and rely heavily on predictions. Here we generated a library of short human putative de novo proteins of varying lengths and ages and sorted the candidates according to their structural compactness and disorder propensity. Using Förster resonance energy transfer combined with Fluorescence-activated cell sorting, we were able to screen the library for most compact protein structures, as well as most elongated and flexible structures. We find that compact de novo proteins are on average slightly shorter and contain lower predicted disorder than less compact ones. The predicted structures for most and least compact de novo proteins correspond to expectations in that they contain more secondary structure content or higher disorder content, respectively. Our experiments indicate that older de novo proteins have higher compactness and structural propensity compared with young ones. We discuss possible evolutionary scenarios and their implications underlying the age-dependencies of compactness and structural content of putative de novo proteins.
Collapse
Affiliation(s)
- Margaux Aubel
- Institute for Evolution and Biodiversity, University of Muenster, Muenster, Germany
| | - Filip Buchel
- Department of Cell Biology, Faculty of Science, Charles University, Prague, Czech Republic
- Department of Biochemistry, Faculty of Science, Charles University, Prague, Czech Republic
| | - Brennen Heames
- Institute for Evolution and Biodiversity, University of Muenster, Muenster, Germany
| | - Alun Jones
- Institute for Evolution and Biodiversity, University of Muenster, Muenster, Germany
| | - Ondrej Honc
- Imaging Methods Core Facility, BIOCEV, Prague, Czech Republic
| | - Erich Bornberg-Bauer
- Institute for Evolution and Biodiversity, University of Muenster, Muenster, Germany
- Department of Protein Evolution, Max Planck-Institute for Biology Tuebingen, Tuebingen, Germany
| | - Klara Hlouchova
- Department of Cell Biology, Faculty of Science, Charles University, Prague, Czech Republic
- Institute of Organic Chemistry and Biochemistry, Czech Academy of Sciences, Prague, Czech Republic
| |
Collapse
|
4
|
Lund S, Potapov V, Johnson SR, Buss J, Tanner NA. Highly Parallelized Construction of DNA from Low-Cost Oligonucleotide Mixtures Using Data-Optimized Assembly Design and Golden Gate. ACS Synth Biol 2024; 13:745-751. [PMID: 38377591 PMCID: PMC10949349 DOI: 10.1021/acssynbio.3c00694] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/20/2023] [Revised: 02/12/2024] [Accepted: 02/13/2024] [Indexed: 02/22/2024]
Abstract
Commercially synthesized genes are typically made using variations of homology-based cloning techniques, including polymerase cycling assembly from chemically synthesized microarray-derived oligonucleotides. Here, we apply Data-optimized Assembly Design (DAD) to the synthesis of hundreds of codon-optimized genes in both constitutive and inducible vectors using Golden Gate Assembly. Starting from oligonucleotide pools, we synthesize genes in three simple steps: (1) amplification of parts belonging to individual assemblies in parallel from a single pool; (2) Golden Gate Assembly of parts for each construct; and (3) transformation. We construct genes from receiving DNA to sequence confirmed isolates in as little as 4 days. By leveraging the ligation fidelity afforded by T4 DNA ligase, we expect to be able to construct a larger breadth of sequences not currently supported by homology-based methods, which require stability of extensive single-stranded DNA overhangs.
Collapse
Affiliation(s)
- Sean Lund
- Research
Department, New England Biolabs, Ipswich, Massachusetts 01938, United States
| | - Vladimir Potapov
- Research
Department, New England Biolabs, Ipswich, Massachusetts 01938, United States
| | - Sean R. Johnson
- Research
Department, New England Biolabs, Ipswich, Massachusetts 01938, United States
| | - Jackson Buss
- Research
Department, New England Biolabs, Ipswich, Massachusetts 01938, United States
| | - Nathan A. Tanner
- Research
Department, New England Biolabs, Ipswich, Massachusetts 01938, United States
| |
Collapse
|
5
|
Arbel-Groissman M, Menuhin-Gruman I, Yehezkeli H, Naki D, Bergman S, Udi Y, Tuller T. The Causes for Genomic Instability and How to Try and Reduce Them Through Rational Design of Synthetic DNA. Methods Mol Biol 2024; 2760:371-392. [PMID: 38468099 DOI: 10.1007/978-1-0716-3658-9_21] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 03/13/2024]
Abstract
Genetic engineering has revolutionized our ability to manipulate DNA and engineer organisms for various applications. However, this approach can lead to genomic instability, which can result in unwanted effects such as toxicity, mutagenesis, and reduced productivity. To overcome these challenges, smart design of synthetic DNA has emerged as a promising solution. By taking into consideration the intricate relationships between gene expression and cellular metabolism, researchers can design synthetic constructs that minimize metabolic stress on the host cell, reduce mutagenesis, and increase protein yield. In this chapter, we summarize the main challenges of genomic instability in genetic engineering and address the dangers of unknowingly incorporating genomically unstable sequences in synthetic DNA. We also demonstrate the instability of those sequences by the fact that they are selected against conserved sequences in nature. We highlight the benefits of using ESO, a tool for the rational design of DNA for avoiding genetically unstable sequences, and also summarize the main principles and working parameters of the software that allow maximizing its benefits and impact.
Collapse
Affiliation(s)
- Matan Arbel-Groissman
- Shmunis School of Biomedicine and Cancer Research, The George S. Wise Faculty of Life Sciences, Tel Aviv University, Tel Aviv, Israel
| | - Itamar Menuhin-Gruman
- School of Mathematical Sciences, The Raymond and Beverly Sackler Faculty of Exact Sciences, Tel Aviv University, Tel Aviv, Israel
| | - Hader Yehezkeli
- Department of Biomedical Engineering, Tel Aviv University, Tel Aviv, Israel
| | - Doron Naki
- Shmunis School of Biomedicine and Cancer Research, The George S. Wise Faculty of Life Sciences, Tel Aviv University, Tel Aviv, Israel
| | - Shaked Bergman
- Department of Biomedical Engineering, Tel Aviv University, Tel Aviv, Israel
| | - Yarin Udi
- Department of Biomedical Engineering, Tel Aviv University, Tel Aviv, Israel
| | - Tamir Tuller
- Department of Biomedical Engineering, Tel Aviv University, Tel Aviv, Israel.
- The Sagol School of Neuroscience, Tel-Aviv University, Tel Aviv, Israel.
| |
Collapse
|
6
|
Schmidt M, Lee N, Zhan C, Roberts JB, Nava AA, Keiser LS, Vilchez AA, Chen Y, Petzold CJ, Haushalter RW, Blank LM, Keasling JD. Maximizing Heterologous Expression of Engineered Type I Polyketide Synthases: Investigating Codon Optimization Strategies. ACS Synth Biol 2023; 12:3366-3380. [PMID: 37851920 PMCID: PMC10661030 DOI: 10.1021/acssynbio.3c00367] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/15/2023] [Indexed: 10/20/2023]
Abstract
Type I polyketide synthases (T1PKSs) hold enormous potential as a rational production platform for the biosynthesis of specialty chemicals. However, despite great progress in this field, the heterologous expression of PKSs remains a major challenge. One of the first measures to improve heterologous gene expression can be codon optimization. Although controversial, choosing the wrong codon optimization strategy can have detrimental effects on the protein and product levels. In this study, we analyzed 11 different codon variants of an engineered T1PKS and investigated in a systematic approach their influence on heterologous expression in Corynebacterium glutamicum, Escherichia coli, and Pseudomonas putida. Our best performing codon variants exhibited a minimum 50-fold increase in PKS protein levels, which also enabled the production of an unnatural polyketide in each of these hosts. Furthermore, we developed a free online tool (https://basebuddy.lbl.gov) that offers transparent and highly customizable codon optimization with up-to-date codon usage tables. In this work, we not only highlight the significance of codon optimization but also establish the groundwork for the high-throughput assembly and characterization of PKS pathways in alternative hosts.
Collapse
Affiliation(s)
- Matthias Schmidt
- Joint
BioEnergy Institute, 5885 Hollis Street, Emeryville, California 94608, United States
- Biological
Systems & Engineering Division, Lawrence
Berkeley National Laboratory, Berkeley, California 94720, United States
- Institute
of Applied Microbiology (iAMB), Aachen Biology and Biotechnology (ABBt), RWTH Aachen University, 52062 Aachen, Germany
- California
Institute for Quantitative Biosciences (QB3), University of California, Berkeley, California 94720, United States
| | - Namil Lee
- Joint
BioEnergy Institute, 5885 Hollis Street, Emeryville, California 94608, United States
- Biological
Systems & Engineering Division, Lawrence
Berkeley National Laboratory, Berkeley, California 94720, United States
- California
Institute for Quantitative Biosciences (QB3), University of California, Berkeley, California 94720, United States
| | - Chunjun Zhan
- Joint
BioEnergy Institute, 5885 Hollis Street, Emeryville, California 94608, United States
- Biological
Systems & Engineering Division, Lawrence
Berkeley National Laboratory, Berkeley, California 94720, United States
| | - Jacob B. Roberts
- Joint
BioEnergy Institute, 5885 Hollis Street, Emeryville, California 94608, United States
- Biological
Systems & Engineering Division, Lawrence
Berkeley National Laboratory, Berkeley, California 94720, United States
- Joint
Program in Bioengineering, University of
California, Berkeley, California 94720, United States
| | - Alberto A. Nava
- Joint
BioEnergy Institute, 5885 Hollis Street, Emeryville, California 94608, United States
- Biological
Systems & Engineering Division, Lawrence
Berkeley National Laboratory, Berkeley, California 94720, United States
- Department
of Chemical and Biomolecular Engineering, University of California, Berkeley, California 94720, United States
| | - Leah S. Keiser
- Joint
BioEnergy Institute, 5885 Hollis Street, Emeryville, California 94608, United States
- Biological
Systems & Engineering Division, Lawrence
Berkeley National Laboratory, Berkeley, California 94720, United States
- Department
of Chemical and Biomolecular Engineering, University of California, Berkeley, California 94720, United States
| | - Aaron A. Vilchez
- Joint
BioEnergy Institute, 5885 Hollis Street, Emeryville, California 94608, United States
- Biological
Systems & Engineering Division, Lawrence
Berkeley National Laboratory, Berkeley, California 94720, United States
- Department
of Chemical and Biomolecular Engineering, University of California, Berkeley, California 94720, United States
| | - Yan Chen
- Joint
BioEnergy Institute, 5885 Hollis Street, Emeryville, California 94608, United States
- Biological
Systems & Engineering Division, Lawrence
Berkeley National Laboratory, Berkeley, California 94720, United States
| | - Christopher J. Petzold
- Joint
BioEnergy Institute, 5885 Hollis Street, Emeryville, California 94608, United States
- Biological
Systems & Engineering Division, Lawrence
Berkeley National Laboratory, Berkeley, California 94720, United States
| | - Robert W. Haushalter
- Joint
BioEnergy Institute, 5885 Hollis Street, Emeryville, California 94608, United States
- Biological
Systems & Engineering Division, Lawrence
Berkeley National Laboratory, Berkeley, California 94720, United States
| | - Lars M. Blank
- Institute
of Applied Microbiology (iAMB), Aachen Biology and Biotechnology (ABBt), RWTH Aachen University, 52062 Aachen, Germany
| | - Jay D. Keasling
- Joint
BioEnergy Institute, 5885 Hollis Street, Emeryville, California 94608, United States
- Biological
Systems & Engineering Division, Lawrence
Berkeley National Laboratory, Berkeley, California 94720, United States
- Joint
Program in Bioengineering, University of
California, Berkeley, California 94720, United States
- Department
of Chemistry, University of California, Berkeley, California 94720, United States
- Environmental
Genomics and Systems Biology Division, Lawrence
Berkeley National Laboratory, Berkeley, California 94720, United States
- Department
of Chemical and Biomolecular Engineering, University of California, Berkeley, California 94720, United States
- The
Novo Nordisk Foundation Center for Biosustainability, Technical University of Denmark, 2800 Kgs. Lyngby, Denmark
- Center
for Synthetic Biochemistry, Institute for
Synthetic Biology, Shenzhen Institutes for Advanced Technologies, Shenzhen 518071, China
| |
Collapse
|
7
|
Mukund AX, Tycko J, Allen SJ, Robinson SA, Andrews C, Sinha J, Ludwig CH, Spees K, Bassik MC, Bintu L. High-throughput functional characterization of combinations of transcriptional activators and repressors. Cell Syst 2023; 14:746-763.e5. [PMID: 37543039 PMCID: PMC10642976 DOI: 10.1016/j.cels.2023.07.001] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/20/2022] [Revised: 06/26/2023] [Accepted: 07/06/2023] [Indexed: 08/07/2023]
Abstract
Despite growing knowledge of the functions of individual human transcriptional effector domains, much less is understood about how multiple effector domains within the same protein combine to regulate gene expression. Here, we measure transcriptional activity for 8,400 effector domain combinations by recruiting them to reporter genes in human cells. In our assay, weak and moderate activation domains synergize to drive strong gene expression, whereas combining strong activators often results in weaker activation. In contrast, repressors combine linearly and produce full gene silencing, and repressor domains often overpower activation domains. We use this information to build a synthetic transcription factor whose function can be tuned between repression and activation independent of recruitment to target genes by using a small-molecule drug. Altogether, we outline the basic principles of how effector domains combine to regulate gene expression and demonstrate their value in building precise and flexible synthetic biology tools. A record of this paper's transparent peer review process is included in the supplemental information.
Collapse
Affiliation(s)
- Adi X Mukund
- Biophysics Program, Stanford University, Stanford, CA 94305, USA
| | - Josh Tycko
- Department of Genetics, Stanford University, Stanford, CA 94305, USA
| | - Sage J Allen
- Department of Bioengineering, Stanford University, Stanford, CA 94305, USA
| | | | - Cecelia Andrews
- Department of Developmental Biology, Stanford University, Stanford, CA 94305, USA
| | - Joydeb Sinha
- Department of Chemical and Systems Biology, Stanford University, Stanford, CA 94305, USA
| | - Connor H Ludwig
- Department of Bioengineering, Stanford University, Stanford, CA 94305, USA
| | - Kaitlyn Spees
- Department of Genetics, Stanford University, Stanford, CA 94305, USA
| | - Michael C Bassik
- Department of Genetics, Stanford University, Stanford, CA 94305, USA
| | - Lacramioara Bintu
- Department of Bioengineering, Stanford University, Stanford, CA 94305, USA.
| |
Collapse
|
8
|
Spirgel R, Comolli J, Guido NJ. A Machine Learning Method for Genome Engineering Design Tool Attribution. Health Secur 2023; 21:407-414. [PMID: 37594776 DOI: 10.1089/hs.2022.0152] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 08/19/2023] Open
Abstract
As the ability to engineer biological systems improves with increasingly advanced technology, the risk of accidental or intentional release of a dangerous genetically modified organism becomes greater. It is important that authorities can carry out attribution for the source of a genetically modified biological agent release. In the absence of evidence that ties a release directly to the individuals responsible, attribution can be carried out in part by discovering the in silico tools used to design the engineered genetic components, which can leave a signature in the DNA of the organism. Previous attribution methods have focused on identifying the laboratory of origin of an engineered organism using machine learning on plasmid signatures. The next logical step is to address attribution using signatures from the tools that are used to create the engineered modifications. A random forest classifier was developed that discriminates between design tools used to optimize coding regions for incorporation into the genome of another organism. To this end, tens of thousands of genes were optimized with 4 different codon optimization methods and relevant features from these sequences were generated for a machine learning classifier. This method achieves more than 97% accuracy in predicting which tools were used to design codon optimized genes for expression in other organisms. The methods presented here lay the groundwork for the creation of effective organism engineering attribution techniques. Such methods can act both as deterrents for future attempts at creating dangerous organisms as well as tools for forensic science.
Collapse
Affiliation(s)
- Rebecca Spirgel
- Rebecca Spirgel, MS, is Associate Technical Staff, Group 23, MIT Lincoln Laboratory, Lexington, MA
| | - James Comolli
- James Comolli, PhD, Group 23, MIT Lincoln Laboratory, Lexington, MA
| | - Nicholas J Guido
- Nicholas J. Guido, PhD, are Technical Staff, Group 23, MIT Lincoln Laboratory, Lexington, MA
| |
Collapse
|
9
|
DelRosso N, Tycko J, Suzuki P, Andrews C, Aradhana, Mukund A, Liongson I, Ludwig C, Spees K, Fordyce P, Bassik MC, Bintu L. Large-scale mapping and mutagenesis of human transcriptional effector domains. Nature 2023; 616:365-372. [PMID: 37020022 PMCID: PMC10484233 DOI: 10.1038/s41586-023-05906-y] [Citation(s) in RCA: 39] [Impact Index Per Article: 39.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/14/2022] [Accepted: 03/01/2023] [Indexed: 04/07/2023]
Abstract
Human gene expression is regulated by more than 2,000 transcription factors and chromatin regulators1,2. Effector domains within these proteins can activate or repress transcription. However, for many of these regulators we do not know what type of effector domains they contain, their location in the protein, their activation and repression strengths, and the sequences that are necessary for their functions. Here, we systematically measure the effector activity of more than 100,000 protein fragments tiling across most chromatin regulators and transcription factors in human cells (2,047 proteins). By testing the effect they have when recruited at reporter genes, we annotate 374 activation domains and 715 repression domains, roughly 80% of which are new and have not been previously annotated3-5. Rational mutagenesis and deletion scans across all the effector domains reveal aromatic and/or leucine residues interspersed with acidic, proline, serine and/or glutamine residues are necessary for activation domain activity. Furthermore, most repression domain sequences contain sites for small ubiquitin-like modifier (SUMO)ylation, short interaction motifs for recruiting corepressors or are structured binding domains for recruiting other repressive proteins. We discover bifunctional domains that can both activate and repress, some of which dynamically split a cell population into high- and low-expression subpopulations. Our systematic annotation and characterization of effector domains provide a rich resource for understanding the function of human transcription factors and chromatin regulators, engineering compact tools for controlling gene expression and refining predictive models of effector domain function.
Collapse
Affiliation(s)
| | - Josh Tycko
- Department of Genetics, Stanford University, Stanford, CA, USA
| | - Peter Suzuki
- Department of Bioengineering, Stanford University, Stanford, CA, USA
| | - Cecelia Andrews
- Department of Developmental Biology, Stanford University, Stanford, CA, USA
| | - Aradhana
- Department of Genetics, Stanford University, Stanford, CA, USA
| | - Adi Mukund
- Biophysics Program, Stanford University, Stanford, CA, USA
| | - Ivan Liongson
- Department of Biology, Stanford University, Stanford, CA, USA
| | - Connor Ludwig
- Department of Bioengineering, Stanford University, Stanford, CA, USA
| | - Kaitlyn Spees
- Department of Genetics, Stanford University, Stanford, CA, USA
| | - Polly Fordyce
- Department of Genetics, Stanford University, Stanford, CA, USA
- Department of Bioengineering, Stanford University, Stanford, CA, USA
- ChEM-H Institute, Stanford University, Stanford, CA, USA
- Chan Zuckerberg Biohub, San Francisco, CA, USA
| | | | - Lacramioara Bintu
- Department of Bioengineering, Stanford University, Stanford, CA, USA.
| |
Collapse
|
10
|
Heames B, Buchel F, Aubel M, Tretyachenko V, Loginov D, Novák P, Lange A, Bornberg-Bauer E, Hlouchová K. Experimental characterization of de novo proteins and their unevolved random-sequence counterparts. Nat Ecol Evol 2023; 7:570-580. [PMID: 37024625 PMCID: PMC10089919 DOI: 10.1038/s41559-023-02010-2] [Citation(s) in RCA: 14] [Impact Index Per Article: 14.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/14/2022] [Accepted: 02/10/2023] [Indexed: 04/08/2023]
Abstract
De novo gene emergence provides a route for new proteins to be formed from previously non-coding DNA. Proteins born in this way are considered random sequences and typically assumed to lack defined structure. While it remains unclear how likely a de novo protein is to assume a soluble and stable tertiary structure, intersecting evidence from random sequence and de novo-designed proteins suggests that native-like biophysical properties are abundant in sequence space. Taking putative de novo proteins identified in human and fly, we experimentally characterize a library of these sequences to assess their solubility and structure propensity. We compare this library to a set of synthetic random proteins with no evolutionary history. Bioinformatic prediction suggests that de novo proteins may have remarkably similar distributions of biophysical properties to unevolved random sequences of a given length and amino acid composition. However, upon expression in vitro, de novo proteins exhibit moderately higher solubility which is further induced by the DnaK chaperone system. We suggest that while synthetic random sequences are a useful proxy for de novo proteins in terms of structure propensity, de novo proteins may be better integrated in the cellular system than random expectation, given their higher solubility.
Collapse
Affiliation(s)
- Brennen Heames
- Institute for Evolution and Biodiversity, University of Münster, Münster, Germany
| | - Filip Buchel
- Department of Cell Biology, Charles University, BIOCEV, Prague, Czech Republic
- Department of Biochemistry, Charles University, Prague, Czech Republic
| | - Margaux Aubel
- Institute for Evolution and Biodiversity, University of Münster, Münster, Germany
| | | | - Dmitry Loginov
- Institute of Microbiology, Czech Academy of Sciences, Prague, Czech Republic
| | - Petr Novák
- Institute of Microbiology, Czech Academy of Sciences, Prague, Czech Republic
| | - Andreas Lange
- Institute for Evolution and Biodiversity, University of Münster, Münster, Germany
| | - Erich Bornberg-Bauer
- Institute for Evolution and Biodiversity, University of Münster, Münster, Germany.
- Department of Protein Evolution, MPI for Developmental Biology, Tübingen, Germany.
| | - Klára Hlouchová
- Department of Cell Biology, Charles University, BIOCEV, Prague, Czech Republic.
- Institute of Organic Chemistry and Biochemistry, Czech Academy of Sciences, Prague, Czech Republic.
| |
Collapse
|
11
|
Haines MC, Carling B, Marshall J, Shenshin VA, Baldwin GS, Freemont P, Storch M. basicsynbio and the BASIC SEVA collection: software and vectors for an established DNA assembly method. Synth Biol (Oxf) 2022; 7:ysac023. [PMID: 36381610 PMCID: PMC9664905 DOI: 10.1093/synbio/ysac023] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/03/2022] [Revised: 09/06/2022] [Accepted: 10/10/2022] [Indexed: 10/19/2023] Open
Abstract
Standardized deoxyribonucleic acid (DNA) assembly methods utilizing modular components provide a powerful framework to explore designs and iterate through Design-Build-Test-Learn cycles. Biopart Assembly Standard for Idempotent Cloning (BASIC) DNA assembly uses modular parts and linkers, is highly accurate, easy to automate, free for academic and commercial use and enables hierarchical assemblies through an idempotent format. These features enable applications including pathway engineering, ribosome binding site (RBS) tuning, fusion protein engineering and multiplexed guide ribonucleic acid (RNA) expression. In this work, we present basicsynbio, open-source software encompassing a Web App (https://basicsynbio.web.app/) and Python Package (https://github.com/LondonBiofoundry/basicsynbio), enabling BASIC construct design via simple drag-and-drop operations or programmatically. With basicsynbio, users can access commonly used BASIC parts and linkers while designing new parts and assemblies with exception handling for common errors. Users can export sequence data and create instructions for manual or acoustic liquid-handling platforms. Instruction generation relies on the BasicBuild Open Standard, which is parsed for bespoke workflows and is serializable in JavaScript Object Notation for transfer and storage. We demonstrate basicsynbio, assembling 30 vectors using sequences including modules from the Standard European Vector Architecture (SEVA). The BASIC SEVA vector collection is compatible with BASIC and Golden Gate using BsaI. Vectors contain one of six antibiotic resistance markers and five origins of replication from different compatibility groups. The collection is available via Addgene under an OpenMTA agreement. Furthermore, vector sequences are available from within the basicsynbio application programming interface with other collections of parts and linkers, providing a powerful environment for designing assemblies for bioengineering applications. Graphical Abstract.
Collapse
Affiliation(s)
- Matthew C Haines
- Department of Infectious Disease, Sir Alexander Fleming Building, South Kensington Campus, Imperial College London, London SW7 2AZ, UK
- London Biofoundry, Imperial College Translation and Innovation Hub, London W12 0BZ, UK
| | - Benedict Carling
- Department of Bioengineering, Imperial College London, London, Westminster SW7 2AZ, UK
| | - James Marshall
- Department of Bioengineering, Imperial College London, London, Westminster SW7 2AZ, UK
| | - Vasily A Shenshin
- Department of Life Sciences, Imperial College London, London, Westminster SW7 2AZ, UK
| | - Geoff S Baldwin
- Department of Life Sciences, Imperial College London, London, Westminster SW7 2AZ, UK
- Imperial College Centre for Synthetic Biology, Imperial College London, London SW7 2AZ, UK
| | - Paul Freemont
- Department of Infectious Disease, Sir Alexander Fleming Building, South Kensington Campus, Imperial College London, London SW7 2AZ, UK
- London Biofoundry, Imperial College Translation and Innovation Hub, London W12 0BZ, UK
- UK DRI Care Research and Technology Centre, Imperial College London, Hammersmith Campus, Du Cane Road, London W12 0NN, UK
| | - Marko Storch
- Department of Infectious Disease, Sir Alexander Fleming Building, South Kensington Campus, Imperial College London, London SW7 2AZ, UK
- London Biofoundry, Imperial College Translation and Innovation Hub, London W12 0BZ, UK
| |
Collapse
|
12
|
Luo Y, Liu L, He Z, Zhang S, Huo P, Wang Z, Jiaxin Q, Zhao L, Wu Y, Zhang D, Bu D, Chen R, Zhao Y. TREAT: Therapeutic RNAs exploration inspired by artificial intelligence technology. Comput Struct Biotechnol J 2022; 20:5680-5689. [PMID: 36320935 PMCID: PMC9589171 DOI: 10.1016/j.csbj.2022.10.011] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/30/2022] [Revised: 10/04/2022] [Accepted: 10/05/2022] [Indexed: 11/08/2022] Open
Abstract
Recent advances in RNA engineering have enabled the development of RNA-based therapeutics for a broad spectrum of applications. Developing RNA therapeutics start with targeted RNA screening and move to the drug design and optimization. However, existing target screening tools ignore noncoding RNAs and their disease-relevant regulatory relationships. And designing therapeutic RNAs encounters high computational complexity of multi-objective optimization to overcome the immunogenicity, instability and inefficient translational production. To unlock the therapeutic potential of noncoding RNAs and enable one-stop screening and design of therapeutic RNAs, we have built the platform TREAT. It incorporates 43,087,953 regulatory relationships between coding and noncoding genes from 81 biological networks under different physiological conditions. TREAT introduces graph representation learning with Random Walk Diffusions to perform disease-relevant target screening, in addition to the commonly utilized Topological Degree and PageRank algorithms. Design and optimization of large RNAs or interfering RNAs are both available. To reduce the computational complexity of multi-objective optimization for large RNA, we stratified the features into local and global features. The local features are evaluated on the fixed-length or dynamic-length local bins, whereas the latter are inspired by AI language models of protein sequence. Then the global assessment is performed on refined candidates, thus reducing the enormous search space. Overall, TREAT is a one-stop platform for the screening and designing of therapeutic RNAs, with particular attention to noncoding RNAs and cutting-edge AI technology embedded, leading the progress of innovative therapeutics for challenging diseases. TREAT is freely accessible at https://rna.org.cn/treat.
Collapse
Affiliation(s)
- Yufan Luo
- Research Center for Ubiquitous Computing Systems, Institute of Computing Technology, Chinese Academy of Sciences, Beijing, China,University of Chinese Academy of Sciences, Beijing 100049, China
| | - Liu Liu
- Research Center for Ubiquitous Computing Systems, Institute of Computing Technology, Chinese Academy of Sciences, Beijing, China
| | - Zihao He
- Research Center for Ubiquitous Computing Systems, Institute of Computing Technology, Chinese Academy of Sciences, Beijing, China
| | - Shanshan Zhang
- Luoyang Zhongke Information Industry Research Institute, Luoyang, China
| | - Peipei Huo
- Luoyang Zhongke Information Industry Research Institute, Luoyang, China
| | - Zhihao Wang
- Luoyang Zhongke Information Industry Research Institute, Luoyang, China
| | - Qin Jiaxin
- Research Center for Ubiquitous Computing Systems, Institute of Computing Technology, Chinese Academy of Sciences, Beijing, China
| | - Lianhe Zhao
- Research Center for Ubiquitous Computing Systems, Institute of Computing Technology, Chinese Academy of Sciences, Beijing, China
| | - Yang Wu
- Research Center for Ubiquitous Computing Systems, Institute of Computing Technology, Chinese Academy of Sciences, Beijing, China
| | - Dongdong Zhang
- Key Laboratory of RNA Biology, Center for Big Data Research in Health, Institute of Biophysics, Chinese Academy of Sciences, Beijing 100101, China
| | - Dechao Bu
- Research Center for Ubiquitous Computing Systems, Institute of Computing Technology, Chinese Academy of Sciences, Beijing, China,Hwa Mei Hospital, University of Chinese Academy of Sciences, China,Correspondence authors at: Institute of Computing Technology, Chinese Academy of Sciences, Beijing, China (Y. Zhao).
| | - Runsheng Chen
- Key Laboratory of RNA Biology, Center for Big Data Research in Health, Institute of Biophysics, Chinese Academy of Sciences, Beijing 100101, China,Shenzhen Institute of Nucleic Acid Drug Research, Shenzhen Bay Laboratory Pingshan Translational Medicine Center, Shenzhen 510800, China,Correspondence authors at: Institute of Computing Technology, Chinese Academy of Sciences, Beijing, China (Y. Zhao).
| | - Yi Zhao
- Research Center for Ubiquitous Computing Systems, Institute of Computing Technology, Chinese Academy of Sciences, Beijing, China,Correspondence authors at: Institute of Computing Technology, Chinese Academy of Sciences, Beijing, China (Y. Zhao).
| |
Collapse
|
13
|
Chitayat Levi L, Rippin I, Ben Tulila M, Galron R, Tuller T. Modulating Gene Expression within a Microbiome Based on Computational Models. BIOLOGY 2022; 11:biology11091301. [PMID: 36138780 PMCID: PMC9495703 DOI: 10.3390/biology11091301] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 07/15/2022] [Revised: 08/20/2022] [Accepted: 08/26/2022] [Indexed: 11/16/2022]
Abstract
Simple Summary Development of computational biology methodologies has provided comprehensive understanding of the complexity of microbiomes, and the extensive ways in which they influence their environment. This has awakened a new research goal, aiming to not only understand the mechanisms in which microbiomes function, but to actively modulate and engineer them for various purposes. However, current microbiome engineering techniques are usually manually tailored for a specific system and neglect the different interactions between the new genetic information and the bacterial population, turning a blind eye to processes such as horizontal gene transfer, mutations, and other genetic alterations. In this work, we developed a generic computational method to automatically tune the expression of heterologous genes within a microbiome according to given preferences, to allow the functionality of the engineering process to propagate in longer periods of time. This goal was achieved by treating each part of the gene individually and considering long term fitness effects on the environment, providing computational and experimental evidence for this approach. Abstract Recent research in the field of bioinformatics and molecular biology has revealed the immense complexity and uniqueness of microbiomes, while also showcasing the impact of the symbiosis between a microbiome and its host or environment. A core property influencing this process is horizontal gene transfer between members of the bacterial community used to maintain genetic variation. The essential effect of this mechanism is the exposure of genetic information to a wide array of members of the community, creating an additional “layer” of information in the microbiome named the “plasmidome”. From an engineering perspective, introduction of genetic information to an environment must be facilitated into chosen species which will be able to carry out the desired effect instead of competing and inhibiting it. Moreover, this process of information transfer imposes concerns for the biosafety of genetic engineering of microbiomes as exposure of genetic information into unwanted hosts can have unprecedented ecological impacts. Current technologies are usually experimentally developed for a specific host/environment, and only deal with the transformation process itself at best, ignoring the impact of horizontal gene transfer and gene-microbiome interactions that occur over larger periods of time in uncontrolled environments. The goal of this research was to design new microbiome-specific versions of engineered genetic information, providing an additional layer of compatibility to existing engineering techniques. The engineering framework is entirely computational and is agnostic to the selected microbiome or gene by reducing the problem into the following set up: microbiome species can be defined as wanted or unwanted hosts of the modification. Then, every element related to gene expression (e.g., promoters, coding regions, etc.) and regulation is individually examined and engineered by novel algorithms to provide the defined expression preferences. Additionally, the synergistic effect of the combination of engineered gene blocks facilitates robustness to random mutations that might occur over time. This method has been validated using both computational and experimental tools, stemming from the research done in the iGEM 2021 competition, by the TAU group.
Collapse
Affiliation(s)
- Liyam Chitayat Levi
- Department of Biomedical Engineering, Tel-Aviv University, Tel Aviv 997801, Israel
| | - Ido Rippin
- Sackler Faculty of Medicine, Tel-Aviv University, Tel Aviv 997801, Israel
| | - Moran Ben Tulila
- Department of Biomedical Engineering, Tel-Aviv University, Tel Aviv 997801, Israel
| | - Rotem Galron
- Department of Biomedical Engineering, Tel-Aviv University, Tel Aviv 997801, Israel
| | - Tamir Tuller
- Department of Biomedical Engineering, Tel-Aviv University, Tel Aviv 997801, Israel
- The Sagol School of Neuroscience, Tel-Aviv University, Tel Aviv 997801, Israel
- Correspondence:
| |
Collapse
|
14
|
Malcı K, Watts E, Roberts TM, Auxillos JY, Nowrouzi B, Boll HO, Nascimento CZSD, Andreou A, Vegh P, Donovan S, Fragkoudis R, Panke S, Wallace E, Elfick A, Rios-Solis L. Standardization of Synthetic Biology Tools and Assembly Methods for Saccharomyces cerevisiae and Emerging Yeast Species. ACS Synth Biol 2022; 11:2527-2547. [PMID: 35939789 PMCID: PMC9396660 DOI: 10.1021/acssynbio.1c00442] [Citation(s) in RCA: 18] [Impact Index Per Article: 9.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/26/2022]
Abstract
![]()
As redesigning organisms using engineering principles
is one of
the purposes of synthetic biology (SynBio), the standardization of
experimental methods and DNA parts is becoming increasingly a necessity.
The synthetic biology community focusing on the engineering of Saccharomyces cerevisiae has been in the foreground in this
area, conceiving several well-characterized SynBio toolkits widely
adopted by the community. In this review, the molecular methods and
toolkits developed for S. cerevisiae are discussed
in terms of their contributions to the required standardization efforts.
In addition, the toolkits designed for emerging nonconventional yeast
species including Yarrowia lipolytica, Komagataella
phaffii, and Kluyveromyces marxianus are
also reviewed. Without a doubt, the characterized DNA parts combined
with the standardized assembly strategies highlighted in these toolkits
have greatly contributed to the rapid development of many metabolic
engineering and diagnostics applications among others. Despite the
growing capacity in deploying synthetic biology for common yeast genome
engineering works, the yeast community has a long journey to go to
exploit it in more sophisticated and delicate applications like bioautomation.
Collapse
Affiliation(s)
- Koray Malcı
- Institute for Bioengineering, School of Engineering, University of Edinburgh, Kings Buildings, EH9 3BF Edinburgh, United Kingdom.,Centre for Synthetic and Systems Biology (SynthSys), University of Edinburgh, Kings Buildings, EH9 3BD Edinburgh, United Kingdom
| | - Emma Watts
- School of Biological Sciences, University of Edinburgh, Kings Buildings, EH9 3JW Edinburgh, United Kingdom
| | | | - Jamie Yam Auxillos
- Centre for Synthetic and Systems Biology (SynthSys), University of Edinburgh, Kings Buildings, EH9 3BD Edinburgh, United Kingdom.,Institute of Cell Biology, School of Biological Sciences, University of Edinburgh, Kings Buildings, EH9 3FF Edinburgh, United Kingdom
| | - Behnaz Nowrouzi
- Institute for Bioengineering, School of Engineering, University of Edinburgh, Kings Buildings, EH9 3BF Edinburgh, United Kingdom.,Centre for Synthetic and Systems Biology (SynthSys), University of Edinburgh, Kings Buildings, EH9 3BD Edinburgh, United Kingdom
| | - Heloísa Oss Boll
- Department of Genetics and Morphology, Institute of Biological Sciences, University of Brasília, Brasília, Federal District 70910-900, Brazil
| | | | - Andreas Andreou
- Centre for Synthetic and Systems Biology (SynthSys), University of Edinburgh, Kings Buildings, EH9 3BD Edinburgh, United Kingdom
| | - Peter Vegh
- Edinburgh Genome Foundry, University of Edinburgh, Kings Buildings, Edinburgh EH9 3BF, United Kingdom
| | - Sophie Donovan
- Edinburgh Genome Foundry, University of Edinburgh, Kings Buildings, Edinburgh EH9 3BF, United Kingdom
| | - Rennos Fragkoudis
- Edinburgh Genome Foundry, University of Edinburgh, Kings Buildings, Edinburgh EH9 3BF, United Kingdom
| | - Sven Panke
- Department of Biosystems Science and Engineering, ETH Zürich, 4058 Basel, Switzerland
| | - Edward Wallace
- Centre for Synthetic and Systems Biology (SynthSys), University of Edinburgh, Kings Buildings, EH9 3BD Edinburgh, United Kingdom.,Institute of Cell Biology, School of Biological Sciences, University of Edinburgh, Kings Buildings, EH9 3FF Edinburgh, United Kingdom
| | - Alistair Elfick
- Institute for Bioengineering, School of Engineering, University of Edinburgh, Kings Buildings, EH9 3BF Edinburgh, United Kingdom.,Centre for Synthetic and Systems Biology (SynthSys), University of Edinburgh, Kings Buildings, EH9 3BD Edinburgh, United Kingdom
| | - Leonardo Rios-Solis
- Institute for Bioengineering, School of Engineering, University of Edinburgh, Kings Buildings, EH9 3BF Edinburgh, United Kingdom.,Centre for Synthetic and Systems Biology (SynthSys), University of Edinburgh, Kings Buildings, EH9 3BD Edinburgh, United Kingdom.,School of Natural and Environmental Sciences, Newcastle University, Newcastle upon Tyne NE1 7RU, United Kingdom
| |
Collapse
|
15
|
Wang Y, Jiang B, Wu Y, He X, Liu L. Rapid intraspecies evolution of fitness effects of yeast genes. Genome Biol Evol 2022; 14:6575331. [PMID: 35482054 PMCID: PMC9113246 DOI: 10.1093/gbe/evac061] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 04/21/2022] [Indexed: 11/14/2022] Open
Abstract
Organisms within species have numerous genetic and phenotypic variations. Growing evidences show intraspecies variation of mutant phenotypes may be more complicated than expected. Current studies on intraspecies variations of mutant phenotypes are limited to just a few strains. This study investigated the intraspecies variation of fitness effects of 5,630 gene mutants in ten Saccharomyces cerevisiae strains using CRISPR–Cas9 screening. We found that the variability of fitness effects induced by gene disruptions is very large across different strains. Over 75% of genes affected cell fitness in a strain-specific manner to varying degrees. The strain specificity of the fitness effect of a gene is related to its evolutionary and functional properties. Subsequent analysis revealed that younger genes, especially those newly acquired in S. cerevisiae species, are more likely to be strongly strain-specific. Intriguingly, there seems to exist a ceiling of fitness effect size for strong strain-specific genes, and among them, the newly acquired genes are still evolving and have yet to reach this ceiling. Additionally, for a large proportion of protein complexes, the strain specificity profile is inconsistent among genes encoding the same complex. Taken together, these results offer a genome-wide map of intraspecies variation for fitness effect as a mutant phenotype and provide an updated insight on intraspecies phenotypic evolution.
Collapse
Affiliation(s)
- Yayu Wang
- MOE Key Laboratory of Gene Function and Regulation, State Key Laboratory of Biocontrol, School of Life Sciences, Sun Yat-sen University, Guangzhou 510275, China
| | - Bei Jiang
- MOE Key Laboratory of Gene Function and Regulation, State Key Laboratory of Biocontrol, School of Life Sciences, Sun Yat-sen University, Guangzhou 510275, China
| | - Yue Wu
- MOE Key Laboratory of Gene Function and Regulation, State Key Laboratory of Biocontrol, School of Life Sciences, Sun Yat-sen University, Guangzhou 510275, China
| | - Xionglei He
- MOE Key Laboratory of Gene Function and Regulation, State Key Laboratory of Biocontrol, School of Life Sciences, Sun Yat-sen University, Guangzhou 510275, China
| | - Li Liu
- MOE Key Laboratory of Gene Function and Regulation, State Key Laboratory of Biocontrol, School of Life Sciences, Sun Yat-sen University, Guangzhou 510275, China
| |
Collapse
|
16
|
Menuhin-Gruman I, Arbel M, Amitay N, Sionov K, Naki D, Katzir I, Edgar O, Bergman S, Tuller T. Evolutionary Stability Optimizer (ESO): A Novel Approach to Identify and Avoid Mutational Hotspots in DNA Sequences While Maintaining High Expression Levels. ACS Synth Biol 2022; 11:1142-1151. [PMID: 34928133 PMCID: PMC8938948 DOI: 10.1021/acssynbio.1c00426] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/03/2022]
Abstract
![]()
Modern
synthetic biology procedures rely on the ability to generate
stable genetic constructs that keep their functionality over long
periods of time. However, maintenance of these constructs requires
energy from the cell and thus reduces the host’s fitness. Natural
selection results in loss-of-functionality mutations that negate the
expression of the construct in the population. Current approaches
for the prevention of this phenomenon focus on either small-scale,
manual design of evolutionary stable constructs or the detection of
mutational sites with unstable tendencies. We designed the Evolutionary
Stability Optimizer (ESO), a software tool that enables the large-scale
automatic design of evolutionarily stable constructs with respect
to both mutational and epigenetic hotspots and allows users to define
custom hotspots to avoid. Furthermore, our tool takes the expression
of the input constructs into account by considering the guanine-cytosine
(GC) content and codon usage of the host organism, balancing the trade-off
between stability and gene expression, allowing to increase evolutionary
stability while maintaining the high expression. In this study, we
present the many features of the ESO and show that it accurately predicts
the evolutionary stability of endogenous genes. The ESO was created
as an easy-to-use, flexible platform based on the notion that directed
genetic stability research will continue to evolve and revolutionize
current applications of synthetic biology. The ESO is available at
the following link: https://www.cs.tau.ac.il/~tamirtul/ESO/.
Collapse
Affiliation(s)
- Itamar Menuhin-Gruman
- School of Mathematical Sciences, The Raymond and Beverly Sackler Faculty of Exact Sciences, Tel Aviv University, Tel Aviv, Israel 6997801
| | - Matan Arbel
- Shmunis School of Biomedicine and Cancer Research, The George S. Wise Faculty of Life Sciences, Tel Aviv University, Tel Aviv, Israel 6997801
| | - Niv Amitay
- School of Electrical Engineering, The Iby and Aladar Fleischman Faculty of Engineering, Tel Aviv University, Tel Aviv, Israel 6997801
| | - Karin Sionov
- Department of Biomedical Engineering, Tel Aviv University, Tel Aviv, Israel 6997801
| | - Doron Naki
- Shmunis School of Biomedicine and Cancer Research, The George S. Wise Faculty of Life Sciences, Tel Aviv University, Tel Aviv, Israel 6997801
| | - Itai Katzir
- Shmunis School of Biomedicine and Cancer Research, The George S. Wise Faculty of Life Sciences, Tel Aviv University, Tel Aviv, Israel 6997801
| | - Omer Edgar
- School of Medicine, The Sackler Faculty of Medicine, Tel Aviv University, Tel Aviv, Israel 6997801
| | - Shaked Bergman
- Department of Biomedical Engineering, Tel Aviv University, Tel Aviv, Israel 6997801
| | - Tamir Tuller
- Department of Biomedical Engineering, Tel Aviv University, Tel Aviv, Israel 6997801
- The Sagol School of Neuroscience, Tel Aviv University, Tel Aviv, Israel 6997801
| |
Collapse
|
17
|
Cantoia A, Aguilar Lucero D, Ceccarelli EA, Rosano GL. From the notebook to recombinant protein production in Escherichia coli: Design of expression vectors and gene cloning. Methods Enzymol 2021; 659:19-35. [PMID: 34752286 DOI: 10.1016/bs.mie.2021.07.008] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/04/2023]
Abstract
Research in recombinant protein expression in microorganism hosts spans half a century. The field has evolved from mostly trial-and-error approaches to more rational strategies, including careful design of the expression vectors and the coding sequence for the protein of interest. It is important to reflect on many aspects about vector construction, such as codon usage, integration site, coding sequence mutagenesis and many others. In this chapter, we overview methods and considerations to generate a suitable construct and anticipate possible experimental roadblocks.
Collapse
Affiliation(s)
- Alejo Cantoia
- Instituto de Biología Molecular y Celular de Rosario (IBR), CONICET, Facultad de Ciencias Bioquímicas y Farmacéuticas, Universidad Nacional de Rosario, Rosario, Argentina
| | - Dianela Aguilar Lucero
- Instituto de Biología Molecular y Celular de Rosario (IBR), CONICET, Facultad de Ciencias Bioquímicas y Farmacéuticas, Universidad Nacional de Rosario, Rosario, Argentina
| | - Eduardo A Ceccarelli
- Instituto de Biología Molecular y Celular de Rosario (IBR), CONICET, Facultad de Ciencias Bioquímicas y Farmacéuticas, Universidad Nacional de Rosario, Rosario, Argentina
| | - Germán L Rosano
- Instituto de Biología Molecular y Celular de Rosario (IBR), CONICET, Facultad de Ciencias Bioquímicas y Farmacéuticas, Universidad Nacional de Rosario, Rosario, Argentina.
| |
Collapse
|
18
|
Rapid single-molecule detection of COVID-19 and MERS antigens via nanobody-functionalized organic electrochemical transistors. Nat Biomed Eng 2021; 5:666-677. [PMID: 34031558 DOI: 10.1038/s41551-021-00734-9] [Citation(s) in RCA: 155] [Impact Index Per Article: 51.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/10/2020] [Accepted: 04/16/2021] [Indexed: 02/07/2023]
Abstract
The coronavirus disease 2019 (COVID-19) pandemic has highlighted the need for rapid and sensitive protein detection and quantification in simple and robust formats for widespread point-of-care applications. Here, we report on nanobody-functionalized organic electrochemical transistors with a modular architecture for the rapid quantification of single-molecule-to-nanomolar levels of specific antigens in complex bodily fluids. The sensors combine a solution-processable conjugated polymer in the transistor channel and high-density and orientation-controlled bioconjugation of nanobody-SpyCatcher fusion proteins on disposable gate electrodes. The devices provide results after 10 min of exposure to 5 μl of unprocessed samples, maintain high specificity and single-molecule sensitivity in human saliva and serum, and can be reprogrammed to detect any protein antigen if a corresponding specific nanobody is available. We used the sensors to detect green fluorescent protein, and severe acute respiratory syndrome coronavirus-2 (SARS-CoV-2) and Middle East respiratory syndrome coronavirus (MERS-CoV) spike proteins, and for the COVID-19 screening of unprocessed clinical nasopharyngeal swab and saliva samples with a wide range of viral loads.
Collapse
|
19
|
Sanborn AL, Yeh BT, Feigerle JT, Hao CV, Townshend RJ, Lieberman Aiden E, Dror RO, Kornberg RD. Simple biochemical features underlie transcriptional activation domain diversity and dynamic, fuzzy binding to Mediator. eLife 2021; 10:68068. [PMID: 33904398 PMCID: PMC8137143 DOI: 10.7554/elife.68068] [Citation(s) in RCA: 66] [Impact Index Per Article: 22.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/03/2021] [Accepted: 04/25/2021] [Indexed: 01/07/2023] Open
Abstract
Gene activator proteins comprise distinct DNA-binding and transcriptional activation domains (ADs). Because few ADs have been described, we tested domains tiling all yeast transcription factors for activation in vivo and identified 150 ADs. By mRNA display, we showed that 73% of ADs bound the Med15 subunit of Mediator, and that binding strength was correlated with activation. AD-Mediator interaction in vitro was unaffected by a large excess of free activator protein, pointing to a dynamic mechanism of interaction. Structural modeling showed that ADs interact with Med15 without shape complementarity (‘fuzzy’ binding). ADs shared no sequence motifs, but mutagenesis revealed biochemical and structural constraints. Finally, a neural network trained on AD sequences accurately predicted ADs in human proteins and in other yeast proteins, including chromosomal proteins and chromatin remodeling complexes. These findings solve the longstanding enigma of AD structure and function and provide a rationale for their role in biology. Cells adapt and respond to changes by regulating the activity of their genes. To turn genes on or off, they use a family of proteins called transcription factors. Transcription factors influence specific but overlapping groups of genes, so that each gene is controlled by several transcription factors that act together like a dimmer switch to regulate gene activity. The presence of transcription factors attracts proteins such as the Mediator complex, which activates genes by gathering the protein machines that read the genes. The more transcription factors are found near a specific gene, the more strongly they attract Mediator and the more active the gene is. A specific region on the transcription factor called the activation domain is necessary for this process. The biochemical sequences of these domains vary greatly between species, yet activation domains from, for example, yeast and human proteins are often interchangeable. To understand why this is the case, Sanborn et al. analyzed the genome of baker’s yeast and identified 150 activation domains, each very different in sequence. Three-quarters of them bound to a subunit of the Mediator complex called Med15. Sanborn et al. then developed a machine learning algorithm to predict activation domains in both yeast and humans. This algorithm also showed that negatively charged and greasy regions on the activation domains were essential to be activated by the Mediator complex. Further analyses revealed that activation domains used different poses to bind multiple sites on Med15, a behavior known as ‘fuzzy’ binding. This creates a high overall affinity even though the binding strength at each individual site is low, enabling the protein complexes to remain dynamic. These weak interactions together permit fine control over the activity of several genes, allowing cells to respond quickly and precisely to many changes. The computer algorithm used here provides a new way to identify activation domains across species and could improve our understanding of how living things grow, adapt and evolve. It could also give new insights into mechanisms of disease, particularly cancer, where transcription factors are often faulty.
Collapse
Affiliation(s)
- Adrian L Sanborn
- Department of Structural Biology, Stanford University School of Medicine, Stanford, United States.,Department of Computer Science, Stanford University, Stanford, United States
| | - Benjamin T Yeh
- Department of Computer Science, Stanford University, Stanford, United States
| | - Jordan T Feigerle
- Department of Structural Biology, Stanford University School of Medicine, Stanford, United States
| | - Cynthia V Hao
- Department of Structural Biology, Stanford University School of Medicine, Stanford, United States
| | | | - Erez Lieberman Aiden
- The Center for Genome Architecture, Baylor College of Medicine, Houston, United States.,Center for Theoretical Biological Physics, Rice University, Houston, United States
| | - Ron O Dror
- Department of Computer Science, Stanford University, Stanford, United States
| | - Roger D Kornberg
- Department of Structural Biology, Stanford University School of Medicine, Stanford, United States
| |
Collapse
|
20
|
Yeoh JW, Swainston N, Vegh P, Zulkower V, Carbonell P, Holowko MB, Peddinti G, Poh CL. SynBiopython: an open-source software library for Synthetic Biology. Synth Biol (Oxf) 2021. [PMCID: PMC8063678 DOI: 10.1093/synbio/ysab001] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/31/2022] Open
Abstract
Advances in hardware automation in synthetic biology laboratories are not yet fully matched by those of their software counterparts. Such automated laboratories, now commonly called biofoundries, require software solutions that would help with many specialized tasks such as batch DNA design, sample and data tracking, and data analysis, among others. Typically, many of the challenges facing biofoundries are shared, yet there is frequent wheel-reinvention where many labs develop similar software solutions in parallel. In this article, we present the first attempt at creating a standardized, open-source Python package. A number of tools will be integrated and developed that we envisage will become the obvious starting point for software development projects within biofoundries globally. Specifically, we describe the current state of available software, present usage scenarios and case studies for common problems, and finally describe plans for future development. SynBiopython is publicly available at the following address: http://synbiopython.org.
Collapse
Affiliation(s)
- Jing Wui Yeoh
- NUS Synthetic Biology for Clinical and Technological Innovation (SynCTI), Life Sciences Institute, National University of Singapore, Singapore, Singapore
| | - Neil Swainston
- Institute of Systems, Molecular and Integrative Biology, University of Liverpool, Liverpool, UK
| | - Peter Vegh
- Edinburgh Genome Foundry, University of Edinburgh, Edinburgh, UK
| | | | - Pablo Carbonell
- Instituto Universitario de Automática e Informática Industrial, Universitat Politècnica de València, Valencia, Spain
- Manchester Synthetic Biology Research Centre for Fine and Speciality Chemicals (SYNBIOCHEM), Manchester Institute of Biotechnology, The University of Manchester, Manchester, UK
| | - Maciej B Holowko
- CSIRO Synthetic Biology Future Science Platform, Canberra, ACT, Australia
| | - Gopal Peddinti
- VTT Technical Research Center of Finland, Espoo, Finland
| | - Chueh Loo Poh
- NUS Synthetic Biology for Clinical and Technological Innovation (SynCTI), Life Sciences Institute, National University of Singapore, Singapore, Singapore
| |
Collapse
|
21
|
Tycko J, DelRosso N, Hess GT, Aradhana, Banerjee A, Mukund A, Van MV, Ego BK, Yao D, Spees K, Suzuki P, Marinov GK, Kundaje A, Bassik MC, Bintu L. High-Throughput Discovery and Characterization of Human Transcriptional Effectors. Cell 2020; 183:2020-2035.e16. [PMID: 33326746 PMCID: PMC8178797 DOI: 10.1016/j.cell.2020.11.024] [Citation(s) in RCA: 62] [Impact Index Per Article: 15.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/21/2020] [Revised: 09/22/2020] [Accepted: 11/13/2020] [Indexed: 02/07/2023]
Abstract
Thousands of proteins localize to the nucleus; however, it remains unclear which contain transcriptional effectors. Here, we develop HT-recruit, a pooled assay where protein libraries are recruited to a reporter, and their transcriptional effects are measured by sequencing. Using this approach, we measure gene silencing and activation for thousands of domains. We find a relationship between repressor function and evolutionary age for the KRAB domains, discover that Homeodomain repressor strength is collinear with Hox genetic organization, and identify activities for several domains of unknown function. Deep mutational scanning of the CRISPRi KRAB maps the co-repressor binding surface and identifies substitutions that improve stability/silencing. By tiling 238 proteins, we find repressors as short as ten amino acids. Finally, we report new activator domains, including a divergent KRAB. These results provide a resource of 600 human proteins containing effectors and demonstrate a scalable strategy for assigning functions to protein domains.
Collapse
Affiliation(s)
- Josh Tycko
- Department of Genetics, Stanford University, Stanford, CA 94305, USA
| | - Nicole DelRosso
- Biophysics Program, Stanford University, Stanford, CA 94305, USA
| | - Gaelen T Hess
- Department of Genetics, Stanford University, Stanford, CA 94305, USA
| | - Aradhana
- Department of Genetics, Stanford University, Stanford, CA 94305, USA
| | | | - Aditya Mukund
- Biophysics Program, Stanford University, Stanford, CA 94305, USA
| | - Mike V Van
- Department of Biology, Stanford University, Stanford, CA 94305, USA
| | - Braeden K Ego
- Department of Genetics, Stanford University, Stanford, CA 94305, USA
| | - David Yao
- Department of Genetics, Stanford University, Stanford, CA 94305, USA
| | - Kaitlyn Spees
- Department of Genetics, Stanford University, Stanford, CA 94305, USA
| | - Peter Suzuki
- Department of Bioengineering, Stanford University, Stanford, CA 94305, USA
| | - Georgi K Marinov
- Department of Genetics, Stanford University, Stanford, CA 94305, USA
| | - Anshul Kundaje
- Department of Genetics, Stanford University, Stanford, CA 94305, USA; Department of Computer Science, Stanford University, Stanford, CA 94305, USA
| | - Michael C Bassik
- Department of Genetics, Stanford University, Stanford, CA 94305, USA.
| | - Lacramioara Bintu
- Department of Bioengineering, Stanford University, Stanford, CA 94305, USA.
| |
Collapse
|
22
|
Garrett ME, Galloway J, Chu HY, Itell HL, Stoddard CI, Wolf CR, Logue JK, McDonald D, Matsen FA, Overbaugh J. High resolution profiling of pathways of escape for SARS-CoV-2 spike-binding antibodies. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2020:2020.11.16.385278. [PMID: 33236010 PMCID: PMC7685320 DOI: 10.1101/2020.11.16.385278] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 11/25/2022]
Abstract
Defining long-term protective immunity to SARS-CoV-2 is one of the most pressing questions of our time and will require a detailed understanding of potential ways this virus can evolve to escape immune protection. Immune protection will most likely be mediated by antibodies that bind to the viral entry protein, Spike (S). Here we used Phage-DMS, an approach that comprehensively interrogates the effect of all possible mutations on binding to a protein of interest, to define the profile of antibody escape to the SARS-CoV-2 S protein using COVID-19 convalescent plasma. Antibody binding was common in two regions: the fusion peptide and linker region upstream of the heptad repeat region 2. However, escape mutations were variable within these immunodominant regions. There was also individual variation in less commonly targeted epitopes. This study provides a granular view of potential antibody escape pathways and suggests there will be individual variation in antibody-mediated virus evolution.
Collapse
Affiliation(s)
- Meghan E Garrett
- Division of Human Biology, Fred Hutchinson Cancer Research Center, Seattle, WA, USA
- Molecular and Cellular Biology Graduate Program, University of Washington and Fred Hutchinson Cancer Research Center, Seattle, WA, USA
| | - Jared Galloway
- Division of Public Health Sciences, Fred Hutchinson Cancer Research Center, Seattle, WA, USA
| | - Helen Y Chu
- Department of Medicine, University of Washington, Seattle, WA, USA
| | - Hannah L Itell
- Division of Human Biology, Fred Hutchinson Cancer Research Center, Seattle, WA, USA
- Molecular and Cellular Biology Graduate Program, University of Washington and Fred Hutchinson Cancer Research Center, Seattle, WA, USA
| | - Caitlin I Stoddard
- Division of Human Biology, Fred Hutchinson Cancer Research Center, Seattle, WA, USA
| | - Caitlin R Wolf
- Department of Medicine, University of Washington, Seattle, WA, USA
| | - Jennifer K Logue
- Department of Medicine, University of Washington, Seattle, WA, USA
| | - Dylan McDonald
- Department of Medicine, University of Washington, Seattle, WA, USA
| | - Frederick A Matsen
- Division of Public Health Sciences, Fred Hutchinson Cancer Research Center, Seattle, WA, USA
- Computational Biology Program, Fred Hutchinson Cancer Research Center, Seattle, WA, USA
| | - Julie Overbaugh
- Division of Human Biology, Fred Hutchinson Cancer Research Center, Seattle, WA, USA
- Division of Public Health Sciences, Fred Hutchinson Cancer Research Center, Seattle, WA, USA
| |
Collapse
|