1
|
Wen X, Lin J, Yang C, Li Y, Cheng H, Liu Y, Zhang Y, Ma H, Mao Y, Liao X, Wang M. Automated characterization and analysis of expression compatibility between regulatory sequences and metabolic genes in Escherichia coli. Synth Syst Biotechnol 2024; 9:647-657. [PMID: 38817827 PMCID: PMC11137365 DOI: 10.1016/j.synbio.2024.05.010] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/13/2024] [Revised: 05/11/2024] [Accepted: 05/16/2024] [Indexed: 06/01/2024] Open
Abstract
Utilizing standardized artificial regulatory sequences to fine-tuning the expression of multiple metabolic pathways/genes is a key strategy in the creation of efficient microbial cell factories. However, when regulatory sequence expression strengths are characterized using only a few reporter genes, they may not be applicable across diverse genes. This introduces great uncertainty into the precise regulation of multiple genes at multiple expression levels. To address this, our study adopted a fluorescent protein fusion strategy for a more accurate assessment of target protein expression levels. We combined 41 commonly-used metabolic genes with 15 regulatory sequences, yielding an expression dataset encompassing 520 unique combinations. This dataset highlighted substantial variation in protein expression level under identical regulatory sequences, with relative expression levels ranging from 2.8 to 176-fold. It also demonstrated that improving the strength of regulatory sequences does not necessarily lead to significant improvements in the expression levels of target proteins. Utilizing this dataset, we have developed various machine learning models and discovered that the integration of promoter regions, ribosome binding sites, and coding sequences significantly improves the accuracy of predicting protein expression levels, with a Spearman correlation coefficient of 0.72, where the promoter sequence exerts a predominant influence. Our study aims not only to provide a detailed guide for fine-tuning gene expression in the metabolic engineering of Escherichia coli but also to deepen our understanding of the compatibility issues between regulatory sequences and target genes.
Collapse
Affiliation(s)
- Xiao Wen
- School of Life Sciences, Division of Life Sciences and Medicine, University of Science and Technology of China, Hefei, 230026, China
- Tianjin Institute of Industrial Biotechnology, Chinese Academy of Sciences, Tianjin, 300308, China
- Key Laboratory of Engineering Biology for Low-Carbon Manufacturing, Tianjin, 300308, China
| | - Jiawei Lin
- Tianjin Institute of Industrial Biotechnology, Chinese Academy of Sciences, Tianjin, 300308, China
- School of Biological Engineering, Tianjin University of Science and Technology, Tianjin, 300457, China
| | - Chunhe Yang
- Tianjin Institute of Industrial Biotechnology, Chinese Academy of Sciences, Tianjin, 300308, China
- School of Biological Engineering, Tianjin University of Science and Technology, Tianjin, 300457, China
| | - Ying Li
- Tianjin Institute of Industrial Biotechnology, Chinese Academy of Sciences, Tianjin, 300308, China
- School of Biological Engineering, Tianjin University of Science and Technology, Tianjin, 300457, China
| | - Haijiao Cheng
- Tianjin Institute of Industrial Biotechnology, Chinese Academy of Sciences, Tianjin, 300308, China
- Key Laboratory of Engineering Biology for Low-Carbon Manufacturing, Tianjin, 300308, China
| | - Ye Liu
- Tianjin Institute of Industrial Biotechnology, Chinese Academy of Sciences, Tianjin, 300308, China
- Key Laboratory of Engineering Biology for Low-Carbon Manufacturing, Tianjin, 300308, China
| | - Yue Zhang
- Tianjin Institute of Industrial Biotechnology, Chinese Academy of Sciences, Tianjin, 300308, China
- Key Laboratory of Engineering Biology for Low-Carbon Manufacturing, Tianjin, 300308, China
| | - Hongwu Ma
- Tianjin Institute of Industrial Biotechnology, Chinese Academy of Sciences, Tianjin, 300308, China
- Key Laboratory of Engineering Biology for Low-Carbon Manufacturing, Tianjin, 300308, China
| | - Yufeng Mao
- Tianjin Institute of Industrial Biotechnology, Chinese Academy of Sciences, Tianjin, 300308, China
- Key Laboratory of Engineering Biology for Low-Carbon Manufacturing, Tianjin, 300308, China
| | - Xiaoping Liao
- Tianjin Institute of Industrial Biotechnology, Chinese Academy of Sciences, Tianjin, 300308, China
- Key Laboratory of Engineering Biology for Low-Carbon Manufacturing, Tianjin, 300308, China
| | - Meng Wang
- School of Life Sciences, Division of Life Sciences and Medicine, University of Science and Technology of China, Hefei, 230026, China
- Tianjin Institute of Industrial Biotechnology, Chinese Academy of Sciences, Tianjin, 300308, China
- Key Laboratory of Engineering Biology for Low-Carbon Manufacturing, Tianjin, 300308, China
| |
Collapse
|
2
|
La Fleur A, Shi Y, Seelig G. Decoding biology with massively parallel reporter assays and machine learning. Genes Dev 2024; 38:843-865. [PMID: 39362779 DOI: 10.1101/gad.351800.124] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/05/2024]
Abstract
Massively parallel reporter assays (MPRAs) are powerful tools for quantifying the impacts of sequence variation on gene expression. Reading out molecular phenotypes with sequencing enables interrogating the impact of sequence variation beyond genome scale. Machine learning models integrate and codify information learned from MPRAs and enable generalization by predicting sequences outside the training data set. Models can provide a quantitative understanding of cis-regulatory codes controlling gene expression, enable variant stratification, and guide the design of synthetic regulatory elements for applications from synthetic biology to mRNA and gene therapy. This review focuses on cis-regulatory MPRAs, particularly those that interrogate cotranscriptional and post-transcriptional processes: alternative splicing, cleavage and polyadenylation, translation, and mRNA decay.
Collapse
Affiliation(s)
- Alyssa La Fleur
- Paul G. Allen School of Computer Science and Engineering, University of Washington, Seattle, Washington 98195, USA
| | - Yongsheng Shi
- Department of Microbiology and Molecular Genetics, School of Medicine, University of California, Irvine, Irvine, California 92697, USA;
| | - Georg Seelig
- Paul G. Allen School of Computer Science and Engineering, University of Washington, Seattle, Washington 98195, USA;
- Department of Electrical & Computer Engineering, University of Washington, Seattle, Washington 98195, USA
| |
Collapse
|
3
|
Liu M, Jin Z, Xiang Q, He H, Huang Y, Long M, Wu J, Zhi Huang C, Mao C, Zuo H. Rational Design of Untranslated Regions to Enhance Gene Expression. J Mol Biol 2024; 436:168804. [PMID: 39326490 DOI: 10.1016/j.jmb.2024.168804] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/01/2024] [Revised: 09/19/2024] [Accepted: 09/20/2024] [Indexed: 09/28/2024]
Abstract
How to improve gene expression by optimizing mRNA structures is a crucial question for various medical and biotechnological applications. Previous efforts focus largely on investigation of the 5' UTR hairpin structures. In this study, we present a rational strategy that enhances mRNA stability and translation by engineering both the 5' and 3' UTR sequences. We have successfully demonstrated this strategy using green fluorescent protein (GFP) as a model in Escherichia coli and across different expression vectors. We further validated it with luciferase and Plasmodium falciparum lactate dehydrogenase (PfLDH). To elucidate the underlying mechanism, we have quantitatively analyzed both protein, mRNA levels and half-life time. We have identified several key aspects of UTRs that significantly influence mRNA stability and protein expression in our system: (1) The optimal length of the single-stranded spacer between the stabilizer hairpin and ribosome binding site (RBS) in the 5' UTR is 25-30 nucleotide (nt) long. An optimal 32% GC content in the spacer yielded the highest levels of GFP protein production. (2) The insertion of a homodimerdizable, G-quadruplex structure containing RNA aptamer, "Corn", in the 3' UTR markedly increased the protein expression. Our findings indicated that the carefully engineered 5' UTRs and 3' UTRs significantly boosted gene expression. Specifically, the inclusion of 5 × Corn in the 3' UTR appeared to facilitate the local aggregation of mRNA, leading to the formation of mRNA condensates. Aside from shedding light on the regulation of mRNA stability and expression, this study is expected to substantially increase biological protein production.
Collapse
Affiliation(s)
- Mingchun Liu
- College of Pharmaceutical Sciences, Southwest University, Chongqing 400715, China
| | - Zhuoer Jin
- College of Pharmaceutical Sciences, Southwest University, Chongqing 400715, China
| | - Qing Xiang
- College of Pharmaceutical Sciences, Southwest University, Chongqing 400715, China
| | - Huawei He
- Biological Sciences Research Center, State Key Laboratory of Silkworm Genome Biology, Southwest University, Chongqing 400715, China
| | - Yuhan Huang
- College of Pharmaceutical Sciences, Southwest University, Chongqing 400715, China
| | - Mengfei Long
- College of Pharmaceutical Sciences, Southwest University, Chongqing 400715, China
| | - Jicheng Wu
- Chongqing Key Laboratory of Natural Product Synthesis and Drug Research, School of Pharmaceutical Sciences, Chongqing University, Chongqing 401331, China
| | - Cheng Zhi Huang
- College of Pharmaceutical Sciences, Southwest University, Chongqing 400715, China
| | - Chengde Mao
- Department of Chemistry, Purdue University, West Lafayette 47907, IN, USA
| | - Hua Zuo
- College of Pharmaceutical Sciences, Southwest University, Chongqing 400715, China.
| |
Collapse
|
4
|
Zhu P, Molina Resendiz M, von Ossowski I, Scheller S. A promoter-RBS library for fine-tuning gene expression in Methanosarcina acetivorans. Appl Environ Microbiol 2024; 90:e0109224. [PMID: 39132998 PMCID: PMC11409679 DOI: 10.1128/aem.01092-24] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/04/2024] [Accepted: 07/15/2024] [Indexed: 08/13/2024] Open
Abstract
Methanogens are the main biological producers of methane on Earth. Methanosarcina acetivorans is one of the best characterized methanogens that has powerful genetic tools for genome editing. To study the physiology of this methanogen in further detail as well as to effectively balance the flux of their engineered metabolic pathways in expansive project undertakings, there is the need for controlled gene expression, which then requires the availability of well-characterized promoters and ribosome-binding sites (RBS). In this study, we constructed a library of 33 promoter-RBS combinations that includes 13 wild-type and 14 hybrid combinations, as well as six combination variants in which the 5'-untranslated region (5'UTR) was rationally engineered. The expression strength for each combination was calculated by inducing the expression of the β-glucuronidase reporter gene in M. acetivorans cells in the presence of the two most used growth substrates, either methanol (MeOH) or trimethyl amine (TMA). In this study, the constructed library covers a relatively wide range (140-fold) between the weakest and strongest promoter-RBS combination as well as shows a steady increase and allows different levels of gene expression. Effects on the gene expression strength were also assessed by making measurements at three distinct growth phases for all 33 promoter-RBS combinations. Our promoter-RBS library is effective in enabling the fine-tuning of gene expression in M. acetivorans for physiological studies and the design of metabolic engineering projects that, e.g., aim for the biotechnological valorization of one-carbon compounds. IMPORTANCE Methanogenic archaea are potent producers of the greenhouse gas methane and thus contribute substantially to global warming. Under controlled conditions, these microbes can catalyze the production of biogas, which is a renewable fuel, and might help counter global warming and its effects. Engineering the primary metabolism of Methanosarcina acetivorans to render it better and more useful requires controllable gene expression, yet only a few well-characterized promoters and RBSs are presently available. Our study rectifies this situation by providing a library of 33 different promoter-RBS combinations with a 140-fold dynamic range in expression strength. Future metabolic engineering projects can take advantage of this library by using these promoter-RBS combinations as an efficient and tunable gene expression system for M. acetivorans. Furthermore, the methodologies we developed in this study could also be utilized to construct promoter libraries for other types of methanogens.
Collapse
Affiliation(s)
- Ping Zhu
- Department of Bioproducts and Biosystems, School of Chemical Engineering, Aalto University, Espoo, Finland
| | - Mariana Molina Resendiz
- Department of Bioproducts and Biosystems, School of Chemical Engineering, Aalto University, Espoo, Finland
| | - Ingemar von Ossowski
- Department of Bioproducts and Biosystems, School of Chemical Engineering, Aalto University, Espoo, Finland
| | - Silvan Scheller
- Department of Bioproducts and Biosystems, School of Chemical Engineering, Aalto University, Espoo, Finland
| |
Collapse
|
5
|
Huss P, Kieft K, Meger A, Nishikawa K, Anantharaman K, Raman S. Engineering bacteriophages through deep mining of metagenomic motifs. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2023.02.07.527309. [PMID: 36798209 PMCID: PMC9934549 DOI: 10.1101/2023.02.07.527309] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/16/2023]
Abstract
Bacteriophages can adapt to new hosts by altering sequence motifs through recombination or convergent evolution. Where such motifs exist and what fitness advantage they confer remains largely unknown. We report a new method, Metagenomic Sequence Informed Functional Scoring (Meta-SIFT), to discover sequence motifs in metagenomic datasets that can be used to engineer phage activity. Meta-SIFT uses experimental deep mutational scanning data to create sequence profiles to enable deep mining of metagenomes for functional motifs which are otherwise invisible to searches. We experimentally tested over 17,000 Meta-SIFT derived sequence motifs in the receptor-binding protein of the T7 phage. The screen revealed thousands of T7 variants with novel host specificity with functional motifs sourced from distant families. Position, substitution and location preferences dictated specificity across a panel of 20 hosts and conditions. To demonstrate therapeutic utility, we engineered active T7 variants against foodborne pathogen E. coli O121. Meta-SIFT is a powerful tool to unlock the functional potential encoded in phage metagenomes to engineer bacteriophages.
Collapse
|
6
|
Jodlbauer J, Schmal M, Waltl C, Rohr T, Mach-Aigner AR, Mihovilovic MD, Rudroff F. Unlocking the potential of cyanobacteria: a high-throughput strategy for enhancing biocatalytic performance through genetic optimization. Trends Biotechnol 2024:S0167-7799(24)00189-6. [PMID: 39214789 DOI: 10.1016/j.tibtech.2024.07.011] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/03/2024] [Revised: 07/15/2024] [Accepted: 07/19/2024] [Indexed: 09/04/2024]
Abstract
Cyanobacteria show promise as hosts for whole-cell biocatalysis. Their photoautotrophic metabolism can be leveraged for a sustainable production process. Despite advancements, performance still lags behind heterotrophic hosts. A key challenge is the limited ability to overexpress recombinant enzymes, which also hinders their biocatalytic efficiency. To address this, we generated large-scale expression libraries and developed a high-throughput method combining fluorescence-activated cell sorting (FACS) and deep sequencing in Synechocystis sp. PCC 6803 (Syn. 6803) to screen and optimize its genetic background. We apply this approach to enhance expression and biocatalyst performance for three enzymes: the ketoreductase LfSDR1M50, enoate reductase YqjM, and Baeyer-Villiger monooxygenase (BVMO) CHMOmut. Diverse genetic combinations yielded significant improvements: optimizing LfSDR1M50 expression showed a 17-fold increase to 39.2 U gcell dry weight (CDW)-1. In vivo activity of Syn. YqjM was improved 16-fold to 58.7 U gCDW-1 and, for Syn. CHMOmut, a 1.5-fold increase to 7.3 U gCDW-1 was achieved by tailored genetic design. Thus, this strategy offers a pathway to optimize cyanobacteria as expression hosts, paving the way for broader applications in other cyanobacteria strains and larger libraries.
Collapse
Affiliation(s)
- Julia Jodlbauer
- Institute of Applied Synthetic Chemistry, TU Wien, Getreidemarkt 9, 1060, Vienna, Austria
| | - Matthias Schmal
- Institute of Chemical, Environmental, and Bioscience Engineering, TU Wien, Gumpendorfer Str. 1a, 1060, Vienna, Austria
| | - Christian Waltl
- Institute of Applied Synthetic Chemistry, TU Wien, Getreidemarkt 9, 1060, Vienna, Austria
| | - Thomas Rohr
- Institute of Applied Synthetic Chemistry, TU Wien, Getreidemarkt 9, 1060, Vienna, Austria
| | - Astrid R Mach-Aigner
- Institute of Chemical, Environmental, and Bioscience Engineering, TU Wien, Gumpendorfer Str. 1a, 1060, Vienna, Austria
| | - Marko D Mihovilovic
- Institute of Applied Synthetic Chemistry, TU Wien, Getreidemarkt 9, 1060, Vienna, Austria
| | - Florian Rudroff
- Institute of Applied Synthetic Chemistry, TU Wien, Getreidemarkt 9, 1060, Vienna, Austria.
| |
Collapse
|
7
|
Xiong T, Gao Q, Zhang J, Zhang J, Zhang C, Yue H, Liu J, Bai D, Li J. Engineering Escherichia coli with a symbiotic plasmid for the production of phenylpyruvic acid. RSC Adv 2024; 14:26580-26584. [PMID: 39175686 PMCID: PMC11339955 DOI: 10.1039/d4ra03707c] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/20/2024] [Accepted: 07/29/2024] [Indexed: 08/24/2024] Open
Abstract
Plasmid-based microbial systems have become a major avenue for the production of pharmaceutical and chemical products; however, antibiotics are often required to maintain the stability of the plasmid. To eliminate the need for antibiotics, we developed a symbiotic system between plasmids and hosts by knocking out the essential gene of folP on the chromosome and placing it on the same plasmid as l-amino acid dehydrogenase (aadL); the resulting strain was named E. coli A06ΔfolP. To increase the copy number of aadL, different strengths of promoters were used for the expression of folP, resulting in the creation of a mutant E. coli A17ΔfolP. The yield of phenylpyruvic acid (PPA) from E. coli A17ΔfolP (4.1 ± 0.3 g L-1) was 1.9-fold that of E. coli A06ΔfolP (2.1 ± 0.2 g L-1). Next, the stability of plasmids was tested, and results showed that the plasmids could be maintained stably for 10 transfer numbers under antibiotic-free conditions. Finally, E. coli A17ΔfolP was used to produce PPA; the yield of PPA was 18.7 g L-1 within 14 h.
Collapse
Affiliation(s)
- Tianzhen Xiong
- College of Life Science, Xinyang Normal University 237 Nanhu Road Xinyang Henan 464000 China +86-13939748578
| | - Qiuyue Gao
- College of Social Science, Xinyang University 7th New Avenue West Xinyang Henan 464000 China
| | - Jiting Zhang
- College of Life Science, Xinyang Normal University 237 Nanhu Road Xinyang Henan 464000 China +86-13939748578
| | - Jiaguang Zhang
- College of Life Science, Xinyang Normal University 237 Nanhu Road Xinyang Henan 464000 China +86-13939748578
| | - Can Zhang
- College of Life Science, Xinyang Normal University 237 Nanhu Road Xinyang Henan 464000 China +86-13939748578
| | - Huidie Yue
- College of Life Science, Xinyang Normal University 237 Nanhu Road Xinyang Henan 464000 China +86-13939748578
| | - Junling Liu
- College of Life Science, Xinyang Normal University 237 Nanhu Road Xinyang Henan 464000 China +86-13939748578
| | - Dingyuan Bai
- College of Life Science, Xinyang Normal University 237 Nanhu Road Xinyang Henan 464000 China +86-13939748578
| | - Jinying Li
- College of Life Science, Xinyang Normal University 237 Nanhu Road Xinyang Henan 464000 China +86-13939748578
| |
Collapse
|
8
|
Fontana J, Sparkman-Yager D, Faulkner I, Cardiff R, Kiattisewee C, Walls A, Primo TG, Kinnunen PC, Garcia Martin H, Zalatan JG, Carothers JM. Guide RNA structure design enables combinatorial CRISPRa programs for biosynthetic profiling. Nat Commun 2024; 15:6341. [PMID: 39068154 PMCID: PMC11283517 DOI: 10.1038/s41467-024-50528-1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/17/2023] [Accepted: 07/12/2024] [Indexed: 07/30/2024] Open
Abstract
Engineering metabolism to efficiently produce chemicals from multi-step pathways requires optimizing multi-gene expression programs to achieve enzyme balance. CRISPR-Cas transcriptional control systems are emerging as important tools for programming multi-gene expression, but poor predictability of guide RNA folding can disrupt expression control. Here, we correlate efficacy of modified guide RNAs (scRNAs) for CRISPR activation (CRISPRa) in E. coli with a computational kinetic parameter describing scRNA folding rate into the active structure (rS = 0.8). This parameter also enables forward design of scRNAs, allowing us to design a system of three synthetic CRISPRa promoters that can orthogonally activate (>35-fold) expression of chosen outputs. Through combinatorial activation tuning, we profile a three-dimensional design space expressing two different biosynthetic pathways, demonstrating variable production of pteridine and human milk oligosaccharide products. This RNA design approach aids combinatorial optimization of metabolic pathways and may accelerate routine design of effective multi-gene regulation programs in bacterial hosts.
Collapse
Affiliation(s)
- Jason Fontana
- Molecular Engineering & Sciences Institute and Center for Synthetic Biology, University of Washington, Seattle, WA, USA
- Department of Chemistry, University of Washington, Seattle, WA, USA
- Department of Chemical Engineering, University of Washington, Seattle, WA, USA
| | - David Sparkman-Yager
- Molecular Engineering & Sciences Institute and Center for Synthetic Biology, University of Washington, Seattle, WA, USA
- Department of Chemical Engineering, University of Washington, Seattle, WA, USA
| | - Ian Faulkner
- Molecular Engineering & Sciences Institute and Center for Synthetic Biology, University of Washington, Seattle, WA, USA
- Department of Chemical Engineering, University of Washington, Seattle, WA, USA
| | - Ryan Cardiff
- Molecular Engineering & Sciences Institute and Center for Synthetic Biology, University of Washington, Seattle, WA, USA
- Department of Chemistry, University of Washington, Seattle, WA, USA
- Department of Chemical Engineering, University of Washington, Seattle, WA, USA
| | - Cholpisit Kiattisewee
- Molecular Engineering & Sciences Institute and Center for Synthetic Biology, University of Washington, Seattle, WA, USA
- Department of Chemistry, University of Washington, Seattle, WA, USA
- Department of Chemical Engineering, University of Washington, Seattle, WA, USA
| | - Aria Walls
- Molecular Engineering & Sciences Institute and Center for Synthetic Biology, University of Washington, Seattle, WA, USA
- Department of Chemical Engineering, University of Washington, Seattle, WA, USA
| | - Tommy G Primo
- Molecular Engineering & Sciences Institute and Center for Synthetic Biology, University of Washington, Seattle, WA, USA
- Department of Bioengineering, University of Washington, Seattle, WA, USA
| | - Patrick C Kinnunen
- Biological Systems and Engineering Division, Lawrence Berkeley National Laboratory, Berkeley, CA, USA
- Biofuels and Bioproducts Division, DOE Joint BioEnergy Institute, Emeryville, CA, USA
- DOE Agile BioFoundry, Emeryville, CA, USA
| | - Hector Garcia Martin
- Biological Systems and Engineering Division, Lawrence Berkeley National Laboratory, Berkeley, CA, USA
- Biofuels and Bioproducts Division, DOE Joint BioEnergy Institute, Emeryville, CA, USA
- DOE Agile BioFoundry, Emeryville, CA, USA
| | - Jesse G Zalatan
- Molecular Engineering & Sciences Institute and Center for Synthetic Biology, University of Washington, Seattle, WA, USA.
- Department of Chemistry, University of Washington, Seattle, WA, USA.
| | - James M Carothers
- Molecular Engineering & Sciences Institute and Center for Synthetic Biology, University of Washington, Seattle, WA, USA.
- Department of Chemical Engineering, University of Washington, Seattle, WA, USA.
| |
Collapse
|
9
|
Gilliot PA, Gorochowski TE. Transfer learning for cross-context prediction of protein expression from 5'UTR sequence. Nucleic Acids Res 2024; 52:e58. [PMID: 38864396 PMCID: PMC11260469 DOI: 10.1093/nar/gkae491] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/10/2023] [Revised: 04/28/2024] [Accepted: 05/28/2024] [Indexed: 06/13/2024] Open
Abstract
Model-guided DNA sequence design can accelerate the reprogramming of living cells. It allows us to engineer more complex biological systems by removing the need to physically assemble and test each potential design. While mechanistic models of gene expression have seen some success in supporting this goal, data-centric, deep learning-based approaches often provide more accurate predictions. This accuracy, however, comes at a cost - a lack of generalization across genetic and experimental contexts that has limited their wider use outside the context in which they were trained. Here, we address this issue by demonstrating how a simple transfer learning procedure can effectively tune a pre-trained deep learning model to predict protein translation rate from 5' untranslated region (5'UTR) sequence for diverse contexts in Escherichia coli using a small number of new measurements. This allows for important model features learnt from expensive massively parallel reporter assays to be easily transferred to new settings. By releasing our trained deep learning model and complementary calibration procedure, this study acts as a starting point for continually refined model-based sequence design that builds on previous knowledge and future experimental efforts.
Collapse
Affiliation(s)
- Pierre-Aurélien Gilliot
- School of Biological Sciences, University of Bristol, 24 Tyndall Avenue, Bristol BS8 1TQ, UK
| | - Thomas E Gorochowski
- School of Biological Sciences, University of Bristol, 24 Tyndall Avenue, Bristol BS8 1TQ, UK
- BrisEngBio, School of Chemistry, University of Bristol, Cantock’s Close, Bristol BS8 1TS, UK
| |
Collapse
|
10
|
Stone A, Youssef A, Rijal S, Zhang R, Tian XJ. Context-dependent redesign of robust synthetic gene circuits. Trends Biotechnol 2024; 42:895-909. [PMID: 38320912 PMCID: PMC11223972 DOI: 10.1016/j.tibtech.2024.01.003] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/02/2023] [Revised: 01/10/2024] [Accepted: 01/11/2024] [Indexed: 02/08/2024]
Abstract
Cells provide dynamic platforms for executing exogenous genetic programs in synthetic biology, resulting in highly context-dependent circuit performance. Recent years have seen an increasing interest in understanding the intricacies of circuit-host relationships, their influence on the synthetic bioengineering workflow, and in devising strategies to alleviate undesired effects. We provide an overview of how emerging circuit-host interactions, such as growth feedback and resource competition, impact both deterministic and stochastic circuit behaviors. We also emphasize control strategies for mitigating these unwanted effects. This review summarizes the latest advances and the current state of host-aware and resource-aware design of synthetic gene circuits.
Collapse
Affiliation(s)
- Austin Stone
- School of Biological and Health System Engineering, Arizona State University, Tempe, AZ 85281, USA
| | - Abdelrahaman Youssef
- School of Biological and Health System Engineering, Arizona State University, Tempe, AZ 85281, USA
| | - Sadikshya Rijal
- School of Biological and Health System Engineering, Arizona State University, Tempe, AZ 85281, USA
| | - Rong Zhang
- School of Biological and Health System Engineering, Arizona State University, Tempe, AZ 85281, USA
| | - Xiao-Jun Tian
- School of Biological and Health System Engineering, Arizona State University, Tempe, AZ 85281, USA.
| |
Collapse
|
11
|
Zelenka NR, Di Cara N, Sharma K, Sarvaharman S, Ghataora JS, Parmeggiani F, Nivala J, Abdallah ZS, Marucci L, Gorochowski TE. Data hazards in synthetic biology. Synth Biol (Oxf) 2024; 9:ysae010. [PMID: 38973982 PMCID: PMC11227101 DOI: 10.1093/synbio/ysae010] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/15/2024] [Revised: 05/17/2024] [Accepted: 06/19/2024] [Indexed: 07/09/2024] Open
Abstract
Data science is playing an increasingly important role in the design and analysis of engineered biology. This has been fueled by the development of high-throughput methods like massively parallel reporter assays, data-rich microscopy techniques, computational protein structure prediction and design, and the development of whole-cell models able to generate huge volumes of data. Although the ability to apply data-centric analyses in these contexts is appealing and increasingly simple to do, it comes with potential risks. For example, how might biases in the underlying data affect the validity of a result and what might the environmental impact of large-scale data analyses be? Here, we present a community-developed framework for assessing data hazards to help address these concerns and demonstrate its application to two synthetic biology case studies. We show the diversity of considerations that arise in common types of bioengineering projects and provide some guidelines and mitigating steps. Understanding potential issues and dangers when working with data and proactively addressing them will be essential for ensuring the appropriate use of emerging data-intensive AI methods and help increase the trustworthiness of their applications in synthetic biology.
Collapse
Affiliation(s)
- Natalie R Zelenka
- Jean Golding Institute, University of Bristol, Bristol, UK
- BrisEngBio, University of Bristol, Bristol, UK
| | - Nina Di Cara
- School of Psychological Science, University of Bristol, Bristol, UK
| | - Kieren Sharma
- School of Engineering Mathematics and Technology, University of Bristol, Bristol, UK
| | | | - Jasdeep S Ghataora
- BrisEngBio, University of Bristol, Bristol, UK
- School of Biological Sciences, University of Bristol, Bristol, UK
| | - Fabio Parmeggiani
- BrisEngBio, University of Bristol, Bristol, UK
- School of Biochemistry, University of Bristol, Bristol, UK
- School of Pharmacy and Pharmaceutical Sciences, Cardiff University, Cardiff, UK
| | - Jeff Nivala
- Paul G. Allen School of Computer Science and Engineering, University of Washington, Seattle, WA, USA
| | - Zahraa S Abdallah
- School of Engineering Mathematics and Technology, University of Bristol, Bristol, UK
| | - Lucia Marucci
- BrisEngBio, University of Bristol, Bristol, UK
- School of Engineering Mathematics and Technology, University of Bristol, Bristol, UK
| | - Thomas E Gorochowski
- BrisEngBio, University of Bristol, Bristol, UK
- School of Biological Sciences, University of Bristol, Bristol, UK
| |
Collapse
|
12
|
Du F, Li Z, Li X, Zhang D, Zhang F, Zhang Z, Xu Y, Tang J, Li Y, Huang X, Gu Y, Sun X, Huang H. Optimizing multicopy chromosomal integration for stable high-performing strains. Nat Chem Biol 2024:10.1038/s41589-024-01650-0. [PMID: 38858530 DOI: 10.1038/s41589-024-01650-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/15/2023] [Accepted: 05/15/2024] [Indexed: 06/12/2024]
Abstract
The copy number of genes in chromosomes can be modified by chromosomal integration to construct efficient microbial cell factories but the resulting genetic systems are prone to failure or instability from triggering homologous recombination in repetitive DNA sequences. Finding the optimal copy number of each gene in a pathway is also time and labor intensive. To overcome these challenges, we applied a multiple nonrepetitive coding sequence calculator that generates sets of coding DNA sequence (CDS) variants. A machine learning method was developed to calculate the optimal copy number combination of genes in a pathway. We obtained an engineered Yarrowia lipolytica strain for eicosapentaenoic acid biosynthesis in 6 months, producing the highest titer of 27.5 g l-1 in a 50-liter bioreactor. Moreover, the lycopene production in Escherichia coli was also greatly improved. Importantly, all engineered strains of Y. lipolytica, E. coli and Saccharomyces cerevisiae constructed with nonrepetitive CDSs maintained genetic stability.
Collapse
Affiliation(s)
- Fei Du
- School of Food Science and Pharmaceutical Engineering, Nanjing Normal University, Nanjing, China
| | - Zijia Li
- School of Food Science and Pharmaceutical Engineering, Nanjing Normal University, Nanjing, China
| | - Xin Li
- School of Food Science and Pharmaceutical Engineering, Nanjing Normal University, Nanjing, China
| | - Duoduo Zhang
- School of Food Science and Pharmaceutical Engineering, Nanjing Normal University, Nanjing, China
| | - Feng Zhang
- School of Food Science and Pharmaceutical Engineering, Nanjing Normal University, Nanjing, China
| | - Zixu Zhang
- School of Food Science and Pharmaceutical Engineering, Nanjing Normal University, Nanjing, China
| | - Yingshuang Xu
- School of Food Science and Pharmaceutical Engineering, Nanjing Normal University, Nanjing, China
| | - Jin Tang
- Research Institute of Intelligent Computing, Zhejiang Lab, Hangzhou, China
| | - Yongqian Li
- Research Institute of Intelligent Computing, Zhejiang Lab, Hangzhou, China
| | - Xingxu Huang
- Research Institute of Intelligent Computing, Zhejiang Lab, Hangzhou, China
- School of Life Science and Technology, ShanghaiTech University, Shanghai, China
| | - Yang Gu
- School of Food Science and Pharmaceutical Engineering, Nanjing Normal University, Nanjing, China
| | - Xiaoman Sun
- School of Food Science and Pharmaceutical Engineering, Nanjing Normal University, Nanjing, China.
| | - He Huang
- School of Food Science and Pharmaceutical Engineering, Nanjing Normal University, Nanjing, China.
| |
Collapse
|
13
|
Liu Z, Gillis TG, Raman S, Cui Q. A parameterized two-domain thermodynamic model explains diverse mutational effects on protein allostery. eLife 2024; 12:RP92262. [PMID: 38836839 PMCID: PMC11152574 DOI: 10.7554/elife.92262] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/06/2024] Open
Abstract
New experimental findings continue to challenge our understanding of protein allostery. Recent deep mutational scanning study showed that allosteric hotspots in the tetracycline repressor (TetR) and its homologous transcriptional factors are broadly distributed rather than spanning well-defined structural pathways as often assumed. Moreover, hotspot mutation-induced allostery loss was rescued by distributed additional mutations in a degenerate fashion. Here, we develop a two-domain thermodynamic model for TetR, which readily rationalizes these intriguing observations. The model accurately captures the in vivo activities of various mutants with changes in physically transparent parameters, allowing the data-based quantification of mutational effects using statistical inference. Our analysis reveals the intrinsic connection of intra- and inter-domain properties for allosteric regulation and illustrate epistatic interactions that are consistent with structural features of the protein. The insights gained from this study into the nature of two-domain allostery are expected to have broader implications for other multi-domain allosteric proteins.
Collapse
Affiliation(s)
- Zhuang Liu
- Department of Physics, Boston UniversityBostonUnited States
| | - Thomas G Gillis
- Department of Biochemistry, University of WisconsinMadisonUnited States
| | - Srivatsan Raman
- Department of Biochemistry, University of WisconsinMadisonUnited States
- Department of Chemistry, University of WisconsinMadisonUnited States
- Department of Bacteriology, University of WisconsinMadisonUnited States
| | - Qiang Cui
- Department of Physics, Boston UniversityBostonUnited States
- Department of Chemistry, Boston UniversityBostonUnited States
| |
Collapse
|
14
|
Ali MZ, Guharajan S, Parisutham V, Brewster RC. Regulatory properties of transcription factors with diverse mechanistic function. PLoS Comput Biol 2024; 20:e1012194. [PMID: 38857275 PMCID: PMC11192337 DOI: 10.1371/journal.pcbi.1012194] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/18/2024] [Revised: 06/21/2024] [Accepted: 05/24/2024] [Indexed: 06/12/2024] Open
Abstract
Transcription factors (TFs) regulate the process of transcription through the modulation of different kinetic steps. Although models can often describe the observed transcriptional output of a measured gene, predicting a TFs role on a given promoter requires an understanding of how the TF alters each step of the transcription process. In this work, we use a simple model of transcription to assess the role of promoter identity, and the degree to which TFs alter binding of RNAP (stabilization) and initiation of transcription (acceleration) on three primary characteristics: the range of steady-state regulation, cell-to-cell variability in expression, and the dynamic response time of a regulated gene. We find that steady state regulation and the response time of a gene behave uniquely for TFs that regulate incoherently, i.e that speed up one step but slow the other. We also find that incoherent TFs have dynamic implications, with one type of incoherent mode configuring the promoter to respond more slowly at intermediate TF concentrations. We also demonstrate that the noise of gene expression for these TFs is sensitive to promoter strength, with a distinct non-monotonic profile that is apparent under stronger promoters. Taken together, our work uncovers the coupling between promoters and TF regulatory modes with implications for understanding natural promoters and engineering synthetic gene circuits with desired expression properties.
Collapse
Affiliation(s)
- Md Zulfikar Ali
- Department of Systems Biology, University of Massachusetts Medical School, Worcester, Massachusetts, United States of America
- Department of Microbiology and Physiological Systems, University of Massachusetts Medical School, Worcester, Massachusetts, United States of America
- Department of Geology, Physics and Environmental Science, University of Southern Indiana, Evansville, Indiana, United States of America
| | - Sunil Guharajan
- Department of Systems Biology, University of Massachusetts Medical School, Worcester, Massachusetts, United States of America
- Department of Microbiology and Physiological Systems, University of Massachusetts Medical School, Worcester, Massachusetts, United States of America
| | - Vinuselvi Parisutham
- Department of Systems Biology, University of Massachusetts Medical School, Worcester, Massachusetts, United States of America
- Department of Microbiology and Physiological Systems, University of Massachusetts Medical School, Worcester, Massachusetts, United States of America
| | - Robert C. Brewster
- Department of Systems Biology, University of Massachusetts Medical School, Worcester, Massachusetts, United States of America
- Department of Microbiology and Physiological Systems, University of Massachusetts Medical School, Worcester, Massachusetts, United States of America
| |
Collapse
|
15
|
Nishikawa KK, Chen J, Acheson JF, Harbaugh SV, Huss P, Frenkel M, Novy N, Sieren HR, Lodewyk EC, Lee DH, Chávez JL, Fox BG, Raman S. Highly multiplexed design of an allosteric transcription factor to sense novel ligands. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.03.07.583947. [PMID: 38496486 PMCID: PMC10942455 DOI: 10.1101/2024.03.07.583947] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 03/19/2024]
Abstract
Allosteric transcription factors (aTF), widely used as biosensors, have proven challenging to design for detecting novel molecules because mutation of ligand-binding residues often disrupts allostery. We developed Sensor-seq, a high-throughput platform to design and identify aTF biosensors that bind to non-native ligands. We screened a library of 17,737 variants of the aTF TtgR, a regulator of a multidrug exporter, against six non-native ligands of diverse chemical structures - four derivatives of the cancer therapeutic tamoxifen, the antimalarial drug quinine, and the opiate analog naltrexone - as well as two native flavonoid ligands, naringenin and phloretin. Sensor-seq identified novel biosensors for each of these ligands with high dynamic range and diverse specificity profiles. The structure of a naltrexone-bound design showed shape-complementary methionine-aromatic interactions driving ligand specificity. To demonstrate practical utility, we developed cell-free detection systems for naltrexone and quinine. Sensor-seq enables rapid, scalable design of new biosensors, overcoming constraints of natural biosensors.
Collapse
Affiliation(s)
- Kyle K Nishikawa
- Department of Biochemistry, University of Wisconsin-Madison, Madison, Wisconsin, USA
| | - Jackie Chen
- Department of Biochemistry, University of Wisconsin-Madison, Madison, Wisconsin, USA
| | - Justin F Acheson
- Department of Biochemistry, University of Wisconsin-Madison, Madison, Wisconsin, USA
| | - Svetlana V Harbaugh
- 711th Human Performance Wing, Air Force Research Laboratory Wright Patterson Air Force Base, OH, USA
| | - Phil Huss
- Department of Biochemistry, University of Wisconsin-Madison, Madison, Wisconsin, USA
| | - Max Frenkel
- Department of Biochemistry, University of Wisconsin-Madison, Madison, Wisconsin, USA
| | - Nathan Novy
- Department of Biochemistry, University of Wisconsin-Madison, Madison, Wisconsin, USA
| | - Hailey R Sieren
- Department of Biochemistry, University of Wisconsin-Madison, Madison, Wisconsin, USA
| | - Ella C Lodewyk
- Department of Biochemistry, University of Wisconsin-Madison, Madison, Wisconsin, USA
| | - Daniel H Lee
- Department of Biochemistry, University of Wisconsin-Madison, Madison, Wisconsin, USA
| | - Jorge L Chávez
- 711th Human Performance Wing, Air Force Research Laboratory Wright Patterson Air Force Base, OH, USA
| | - Brian G Fox
- Department of Biochemistry, University of Wisconsin-Madison, Madison, Wisconsin, USA
- Great Lakes Bioenergy Research Center, University of Wisconsin-Madison, Madison, WI, USA
| | - Srivatsan Raman
- Department of Biochemistry, University of Wisconsin-Madison, Madison, Wisconsin, USA
- Department of Bacteriology, University of Wisconsin-Madison, Madison, WI, USA
- Department of Chemical and Biological Engineering, University of Wisconsin-Madison, Madison, WI, USA
- Great Lakes Bioenergy Research Center, University of Wisconsin-Madison, Madison, WI, USA
| |
Collapse
|
16
|
Bushhouse DZ, Fu J, Lucks JB. RNA folding kinetics control riboswitch sensitivity in vivo. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.03.29.587317. [PMID: 38585885 PMCID: PMC10996619 DOI: 10.1101/2024.03.29.587317] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 04/09/2024]
Abstract
Riboswitches are ligand-responsive gene-regulatory RNA elements that perform key roles in maintaining cellular homeostasis. Understanding how riboswitch sensitivity is controlled is critical to understanding how highly conserved aptamer domains are deployed in a variety of contexts with different sensitivity demands. Here we uncover new roles by which RNA folding dynamics control riboswitch sensitivity in cells. By investigating the Clostridium beijerinckii pfl ZTP riboswitch, we identify multiple mechanistic routes of altering expression platform sequence and structure to slow RNA folding, all of which enhance riboswitch sensitivity. Applying these methods to riboswitches with diverse aptamer architectures that regulate transcription and translation with ON and OFF logic demonstrates the generality of our findings, indicating that any riboswitch that operates in a kinetic regime can be sensitized by slowing expression platform folding. Comparison of the most sensitized versions of these switches to equilibrium aptamer:ligand dissociation constants suggests a limit to the sensitivities achievable by kinetic RNA switches. Our results add to the growing suite of knowledge and approaches that can be used to rationally program cotranscriptional RNA folding for biotechnology applications, and suggest general RNA folding principles for understanding dynamic RNA systems in other areas of biology.
Collapse
Affiliation(s)
- David Z. Bushhouse
- Interdisciplinary Biological Sciences Graduate Program, Northwestern University, Evanston, Illinois 60208, USA
- Center for Synthetic Biology, Northwestern University, Evanston, Illinois 60208, USA
| | - Jiayu Fu
- Interdisciplinary Biological Sciences Graduate Program, Northwestern University, Evanston, Illinois 60208, USA
- Center for Synthetic Biology, Northwestern University, Evanston, Illinois 60208, USA
| | - Julius B. Lucks
- Interdisciplinary Biological Sciences Graduate Program, Northwestern University, Evanston, Illinois 60208, USA
- Center for Synthetic Biology, Northwestern University, Evanston, Illinois 60208, USA
- Department of Chemical and Biological Engineering, Northwestern University, Evanston, Illinois 60208, USA
- Center for Water Research, Northwestern University, Evanston, Illinois 60208, USA
- Center for Engineering Sustainability and Resilience, Northwestern University, Evanston, Illinois 60208, USA
- International Institute for Nanotechnology, Northwestern University, Evanston, Illinois 60208, USA
| |
Collapse
|
17
|
Pal U, Bachmann D, Pelzer C, Christiansen J, Blank LM, Tiso T. A genetic toolbox to empower Paracoccus pantotrophus DSM 2944 as a metabolically versatile SynBio chassis. Microb Cell Fact 2024; 23:53. [PMID: 38360576 PMCID: PMC10870620 DOI: 10.1186/s12934-024-02325-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/12/2024] [Accepted: 02/05/2024] [Indexed: 02/17/2024] Open
Abstract
BACKGROUND To contribute to the discovery of new microbial strains with metabolic and physiological robustness and develop them into successful chasses, Paracoccus pantotrophus DSM 2944, a Gram-negative bacterium from the phylum Alphaproteobacteria and the family Rhodobacteraceae, was chosen. The strain possesses an innate ability to tolerate high salt concentrations. It utilizes diverse substrates, including cheap and renewable feedstocks, such as C1 and C2 compounds. Also, it can consume short-chain alkanes, predominately found in hydrocarbon-rich environments, making it a potential bioremediation agent. The demonstrated metabolic versatility, coupled with the synthesis of the biodegradable polymer polyhydroxyalkanoate, positions this microbial strain as a noteworthy candidate for advancing the principles of a circular bioeconomy. RESULTS The study aims to follow the chassis roadmap, as depicted by Calero and Nikel, and de Lorenzo, to transform wild-type P. pantotrophus DSM 2944 into a proficient SynBio (Synthetic Biology) chassis. The initial findings highlight the antibiotic resistance profile of this prospective SynBio chassis. Subsequently, the best origin of replication (ori) was identified as RK2. In contrast, the non-replicative ori R6K was selected for the development of a suicide plasmid necessary for genome integration or gene deletion. Moreover, when assessing the most effective method for gene transfer, it was observed that conjugation had superior efficiency compared to electroporation, while transformation by heat shock was ineffective. Robust host fitness was demonstrated by stable plasmid maintenance, while standardized gene expression using an array of synthetic promoters could be shown. pEMG-based scarless gene deletion was successfully adapted, allowing gene deletion and integration. The successful integration of a gene cassette for terephthalic acid degradation is showcased. The resulting strain can grow on both monomers of polyethylene terephthalate (PET), with an increased growth rate achieved through adaptive laboratory evolution. CONCLUSION The chassis roadmap for the development of P. pantotrophus DSM 2944 into a proficient SynBio chassis was implemented. The presented genetic toolkit allows genome editing and therewith the possibility to exploit Paracoccus for a myriad of applications.
Collapse
Affiliation(s)
- Upasana Pal
- Institute of Applied Microbiology, RWTH Aachen University, Aachen, Germany
| | - Denise Bachmann
- Institute of Applied Microbiology, RWTH Aachen University, Aachen, Germany
| | - Chiara Pelzer
- Institute of Applied Microbiology, RWTH Aachen University, Aachen, Germany
| | - Julia Christiansen
- Institute of Applied Microbiology, RWTH Aachen University, Aachen, Germany
- Chair of Microbiology, Technical University of Munich, Freising, Germany
| | - Lars M Blank
- Institute of Applied Microbiology, RWTH Aachen University, Aachen, Germany
| | - Till Tiso
- Institute of Applied Microbiology, RWTH Aachen University, Aachen, Germany.
| |
Collapse
|
18
|
Liu Z, Gillis T, Raman S, Cui Q. A parametrized two-domain thermodynamic model explains diverse mutational effects on protein allostery. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2023.08.06.552196. [PMID: 37662419 PMCID: PMC10473640 DOI: 10.1101/2023.08.06.552196] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 09/05/2023]
Abstract
New experimental findings continue to challenge our understanding of protein allostery. Recent deep mutational scanning study showed that allosteric hotspots in the tetracycline repressor (TetR) and its homologous transcriptional factors are broadly distributed rather than spanning well-defined structural pathways as often assumed. Moreover, hotspot mutation-induced allostery loss was rescued by distributed additional mutations in a degenerate fashion. Here, we develop a two-domain thermodynamic model for TetR, which readily rationalizes these intriguing observations. The model accurately captures the in vivo activities of various mutants with changes in physically transparent parameters, allowing the data-based quantification of mutational effects using statistical inference. Our analysis reveals the intrinsic connection of intra- and inter-domain properties for allosteric regulation and illustrate epistatic interactions that are consistent with structural features of the protein. The insights gained from this study into the nature of two-domain allostery are expected to have broader implications for other multidomain allosteric proteins.
Collapse
Affiliation(s)
- Zhuang Liu
- Department of Physics, Boston University, Boston, United States
| | - Thomas Gillis
- Department of Biochemistry, University of Wisconsin, Madison, United States
| | - Srivatsan Raman
- Department of Biochemistry, University of Wisconsin, Madison, United States
- Department of Chemistry, University of Wisconsin, Madison, United States
- Department of Bacteriology, University of Wisconsin, Madison, United States
| | - Qiang Cui
- Department of Physics, Boston University, Boston, United States
- Department of Chemistry, Boston University, Boston, United States
| |
Collapse
|
19
|
Rondthaler S, Sarker B, Howitz N, Shah I, Andrews LB. Toolbox of Characterized Genetic Parts for Staphylococcus aureus. ACS Synth Biol 2024; 13:103-118. [PMID: 38064657 PMCID: PMC10805105 DOI: 10.1021/acssynbio.3c00325] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/24/2023] [Revised: 10/06/2023] [Accepted: 10/10/2023] [Indexed: 01/23/2024]
Abstract
Staphylococcus aureus is an important clinical bacterium prevalent in human-associated microbiomes and the cause of many diseases. However, S. aureus has been intractable to synthetic biology approaches due to limited characterized genetic parts for this nonmodel Gram-positive bacterium. Moreover, genetic manipulation of S. aureus has relied on cumbersome and inefficient cloning strategies. Here, we report the first standardized genetic parts toolbox for S. aureus, which includes characterized promoters, ribosome binding sites, terminators, and plasmid replicons from a variety of bacteria for precise control of gene expression. We established a standard relative expression unit (REU) for S. aureus using a plasmid reference and characterized genetic parts in standardized REUs using S. aureus ATCC 12600. We constructed promoter and terminator part plasmids that are compatible with an efficient Type IIS DNA assembly strategy to effectively build multipart DNA constructs. A library of 24 constitutive promoters was built and characterized in S. aureus, which showed a 380-fold activity range. This promoter library was also assayed in Bacillus subtilis (122-fold activity range) to demonstrate the transferability of the constitutive promoters between these Gram-positive bacteria. By applying an iterative design-build-test-learn cycle, we demonstrated the use of our toolbox for the rational design and engineering of a tetracycline sensor in S. aureus using the PXyl-TetO aTc-inducible promoter that achieved 25.8-fold induction. This toolbox greatly expands the growing number of genetic parts for Gram-positive bacteria and will allow researchers to leverage synthetic biology approaches to study and engineer cellular processes in S. aureus.
Collapse
Affiliation(s)
- Stephen
N. Rondthaler
- Department
of Chemical Engineering, University of Massachusetts
Amherst, Amherst, Massachusetts 01003, United States
| | - Biprodev Sarker
- Department
of Chemical Engineering, University of Massachusetts
Amherst, Amherst, Massachusetts 01003, United States
| | - Nathaniel Howitz
- Department
of Chemical Engineering, University of Massachusetts
Amherst, Amherst, Massachusetts 01003, United States
| | - Ishita Shah
- Department
of Chemical Engineering, University of Massachusetts
Amherst, Amherst, Massachusetts 01003, United States
| | - Lauren B. Andrews
- Department
of Chemical Engineering, University of Massachusetts
Amherst, Amherst, Massachusetts 01003, United States
- Molecular
and Cellular Biology Graduate Program, University
of Massachusetts Amherst, Amherst, Massachusetts 01003, United States
- Biotechnology
Training Program, University of Massachusetts
Amherst, Amherst, Massachusetts 01003, United States
| |
Collapse
|
20
|
Stock M, Gorochowski TE. Open-endedness in synthetic biology: A route to continual innovation for biological design. SCIENCE ADVANCES 2024; 10:eadi3621. [PMID: 38241375 DOI: 10.1126/sciadv.adi3621] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/20/2023] [Accepted: 12/20/2023] [Indexed: 01/21/2024]
Abstract
Design in synthetic biology is typically goal oriented, aiming to repurpose or optimize existing biological functions, augmenting biology with new-to-nature capabilities, or creating life-like systems from scratch. While the field has seen many advances, bottlenecks in the complexity of the systems built are emerging and designs that function in the lab often fail when used in real-world contexts. Here, we propose an open-ended approach to biological design, with the novelty of designed biology being at least as important as how well it fulfils its goal. Rather than solely focusing on optimization toward a single best design, designing with novelty in mind may allow us to move beyond the diminishing returns we see in performance for most engineered biology. Research from the artificial life community has demonstrated that embracing novelty can automatically generate innovative and unexpected solutions to challenging problems beyond local optima. Synthetic biology offers the ideal playground to explore more creative approaches to biological design.
Collapse
Affiliation(s)
- Michiel Stock
- KERMIT & Biobix, Department of Data Analysis and Mathematical Modelling, Ghent University, Ghent, Belgium
| | - Thomas E Gorochowski
- School of Biological Sciences, University of Bristol, Life Sciences Building, 24 Tyndall Avenue, Bristol BS8 1TQ, UK
- BrisEngBio, School of Chemistry, University of Bristol, Cantock's Close, Bristol BS8 1TS, UK
| |
Collapse
|
21
|
Feng H, Zhou Y, Zhang C. Encoding Genetic Circuits with DNA Barcodes Paves the Way for High-Throughput Profiling of Dose-Response Curves of Metabolite Biosensors. Methods Mol Biol 2024; 2760:309-318. [PMID: 38468096 DOI: 10.1007/978-1-0716-3658-9_18] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 03/13/2024]
Abstract
Metabolite biosensors, through which the intracellular metabolite concentrations could be converted to changes in gene expression, are widely used in a variety of applications according to the different output signals. However, it remains challenging to fine-tune the dose-response relationships of biosensors to meet the needs of various scenarios. On the other hand, the short read length of next-generation sequencing (NGS) has greatly limited the design capability of sequence libraries. To address these issues, we describe a DNA trackable assembly method, coupled with fluorescence-activated cell sorting and NGS (Sort-Seq), to achieve the characterization of dose-response curves in a massively parallel manner. As a proof of the concept, we constructed a malonyl-CoA biosensor library containing 5184 combinations with six levels of transcription factor dosage, four different operator positions, and 216 possible upstream enhancer sequence (UAS) designs in Saccharomyces cerevisiae BY4700. By using Sort-Seq and machine learning approach, we obtained comprehensive dose-response relationships of the combinatorial sequence space. Therefore, our pipeline provides a platform for the design, tuning, and profiling of biosensor response curves and shows great potential to facilitate the rational design of genetic circuits.
Collapse
Affiliation(s)
- Huibao Feng
- MOE Key Laboratory for Industrial Biocatalysis, Institute of Biochemical Engineering, Department of Chemical Engineering, Tsinghua University, Beijing, China
| | - Yikang Zhou
- MOE Key Laboratory for Industrial Biocatalysis, Institute of Biochemical Engineering, Department of Chemical Engineering, Tsinghua University, Beijing, China
| | - Chong Zhang
- MOE Key Laboratory for Industrial Biocatalysis, Institute of Biochemical Engineering, Department of Chemical Engineering, Tsinghua University, Beijing, China.
- Center for Synthetic and Systems Biology, Tsinghua University, Beijing, China.
| |
Collapse
|
22
|
Feng H, Li F, Wang T, Xing XH, Zeng AP, Zhang C. Deep-learning-assisted Sort-Seq enables high-throughput profiling of gene expression characteristics with high precision. SCIENCE ADVANCES 2023; 9:eadg5296. [PMID: 37939173 PMCID: PMC10631719 DOI: 10.1126/sciadv.adg5296] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/04/2023] [Accepted: 10/06/2023] [Indexed: 11/10/2023]
Abstract
Owing to the nondeterministic and nonlinear nature of gene expression, the steady-state intracellular protein abundance of a clonal population forms a distribution. The characteristics of this distribution, including expression strength and noise, are closely related to cellular behavior. However, quantitative description of these characteristics has so far relied on arrayed methods, which are time-consuming and labor-intensive. To address this issue, we propose a deep-learning-assisted Sort-Seq approach (dSort-Seq) in this work, enabling high-throughput profiling of expression properties with high precision. We demonstrated the validity of dSort-Seq for large-scale assaying of the dose-response relationships of biosensors. In addition, we comprehensively investigated the contribution of transcription and translation to noise production in Escherichia coli, from which we found that the expression noise is strongly coupled with the mean expression level. We also found that the transcriptional interference caused by overlapping RpoD-binding sites contributes to noise production, which suggested the existence of a simple and feasible noise control strategy in E. coli.
Collapse
Affiliation(s)
- Huibao Feng
- MOE Key Laboratory for Industrial Biocatalysis, Institute of Biochemical Engineering, Department of Chemical Engineering, Tsinghua University, Beijing 100084, China
| | - Fan Li
- MOE Key Laboratory for Industrial Biocatalysis, Institute of Biochemical Engineering, Department of Chemical Engineering, Tsinghua University, Beijing 100084, China
| | - Tianmin Wang
- Tsinghua-Peking Center for Life Sciences, School of Medicine, Tsinghua University, Beijing 100084, China
- School of Life Science and Technology, ShanghaiTech University, Shanghai 201210, China
| | - Xin-hui Xing
- MOE Key Laboratory for Industrial Biocatalysis, Institute of Biochemical Engineering, Department of Chemical Engineering, Tsinghua University, Beijing 100084, China
- Shenzhen International Graduate School, Tsinghua University, Shenzhen 518055, China
| | - An-ping Zeng
- Institute of Bioprocess and Biosystems Engineering, Hamburg University of Technology, Hamburg 21073, Germany
- Center of Synthetic Biology and Integrated Bioengineering, School of Engineering, Westlake University, Hangzhou 310024, China
| | - Chong Zhang
- MOE Key Laboratory for Industrial Biocatalysis, Institute of Biochemical Engineering, Department of Chemical Engineering, Tsinghua University, Beijing 100084, China
- Center for Synthetic and Systems Biology, Tsinghua University, Beijing 100084, China
| |
Collapse
|
23
|
Capponi S, Daniels KG. Harnessing the power of artificial intelligence to advance cell therapy. Immunol Rev 2023; 320:147-165. [PMID: 37415280 DOI: 10.1111/imr.13236] [Citation(s) in RCA: 7] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/02/2023] [Accepted: 06/17/2023] [Indexed: 07/08/2023]
Abstract
Cell therapies are powerful technologies in which human cells are reprogrammed for therapeutic applications such as killing cancer cells or replacing defective cells. The technologies underlying cell therapies are increasing in effectiveness and complexity, making rational engineering of cell therapies more difficult. Creating the next generation of cell therapies will require improved experimental approaches and predictive models. Artificial intelligence (AI) and machine learning (ML) methods have revolutionized several fields in biology including genome annotation, protein structure prediction, and enzyme design. In this review, we discuss the potential of combining experimental library screens and AI to build predictive models for the development of modular cell therapy technologies. Advances in DNA synthesis and high-throughput screening techniques enable the construction and screening of libraries of modular cell therapy constructs. AI and ML models trained on this screening data can accelerate the development of cell therapies by generating predictive models, design rules, and improved designs.
Collapse
Affiliation(s)
- Sara Capponi
- Department of Functional Genomics and Cellular Engineering, IBM Almaden Research Center, San Jose, California, USA
- Center for Cellular Construction, San Francisco, California, USA
| | - Kyle G Daniels
- Department of Cellular and Molecular Pharmacology, University of California, San Francisco, California, USA
- Department of Genetics, Stanford University School of Medicine, Stanford, California, USA
| |
Collapse
|
24
|
Boulas I, Bruno L, Rimsky S, Espeli O, Junier I, Rivoire O. Assessing in vivo the impact of gene context on transcription through DNA supercoiling. Nucleic Acids Res 2023; 51:9509-9521. [PMID: 37667073 PMCID: PMC10570042 DOI: 10.1093/nar/gkad688] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/19/2022] [Revised: 07/24/2023] [Accepted: 08/09/2023] [Indexed: 09/06/2023] Open
Abstract
Gene context can have significant impact on gene expression but is currently not integrated in quantitative models of gene regulation despite known biophysical principles and quantitative in vitro measurements. Conceptually, the simplest gene context consists of a single gene framed by two topological barriers, known as the twin transcriptional-loop model, which illustrates the interplay between transcription and DNA supercoiling. In vivo, DNA supercoiling is additionally modulated by topoisomerases, whose modus operandi remains to be quantified. Here, we bridge the gap between theory and in vivo properties by realizing in Escherichia coli the twin transcriptional-loop model and by measuring how gene expression varies with promoters and distances to the topological barriers. We find that gene expression depends on the distance to the upstream barrier but not to the downstream barrier, with a promoter-dependent intensity. We rationalize these findings with a first-principle biophysical model of DNA transcription. Our results are explained if TopoI and gyrase both act specifically, respectively upstream and downstream of the gene, with antagonistic effects of TopoI, which can repress initiation while facilitating elongation. Altogether, our work sets the foundations for a systematic and quantitative description of the impact of gene context on gene regulation.
Collapse
Affiliation(s)
- Ihab Boulas
- Center for Interdisciplinary Research in Biology (CIRB), Collège de France, CNRS, INSERM, Université PSL, Paris, France
| | - Lisa Bruno
- Center for Interdisciplinary Research in Biology (CIRB), Collège de France, CNRS, INSERM, Université PSL, Paris, France
| | - Sylvie Rimsky
- Center for Interdisciplinary Research in Biology (CIRB), Collège de France, CNRS, INSERM, Université PSL, Paris, France
| | - Olivier Espeli
- Center for Interdisciplinary Research in Biology (CIRB), Collège de France, CNRS, INSERM, Université PSL, Paris, France
| | - Ivan Junier
- Univ. Grenoble Alpes, CNRS, UMR 5525, VetAgro Sup, Grenoble INP, TIMC, 38000 Grenoble, France
| | - Olivier Rivoire
- Center for Interdisciplinary Research in Biology (CIRB), Collège de France, CNRS, INSERM, Université PSL, Paris, France
- Gulliver, ESPCI, CNRS, Université PSL, Paris, France
| |
Collapse
|
25
|
Chen X, Kaiser CM. AP profiling resolves co-translational folding pathway and chaperone interactions in vivo. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.09.01.555749. [PMID: 37693575 PMCID: PMC10491307 DOI: 10.1101/2023.09.01.555749] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 09/12/2023]
Abstract
Natural proteins have evolved to fold robustly along specific pathways. Folding begins during synthesis, guided by interactions of the nascent protein with the ribosome and molecular chaperones. However, the timing and progression of co-translational folding remain largely elusive, in part because the process is difficult to measure in the natural environment of the cytosol. We developed a high-throughput method to quantify co-translational folding in live cells that we term Arrest Peptide profiling (AP profiling). We employed AP profiling to delineate co-translational folding for a set of GTPase domains with very similar structures, defining how topology shapes folding pathways. Genetic ablation of major nascent chain-binding chaperones resulted in localized folding changes that suggest how functional redundancies among chaperones are achieved by distinct interactions with the nascent protein. Collectively, our studies provide a window into cellular folding pathways of complex proteins and pave the way for systematic studies on nascent protein folding at unprecedented resolution and throughput.
Collapse
Affiliation(s)
- Xiuqi Chen
- CMDB Graduate Program, Johns Hopkins University, Baltimore, MD, United States
- Department of Biology, Johns Hopkins University, Baltimore, MD, United States
- Present address: Department of Bionanoscience, Kavli Institute of Nanoscience Delft, Delft University of Technology, Delft, the Netherlands
| | - Christian M. Kaiser
- Department of Biology, Johns Hopkins University, Baltimore, MD, United States
- Department of Biophysics, Johns Hopkins University, Baltimore, MD, United States
| |
Collapse
|
26
|
Liu W, Zuo S, Shao Y, Bi K, Zhao J, Huang L, Xu Z, Lian J. Retron-mediated multiplex genome editing and continuous evolution in Escherichia coli. Nucleic Acids Res 2023; 51:8293-8307. [PMID: 37471041 PMCID: PMC10450171 DOI: 10.1093/nar/gkad607] [Citation(s) in RCA: 5] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/13/2023] [Revised: 07/02/2023] [Accepted: 07/07/2023] [Indexed: 07/21/2023] Open
Abstract
While there are several genome editing techniques available, few are suitable for dynamic and simultaneous mutagenesis of arbitrary targeted sequences in prokaryotes. Here, to address these limitations, we present a versatile and multiplex retron-mediated genome editing system (REGES). First, through systematic optimization of REGES, we achieve efficiency of ∼100%, 85 ± 3%, 69 ± 14% and 25 ± 14% for single-, double-, triple- and quadruple-locus genome editing, respectively. In addition, we employ REGES to generate pooled and barcoded variant libraries with degenerate RBS sequences to fine-tune the expression level of endogenous and exogenous genes, such as transcriptional factors to improve ethanol tolerance and biotin biosynthesis. Finally, we demonstrate REGES-mediated continuous in vivo protein evolution, by combining retron, polymerase-mediated base editing and error-prone transcription. By these case studies, we demonstrate REGES as a powerful multiplex genome editing and continuous evolution tool with broad applications in synthetic biology and metabolic engineering.
Collapse
Affiliation(s)
- Wenqian Liu
- Key Laboratory of Biomass Chemical Engineering of Ministry of Education, College of Chemical and Biological Engineering, Zhejiang University, Hangzhou 310027, China
- Institute of Bioengineering, College of Chemical and Biological Engineering, Zhejiang University, Hangzhou 310027, China
| | - Siqi Zuo
- Key Laboratory of Biomass Chemical Engineering of Ministry of Education, College of Chemical and Biological Engineering, Zhejiang University, Hangzhou 310027, China
- Institute of Bioengineering, College of Chemical and Biological Engineering, Zhejiang University, Hangzhou 310027, China
| | - Youran Shao
- Key Laboratory of Biomass Chemical Engineering of Ministry of Education, College of Chemical and Biological Engineering, Zhejiang University, Hangzhou 310027, China
- Institute of Bioengineering, College of Chemical and Biological Engineering, Zhejiang University, Hangzhou 310027, China
| | - Ke Bi
- Key Laboratory of Biomass Chemical Engineering of Ministry of Education, College of Chemical and Biological Engineering, Zhejiang University, Hangzhou 310027, China
- Institute of Bioengineering, College of Chemical and Biological Engineering, Zhejiang University, Hangzhou 310027, China
| | - Jiarun Zhao
- Key Laboratory of Biomass Chemical Engineering of Ministry of Education, College of Chemical and Biological Engineering, Zhejiang University, Hangzhou 310027, China
- Institute of Bioengineering, College of Chemical and Biological Engineering, Zhejiang University, Hangzhou 310027, China
| | - Lei Huang
- Key Laboratory of Biomass Chemical Engineering of Ministry of Education, College of Chemical and Biological Engineering, Zhejiang University, Hangzhou 310027, China
- Institute of Bioengineering, College of Chemical and Biological Engineering, Zhejiang University, Hangzhou 310027, China
| | - Zhinan Xu
- Key Laboratory of Biomass Chemical Engineering of Ministry of Education, College of Chemical and Biological Engineering, Zhejiang University, Hangzhou 310027, China
- Institute of Bioengineering, College of Chemical and Biological Engineering, Zhejiang University, Hangzhou 310027, China
| | - Jiazhang Lian
- Key Laboratory of Biomass Chemical Engineering of Ministry of Education, College of Chemical and Biological Engineering, Zhejiang University, Hangzhou 310027, China
- Institute of Bioengineering, College of Chemical and Biological Engineering, Zhejiang University, Hangzhou 310027, China
- ZJU-Hangzhou Global Scientific and Technological Innovation Center, Zhejiang University, Hangzhou 311215, China
| |
Collapse
|
27
|
Dawes P, Murray LF, Olson MN, Barton NJ, Smullen M, Suresh M, Yan G, Zhang Y, Fernandez-Fontaine A, English J, Uddin M, Pak C, Church GM, Chan Y, Lim ET. oFlowSeq: a quantitative approach to identify protein coding mutations affecting cell type enrichment using mosaic CRISPR-Cas9 edited cerebral organoids. Hum Genet 2023; 142:1281-1291. [PMID: 36877372 PMCID: PMC10807401 DOI: 10.1007/s00439-023-02534-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/04/2022] [Accepted: 02/19/2023] [Indexed: 03/07/2023]
Abstract
Cerebral organoids are comprised of diverse cell types found in the developing human brain, and can be leveraged in the identification of critical cell types perturbed by genetic risk variants in common, neuropsychiatric disorders. There is great interest in developing high-throughput technologies to associate genetic variants with cell types. Here, we describe a high-throughput, quantitative approach (oFlowSeq) by utilizing CRISPR-Cas9, FACS sorting, and next-generation sequencing. Using oFlowSeq, we found that deleterious mutations in autism-associated gene KCTD13 resulted in increased proportions of Nestin+ cells and decreased proportions of TRA-1-60+ cells within mosaic cerebral organoids. We further identified that a locus-wide CRISPR-Cas9 survey of another 18 genes in the 16p11.2 locus resulted in most genes with > 2% maximum editing efficiencies for short and long indels, suggesting a high feasibility for an unbiased, locus-wide experiment using oFlowSeq. Our approach presents a novel method to identify genotype-to-cell type imbalances in an unbiased, high-throughput, quantitative manner.
Collapse
Affiliation(s)
- Pepper Dawes
- Program in Bioinformatics and Integrative Biology, University of Massachusetts Chan Medical School, Worcester, MA, 01605, USA
- Department of Neurology, University of Massachusetts Chan Medical School, Worcester, MA, 01605, USA
- NeuroNexus Institute, University of Massachusetts Chan Medical School, Worcester, MA, 01605, USA
- Department of Molecular, Cell and Cancer Biology, University of Massachusetts Chan Medical School, Worcester, MA, 01605, USA
| | - Liam F Murray
- Program in Bioinformatics and Integrative Biology, University of Massachusetts Chan Medical School, Worcester, MA, 01605, USA
- Department of Neurology, University of Massachusetts Chan Medical School, Worcester, MA, 01605, USA
- NeuroNexus Institute, University of Massachusetts Chan Medical School, Worcester, MA, 01605, USA
- Department of Molecular, Cell and Cancer Biology, University of Massachusetts Chan Medical School, Worcester, MA, 01605, USA
| | - Meagan N Olson
- Program in Bioinformatics and Integrative Biology, University of Massachusetts Chan Medical School, Worcester, MA, 01605, USA
- Department of Neurology, University of Massachusetts Chan Medical School, Worcester, MA, 01605, USA
- NeuroNexus Institute, University of Massachusetts Chan Medical School, Worcester, MA, 01605, USA
- Department of Molecular, Cell and Cancer Biology, University of Massachusetts Chan Medical School, Worcester, MA, 01605, USA
| | - Nathaniel J Barton
- Program in Bioinformatics and Integrative Biology, University of Massachusetts Chan Medical School, Worcester, MA, 01605, USA
- Department of Neurology, University of Massachusetts Chan Medical School, Worcester, MA, 01605, USA
- NeuroNexus Institute, University of Massachusetts Chan Medical School, Worcester, MA, 01605, USA
- Department of Molecular, Cell and Cancer Biology, University of Massachusetts Chan Medical School, Worcester, MA, 01605, USA
| | - Molly Smullen
- Program in Bioinformatics and Integrative Biology, University of Massachusetts Chan Medical School, Worcester, MA, 01605, USA
- Department of Neurology, University of Massachusetts Chan Medical School, Worcester, MA, 01605, USA
- NeuroNexus Institute, University of Massachusetts Chan Medical School, Worcester, MA, 01605, USA
- Department of Molecular, Cell and Cancer Biology, University of Massachusetts Chan Medical School, Worcester, MA, 01605, USA
| | - Madhusoodhanan Suresh
- Program in Bioinformatics and Integrative Biology, University of Massachusetts Chan Medical School, Worcester, MA, 01605, USA
- Department of Neurology, University of Massachusetts Chan Medical School, Worcester, MA, 01605, USA
- NeuroNexus Institute, University of Massachusetts Chan Medical School, Worcester, MA, 01605, USA
- Department of Molecular, Cell and Cancer Biology, University of Massachusetts Chan Medical School, Worcester, MA, 01605, USA
| | - Guang Yan
- Program in Bioinformatics and Integrative Biology, University of Massachusetts Chan Medical School, Worcester, MA, 01605, USA
- Department of Neurology, University of Massachusetts Chan Medical School, Worcester, MA, 01605, USA
- NeuroNexus Institute, University of Massachusetts Chan Medical School, Worcester, MA, 01605, USA
- Department of Molecular, Cell and Cancer Biology, University of Massachusetts Chan Medical School, Worcester, MA, 01605, USA
| | - Yucheng Zhang
- Program in Bioinformatics and Integrative Biology, University of Massachusetts Chan Medical School, Worcester, MA, 01605, USA
- Department of Neurology, University of Massachusetts Chan Medical School, Worcester, MA, 01605, USA
- NeuroNexus Institute, University of Massachusetts Chan Medical School, Worcester, MA, 01605, USA
- Department of Molecular, Cell and Cancer Biology, University of Massachusetts Chan Medical School, Worcester, MA, 01605, USA
| | - Aria Fernandez-Fontaine
- Program in Bioinformatics and Integrative Biology, University of Massachusetts Chan Medical School, Worcester, MA, 01605, USA
- Department of Neurology, University of Massachusetts Chan Medical School, Worcester, MA, 01605, USA
- NeuroNexus Institute, University of Massachusetts Chan Medical School, Worcester, MA, 01605, USA
- Department of Molecular, Cell and Cancer Biology, University of Massachusetts Chan Medical School, Worcester, MA, 01605, USA
| | - Jay English
- Department of Biochemistry and Molecular Biology, University of Massachusetts Amherst, Amherst, MA, 01003, USA
| | - Mohammed Uddin
- College of Medicine, Mohammed Bin Rashid University of Medicine and Health Sciences, Dubai, UAE
- Cellular Intelligence (Ci) Lab, GenomeArc Inc., Toronto, ON, Canada
| | - ChangHui Pak
- Department of Biochemistry and Molecular Biology, University of Massachusetts Amherst, Amherst, MA, 01003, USA
| | - George M Church
- Department of Genetics, Harvard Medical School, Boston, MA, 02115, USA
- Wyss Institute for Biologically Inspired Engineering, Harvard University, Boston, MA, 02115, USA
| | - Yingleong Chan
- Program in Bioinformatics and Integrative Biology, University of Massachusetts Chan Medical School, Worcester, MA, 01605, USA
- Department of Neurology, University of Massachusetts Chan Medical School, Worcester, MA, 01605, USA
- NeuroNexus Institute, University of Massachusetts Chan Medical School, Worcester, MA, 01605, USA
| | - Elaine T Lim
- Program in Bioinformatics and Integrative Biology, University of Massachusetts Chan Medical School, Worcester, MA, 01605, USA.
- Department of Neurology, University of Massachusetts Chan Medical School, Worcester, MA, 01605, USA.
- NeuroNexus Institute, University of Massachusetts Chan Medical School, Worcester, MA, 01605, USA.
- Department of Molecular, Cell and Cancer Biology, University of Massachusetts Chan Medical School, Worcester, MA, 01605, USA.
| |
Collapse
|
28
|
Wang Y, Li S, Xue N, Wang L, Zhang X, Zhao L, Guo Y, Zhang Y, Wang M. Modulating Sensitivity of an Erythromycin Biosensor for Precise High-Throughput Screening of Strains with Different Characteristics. ACS Synth Biol 2023; 12:1761-1771. [PMID: 37198736 DOI: 10.1021/acssynbio.3c00059] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 05/19/2023]
Abstract
Genetically encoded biosensors are powerful tools for product-driven high-throughput screening in synthetic biology and metabolic engineering. However, most biosensors can only properly function in a limited concentration cutoff, and the incompatible performance characteristics of biosensors will lead to false positives or failure in screening. The transcription factor (TF)-based biosensors are usually organized in modular architecture and function in a regulator-depended manner, whose performance properties can be fine-tuned by modifying the expression level of the TF. In this study, we modulated the performance characteristics, including sensitivity and operating range, of an MphR-based erythromycin biosensor by fine-adjusting regulator expression levels via ribosome-binding site (RBS) engineering and obtained a panel of biosensors with varied sensitivities by iterative fluorescence-assisted cell sorting (FACS) in Escherichia coli to accommodate different screening purposes. To exemplify their application potential, two engineered biosensors with 10-fold different sensitivities were employed in the precise high-throughput screening by microfluidic-based fluorescence-activated droplet sorting (FADS) of Saccharopolyspora erythraea mutant libraries with different starting erythromycin productions, and mutants representing as high as 6.8 folds and over 100% of production improvements were obtained starting from the wild-type strain and the high-producing industrial strain, respectively. This work demonstrated a simple strategy to engineer biosensor performance properties, which was significant to stepwise strain engineering and production improvement.
Collapse
Affiliation(s)
- Yan Wang
- College of Biotechnology, Tianjin University of Science and Technology, Tianjin 300457, China
- Tianjin Institute of Industrial Biotechnology, Chinese Academy of Sciences, Tianjin 300308, China
- Haihe Laboratory of Synthetic Biology, Tianjin 300308, China
| | - Shixin Li
- College of Biotechnology, Tianjin University of Science and Technology, Tianjin 300457, China
- Tianjin Institute of Industrial Biotechnology, Chinese Academy of Sciences, Tianjin 300308, China
| | - Ning Xue
- College of Biotechnology, Tianjin University of Science and Technology, Tianjin 300457, China
- Tianjin Institute of Industrial Biotechnology, Chinese Academy of Sciences, Tianjin 300308, China
| | - Lixian Wang
- Tianjin Institute of Industrial Biotechnology, Chinese Academy of Sciences, Tianjin 300308, China
| | - Xuemei Zhang
- Tianjin Institute of Industrial Biotechnology, Chinese Academy of Sciences, Tianjin 300308, China
- School of Life Sciences, University of Science and Technology of China, Hefei 230026, China
| | - Longqian Zhao
- College of Biotechnology, Tianjin University of Science and Technology, Tianjin 300457, China
- Tianjin Institute of Industrial Biotechnology, Chinese Academy of Sciences, Tianjin 300308, China
| | - Yanmei Guo
- Tianjin Institute of Industrial Biotechnology, Chinese Academy of Sciences, Tianjin 300308, China
| | - Yue Zhang
- Tianjin Institute of Industrial Biotechnology, Chinese Academy of Sciences, Tianjin 300308, China
| | - Meng Wang
- Tianjin Institute of Industrial Biotechnology, Chinese Academy of Sciences, Tianjin 300308, China
| |
Collapse
|
29
|
Zhou A, Kirkpatrick LD, Ornelas IJ, Washington LJ, Hummel NFC, Gee CW, Tang SN, Barnum CR, Scheller HV, Shih PM. A Suite of Constitutive Promoters for Tuning Gene Expression in Plants. ACS Synth Biol 2023; 12:1533-1545. [PMID: 37083366 DOI: 10.1021/acssynbio.3c00075] [Citation(s) in RCA: 4] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 04/22/2023]
Abstract
The need for convenient tools to express transgenes over a large dynamic range is pervasive throughout plant synthetic biology; however, current efforts are largely limited by the heavy reliance on a small set of strong promoters, precluding more nuanced and refined engineering endeavors in planta. To address this technical gap, we characterize a suite of constitutive promoters that span a wide range of transcriptional levels and develop a GoldenGate-based plasmid toolkit named PCONS, optimized for versatile cloning and rapid testing of transgene expression at varying strengths. We demonstrate how easy access to a stepwise gradient of expression levels can be used for optimizing synthetic transcriptional systems and the production of small molecules in planta. We also systematically investigate the potential of using PCONS as an internal standard in plant biology experimental design, establishing the best practices for signal normalization in experiments. Although our library has primarily been developed for optimizing expression in N. benthamiana, we demonstrate the translatability of our promoters across distantly related species using a multiplexed reporter assay with barcoded transcripts. Our findings showcase the advantages of the PCONS library as an invaluable toolkit for plant synthetic biology.
Collapse
Affiliation(s)
- Andy Zhou
- Department of Plant and Microbial Biology, University of California, Berkeley, Berkeley, California 94720, United States
- Feedstocks Division, Joint BioEnergy Institute, Emeryville, California 94608, United States
- Environmental Genomics and Systems Biology Division, Lawrence Berkeley National Laboratory, Berkeley, California 94705, United States
| | - Liam D Kirkpatrick
- Department of Plant and Microbial Biology, University of California, Berkeley, Berkeley, California 94720, United States
- Feedstocks Division, Joint BioEnergy Institute, Emeryville, California 94608, United States
- Environmental Genomics and Systems Biology Division, Lawrence Berkeley National Laboratory, Berkeley, California 94705, United States
| | - Izaiah J Ornelas
- Feedstocks Division, Joint BioEnergy Institute, Emeryville, California 94608, United States
- Environmental Genomics and Systems Biology Division, Lawrence Berkeley National Laboratory, Berkeley, California 94705, United States
- Department of Molecular and Cell Biology, University of California, Berkeley, Berkeley, California 94720, United States
| | - Lorenzo J Washington
- Department of Plant and Microbial Biology, University of California, Berkeley, Berkeley, California 94720, United States
- Feedstocks Division, Joint BioEnergy Institute, Emeryville, California 94608, United States
- Environmental Genomics and Systems Biology Division, Lawrence Berkeley National Laboratory, Berkeley, California 94705, United States
| | - Niklas F C Hummel
- Feedstocks Division, Joint BioEnergy Institute, Emeryville, California 94608, United States
- Environmental Genomics and Systems Biology Division, Lawrence Berkeley National Laboratory, Berkeley, California 94705, United States
| | - Christopher W Gee
- Feedstocks Division, Joint BioEnergy Institute, Emeryville, California 94608, United States
- Environmental Genomics and Systems Biology Division, Lawrence Berkeley National Laboratory, Berkeley, California 94705, United States
| | - Sophia N Tang
- Feedstocks Division, Joint BioEnergy Institute, Emeryville, California 94608, United States
- Environmental Genomics and Systems Biology Division, Lawrence Berkeley National Laboratory, Berkeley, California 94705, United States
- Department of Molecular and Cell Biology, University of California, Berkeley, Berkeley, California 94720, United States
| | - Collin R Barnum
- Biochemistry, Molecular, Cellular and Developmental Biology Graduate Group, University of California, Davis, California 95616, United States
| | - Henrik V Scheller
- Department of Plant and Microbial Biology, University of California, Berkeley, Berkeley, California 94720, United States
- Feedstocks Division, Joint BioEnergy Institute, Emeryville, California 94608, United States
- Environmental Genomics and Systems Biology Division, Lawrence Berkeley National Laboratory, Berkeley, California 94705, United States
| | - Patrick M Shih
- Department of Plant and Microbial Biology, University of California, Berkeley, Berkeley, California 94720, United States
- Feedstocks Division, Joint BioEnergy Institute, Emeryville, California 94608, United States
- Environmental Genomics and Systems Biology Division, Lawrence Berkeley National Laboratory, Berkeley, California 94705, United States
- Innovative Genomics Institute, University of California, Berkeley, California 94720, United States
| |
Collapse
|
30
|
Höllerer S, Jeschek M. Ultradeep characterisation of translational sequence determinants refutes rare-codon hypothesis and unveils quadruplet base pairing of initiator tRNA and transcript. Nucleic Acids Res 2023; 51:2377-2396. [PMID: 36727459 PMCID: PMC10018350 DOI: 10.1093/nar/gkad040] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/17/2022] [Revised: 12/05/2022] [Accepted: 01/13/2023] [Indexed: 02/03/2023] Open
Abstract
Translation is a key determinant of gene expression and an important biotechnological engineering target. In bacteria, 5'-untranslated region (5'-UTR) and coding sequence (CDS) are well-known mRNA parts controlling translation and thus cellular protein levels. However, the complex interaction of 5'-UTR and CDS has so far only been studied for few sequences leading to non-generalisable and partly contradictory conclusions. Herein, we systematically assess the dynamic translation from over 1.2 million 5'-UTR-CDS pairs in Escherichia coli to investigate their collective effect using a new method for ultradeep sequence-function mapping. This allows us to disentangle and precisely quantify effects of various sequence determinants of translation. We find that 5'-UTR and CDS individually account for 53% and 20% of variance in translation, respectively, and show conclusively that, contrary to a common hypothesis, tRNA abundance does not explain expression changes between CDSs with different synonymous codons. Moreover, the obtained large-scale data provide clear experimental evidence for a base-pairing interaction between initiator tRNA and mRNA beyond the anticodon-codon interaction, an effect that is often masked for individual sequences and therefore inaccessible to low-throughput approaches. Our study highlights the indispensability of ultradeep sequence-function mapping to accurately determine the contribution of parts and phenomena involved in gene regulation.
Collapse
Affiliation(s)
- Simon Höllerer
- Department of Biosystems Science and Engineering, Swiss Federal Institute of Technology – ETH Zurich, Basel CH-4058, Switzerland
| | - Markus Jeschek
- Department of Biosystems Science and Engineering, Swiss Federal Institute of Technology – ETH Zurich, Basel CH-4058, Switzerland
- Institute of Microbiology, Synthetic Microbiology Group, University of Regensburg, Regensburg D-93053, Germany
| |
Collapse
|
31
|
O'Connell RW, Rai K, Piepergerdes TC, Samra KD, Wilson JA, Lin S, Zhang TH, Ramos EM, Sun A, Kille B, Curry KD, Rocks JW, Treangen TJ, Mehta P, Bashor CJ. Ultra-high throughput mapping of genetic design space. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.03.16.532704. [PMID: 36993481 PMCID: PMC10055055 DOI: 10.1101/2023.03.16.532704] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/10/2023]
Abstract
Massively parallel genetic screens have been used to map sequence-to-function relationships for a variety of genetic elements. However, because these approaches only interrogate short sequences, it remains challenging to perform high throughput (HT) assays on constructs containing combinations of sequence elements arranged across multi-kb length scales. Overcoming this barrier could accelerate synthetic biology; by screening diverse gene circuit designs, "composition-to-function" mappings could be created that reveal genetic part composability rules and enable rapid identification of behavior-optimized variants. Here, we introduce CLASSIC, a generalizable genetic screening platform that combines long- and short-read next-generation sequencing (NGS) modalities to quantitatively assess pooled libraries of DNA constructs of arbitrary length. We show that CLASSIC can measure expression profiles of >10 5 drug-inducible gene circuit designs (ranging from 6-9 kb) in a single experiment in human cells. Using statistical inference and machine learning (ML) approaches, we demonstrate that data obtained with CLASSIC enables predictive modeling of an entire circuit design landscape, offering critical insight into underlying design principles. Our work shows that by expanding the throughput and understanding gained with each design-build-test-learn (DBTL) cycle, CLASSIC dramatically augments the pace and scale of synthetic biology and establishes an experimental basis for data-driven design of complex genetic systems.
Collapse
|
32
|
Fristot E, Bessede T, Camacho Rufino M, Mayonove P, Chang HJ, Zuniga A, Michon AL, Godreuil S, Bonnet J, Cambray G. An optimized electrotransformation protocol for Lactobacillus jensenii. PLoS One 2023; 18:e0280935. [PMID: 36800374 PMCID: PMC9937494 DOI: 10.1371/journal.pone.0280935] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/28/2021] [Accepted: 01/12/2023] [Indexed: 02/18/2023] Open
Abstract
Engineered bacteria are promising candidates for in situ detection and treatment of diseases. The female uro-genital tract presents several pathologies, such as sexually transmitted diseases or genital cancer, that could benefit from such technology. While bacteria from the gut microbiome are increasingly engineered, the use of chassis isolated from the female uro-genital resident flora has been limited. A major hurdle to implement the experimental throughput required for efficient engineering in these non-model bacteria is their low transformability. Here we report an optimized electrotransformation protocol for Lactobacillus jensenii, one the most widespread species across vaginal microflora. Starting from classical conditions, we optimized buffers, electric field parameters, cuvette type and DNA quantity to achieve an 80-fold improvement in transformation efficiency, with up to 3.5·103 CFUs/μg of DNA in L. jensenii ATCC 25258. We also identify several plasmids that are maintained and support reporter gene expression in L. jensenii. Finally, we demonstrate that our protocol provides increased transformability in three independent clinical isolates of L. jensenii. This work will facilitate the genetic engineering of L. jensenii and enable its use for addressing challenges in gynecological healthcare.
Collapse
Affiliation(s)
- Elsa Fristot
- Centre de Biologie Structurale (CBS), University of Montpellier, INSERM U 1054, CNRS UMR 5048, Montpellier, France
| | - Thomas Bessede
- Centre de Biologie Structurale (CBS), University of Montpellier, INSERM U 1054, CNRS UMR 5048, Montpellier, France
| | - Miguel Camacho Rufino
- Centre de Biologie Structurale (CBS), University of Montpellier, INSERM U 1054, CNRS UMR 5048, Montpellier, France
| | - Pauline Mayonove
- Centre de Biologie Structurale (CBS), University of Montpellier, INSERM U 1054, CNRS UMR 5048, Montpellier, France
| | - Hung-Ju Chang
- Centre de Biologie Structurale (CBS), University of Montpellier, INSERM U 1054, CNRS UMR 5048, Montpellier, France
| | - Ana Zuniga
- Centre de Biologie Structurale (CBS), University of Montpellier, INSERM U 1054, CNRS UMR 5048, Montpellier, France
| | - Anne-Laure Michon
- Diversité des Génomes et Interactions Microorganismes Insectes (DGIMI), University of Montpellier, INRAE UMR1333, Montpellier, France
| | - Sylvain Godreuil
- Service de Bactériologie, Hôpital Arnaud de Villeneuve—CHU de Montpellier, Montpellier, France
| | - Jérôme Bonnet
- Centre de Biologie Structurale (CBS), University of Montpellier, INSERM U 1054, CNRS UMR 5048, Montpellier, France
- * E-mail: (GC); (JB)
| | - Guillaume Cambray
- Centre de Biologie Structurale (CBS), University of Montpellier, INSERM U 1054, CNRS UMR 5048, Montpellier, France
- Diversité des Génomes et Interactions Microorganismes Insectes (DGIMI), University of Montpellier, INRAE UMR1333, Montpellier, France
- * E-mail: (GC); (JB)
| |
Collapse
|
33
|
Satta A, Esquirol L, Ebert BE. Current Metabolic Engineering Strategies for Photosynthetic Bioproduction in Cyanobacteria. Microorganisms 2023; 11:455. [PMID: 36838420 PMCID: PMC9964548 DOI: 10.3390/microorganisms11020455] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/12/2022] [Revised: 02/04/2023] [Accepted: 02/09/2023] [Indexed: 02/16/2023] Open
Abstract
Cyanobacteria are photosynthetic microorganisms capable of using solar energy to convert CO2 and H2O into O2 and energy-rich organic compounds, thus enabling sustainable production of a wide range of bio-products. More and more strains of cyanobacteria are identified that show great promise as cell platforms for the generation of bioproducts. However, strain development is still required to optimize their biosynthesis and increase titers for industrial applications. This review describes the most well-known, newest and most promising strains available to the community and gives an overview of current cyanobacterial biotechnology and the latest innovative strategies used for engineering cyanobacteria. We summarize advanced synthetic biology tools for modulating gene expression and their use in metabolic pathway engineering to increase the production of value-added compounds, such as terpenoids, fatty acids and sugars, to provide a go-to source for scientists starting research in cyanobacterial metabolic engineering.
Collapse
Affiliation(s)
- Alessandro Satta
- Australian Institute for Bioengineering and Nanotechnology, The University of Queensland, St Lucia, QLD 4072, Australia
- Department of Biology, University of Padua, 35100 Padua, Italy
| | - Lygie Esquirol
- Centre for Cell Factories and Biopolymers, Griffith Institute for Drug Discovery, Griffith University, Natha, QLD 4111, Australia
| | - Birgitta E. Ebert
- Australian Institute for Bioengineering and Nanotechnology, The University of Queensland, St Lucia, QLD 4072, Australia
| |
Collapse
|
34
|
Zabolotskii AI, Kozlovskiy SV, Katrukha AG. The Influence of the Nucleotide Composition of Genes and Gene Regulatory Elements on the Efficiency of Protein Expression in Escherichia coli. BIOCHEMISTRY (MOSCOW) 2023; 88:S176-S191. [PMID: 37069120 DOI: 10.1134/s0006297923140109] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 02/22/2023]
Abstract
Recombinant proteins expressed in Escherichia coli are widely used in biochemical research and industrial processes. At the same time, achieving higher protein expression levels and correct protein folding still remains the key problem, since optimization of nutrient media, growth conditions, and methods for induction of protein synthesis do not always lead to the desired result. Often, low protein expression is determined by the sequences of the expressed genes and their regulatory regions. The genetic code is degenerated; 18 out of 20 amino acids are encoded by more than one codon. Choosing between synonymous codons in the coding sequence can significantly affect the level of protein expression and protein folding due to the influence of the gene nucleotide composition on the probability of formation of secondary mRNA structures that affect the ribosome binding at the translation initiation phase, as well as the ribosome movement along the mRNA during elongation, which, in turn, influences the mRNA degradation and the folding of the nascent protein. The nucleotide composition of the mRNA untranslated regions, in particular the promoter and Shine-Dalgarno sequences, also affects the efficiency of mRNA transcription, translation, and degradation. In this review, we describe the genetic principles that determine the efficiency of protein production in Escherichia coli.
Collapse
Affiliation(s)
- Artur I Zabolotskii
- Faculty of Biology, Lomonosov Moscow State University, Moscow, 119991, Russia.
| | | | - Alexey G Katrukha
- Faculty of Biology, Lomonosov Moscow State University, Moscow, 119991, Russia
| |
Collapse
|
35
|
Murray L, Olson MN, Barton N, Dawes P, Chan Y, Lim ET. FACS-Based Sequencing Approach to Evaluate Cell Type to Genotype Associations Using Cerebral Organoids. Methods Mol Biol 2023; 2683:193-199. [PMID: 37300776 DOI: 10.1007/978-1-0716-3287-1_15] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/12/2023]
Abstract
Recent technological developments have led to widespread applications of large-scale transcriptomics-based sequencing methods to identify genotype-to-cell type associations. Here we describe a fluorescence-activated cell sorting (FACS)-based sequencing method to utilize CRISPR/Cas9 edited mosaic cerebral organoids to identify or validate genotype-to-cell type associations. Our approach is high-throughput and quantitative and uses internal controls to enable comparisons of the results across different antibody markers and experiments.
Collapse
Affiliation(s)
- Liam Murray
- Program in Bioinformatics and Integrative Biology, University of Massachusetts Chan Medical School, Worcester, MA, USA
- Department of Neurology, University of Massachusetts Chan Medical School, Worcester, MA, USA
- NeuroNexus Institute, University of Massachusetts Chan Medical School, Worcester, MA, USA
- Department of Molecular, Cell and Cancer Biology, University of Massachusetts Chan Medical School, Worcester, MA, USA
| | - Meagan N Olson
- Program in Bioinformatics and Integrative Biology, University of Massachusetts Chan Medical School, Worcester, MA, USA
- Department of Neurology, University of Massachusetts Chan Medical School, Worcester, MA, USA
- NeuroNexus Institute, University of Massachusetts Chan Medical School, Worcester, MA, USA
- Department of Molecular, Cell and Cancer Biology, University of Massachusetts Chan Medical School, Worcester, MA, USA
| | - Nathaniel Barton
- Program in Bioinformatics and Integrative Biology, University of Massachusetts Chan Medical School, Worcester, MA, USA
- Department of Neurology, University of Massachusetts Chan Medical School, Worcester, MA, USA
- NeuroNexus Institute, University of Massachusetts Chan Medical School, Worcester, MA, USA
- Department of Molecular, Cell and Cancer Biology, University of Massachusetts Chan Medical School, Worcester, MA, USA
| | - Pepper Dawes
- Program in Bioinformatics and Integrative Biology, University of Massachusetts Chan Medical School, Worcester, MA, USA
- Department of Neurology, University of Massachusetts Chan Medical School, Worcester, MA, USA
- NeuroNexus Institute, University of Massachusetts Chan Medical School, Worcester, MA, USA
- Department of Molecular, Cell and Cancer Biology, University of Massachusetts Chan Medical School, Worcester, MA, USA
| | - Yingleong Chan
- Program in Bioinformatics and Integrative Biology, University of Massachusetts Chan Medical School, Worcester, MA, USA
- Department of Neurology, University of Massachusetts Chan Medical School, Worcester, MA, USA
- NeuroNexus Institute, University of Massachusetts Chan Medical School, Worcester, MA, USA
| | - Elaine T Lim
- Program in Bioinformatics and Integrative Biology, University of Massachusetts Chan Medical School, Worcester, MA, USA.
- Department of Neurology, University of Massachusetts Chan Medical School, Worcester, MA, USA.
- NeuroNexus Institute, University of Massachusetts Chan Medical School, Worcester, MA, USA.
- Department of Molecular, Cell and Cancer Biology, University of Massachusetts Chan Medical School, Worcester, MA, USA.
| |
Collapse
|
36
|
Gilliot PA, Gorochowski TE. Design and Analysis of Massively Parallel Reporter Assays Using FORECAST. Methods Mol Biol 2023; 2553:41-56. [PMID: 36227538 DOI: 10.1007/978-1-0716-2617-7_3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/16/2023]
Abstract
Machine learning is revolutionizing molecular biology and bioengineering by providing powerful insights and predictions. Massively parallel reporter assays (MPRAs) have emerged as a particularly valuable class of high-throughput technique to support such algorithms. MPRAs enable the simultaneous characterization of thousands or even millions of genetic constructs and provide the large amounts of data needed to train models. However, while the scale of this approach is impressive, the design of effective MPRA experiments is challenging due to the many factors that can be varied and the difficulty in predicting how these will impact the quality and quantity of data obtained. Here, we present a computational tool called FORECAST, which can simulate MPRA experiments based on fluorescence-activated cell sorting and subsequent sequencing (commonly referred to as Flow-seq or Sort-seq experiments), as well as carry out rigorous statistical estimation of construct performance from this type of experimental data. FORECAST can be used to develop workflows to aid the design of MPRA experiments and reanalyze existing MPRA data sets.
Collapse
|
37
|
Xu K, Tong Y, Li Y, Tao J, Rao S, Li J, Zhou J, Liu S. Autoinduction Expression Modules for Regulating Gene Expression in Bacillus subtilis. ACS Synth Biol 2022; 11:4220-4225. [PMID: 36468943 DOI: 10.1021/acssynbio.2c00304] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/12/2022]
Abstract
Although quorum sensing (QS) promoters that can autonomously activate gene expression have been identified and engineered in Bacillus subtilis, researchers focus on quantifying individual promoters while ignoring the interaction between other genetic regulatory elements. Here, we constructed the autoinduction expression modules consisting of promoters responsive to QS ComQXPA, ribosome binding sites (RBSs), and terminators. Using superfolder green fluorescent protein (sfGFP) as a reporter gene, three individual element libraries were generated from 945 promoters, 12,000 RBSs, and 425 terminators by random mutation, de novo design, and database mining strategies, respectively. Then, the efficiency of three libraries in regulating gene expression was further enhanced by engineering the core region of each optimal element. After hybridizing the element libraries, the generated expression modules exhibited a 627-fold range in regulating gene expression without significantly affecting the autoinduction initiation. Here, the hybrid modules with broad expression strength may benefit the application of QS-based autoinduction systems in B. subtilis.
Collapse
Affiliation(s)
- Kuidong Xu
- National Engineering Research Center for Cereal Fermentation and Food Biomanufacturing, Jiangnan University, Wuxi 214122, China.,Science Center for Future Foods, Jiangnan University, Wuxi 214122, China.,Jiangsu Provisional Research Center for Bioactive Product Processing Technology, Jiangnan University, 1800 Lihu Road, Wuxi 214122, China
| | - Yi Tong
- National Engineering Research Center for Corn Deep Processing, Jilin COFCO Biochemical Co. Ltd, Changchun 130033, China
| | - Yi Li
- National Engineering Research Center for Corn Deep Processing, Jilin COFCO Biochemical Co. Ltd, Changchun 130033, China
| | - Jin Tao
- National Engineering Research Center for Corn Deep Processing, Jilin COFCO Biochemical Co. Ltd, Changchun 130033, China
| | - Shengqi Rao
- College of Food Science and Engineering, Yangzhou University, Yangzhou, Jiangsu 214122, China
| | - Jianghua Li
- Science Center for Future Foods, Jiangnan University, Wuxi 214122, China.,School of Biotechnology, Jiangnan University, 1800 Lihu Road, Wuxi 214122, China
| | - Jingwen Zhou
- National Engineering Research Center for Cereal Fermentation and Food Biomanufacturing, Jiangnan University, Wuxi 214122, China.,Science Center for Future Foods, Jiangnan University, Wuxi 214122, China.,Jiangsu Provisional Research Center for Bioactive Product Processing Technology, Jiangnan University, 1800 Lihu Road, Wuxi 214122, China
| | - Song Liu
- National Engineering Research Center for Cereal Fermentation and Food Biomanufacturing, Jiangnan University, Wuxi 214122, China.,Science Center for Future Foods, Jiangnan University, Wuxi 214122, China
| |
Collapse
|
38
|
Accuracy and data efficiency in deep learning models of protein expression. Nat Commun 2022; 13:7755. [PMID: 36517468 PMCID: PMC9751117 DOI: 10.1038/s41467-022-34902-5] [Citation(s) in RCA: 16] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/21/2022] [Accepted: 11/10/2022] [Indexed: 12/23/2022] Open
Abstract
Synthetic biology often involves engineering microbial strains to express high-value proteins. Thanks to progress in rapid DNA synthesis and sequencing, deep learning has emerged as a promising approach to build sequence-to-expression models for strain optimization. But such models need large and costly training data that create steep entry barriers for many laboratories. Here we study the relation between accuracy and data efficiency in an atlas of machine learning models trained on datasets of varied size and sequence diversity. We show that deep learning can achieve good prediction accuracy with much smaller datasets than previously thought. We demonstrate that controlled sequence diversity leads to substantial gains in data efficiency and employed Explainable AI to show that convolutional neural networks can finely discriminate between input DNA sequences. Our results provide guidelines for designing genotype-phenotype screens that balance cost and quality of training data, thus helping promote the wider adoption of deep learning in the biotechnology sector.
Collapse
|
39
|
Goodman DB, Azimi CS, Kearns K, Talbot A, Garakani K, Garcia J, Patel N, Hwang B, Lee D, Park E, Vykunta VS, Shy BR, Ye CJ, Eyquem J, Marson A, Bluestone JA, Roybal KT. Pooled screening of CAR T cells identifies diverse immune signaling domains for next-generation immunotherapies. Sci Transl Med 2022; 14:eabm1463. [PMID: 36350984 PMCID: PMC9939256 DOI: 10.1126/scitranslmed.abm1463] [Citation(s) in RCA: 16] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/10/2022]
Abstract
Chimeric antigen receptors (CARs) repurpose natural signaling components to retarget T cells to refractory cancers but have shown limited efficacy in persistent, recurrent malignancies. Here, we introduce "CAR Pooling," a multiplexed approach to rapidly identify CAR designs with clinical potential. Forty CARs with signaling domains derived from a range of immune cell lineages were evaluated in pooled assays for their ability to stimulate critical T cell effector functions during repetitive stimulation that mimics long-term tumor antigen exposure. Several domains were identified from the tumor necrosis factor (TNF) receptor family that have been primarily associated with B cells. CD40 enhanced proliferation, whereas B cell-activating factor receptor (BAFF-R) and transmembrane activator and CAML interactor (TACI) promoted cytotoxicity. These functions were enhanced relative to clinical benchmarks after prolonged antigen stimulation, and CAR T cell signaling through these domains fell into distinct states of memory, cytotoxicity, and metabolism. BAFF-R CAR T cells were enriched for a highly cytotoxic transcriptional signature previously associated with positive clinical outcomes. We also observed that replacing the 4-1BB intracellular signaling domain with the BAFF-R signaling domain in a clinically validated B cell maturation antigen (BCMA)-specific CAR resulted in enhanced activity in a xenotransplant model of multiple myeloma. Together, these results show that CAR Pooling is a general approach for rapid exploration of CAR architecture and activity to improve the efficacy of CAR T cell therapies.
Collapse
Affiliation(s)
- Daniel B. Goodman
- Department of Microbiology and Immunology, University of California, San Francisco; San Francisco, California, 94143, USA
- Parker Institute for Cancer Immunotherapy; San Francisco, California, 94143, USA
- Gladstone UCSF Institute for Genetic Immunology; San Francisco, CA, 94107, USA
- School of Medicine, University of California, San Francisco; San Francisco, CA, USA
- Diabetes Center, University of California, San Francisco; San Francisco, CA 94143, USA
| | - Camillia S. Azimi
- Department of Microbiology and Immunology, University of California, San Francisco; San Francisco, California, 94143, USA
- Parker Institute for Cancer Immunotherapy; San Francisco, California, 94143, USA
| | - Kendall Kearns
- Department of Microbiology and Immunology, University of California, San Francisco; San Francisco, California, 94143, USA
- Parker Institute for Cancer Immunotherapy; San Francisco, California, 94143, USA
| | - Alexis Talbot
- Department of Microbiology and Immunology, University of California, San Francisco; San Francisco, California, 94143, USA
- Parker Institute for Cancer Immunotherapy; San Francisco, California, 94143, USA
- Gladstone UCSF Institute for Genetic Immunology; San Francisco, CA, 94107, USA
- INSERM U976, Saint Louis Research Institute, Paris City University, Paris, France
| | - Kiavash Garakani
- Department of Microbiology and Immunology, University of California, San Francisco; San Francisco, California, 94143, USA
- Parker Institute for Cancer Immunotherapy; San Francisco, California, 94143, USA
| | - Julie Garcia
- Department of Microbiology and Immunology, University of California, San Francisco; San Francisco, California, 94143, USA
- Parker Institute for Cancer Immunotherapy; San Francisco, California, 94143, USA
| | - Nisarg Patel
- Department of Oral and Maxillofacial Surgery, University of California, San Francisco; San Francisco, CA, USA
- Bakar Computational Health Sciences Institute, University of California, San Francisco; San Francisco, CA, USA
- School of Medicine, University of California, San Francisco; San Francisco, CA, USA
| | - Byungjin Hwang
- Institute for Human Genetics (IHG), University of California, San Francisco; San Francisco, California, USA
- Department of Medicine, University of California, San Francisco; San Francisco, California, 94143, USA
| | - David Lee
- Institute for Human Genetics (IHG), University of California, San Francisco; San Francisco, California, USA
- Department of Medicine, University of California, San Francisco; San Francisco, California, 94143, USA
| | - Emily Park
- Department of Microbiology and Immunology, University of California, San Francisco; San Francisco, California, 94143, USA
- Parker Institute for Cancer Immunotherapy; San Francisco, California, 94143, USA
| | - Vivasvan S. Vykunta
- Parker Institute for Cancer Immunotherapy; San Francisco, California, 94143, USA
- Helen Diller Family Comprehensive Cancer Center, University of California, San Francisco; San Francisco, California, 94158, USA
- Gladstone UCSF Institute for Genetic Immunology; San Francisco, CA, 94107, USA
- School of Medicine, University of California, San Francisco; San Francisco, CA, USA
- Department of Medicine, University of California, San Francisco; San Francisco, California, 94143, USA
| | - Brian R. Shy
- Parker Institute for Cancer Immunotherapy; San Francisco, California, 94143, USA
- Helen Diller Family Comprehensive Cancer Center, University of California, San Francisco; San Francisco, California, 94158, USA
- Gladstone UCSF Institute for Genetic Immunology; San Francisco, CA, 94107, USA
- School of Medicine, University of California, San Francisco; San Francisco, CA, USA
- Department of Medicine, University of California, San Francisco; San Francisco, California, 94143, USA
| | - Chun Jimmie Ye
- Parker Institute for Cancer Immunotherapy; San Francisco, California, 94143, USA
- Chan Zuckerberg Biohub; San Francisco, California, 94158, USA
- Bakar Computational Health Sciences Institute, University of California, San Francisco; San Francisco, CA, USA
- Institute for Human Genetics (IHG), University of California, San Francisco; San Francisco, California, USA
- Department of Epidemiology and Biostatistics, San Francisco; San Francisco, CA 94143, USA
- Department of Medicine, University of California, San Francisco; San Francisco, California, 94143, USA
| | - Justin Eyquem
- Department of Microbiology and Immunology, University of California, San Francisco; San Francisco, California, 94143, USA
- Parker Institute for Cancer Immunotherapy; San Francisco, California, 94143, USA
- Gladstone UCSF Institute for Genetic Immunology; San Francisco, CA, 94107, USA
| | - Alexander Marson
- Department of Microbiology and Immunology, University of California, San Francisco; San Francisco, California, 94143, USA
- Parker Institute for Cancer Immunotherapy; San Francisco, California, 94143, USA
- Helen Diller Family Comprehensive Cancer Center, University of California, San Francisco; San Francisco, California, 94158, USA
- Chan Zuckerberg Biohub; San Francisco, California, 94158, USA
- Gladstone UCSF Institute for Genetic Immunology; San Francisco, CA, 94107, USA
- School of Medicine, University of California, San Francisco; San Francisco, CA, USA
- Institute for Human Genetics (IHG), University of California, San Francisco; San Francisco, California, USA
- Innovative Genomics Institute, University of California, Berkeley; Berkeley, CA 94720, USA
- Diabetes Center, University of California, San Francisco; San Francisco, CA 94143, USA
- Department of Medicine, University of California, San Francisco; San Francisco, California, 94143, USA
| | - Jeffrey A. Bluestone
- Diabetes Center, University of California, San Francisco; San Francisco, CA 94143, USA
- Sonoma Biotherapeutics; South San Francisco, CA, USA
- Department of Medicine, University of California, San Francisco; San Francisco, California, 94143, USA
| | - Kole T. Roybal
- Department of Microbiology and Immunology, University of California, San Francisco; San Francisco, California, 94143, USA
- Parker Institute for Cancer Immunotherapy; San Francisco, California, 94143, USA
- Helen Diller Family Comprehensive Cancer Center, University of California, San Francisco; San Francisco, California, 94158, USA
- Chan Zuckerberg Biohub; San Francisco, California, 94158, USA
- Gladstone UCSF Institute for Genetic Immunology; San Francisco, CA, 94107, USA
- UCSF Cell Design Institute; San Francisco, California, 94158, USA
| |
Collapse
|
40
|
Scholz SA, Lindeboom CD, Freddolino PL. Genetic context effects can override canonical cis regulatory elements in Escherichia coli. Nucleic Acids Res 2022; 50:10360-10375. [PMID: 36134716 PMCID: PMC9561378 DOI: 10.1093/nar/gkac787] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/01/2022] [Revised: 08/10/2022] [Accepted: 09/02/2022] [Indexed: 11/12/2022] Open
Abstract
Recent experiments have shown that in addition to control by cis regulatory elements, the local chromosomal context of a gene also has a profound impact on its transcription. Although this chromosome-position dependent expression variation has been empirically mapped at high-resolution, the underlying causes of the variation have not been elucidated. Here, we demonstrate that 1 kb of flanking, non-coding synthetic sequences with a low frequency of guanosine and cytosine (GC) can dramatically reduce reporter expression compared to neutral and high GC-content flanks in Escherichia coli. Natural and artificial genetic context can have a similarly strong effect on reporter expression, regardless of cell growth phase or medium. Despite the strong reduction in the maximal expression level from the fully-induced reporter, low GC synthetic flanks do not affect the time required to reach the maximal expression level after induction. Overall, we demonstrate key determinants of transcriptional propensity that appear to act as tunable modulators of transcription, independent of regulatory sequences such as the promoter. These findings provide insight into the regulation of naturally occurring genes and an independent control for optimizing expression of synthetic biology constructs.
Collapse
Affiliation(s)
- Scott A Scholz
- Department of Biological Chemistry, University of Michigan Medical School, Ann Arbor, MI, USA
| | - Chase D Lindeboom
- Department of Biological Chemistry, University of Michigan Medical School, Ann Arbor, MI, USA
| | - Peter L Freddolino
- Department of Biological Chemistry, University of Michigan Medical School, Ann Arbor, MI, USA
- Department of Computational Medicine and Bioinformatics, University of Michigan Medical School, Ann Arbor, MI, USA
| |
Collapse
|
41
|
Komarova ES, Slesarchuk AN, Rubtsova MP, Osterman IA, Tupikin AE, Pyshnyi DV, Dontsova OA, Kabilov MR, Sergiev PV. Flow-Seq Evaluation of Translation Driven by a Set of Natural Escherichia coli 5'-UTR of Variable Length. Int J Mol Sci 2022; 23:ijms232012293. [PMID: 36293163 PMCID: PMC9604319 DOI: 10.3390/ijms232012293] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/08/2022] [Revised: 10/10/2022] [Accepted: 10/11/2022] [Indexed: 11/25/2022] Open
Abstract
Flow-seq is a method that combines fluorescently activated cell sorting and next-generation sequencing to deduce a large amount of data about translation efficiency from a single experiment. Here, we constructed a library of fluorescent protein-based reporters preceded by a set of 648 natural 5'-untranslated regions (5'-UTRs) of Escherichia coli genes. Usually, Flow-seq libraries are constructed using uniform-length sequence elements, in contrast to natural situations, where functional elements are of heterogenous lengths. Here, we demonstrated that a 5'-UTR library of variable length could be created and analyzed with Flow-seq. In line with previous Flow-seq experiments with randomized 5'-UTRs, we observed the influence of an RNA secondary structure and Shine-Dalgarno sequences on translation efficiency; however, the variability of these parameters for natural 5'-UTRs in our library was smaller in comparison with randomized libraries. In line with this, we only observed a 30-fold difference in translation efficiency between the best and worst bins sorted with this factor. The results correlated with those obtained with ribosome profiling.
Collapse
Affiliation(s)
- Ekaterina S. Komarova
- Institute of Functional Genomics, Lomonosov Moscow State University, 119992 Moscow, Russia
| | - Anna N. Slesarchuk
- Department of Chemistry, Lomonosov Moscow State University, 119991 Moscow, Russia
| | - Maria P. Rubtsova
- Department of Chemistry, Lomonosov Moscow State University, 119991 Moscow, Russia
| | - Ilya A. Osterman
- Center for Life Sciences, Skolkovo Institute of Science and Technology, Skolkovo, 143025 Moscow, Russia
| | - Alexey E. Tupikin
- Institute of Chemical Biology and Fundamental Medicine, Siberian Branch of the Russian Academy of Sciences, 630090 Novosibirsk, Russia
| | - Dmitry V. Pyshnyi
- Institute of Chemical Biology and Fundamental Medicine, Siberian Branch of the Russian Academy of Sciences, 630090 Novosibirsk, Russia
| | - Olga A. Dontsova
- Department of Chemistry, Lomonosov Moscow State University, 119991 Moscow, Russia
- Center for Life Sciences, Skolkovo Institute of Science and Technology, Skolkovo, 143025 Moscow, Russia
- Shemyakin-Ovchinnikov Institute of Bioorganic Chemistry, Russian Academy of Sciences, 119992 Moscow, Russia
- Belozersky Institute of Physico-Chemical Biology, Lomonosov Moscow State University, 119992 Moscow, Russia
| | - Marsel R. Kabilov
- Institute of Chemical Biology and Fundamental Medicine, Siberian Branch of the Russian Academy of Sciences, 630090 Novosibirsk, Russia
- Correspondence: (M.R.K.); (P.V.S.)
| | - Petr V. Sergiev
- Institute of Functional Genomics, Lomonosov Moscow State University, 119992 Moscow, Russia
- Department of Chemistry, Lomonosov Moscow State University, 119991 Moscow, Russia
- Center for Life Sciences, Skolkovo Institute of Science and Technology, Skolkovo, 143025 Moscow, Russia
- Belozersky Institute of Physico-Chemical Biology, Lomonosov Moscow State University, 119992 Moscow, Russia
- Correspondence: (M.R.K.); (P.V.S.)
| |
Collapse
|
42
|
Leander M, Liu Z, Cui Q, Raman S. Deep mutational scanning and machine learning reveal structural and molecular rules governing allosteric hotspots in homologous proteins. eLife 2022; 11:e79932. [PMID: 36226916 PMCID: PMC9662819 DOI: 10.7554/elife.79932] [Citation(s) in RCA: 17] [Impact Index Per Article: 8.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/03/2022] [Accepted: 10/13/2022] [Indexed: 01/29/2023] Open
Abstract
A fundamental question in protein science is where allosteric hotspots - residues critical for allosteric signaling - are located, and what properties differentiate them. We carried out deep mutational scanning (DMS) of four homologous bacterial allosteric transcription factors (aTFs) to identify hotspots and built a machine learning model with this data to glean the structural and molecular properties of allosteric hotspots. We found hotspots to be distributed protein-wide rather than being restricted to 'pathways' linking allosteric and active sites as is commonly assumed. Despite structural homology, the location of hotspots was not superimposable across the aTFs. However, common signatures emerged when comparing hotspots coincident with long-range interactions, suggesting that the allosteric mechanism is conserved among the homologs despite differences in molecular details. Machine learning with our large DMS datasets revealed global structural and dynamic properties to be a strong predictor of whether a residue is a hotspot than local and physicochemical properties. Furthermore, a model trained on one protein can predict hotspots in a homolog. In summary, the overall allosteric mechanism is embedded in the structural fold of the aTF family, but the finer, molecular details are sequence-specific.
Collapse
Affiliation(s)
- Megan Leander
- Department of Biochemistry, University of Wisconsin-MadisonMadisonUnited States
| | - Zhuang Liu
- Department of Physics, Boston UniversityBostonUnited States
| | - Qiang Cui
- Department of Physics, Boston UniversityBostonUnited States
- Department of Chemistry, Boston UniversityBostonUnited States
| | - Srivatsan Raman
- Department of Biochemistry, University of Wisconsin-MadisonMadisonUnited States
- Department of Bacteriology, University of Wisconsin-MadisonMadisonUnited States
- Department of Chemical and Biological Engineering, University of Wisconsin-MadisonMadisonUnited States
| |
Collapse
|
43
|
Komarova ES, Dontsova OA, Pyshnyi DV, Kabilov MR, Sergiev PV. Flow-Seq Method: Features and Application in Bacterial Translation Studies. Acta Naturae 2022; 14:20-37. [PMID: 36694903 PMCID: PMC9844084 DOI: 10.32607/actanaturae.11820] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/04/2022] [Accepted: 11/11/2022] [Indexed: 01/22/2023] Open
Abstract
The Flow-seq method is based on using reporter construct libraries, where a certain element regulating the gene expression of fluorescent reporter proteins is represented in many thousands of variants. Reporter construct libraries are introduced into cells, sorted according to their fluorescence level, and then subjected to next-generation sequencing. Therefore, it turns out to be possible to identify patterns that determine the expression efficiency, based on tens and hundreds of thousands of reporter constructs in one experiment. This method has become common in evaluating the efficiency of protein synthesis simultaneously by multiple mRNA variants. However, its potential is not confined to this area. In the presented review, a comparative analysis of the Flow-seq method and other alternative approaches used for translation efficiency evaluation of mRNA was carried out; the features of its application and the results obtained by Flow-seq were also considered.
Collapse
Affiliation(s)
- E. S. Komarova
- Institute of Functional Genomics, Lomonosov Moscow State University, Moscow, 119234 Russia
| | - O. A. Dontsova
- Department of Chemistry, Lomonosov Moscow State University, Moscow, 119234 Russia
- Skolkovo Institute of Science and Technology, Moscow, 121205 Russia
- Belozersky Institute of Physico-Chemical Biology, Lomonosov Moscow State University, Moscow, 119234 Russia
- Shemyakin-Ovchinnikov Institute of Bioorganic Chemistry, Russian Academy of Sciences, Moscow 117437 Russia
| | - D. V. Pyshnyi
- Institute of Chemical Biology and Fundamental Medicine, Siberian Branch of the Russian Academy of Sciences, Novosibirsk, 630090 Russia
| | - M. R. Kabilov
- Institute of Chemical Biology and Fundamental Medicine, Siberian Branch of the Russian Academy of Sciences, Novosibirsk, 630090 Russia
| | - P. V. Sergiev
- Institute of Functional Genomics, Lomonosov Moscow State University, Moscow, 119234 Russia
- Department of Chemistry, Lomonosov Moscow State University, Moscow, 119234 Russia
- Skolkovo Institute of Science and Technology, Moscow, 121205 Russia
- Belozersky Institute of Physico-Chemical Biology, Lomonosov Moscow State University, Moscow, 119234 Russia
| |
Collapse
|
44
|
Santos SP, Garcés LFS, Silva FS, Santiago LF, Pinheiro CS, Alcantara-Neves NM, Pacheco LG. Engineering an optimized expression operating unit for improved recombinant protein production in Escherichia coli. Protein Expr Purif 2022; 199:106150. [DOI: 10.1016/j.pep.2022.106150] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/27/2022] [Revised: 07/26/2022] [Accepted: 07/27/2022] [Indexed: 10/31/2022]
|
45
|
Esquirol L, McNeale D, Douglas T, Vickers CE, Sainsbury F. Rapid Assembly and Prototyping of Biocatalytic Virus-like Particle Nanoreactors. ACS Synth Biol 2022; 11:2709-2718. [PMID: 35880829 DOI: 10.1021/acssynbio.2c00117] [Citation(s) in RCA: 6] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/05/2023]
Abstract
Protein cages are attractive as molecular scaffolds for the fundamental study of enzymes and metabolons and for the creation of biocatalytic nanoreactors for in vitro and in vivo use. Virus-like particles (VLPs) such as those derived from the P22 bacteriophage capsid protein make versatile self-assembling protein cages and can be used to encapsulate a broad range of protein cargos. In vivo encapsulation of enzymes within VLPs requires fusion to the coat protein or a scaffold protein. However, the expression level, stability, and activity of cargo proteins can vary upon fusion. Moreover, it has been shown that molecular crowding of enzymes inside VLPs can affect their catalytic properties. Consequently, testing of numerous parameters is required for production of the most efficient nanoreactor for a given cargo enzyme. Here, we present a set of acceptor vectors that provide a quick and efficient way to build, test, and optimize cargo loading inside P22 VLPs. We prototyped the system using a yellow fluorescent protein and then applied it to mevalonate kinases (MKs), a key enzyme class in the industrially important terpene (isoprenoid) synthesis pathway. Different MKs required considerably different approaches to deliver maximal encapsulation as well as optimal kinetic parameters, demonstrating the value of being able to rapidly access a variety of encapsulation strategies. The vector system described here provides an approach to optimize cargo enzyme behavior in bespoke P22 nanoreactors. This will facilitate industrial applications as well as basic research on nanoreactor-cargo behavior.
Collapse
Affiliation(s)
- Lygie Esquirol
- Centre for Cell Factories and Biopolymers, Griffith Institute for Drug Discovery, Griffith University, Nathan, Queensland 4111, Australia.,Australian Institute for Bioengineering and Nanotechnology, The University of Queensland, St Lucia, Queensland 4072, Australia
| | - Donna McNeale
- Centre for Cell Factories and Biopolymers, Griffith Institute for Drug Discovery, Griffith University, Nathan, Queensland 4111, Australia.,Synthetic Biology Future Science Platform, Commonwealth Scientific and Industrial Research Organisation (CSIRO), Brisbane, Queensland 4102, Australia
| | - Trevor Douglas
- Department of Chemistry, Indiana University, Bloomington, Indiana 47405, United States
| | - Claudia E Vickers
- Centre for Cell Factories and Biopolymers, Griffith Institute for Drug Discovery, Griffith University, Nathan, Queensland 4111, Australia.,Synthetic Biology Future Science Platform, Commonwealth Scientific and Industrial Research Organisation (CSIRO), Brisbane, Queensland 4102, Australia.,ARC Centre of Excellence in Synthetic Biology, Queensland University of Technology, Brisbane 4000 Australia
| | - Frank Sainsbury
- Centre for Cell Factories and Biopolymers, Griffith Institute for Drug Discovery, Griffith University, Nathan, Queensland 4111, Australia.,Australian Institute for Bioengineering and Nanotechnology, The University of Queensland, St Lucia, Queensland 4072, Australia.,Synthetic Biology Future Science Platform, Commonwealth Scientific and Industrial Research Organisation (CSIRO), Brisbane, Queensland 4102, Australia
| |
Collapse
|
46
|
Duan Y, Zhang X, Zhai W, Zhang J, Zhang X, Xu G, Li H, Deng Z, Shi J, Xu Z. Deciphering the Rules of Ribosome Binding Site Differentiation in Context Dependence. ACS Synth Biol 2022; 11:2726-2740. [PMID: 35877551 DOI: 10.1021/acssynbio.2c00139] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
Abstract
The ribosome binding site (RBS) is a crucial element regulating translation. However, the activity of RBS is poorly predictable, because it is strongly affected by the local possible secondary structure, that is, context dependence. By the Flowseq technique, over 20 000 RBS variants were sorted and sequenced, and the translation of multiple genes under the same RBS was quantitatively characterized to evaluate the context dependence of each RBS variant in E. coli. Two regions, (-7 to -2) and (-17 to -12), of RBS were predicted with a higher possibility to pair with each other to slow down the translation initiation. Associations between phenotypes and the intrinsic factors suspected to affect translation efficiency and context dependence of the RBS, including nucleotide bias at each position, free energy, and conservation, were disentangled. The results showed that translation efficiency was influenced more significantly by conservation of the SD region (-16 to -8), while an AC-rich spacer region (-7 to -1) was associated with low context dependence. We confirmed these characteristics using a series of synthesized RBSs. The average correlation between multiple reporters was significantly higher for RBSs with an AC-rich spacer (0.714) compared with a GU-rich spacer (0.286). Overall, we proposed general design criteria to improve programmability and minimize context dependence of RBS. The characteristics unraveled here can be adapted to other bacteria for fine-tuning target-gene expression.
Collapse
Affiliation(s)
- Yanting Duan
- Ministry of Education, School of Biotechnology, Jiangnan University, Wuxi 214122, China.,National Engineering Research Center for Cereal Fermentation and Food Biomanufacturing, Jiangnan University, Wuxi 214122, China
| | - Xiaojuan Zhang
- Ministry of Education, School of Biotechnology, Jiangnan University, Wuxi 214122, China.,National Engineering Research Center for Cereal Fermentation and Food Biomanufacturing, Jiangnan University, Wuxi 214122, China
| | - Weiji Zhai
- Ministry of Education, School of Biotechnology, Jiangnan University, Wuxi 214122, China.,National Engineering Research Center for Cereal Fermentation and Food Biomanufacturing, Jiangnan University, Wuxi 214122, China
| | - Jinpeng Zhang
- Ministry of Education, School of Biotechnology, Jiangnan University, Wuxi 214122, China.,National Engineering Research Center for Cereal Fermentation and Food Biomanufacturing, Jiangnan University, Wuxi 214122, China
| | - Xiaomei Zhang
- School of Life Science and Health Engineering, Jiangnan University, Wuxi 214122, China.,Jiangsu Engineering Research Center for Bioactive Products Processing Technology, Jiangnan University, 1800 Lihu Avenue, Wuxi 214122, China
| | - Guoqiang Xu
- Ministry of Education, School of Biotechnology, Jiangnan University, Wuxi 214122, China.,National Engineering Research Center for Cereal Fermentation and Food Biomanufacturing, Jiangnan University, Wuxi 214122, China
| | - Hui Li
- School of Artificial Intelligence and Computer Science, Jiangnan University, Wuxi 214122, China
| | - Zhaohong Deng
- School of Artificial Intelligence and Computer Science, Jiangnan University, Wuxi 214122, China
| | - Jinsong Shi
- School of Life Science and Health Engineering, Jiangnan University, Wuxi 214122, China.,Jiangsu Engineering Research Center for Bioactive Products Processing Technology, Jiangnan University, 1800 Lihu Avenue, Wuxi 214122, China
| | - Zhenghong Xu
- Ministry of Education, School of Biotechnology, Jiangnan University, Wuxi 214122, China.,National Engineering Research Center for Cereal Fermentation and Food Biomanufacturing, Jiangnan University, Wuxi 214122, China
| |
Collapse
|
47
|
Simple transformation of the filamentous thermophilic cyanobacterium Leptolyngbya sp. KC45. ALGAL RES 2022. [DOI: 10.1016/j.algal.2022.102758] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/17/2022]
|
48
|
Shah SB, Hill AM, Wilke CO, Hockenberry AJ. Generating dynamic gene expression patterns without the need for regulatory circuits. PLoS One 2022; 17:e0268883. [PMID: 35617346 PMCID: PMC9135205 DOI: 10.1371/journal.pone.0268883] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/28/2021] [Accepted: 05/10/2022] [Indexed: 11/18/2022] Open
Abstract
Synthetic biology has successfully advanced our ability to design and implement complex, time-varying genetic circuits to control the expression of recombinant proteins. However, these circuits typically require the production of regulatory genes whose only purpose is to coordinate expression of other genes. When designing very small genetic constructs, such as viral genomes, we may want to avoid introducing such auxiliary gene products while nevertheless encoding complex expression dynamics. To this end, here we demonstrate that varying only the placement and strengths of promoters, terminators, and RNase cleavage sites in a computational model of a bacteriophage genome is sufficient to achieve solutions to a variety of basic gene expression patterns. We discover these genetic solutions by computationally evolving genomes to reproduce desired gene expression time-course data. Our approach shows that non-trivial patterns can be evolved, including patterns where the relative ordering of genes by abundance changes over time. We find that some patterns are easier to evolve than others, and comparable expression patterns can be achieved via different genetic architectures. Our work opens up a novel avenue to genome engineering via fine-tuning the balance of gene expression and gene degradation rates.
Collapse
Affiliation(s)
- Sahil B. Shah
- Department of Integrative Biology, The University of Texas at Austin, Austin, TX, United States of America
| | - Alexis M. Hill
- Department of Integrative Biology, The University of Texas at Austin, Austin, TX, United States of America
| | - Claus O. Wilke
- Department of Integrative Biology, The University of Texas at Austin, Austin, TX, United States of America
- * E-mail: (COW); (AJH)
| | - Adam J. Hockenberry
- Department of Integrative Biology, The University of Texas at Austin, Austin, TX, United States of America
- * E-mail: (COW); (AJH)
| |
Collapse
|
49
|
Van Brempt M, Peeters AI, Duchi D, De Wannemaeker L, Maertens J, De Paepe B, De Mey M. Biosensor-driven, model-based optimization of the orthogonally expressed naringenin biosynthesis pathway. Microb Cell Fact 2022; 21:49. [PMID: 35346204 PMCID: PMC8962593 DOI: 10.1186/s12934-022-01775-8] [Citation(s) in RCA: 8] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/01/2021] [Accepted: 03/15/2022] [Indexed: 12/30/2022] Open
Abstract
Background The rapidly expanding synthetic biology toolbox allows engineers to develop smarter strategies to tackle the optimization of complex biosynthetic pathways. In such a strategy, multi-gene pathways are subdivided in several modules which are each dynamically controlled to fine-tune their expression in response to a changing cellular environment. To fine-tune separate modules without interference between modules or from the host regulatory machinery, a sigma factor (σ) toolbox was developed in previous work for tunable orthogonal gene expression. Here, this toolbox is implemented in E. coli to orthogonally express and fine-tune a pathway for the heterologous biosynthesis of the industrially relevant plant metabolite, naringenin. To optimize the production of this pathway, a practical workflow is still imperative to balance all steps of the pathway. This is tackled here by the biosensor-driven screening, subsequent genotyping of combinatorially engineered libraries and finally the training of three different computer models to predict the optimal pathway configuration. Results The efficiency and knowledge gained through this workflow is demonstrated here by improving the naringenin production titer by 32% with respect to a random pathway library screen. Our best strain was cultured in a batch bioreactor experiment and was able to produce 286 mg/L naringenin from glycerol in approximately 26 h. This is the highest reported naringenin production titer in E. coli without the supplementation of pathway precursors to the medium or any precursor pathway engineering. In addition, valuable pathway configuration preferences were identified in the statistical learning process, such as specific enzyme variant preferences and significant correlations between promoter strength at specific steps in the pathway and titer. Conclusions An efficient strategy, powered by orthogonal expression, was applied to successfully optimize a biosynthetic pathway for microbial production of flavonoids in E. coli up to high, competitive levels. Within this strategy, statistical learning techniques were combined with combinatorial pathway optimization techniques and an in vivo high-throughput screening method to efficiently determine the optimal operon configuration of the pathway. This “pathway architecture designer” workflow can be applied for the fast and efficient development of new microbial cell factories for different types of molecules of interest while also providing additional insights into the underlying pathway characteristics. Supplementary Information The online version contains supplementary material available at 10.1186/s12934-022-01775-8.
Collapse
Affiliation(s)
- Maarten Van Brempt
- Centre For Synthetic Biology, Ghent University, Coupure Links 653, B-9000, Ghent, Belgium
| | - Andries Ivo Peeters
- Centre For Synthetic Biology, Ghent University, Coupure Links 653, B-9000, Ghent, Belgium
| | - Dries Duchi
- Centre For Synthetic Biology, Ghent University, Coupure Links 653, B-9000, Ghent, Belgium
| | - Lien De Wannemaeker
- Centre For Synthetic Biology, Ghent University, Coupure Links 653, B-9000, Ghent, Belgium
| | - Jo Maertens
- Centre For Synthetic Biology, Ghent University, Coupure Links 653, B-9000, Ghent, Belgium
| | - Brecht De Paepe
- Centre For Synthetic Biology, Ghent University, Coupure Links 653, B-9000, Ghent, Belgium
| | - Marjan De Mey
- Centre For Synthetic Biology, Ghent University, Coupure Links 653, B-9000, Ghent, Belgium.
| |
Collapse
|
50
|
Zhang Y, Zou ZP, Chen SY, Wei WP, Zhou Y, Ye BC. Design and optimization of E. coli artificial genetic circuits for detection of explosive composition 2,4-dinitrotoluene. Biosens Bioelectron 2022; 207:114205. [PMID: 35339074 DOI: 10.1016/j.bios.2022.114205] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/13/2021] [Revised: 03/07/2022] [Accepted: 03/18/2022] [Indexed: 11/26/2022]
Abstract
The detection of mine-based explosives poses a serious threat to the lives of deminers, and carcinogenic residues may cause severe environmental pollution. Whole-cell biosensors that can detect on-site in dangerous or inaccessible environments have great potential to replace conventional methods. Synthetic biology based on engineering modularity serves as a new tool that could be used to engineer microbes to acquire desired functions through artificial design and precise regulation. In this study, we designed artificial genetic circuits in Escherichia coli MG1655 by reconstructing the transcription factor YhaJ-based system to detect explosive composition 2,4-dinitrotoluene (2,4-DNT). These genetic circuits were optimized at the transcriptional, translational, and post-translational levels. The binding affinity of the transcription factor YhaJ with inducer 2,4-DNT metabolites was enhanced via directed evolution, and several activator binding sites were inserted in sensing yqjF promoter (PyqjF) to further improve the output level. The optimized biosensor PyqjF×2-TEV-(mYhaJ + GFP)-Ssr had a maximum induction ratio of 189 with green fluorescent signal output, and it could perceive at least 1 μg/mL 2,4-DNT. Its effective and robust performance was verified in different water samples. Our results demonstrate the use of synthetic biology tools to systematically optimize the performance of sensors for 2,4-DNT detection, that lay the foundation for practical applications.
Collapse
Affiliation(s)
- Yan Zhang
- Laboratory of Biosystems and Microanalysis, Institute of Engineering Biology and Health, State Key Laboratory of Bioreactor Engineering, East China University of Science and Technology, Shanghai, 200237, China; School of Chemistry and Chemical Engineering/Key Laboratory of Environmental Monitoring and Pollutant Control of Xinjiang Bingtuan, Shihezi University, Shihezi, 832003, China
| | - Zhen-Ping Zou
- Laboratory of Biosystems and Microanalysis, Institute of Engineering Biology and Health, State Key Laboratory of Bioreactor Engineering, East China University of Science and Technology, Shanghai, 200237, China
| | - Sheng-Yan Chen
- School of Chemistry and Chemical Engineering/Key Laboratory of Environmental Monitoring and Pollutant Control of Xinjiang Bingtuan, Shihezi University, Shihezi, 832003, China
| | - Wen-Ping Wei
- Institute of Engineering Biology and Health, Collaborative Innovation Center of Yangtze River Delta Region Green Pharmaceuticals, College of Pharmaceutical Sciences, Zhejiang University of Technology, Hangzhou, 310014, Zhejiang, China
| | - Ying Zhou
- Laboratory of Biosystems and Microanalysis, Institute of Engineering Biology and Health, State Key Laboratory of Bioreactor Engineering, East China University of Science and Technology, Shanghai, 200237, China
| | - Bang-Ce Ye
- Laboratory of Biosystems and Microanalysis, Institute of Engineering Biology and Health, State Key Laboratory of Bioreactor Engineering, East China University of Science and Technology, Shanghai, 200237, China; School of Chemistry and Chemical Engineering/Key Laboratory of Environmental Monitoring and Pollutant Control of Xinjiang Bingtuan, Shihezi University, Shihezi, 832003, China; Institute of Engineering Biology and Health, Collaborative Innovation Center of Yangtze River Delta Region Green Pharmaceuticals, College of Pharmaceutical Sciences, Zhejiang University of Technology, Hangzhou, 310014, Zhejiang, China.
| |
Collapse
|