1
|
Pan RW, Röschinger T, Faizi K, Garcia H, Phillips R. Deciphering regulatory architectures from synthetic single-cell expression patterns. ARXIV 2024:arXiv:2401.15880v2. [PMID: 38351929 PMCID: PMC10862939] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Figures] [Subscribe] [Scholar Register] [Indexed: 02/19/2024]
Abstract
For the vast majority of genes in sequenced genomes, there is limited understanding of how they are regulated. Without such knowledge, it is not possible to perform a quantitative theory-experiment dialogue on how such genes give rise to physiological and evolutionary adaptation. One category of high-throughput experiments used to understand the sequence-phenotype relationship of the transcriptome is massively parallel reporter assays (MPRAs). However, to improve the versatility and scalability of MPRA pipelines, we need a "theory of the experiment" to help us better understand the impact of various biological and experimental parameters on the interpretation of experimental data. These parameters include binding site copy number, where a large number of specific binding sites may titrate away transcription factors, as well as the presence of overlapping binding sites, which may affect analysis of the degree of mutual dependence between mutations in the regulatory region and expression levels. To that end, in this paper we create tens of thousands of synthetic single-cell gene expression outputs using both equilibrium and out-of-equilibrium models. These models make it possible to imitate the summary statistics (information footprints and expression shift matrices) used to characterize the output of MPRAs and from this summary statistic to infer the underlying regulatory architecture. Specifically, we use a more refined implementation of the so-called thermodynamic models in which the binding energies of each sequence variant are derived from energy matrices. Our simulations reveal important effects of the parameters on MPRA data and we demonstrate our ability to optimize MPRA experimental designs with the goal of generating thermodynamic models of the transcriptome with base-pair specificity. Further, this approach makes it possible to carefully examine the mapping between mutations in binding sites and their corresponding expression profiles, a tool useful not only for better designing MPRAs, but also for exploring regulatory evolution.
Collapse
Affiliation(s)
- Rosalind Wenshan Pan
- Division of Biology and Biological Engineering, California Institute of Technology, Pasadena, CA
| | - Tom Röschinger
- Division of Biology and Biological Engineering, California Institute of Technology, Pasadena, CA
| | - Kian Faizi
- Division of Biology and Biological Engineering, California Institute of Technology, Pasadena, CA
| | - Hernan Garcia
- Biophysics Graduate Group, University of California, Berkeley, CA
- Department of Physics, University of California, Berkeley, CA
- Department of Molecular and Cell Biology, University of California, Berkeley, CA
- Institute for Quantitative Biosciences-QB3, University of California, Berkeley, CA
| | - Rob Phillips
- Division of Biology and Biological Engineering, California Institute of Technology, Pasadena, CA
- Department of Physics, California Institute of Technology, Pasadena, CA
| |
Collapse
|
2
|
Pan RW, Röschinger T, Faizi K, Garcia H, Phillips R. Deciphering regulatory architectures from synthetic single-cell expression patterns. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.01.28.577658. [PMID: 38352569 PMCID: PMC10862715 DOI: 10.1101/2024.01.28.577658] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 02/22/2024]
Abstract
For the vast majority of genes in sequenced genomes, there is limited understanding of how they are regulated. Without such knowledge, it is not possible to perform a quantitative theory-experiment dialogue on how such genes give rise to physiological and evolutionary adaptation. One category of high-throughput experiments used to understand the sequence-phenotype relationship of the transcriptome is massively parallel reporter assays (MPRAs). However, to improve the versatility and scalability of MPRA pipelines, we need a "theory of the experiment" to help us better understand the impact of various biological and experimental parameters on the interpretation of experimental data. These parameters include binding site copy number, where a large number of specific binding sites may titrate away transcription factors, as well as the presence of overlapping binding sites, which may affect analysis of the degree of mutual dependence between mutations in the regulatory region and expression levels. To that end, in this paper we create tens of thousands of synthetic single-cell gene expression outputs using both equilibrium and out-of-equilibrium models. These models make it possible to imitate the summary statistics (information footprints and expression shift matrices) used to characterize the output of MPRAs and from this summary statistic to infer the underlying regulatory architecture. Specifically, we use a more refined implementation of the so-called thermodynamic models in which the binding energies of each sequence variant are derived from energy matrices. Our simulations reveal important effects of the parameters on MPRA data and we demonstrate our ability to optimize MPRA experimental designs with the goal of generating thermodynamic models of the transcriptome with base-pair specificity. Further, this approach makes it possible to carefully examine the mapping between mutations in binding sites and their corresponding expression profiles, a tool useful not only for better designing MPRAs, but also for exploring regulatory evolution.
Collapse
Affiliation(s)
- Rosalind Wenshan Pan
- Division of Biology and Biological Engineering, California Institute of Technology, Pasadena, CA
| | - Tom Röschinger
- Division of Biology and Biological Engineering, California Institute of Technology, Pasadena, CA
| | - Kian Faizi
- Division of Biology and Biological Engineering, California Institute of Technology, Pasadena, CA
| | - Hernan Garcia
- Biophysics Graduate Group, University of California, Berkeley, CA
- Department of Physics, University of California, Berkeley, CA
- Department of Molecular and Cell Biology, University of California, Berkeley, CA
- Institute for Quantitative Biosciences-QB3, University of California, Berkeley, CA
| | - Rob Phillips
- Division of Biology and Biological Engineering, California Institute of Technology, Pasadena, CA
- Department of Physics, California Institute of Technology, Pasadena, CA
| |
Collapse
|
3
|
Saunders SH, Ahmed AM. ORBIT for E. coli: kilobase-scale oligonucleotide recombineering at high throughput and high efficiency. Nucleic Acids Res 2024; 52:e43. [PMID: 38587185 PMCID: PMC11077079 DOI: 10.1093/nar/gkae227] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/14/2023] [Revised: 02/28/2024] [Accepted: 03/19/2024] [Indexed: 04/09/2024] Open
Abstract
Microbiology and synthetic biology depend on reverse genetic approaches to manipulate bacterial genomes; however, existing methods require molecular biology to generate genomic homology, suffer from low efficiency, and are not easily scaled to high throughput. To overcome these limitations, we developed a system for creating kilobase-scale genomic modifications that uses DNA oligonucleotides to direct the integration of a non-replicating plasmid. This method, Oligonucleotide Recombineering followed by Bxb-1 Integrase Targeting (ORBIT) was pioneered in Mycobacteria, and here we adapt and expand it for Escherichia coli. Our redesigned plasmid toolkit for oligonucleotide recombineering achieved significantly higher efficiency than λ Red double-stranded DNA recombineering and enabled precise, stable knockouts (≤134 kb) and integrations (≤11 kb) of various sizes. Additionally, we constructed multi-mutants in a single transformation, using orthogonal attachment sites. At high throughput, we used pools of targeting oligonucleotides to knock out nearly all known transcription factor and small RNA genes, yielding accurate, genome-wide, single mutant libraries. By counting genomic barcodes, we also show ORBIT libraries can scale to thousands of unique members (>30k). This work demonstrates that ORBIT for E. coli is a flexible reverse genetic system that facilitates rapid construction of complex strains and readily scales to create sophisticated mutant libraries.
Collapse
Affiliation(s)
- Scott H Saunders
- Green Center for Systems Biology - Lyda Hill Department of Bioinformatics, University of Texas Southwestern Medical Center, Dallas, TX 75320, USA
| | - Ayesha M Ahmed
- Green Center for Systems Biology - Lyda Hill Department of Bioinformatics, University of Texas Southwestern Medical Center, Dallas, TX 75320, USA
| |
Collapse
|
4
|
Kang CK, Kim AR. Deep molecular learning of transcriptional control of a synthetic CRE enhancer and its variants. iScience 2024; 27:108747. [PMID: 38222110 PMCID: PMC10784702 DOI: 10.1016/j.isci.2023.108747] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/20/2023] [Revised: 08/29/2023] [Accepted: 12/12/2023] [Indexed: 01/16/2024] Open
Abstract
Massively parallel reporter assay measures transcriptional activities of various cis-regulatory modules (CRMs) in a single experiment. We developed a thermodynamic computational model framework that calculates quantitative levels of gene expression directly from regulatory DNA sequences. Using the framework, we investigated the molecular mechanisms of cis-regulatory mutations of a synthetic enhancer that cause abnormal gene expression. We found that, in a human cell line, competitive binding between family transcription factors (TFs) with slightly different binding preferences significantly increases the accuracy of recapitulating the transcriptional effects of thousands of single- or multi-mutations. We also discovered that even if various harmful mutations occurred in an activator binding site, CRM could stably maintain or even increase gene expression through a certain form of competitive binding between family TFs. These findings enhance understanding the effect of SNPs and indels on CRMs and would help building robust custom-designed CRMs for biologics production and gene therapy.
Collapse
Affiliation(s)
- Chan-Koo Kang
- School of Life Science, Handong Global University, Pohang, Gyeong-Buk 37554, South Korea
- Department of Advanced Convergence, Handong Global University, Pohang, Gyeong-Buk 37554, South Korea
| | - Ah-Ram Kim
- School of Life Science, Handong Global University, Pohang, Gyeong-Buk 37554, South Korea
- Department of Advanced Convergence, Handong Global University, Pohang, Gyeong-Buk 37554, South Korea
- Computer Science and Artificial Intelligence Laboratory, Massachusetts Institute of Technology, Cambridge, MA 02139, USA
- School of Applied Artificial Intelligence, Handong Global University, Pohang, Gyeong-Buk 37554, South Korea
| |
Collapse
|
5
|
Loell KJ, Friedman RZ, Myers CA, Corbo JC, Cohen BA, White MA. Transcription factor interactions explain the context-dependent activity of CRX binding sites. PLoS Comput Biol 2024; 20:e1011802. [PMID: 38227575 PMCID: PMC10817189 DOI: 10.1371/journal.pcbi.1011802] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/06/2023] [Revised: 01/26/2024] [Accepted: 01/06/2024] [Indexed: 01/18/2024] Open
Abstract
The effects of transcription factor binding sites (TFBSs) on the activity of a cis-regulatory element (CRE) depend on the local sequence context. In rod photoreceptors, binding sites for the transcription factor (TF) Cone-rod homeobox (CRX) occur in both enhancers and silencers, but the sequence context that determines whether CRX binding sites contribute to activation or repression of transcription is not understood. To investigate the context-dependent activity of CRX sites, we fit neural network-based models to the activities of synthetic CREs composed of photoreceptor TFBSs. The models revealed that CRX binding sites consistently make positive, independent contributions to CRE activity, while negative homotypic interactions between sites cause CREs composed of multiple CRX sites to function as silencers. The effects of negative homotypic interactions can be overcome by the presence of other TFBSs that either interact cooperatively with CRX sites or make independent positive contributions to activity. The context-dependent activity of CRX sites is thus determined by the balance between positive heterotypic interactions, independent contributions of TFBSs, and negative homotypic interactions. Our findings explain observed patterns of activity among genomic CRX-bound enhancers and silencers, and suggest that enhancers may require diverse TFBSs to overcome negative homotypic interactions between TFBSs.
Collapse
Affiliation(s)
- Kaiser J. Loell
- Department of Genetics, Washington University School of Medicine in St. Louis, St. Louis, Missouri, United States of America
- The Edison Family Center for Genome Sciences & Systems Biology, Washington University School of Medicine in St. Louis, St. Louis, Missouri, United States of America
| | - Ryan Z. Friedman
- Department of Genetics, Washington University School of Medicine in St. Louis, St. Louis, Missouri, United States of America
- The Edison Family Center for Genome Sciences & Systems Biology, Washington University School of Medicine in St. Louis, St. Louis, Missouri, United States of America
| | - Connie A. Myers
- Department of Pathology and Immunology, Washington University School of Medicine in St. Louis, St. Louis, Missouri, United States of America
| | - Joseph C. Corbo
- Department of Pathology and Immunology, Washington University School of Medicine in St. Louis, St. Louis, Missouri, United States of America
| | - Barak A. Cohen
- Department of Genetics, Washington University School of Medicine in St. Louis, St. Louis, Missouri, United States of America
- The Edison Family Center for Genome Sciences & Systems Biology, Washington University School of Medicine in St. Louis, St. Louis, Missouri, United States of America
| | - Michael A. White
- Department of Genetics, Washington University School of Medicine in St. Louis, St. Louis, Missouri, United States of America
- The Edison Family Center for Genome Sciences & Systems Biology, Washington University School of Medicine in St. Louis, St. Louis, Missouri, United States of America
| |
Collapse
|
6
|
Feng H, Li F, Wang T, Xing XH, Zeng AP, Zhang C. Deep-learning-assisted Sort-Seq enables high-throughput profiling of gene expression characteristics with high precision. SCIENCE ADVANCES 2023; 9:eadg5296. [PMID: 37939173 PMCID: PMC10631719 DOI: 10.1126/sciadv.adg5296] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/04/2023] [Accepted: 10/06/2023] [Indexed: 11/10/2023]
Abstract
Owing to the nondeterministic and nonlinear nature of gene expression, the steady-state intracellular protein abundance of a clonal population forms a distribution. The characteristics of this distribution, including expression strength and noise, are closely related to cellular behavior. However, quantitative description of these characteristics has so far relied on arrayed methods, which are time-consuming and labor-intensive. To address this issue, we propose a deep-learning-assisted Sort-Seq approach (dSort-Seq) in this work, enabling high-throughput profiling of expression properties with high precision. We demonstrated the validity of dSort-Seq for large-scale assaying of the dose-response relationships of biosensors. In addition, we comprehensively investigated the contribution of transcription and translation to noise production in Escherichia coli, from which we found that the expression noise is strongly coupled with the mean expression level. We also found that the transcriptional interference caused by overlapping RpoD-binding sites contributes to noise production, which suggested the existence of a simple and feasible noise control strategy in E. coli.
Collapse
Affiliation(s)
- Huibao Feng
- MOE Key Laboratory for Industrial Biocatalysis, Institute of Biochemical Engineering, Department of Chemical Engineering, Tsinghua University, Beijing 100084, China
| | - Fan Li
- MOE Key Laboratory for Industrial Biocatalysis, Institute of Biochemical Engineering, Department of Chemical Engineering, Tsinghua University, Beijing 100084, China
| | - Tianmin Wang
- Tsinghua-Peking Center for Life Sciences, School of Medicine, Tsinghua University, Beijing 100084, China
- School of Life Science and Technology, ShanghaiTech University, Shanghai 201210, China
| | - Xin-hui Xing
- MOE Key Laboratory for Industrial Biocatalysis, Institute of Biochemical Engineering, Department of Chemical Engineering, Tsinghua University, Beijing 100084, China
- Shenzhen International Graduate School, Tsinghua University, Shenzhen 518055, China
| | - An-ping Zeng
- Institute of Bioprocess and Biosystems Engineering, Hamburg University of Technology, Hamburg 21073, Germany
- Center of Synthetic Biology and Integrated Bioengineering, School of Engineering, Westlake University, Hangzhou 310024, China
| | - Chong Zhang
- MOE Key Laboratory for Industrial Biocatalysis, Institute of Biochemical Engineering, Department of Chemical Engineering, Tsinghua University, Beijing 100084, China
- Center for Synthetic and Systems Biology, Tsinghua University, Beijing 100084, China
| |
Collapse
|
7
|
Han Y, Li W, Filko A, Li J, Zhang F. Genome-wide promoter responses to CRISPR perturbations of regulators reveal regulatory networks in Escherichia coli. Nat Commun 2023; 14:5757. [PMID: 37717013 PMCID: PMC10505187 DOI: 10.1038/s41467-023-41572-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/07/2022] [Accepted: 09/08/2023] [Indexed: 09/18/2023] Open
Abstract
Elucidating genome-scale regulatory networks requires a comprehensive collection of gene expression profiles, yet measuring gene expression responses for every transcription factor (TF)-gene pair in living prokaryotic cells remains challenging. Here, we develop pooled promoter responses to TF perturbation sequencing (PPTP-seq) via CRISPR interference to address this challenge. Using PPTP-seq, we systematically measure the activity of 1372 Escherichia coli promoters under single knockdown of 183 TF genes, illustrating more than 200,000 possible TF-gene responses in one experiment. We perform PPTP-seq for E. coli growing in three different media. The PPTP-seq data reveal robust steady-state promoter activities under most single TF knockdown conditions. PPTP-seq also enables identifications of, to the best of our knowledge, previously unknown TF autoregulatory responses and complex transcriptional control on one-carbon metabolism. We further find context-dependent promoter regulation by multiple TFs whose relative binding strengths determined promoter activities. Additionally, PPTP-seq reveals different promoter responses in different growth media, suggesting condition-specific gene regulation. Overall, PPTP-seq provides a powerful method to examine genome-wide transcriptional regulatory networks and can be potentially expanded to reveal gene expression responses to other genetic elements.
Collapse
Affiliation(s)
- Yichao Han
- Department of Energy, Environmental and Chemical Engineering, Washington University in St. Louis, Saint Louis, Missouri, USA
| | - Wanji Li
- Department of Energy, Environmental and Chemical Engineering, Washington University in St. Louis, Saint Louis, Missouri, USA
| | - Alden Filko
- Department of Energy, Environmental and Chemical Engineering, Washington University in St. Louis, Saint Louis, Missouri, USA
| | - Jingyao Li
- Department of Energy, Environmental and Chemical Engineering, Washington University in St. Louis, Saint Louis, Missouri, USA
| | - Fuzhong Zhang
- Department of Energy, Environmental and Chemical Engineering, Washington University in St. Louis, Saint Louis, Missouri, USA.
- Division of Biological and Biomedical Sciences, Washington University in St. Louis, Saint Louis, Missouri, USA.
- Institute of Materials Science and Engineering, Washington University in St. Louis, Saint Louis, Missouri, USA.
| |
Collapse
|
8
|
Takano S, Vila JCC, Miyazaki R, Sánchez Á, Bajić D. The Architecture of Metabolic Networks Constrains the Evolution of Microbial Resource Hierarchies. Mol Biol Evol 2023; 40:msad187. [PMID: 37619982 PMCID: PMC10476156 DOI: 10.1093/molbev/msad187] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/24/2023] [Revised: 07/18/2023] [Accepted: 08/07/2023] [Indexed: 08/26/2023] Open
Abstract
Microbial strategies for resource use are an essential determinant of their fitness in complex habitats. When facing environments with multiple nutrients, microbes often use them sequentially according to a preference hierarchy, resulting in well-known patterns of diauxic growth. In theory, the evolutionary diversification of metabolic hierarchies could represent a mechanism supporting coexistence and biodiversity by enabling temporal segregation of niches. Despite this ecologically critical role, the extent to which substrate preference hierarchies can evolve and diversify remains largely unexplored. Here, we used genome-scale metabolic modeling to systematically explore the evolution of metabolic hierarchies across a vast space of metabolic network genotypes. We find that only a limited number of metabolic hierarchies can readily evolve, corresponding to the most commonly observed hierarchies in genome-derived models. We further show how the evolution of novel hierarchies is constrained by the architecture of central metabolism, which determines both the propensity to change ranks between pairs of substrates and the effect of specific reactions on hierarchy evolution. Our analysis sheds light on the genetic and mechanistic determinants of microbial metabolic hierarchies, opening new research avenues to understand their evolution, evolvability, and ecology.
Collapse
Affiliation(s)
- Sotaro Takano
- Department of Ecology and Evolutionary Biology, Yale University, New Haven, CT, USA
- Microbial Sciences Institute, Yale University, New Haven, CT, USA
- Bioproduction Research Institute, National Institute of Advanced Industrial Science and Technology (AIST), Tsukuba, Japan
| | - Jean C C Vila
- Department of Ecology and Evolutionary Biology, Yale University, New Haven, CT, USA
- Microbial Sciences Institute, Yale University, New Haven, CT, USA
- Department of Biology, Stanford University, Stanford, CA, USA
| | - Ryo Miyazaki
- Bioproduction Research Institute, National Institute of Advanced Industrial Science and Technology (AIST), Tsukuba, Japan
- Computational Bio Big Data Open Innovation Laboratory (CBBD-OIL), AIST, Tokyo, Japan
| | - Álvaro Sánchez
- Department of Ecology and Evolutionary Biology, Yale University, New Haven, CT, USA
- Microbial Sciences Institute, Yale University, New Haven, CT, USA
- Department of Microbial Biotechnology, CNB-CSIC, Campus de Cantoblanco, Madrid, Spain
| | - Djordje Bajić
- Department of Ecology and Evolutionary Biology, Yale University, New Haven, CT, USA
- Microbial Sciences Institute, Yale University, New Haven, CT, USA
- Section of Industrial Microbiology, Department of Biotechnology, Technical University Delft, Delft, The Netherlands
| |
Collapse
|
9
|
Schultz D, Stevanovic M, Tsimring LS. Optimal transcriptional regulation of dynamic bacterial responses to sudden drug exposures. Biophys J 2022; 121:4137-4152. [PMID: 36168291 PMCID: PMC9675034 DOI: 10.1016/j.bpj.2022.09.028] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/11/2022] [Revised: 08/22/2022] [Accepted: 09/23/2022] [Indexed: 11/17/2022] Open
Abstract
Cellular responses to the presence of toxic compounds in their environment require prompt expression of the correct levels of the appropriate enzymes, which are typically regulated by transcription factors that control gene expression for the duration of the response. The characteristics of each response dictate the choice of regulatory parameters such as the affinity of the transcription factor to its binding sites and the strength of the promoters it regulates. Although much is known about the dynamics of cellular responses, we still lack a framework to understand how different regulatory strategies evolved in natural systems relate to the selective pressures acting in each particular case. Here, we analyze a dynamical model of a typical antibiotic response in bacteria, where a transcriptionally repressed enzyme is induced by a sudden exposure to the drug that it processes. We identify strategies of gene regulation that optimize this response for different types of selective pressures, which we define as a set of costs associated with the drug, enzyme, and repressor concentrations during the response. We find that regulation happens in a limited region of the regulatory parameter space. While responses to more costly (toxic) drugs favor the usage of strongly self-regulated repressors, responses where expression of enzyme is more costly favor the usage of constitutively expressed repressors. Only a very narrow range of selective pressures favor weakly self-regulated repressors. We use this framework to determine which costs and benefits are most critical for the evolution of a variety of natural cellular responses that satisfy the approximations in our model and to analyze how regulation is optimized in new environments with different demands.
Collapse
Affiliation(s)
- Daniel Schultz
- Department of Microbiology and Immunology, Geisel School of Medicine at Dartmouth, Hanover, New Hampshire.
| | - Mirjana Stevanovic
- Department of Microbiology and Immunology, Geisel School of Medicine at Dartmouth, Hanover, New Hampshire
| | - Lev S Tsimring
- Synthetic Biology Institute, University of California, San Diego, La Jolla, California
| |
Collapse
|
10
|
Gene regulation in Escherichia coli is commonly selected for both high plasticity and low noise. Nat Ecol Evol 2022; 6:1165-1179. [PMID: 35726087 DOI: 10.1038/s41559-022-01783-2] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/18/2021] [Accepted: 05/03/2022] [Indexed: 11/08/2022]
Abstract
Bacteria often respond to dynamically changing environments by regulating gene expression. Despite this regulation being critically important for growth and survival, little is known about how selection shapes gene regulation in natural populations. To better understand the role natural selection plays in shaping bacterial gene regulation, here we compare differences in the regulatory behaviour of naturally segregating promoter variants from Escherichia coli (which have been subject to natural selection) to randomly mutated promoter variants (which have never been exposed to natural selection). We quantify gene expression phenotypes (expression level, plasticity and noise) for hundreds of promoter variants across multiple environments and show that segregating promoter variants are enriched for mutations with minimal effects on expression level. In many promoters, we infer that there is strong selection to maintain high levels of plasticity, and direct selection to decrease or increase cell-to-cell variability in expression. Taken together, these results expand our knowledge of how gene regulation is affected by natural selection and highlight the power of comparing naturally segregating polymorphisms to de novo random mutations to quantify the action of selection.
Collapse
|
11
|
Vaknin I, Amit R. Molecular and experimental tools to design synthetic enhancers. Curr Opin Biotechnol 2022; 76:102728. [PMID: 35525178 DOI: 10.1016/j.copbio.2022.102728] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/16/2021] [Revised: 03/16/2022] [Accepted: 04/03/2022] [Indexed: 11/03/2022]
Abstract
Understanding the grammar of enhancers and how they regulate gene expression is key for both basic research and for the pharma and biotech industries. The design and characterization of synthetic enhancers can expand the known regulatory space. This is achieved by the utilization of DNA Oligo Libraries (OLs), which facilitates screening of as many as millions of synthetic enhancer variants simultaneously. This review includes the latest commercial DNA OL synthesis technology and its capabilities, and a general 'know-how' guide for the design, construction, and analysis of OL-based synthetic enhancer characterization experiments. Specifically, we focus on synthetic-enhancer-based massively parallel reporter assay, Sort-seq methodologies (e.g. flow cytometry, deep sequencing), and a brief description of machine learning-based attempts for OL-analysis and follow-up validation experiments.
Collapse
Affiliation(s)
- Inbal Vaknin
- Department of Biotechnology and Food Engineering, Technion - Israel Institute of Technology, Haifa 3200000, Israel
| | - Roee Amit
- Department of Biotechnology and Food Engineering, Technion - Israel Institute of Technology, Haifa 3200000, Israel; The Russell Berrie Nanotechnology Institute, Technion - Israel Institute of Technology, Haifa 3200000, Israel.
| |
Collapse
|
12
|
Tareen A, Kooshkbaghi M, Posfai A, Ireland WT, McCandlish DM, Kinney JB. MAVE-NN: learning genotype-phenotype maps from multiplex assays of variant effect. Genome Biol 2022; 23:98. [PMID: 35428271 PMCID: PMC9011994 DOI: 10.1186/s13059-022-02661-7] [Citation(s) in RCA: 21] [Impact Index Per Article: 10.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/16/2021] [Accepted: 03/24/2022] [Indexed: 12/17/2022] Open
Abstract
Multiplex assays of variant effect (MAVEs) are a family of methods that includes deep mutational scanning experiments on proteins and massively parallel reporter assays on gene regulatory sequences. Despite their increasing popularity, a general strategy for inferring quantitative models of genotype-phenotype maps from MAVE data is lacking. Here we introduce MAVE-NN, a neural-network-based Python package that implements a broadly applicable information-theoretic framework for learning genotype-phenotype maps—including biophysically interpretable models—from MAVE datasets. We demonstrate MAVE-NN in multiple biological contexts, and highlight the ability of our approach to deconvolve mutational effects from otherwise confounding experimental nonlinearities and noise.
Collapse
|
13
|
Kim NM, Sinnott RW, Rothschild LN, Sandoval NR. Elucidation of Sequence-Function Relationships for an Improved Biobutanol In Vivo Biosensor in E. coli. Front Bioeng Biotechnol 2022; 10:821152. [PMID: 35265600 PMCID: PMC8899819 DOI: 10.3389/fbioe.2022.821152] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/23/2021] [Accepted: 01/17/2022] [Indexed: 11/30/2022] Open
Abstract
Transcription factor (TF)–promoter pairs have been repurposed from native hosts to provide tools to measure intracellular biochemical production titer and dynamically control gene expression. Most often, native TF–promoter systems require rigorous screening to obtain desirable characteristics optimized for biotechnological applications. High-throughput techniques may provide a rational and less labor-intensive strategy to engineer user-defined TF–promoter pairs using fluorescence-activated cell sorting and deep sequencing methods (sort-seq). Based on the designed promoter library’s distribution characteristics, we elucidate sequence–function interactions between the TF and DNA. In this work, we use the sort-seq method to study the sequence–function relationship of a σ54-dependent, butanol-responsive TF–promoter pair, BmoR-PBMO derived from Thauera butanivorans, at the nucleotide level to improve biosensor characteristics, specifically an improved dynamic range. Activities of promoters from a mutagenized PBMO library were sorted based on gfp expression and subsequently deep sequenced to correlate site-specific sequences with changes in dynamic range. We identified site-specific mutations that increase the sensor output. Double mutant and a single mutant, CA(129,130)TC and G(205)A, in PBMO promoter increased dynamic ranges of 4-fold and 1.65-fold compared with the native system, respectively. In addition, sort-seq identified essential sites required for the proper function of the σ54-dependent promoter biosensor in the context of the host. This work can enable high-throughput screening methods for strain development.
Collapse
Affiliation(s)
- Nancy M Kim
- Interdisciplinary Bioinnovation PhD Program, Tulane University, New Orleans, LA, United States
| | - Riley W Sinnott
- Department of Chemical & Biomolecular Engineering, Tulane University, New Orleans, LA, United States
| | - Lily N Rothschild
- Department of Chemical & Biomolecular Engineering, Tulane University, New Orleans, LA, United States
| | - Nicholas R Sandoval
- Department of Chemical & Biomolecular Engineering, Tulane University, New Orleans, LA, United States
| |
Collapse
|
14
|
Engineering eukaryote-like regulatory circuits to expand artificial control mechanisms for metabolic engineering in Saccharomyces cerevisiae. Commun Biol 2022; 5:135. [PMID: 35173283 PMCID: PMC8850539 DOI: 10.1038/s42003-022-03070-z] [Citation(s) in RCA: 11] [Impact Index Per Article: 5.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/31/2021] [Accepted: 01/20/2022] [Indexed: 12/22/2022] Open
Abstract
Temporal control of heterologous pathway expression is critical to achieve optimal efficiency in microbial metabolic engineering. The broadly-used GAL promoter system for engineered yeast (Saccharomyces cerevisiae) suffers from several drawbacks; specifically, unintended induction during laboratory development, and unintended repression in industrial production applications, which decreases overall production capacity. Eukaryotic synthetic circuits have not been well examined to address these problems. Here, we explore a modularised engineering method to deploy new genetic circuits applicable for expanding the control of GAL promoter-driven heterologous pathways in S. cerevisiae. Trans- and cis- modules, including eukaryotic trans-activating-and-repressing mechanisms, were characterised to provide new and better tools for circuit design. A eukaryote-like tetracycline-mediated circuit that delivers stringent repression was engineered to minimise metabolic burden during strain development and maintenance. This was combined with a novel 37 °C induction circuit to relief glucose-mediated repression on the GAL promoter during the bioprocess. This delivered a 44% increase in production of the terpenoid nerolidol, to 2.54 g L-1 in flask cultivation. These negative/positive transcriptional regulatory circuits expand global strategies of metabolic control to facilitate laboratory maintenance and for industry applications.
Collapse
|
15
|
Lagator M, Sarikas S, Steinrueck M, Toledo-Aparicio D, Bollback JP, Guet CC, Tkačik G. Predicting bacterial promoter function and evolution from random sequences. eLife 2022; 11:64543. [PMID: 35080492 PMCID: PMC8791639 DOI: 10.7554/elife.64543] [Citation(s) in RCA: 7] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/02/2020] [Accepted: 01/09/2022] [Indexed: 12/12/2022] Open
Abstract
Predicting function from sequence is a central problem of biology. Currently, this is possible only locally in a narrow mutational neighborhood around a wildtype sequence rather than globally from any sequence. Using random mutant libraries, we developed a biophysical model that accounts for multiple features of σ70 binding bacterial promoters to predict constitutive gene expression levels from any sequence. We experimentally and theoretically estimated that 10–20% of random sequences lead to expression and ~80% of non-expressing sequences are one mutation away from a functional promoter. The potential for generating expression from random sequences is so pervasive that selection acts against σ70-RNA polymerase binding sites even within inter-genic, promoter-containing regions. This pervasiveness of σ70-binding sites implies that emergence of promoters is not the limiting step in gene regulatory evolution. Ultimately, the inclusion of novel features of promoter function into a mechanistic model enabled not only more accurate predictions of gene expression levels, but also identified that promoters evolve more rapidly than previously thought.
Collapse
Affiliation(s)
- Mato Lagator
- School of Biological Sciences, Faculty of Biology, Medicine and Health, University of Manchester, Manchester, United Kingdom.,Institute of Science and Technology Austria, Klosterneuburg, Austria
| | - Srdjan Sarikas
- Institute of Science and Technology Austria, Klosterneuburg, Austria.,Center for Physiology and Pharmacology, Medical University of Vienna, Klosterneuburg, Austria
| | | | | | - Jonathan P Bollback
- Institute of Integrative Biology, Functional and Comparative Genomics, University of Liverpool, Liverpool, United Kingdom
| | - Calin C Guet
- Institute of Science and Technology Austria, Klosterneuburg, Austria
| | - Gašper Tkačik
- Institute of Science and Technology Austria, Klosterneuburg, Austria
| |
Collapse
|
16
|
SpeedyGenesXL: an Automated, High-Throughput Platform for the Preparation of Bespoke Ultralarge Variant Libraries for Directed Evolution. Methods Mol Biol 2022; 2461:67-83. [PMID: 35727444 DOI: 10.1007/978-1-0716-2152-3_5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/18/2022]
Abstract
Directed evolution of proteins is a highly effective strategy for tailoring biocatalysts to a particular application, and is capable of engineering improvements such as kcat, thermostability and organic solvent tolerance. It is recognized that large and systematic libraries are required to navigate a protein's vast and rugged sequence landscape effectively, yet their preparation is nontrivial and commercial libraries are extremely costly. To address this, we have developed SpeedyGenesXL, an automated, high-throughput platform for the production of wild-type genes, Boolean OR, combinatorial, or combinatorial-OR-type libraries based on the SpeedyGenes methodology. Together this offers a flexible platform for library synthesis, capable of generating many different bespoke, diverse libraries simultaneously.
Collapse
|
17
|
May MP, Munsky B. Exploiting Noise, Non-Linearity, and Feedback for Differential Control of Multiple Synthetic Cells with a Single Optogenetic Input. ACS Synth Biol 2021; 10:3396-3410. [PMID: 34793137 PMCID: PMC9875732 DOI: 10.1021/acssynbio.1c00341] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/27/2023]
Abstract
Synthetic biology seeks to develop modular biocircuits that combine to produce complex, controllable behaviors. These designs are often subject to noisy fluctuations and uncertainties, and most modern synthetic biology design processes have focused to create robust components to mitigate the noise of gene expression and reduce the heterogeneity of single-cell responses. However, a deeper understanding of noise can achieve control goals that would otherwise be impossible. We explore how an "Optogenetic Maxwell Demon" could selectively amplify noise to control multiple cells using single-input-multiple-output (SIMO) feedback. Using data-constrained stochastic model simulations and theory, we show how an appropriately selected stochastic SIMO controller can drive multiple different cells to different user-specified configurations irrespective of initial conditions. We explore how controllability depends on cells' regulatory structures, the amount of information available to the controller, and the accuracy of the model used. Our results suggest that gene regulation noise, when combined with optogenetic feedback and non-linear biochemical auto-regulation, can achieve synergy to enable precise control of complex stochastic processes.
Collapse
Affiliation(s)
- Michael P May
- School of Biomedical Engineering, Colorado State University, Fort Collins, CO, USA, 80523
| | - Brian Munsky
- School of Biomedical Engineering, Colorado State University, Fort Collins, CO, USA, 80523,Department of Chemical and Biological Engineering, Colorado State University, Fort Collins, CO, USA, 80523
| |
Collapse
|
18
|
Lee HM, Ren J, Yu MS, Kim H, Kim WY, Shen J, Yoo SM, Eyun SI, Na D. Construction of a tunable promoter library to optimize gene expression in Methylomonas sp. DH-1, a methanotroph, and its application to cadaverine production. BIOTECHNOLOGY FOR BIOFUELS 2021; 14:228. [PMID: 34863247 PMCID: PMC8645107 DOI: 10.1186/s13068-021-02077-8] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/16/2021] [Accepted: 11/16/2021] [Indexed: 05/11/2023]
Abstract
BACKGROUND As methane is 84 times more potent than carbon dioxide in exacerbating the greenhouse effect, there is an increasing interest in the utilization of methanotrophic bacteria that can convert harmful methane into various value-added compounds. A recently isolated methanotroph, Methylomonas sp. DH-1, is a promising biofactory platform because of its relatively fast growth. However, the lack of genetic engineering tools hampers its wide use in the bioindustry. RESULTS Through three different approaches, we constructed a tunable promoter library comprising 33 promoters that can be used for the metabolic engineering of Methylomonas sp. DH-1. The library had an expression level of 0.24-410% when compared with the strength of the lac promoter. For practical application of the promoter library, we fine-tuned the expressions of cadA and cadB genes, required for cadaverine synthesis and export, respectively. The strain with PrpmB-cadA and PDnaA-cadB produced the highest cadaverine titre (18.12 ± 1.06 mg/L) in Methylomonas sp. DH-1, which was up to 2.8-fold higher than that obtained from a non-optimized strain. In addition, cell growth and lysine (a precursor of cadaverine) production assays suggested that gene expression optimization through transcription tuning can afford a balance between the growth and precursor supply. CONCLUSIONS The tunable promoter library provides standard and tunable components for gene expression, thereby facilitating the use of methanotrophs, specifically Methylomonas sp. DH-1, as a sustainable cell factory.
Collapse
Affiliation(s)
- Hyang-Mi Lee
- Department of Biomedical Engineering, Chung-Ang University, 84 Heukseok-ro Dongjak-gu, Seoul, 06974, Republic of Korea
| | - Jun Ren
- Department of Biomedical Engineering, Chung-Ang University, 84 Heukseok-ro Dongjak-gu, Seoul, 06974, Republic of Korea
| | - Myeong-Sang Yu
- Department of Biomedical Engineering, Chung-Ang University, 84 Heukseok-ro Dongjak-gu, Seoul, 06974, Republic of Korea
| | - Hyunjoo Kim
- Department of Biomedical Engineering, Chung-Ang University, 84 Heukseok-ro Dongjak-gu, Seoul, 06974, Republic of Korea
| | - Woo Young Kim
- Department of Biomedical Engineering, Chung-Ang University, 84 Heukseok-ro Dongjak-gu, Seoul, 06974, Republic of Korea
| | - Junhao Shen
- Department of Biomedical Engineering, Chung-Ang University, 84 Heukseok-ro Dongjak-gu, Seoul, 06974, Republic of Korea
| | - Seung Min Yoo
- Department of Biomedical Engineering, Chung-Ang University, 84 Heukseok-ro Dongjak-gu, Seoul, 06974, Republic of Korea
| | - Seong-Il Eyun
- Department of Life Science, Chung-Ang University, Seoul, 06974, Republic of Korea
| | - Dokyun Na
- Department of Biomedical Engineering, Chung-Ang University, 84 Heukseok-ro Dongjak-gu, Seoul, 06974, Republic of Korea.
| |
Collapse
|
19
|
Tietze L, Lale R. Importance of the 5' regulatory region to bacterial synthetic biology applications. Microb Biotechnol 2021; 14:2291-2315. [PMID: 34171170 PMCID: PMC8601185 DOI: 10.1111/1751-7915.13868] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/03/2021] [Revised: 06/03/2021] [Accepted: 06/04/2021] [Indexed: 01/02/2023] Open
Abstract
The field of synthetic biology is evolving at a fast pace. It is advancing beyond single-gene alterations in single hosts to the logical design of complex circuits and the development of integrated synthetic genomes. Recent breakthroughs in deep learning, which is increasingly used in de novo assembly of DNA components with predictable effects, are also aiding the discipline. Despite advances in computing, the field is still reliant on the availability of pre-characterized DNA parts, whether natural or synthetic, to regulate gene expression in bacteria and make valuable compounds. In this review, we discuss the different bacterial synthetic biology methodologies employed in the creation of 5' regulatory regions - promoters, untranslated regions and 5'-end of coding sequences. We summarize methodologies and discuss their significance for each of the functional DNA components, and highlight the key advances made in bacterial engineering by concentrating on their flaws and strengths. We end the review by outlining the issues that the discipline may face in the near future.
Collapse
Affiliation(s)
- Lisa Tietze
- PhotoSynLabDepartment of BiotechnologyFaculty of Natural SciencesNorwegian University of Science and TechnologyTrondheimN‐7491Norway
| | - Rahmi Lale
- PhotoSynLabDepartment of BiotechnologyFaculty of Natural SciencesNorwegian University of Science and TechnologyTrondheimN‐7491Norway
| |
Collapse
|
20
|
Belliveau NM, Chure G, Hueschen CL, Garcia HG, Kondev J, Fisher DS, Theriot JA, Phillips R. Fundamental limits on the rate of bacterial growth and their influence on proteomic composition. Cell Syst 2021; 12:924-944.e2. [PMID: 34214468 PMCID: PMC8460600 DOI: 10.1016/j.cels.2021.06.002] [Citation(s) in RCA: 32] [Impact Index Per Article: 10.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/10/2020] [Revised: 04/12/2021] [Accepted: 06/04/2021] [Indexed: 12/11/2022]
Abstract
Despite abundant measurements of bacterial growth rate, cell size, and protein content, we lack a rigorous understanding of what sets the scale of these quantities and when protein abundances should (or should not) depend on growth rate. Here, we estimate the basic requirements and physical constraints on steady-state growth by considering key processes in cellular physiology across a collection of Escherichia coli proteomic data covering ≈4,000 proteins and 36 growth rates. Our analysis suggests that cells are predominantly tuned for the task of cell doubling across a continuum of growth rates; specific processes do not limit growth rate or dictate cell size. We present a model of proteomic regulation as a function of nutrient supply that reconciles observed interdependences between protein synthesis, cell size, and growth rate and propose that a theoretical inability to parallelize ribosomal synthesis places a firm limit on the achievable growth rate. A record of this paper's transparent peer review process is included in the supplemental information.
Collapse
Affiliation(s)
- Nathan M Belliveau
- Department of Biology, Howard Hughes Medical Institute, University of Washington, Seattle, WA 98105, USA
| | - Griffin Chure
- Department of Applied Physics, California Institute of Technology, Pasadena, CA 91125, USA
| | - Christina L Hueschen
- Department of Chemical Engineering, Stanford University, Stanford, CA 94305, USA
| | - Hernan G Garcia
- Department of Molecular Cell Biology and Department of Physics, University of California Berkeley, Berkeley, CA 94720, USA
| | - Jane Kondev
- Department of Physics, Brandeis University, Waltham, MA 02453, USA
| | - Daniel S Fisher
- Department of Applied Physics, Stanford University, Stanford, CA 94305, USA
| | - Julie A Theriot
- Department of Biology, Howard Hughes Medical Institute, University of Washington, Seattle, WA 98105, USA.
| | - Rob Phillips
- Division of Biology and Biological Engineering, California Institute of Technology, Pasadena, CA 91125, USA; Department of Physics, California Institute of Technology, Pasadena, CA 91125, USA.
| |
Collapse
|
21
|
Abstract
Bacterial protein synthesis rates have evolved to maintain preferred stoichiometries at striking precision, from the components of protein complexes to constituents of entire pathways. Setting relative protein production rates to be well within a factor of two requires concerted tuning of transcription, RNA turnover, and translation, allowing many potential regulatory strategies to achieve the preferred output. The last decade has seen a greatly expanded capacity for precise interrogation of each step of the central dogma genome-wide. Here, we summarize how these technologies have shaped the current understanding of diverse bacterial regulatory architectures underpinning stoichiometric protein synthesis. We focus on the emerging expanded view of bacterial operons, which encode diverse primary and secondary mRNA structures for tuning protein stoichiometry. Emphasis is placed on how quantitative tuning is achieved. We discuss the challenges and open questions in the application of quantitative, genome-wide methodologies to the problem of precise protein production. Expected final online publication date for the Annual Review of Microbiology, Volume 75 is October 2021. Please see http://www.annualreviews.org/page/journal/pubdates for revised estimates.
Collapse
Affiliation(s)
- James C Taggart
- Department of Biology, Massachusetts Institute of Technology, Cambridge, Massachusetts 02139, USA; ,
| | - Jean-Benoît Lalanne
- Department of Biology, Massachusetts Institute of Technology, Cambridge, Massachusetts 02139, USA; , .,Department of Physics, Massachusetts Institute of Technology, Cambridge, Massachusetts 02139, USA.,Current affiliation: Department of Genome Sciences, University of Washington, Seattle, Washington 98195, USA;
| | - Gene-Wei Li
- Department of Biology, Massachusetts Institute of Technology, Cambridge, Massachusetts 02139, USA; ,
| |
Collapse
|
22
|
Lu Y, Gu X, Lin H, Melis A. Engineering microalgae: transition from empirical design to programmable cells. Crit Rev Biotechnol 2021; 41:1233-1256. [PMID: 34130561 DOI: 10.1080/07388551.2021.1917507] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/20/2022]
Abstract
Domesticated microalgae hold great promise for the sustainable provision of various bioresources for human domestic and industrial consumption. Efforts to exploit their potential are far from being fully realized due to limitations in the know-how of microalgal engineering. The associated technologies are not as well developed as those for heterotrophic microbes, cyanobacteria, and plants. However, recent studies on microalgal metabolic engineering, genome editing, and synthetic biology have immensely helped to enhance transformation efficiencies and are bringing new insights into this field. Therefore, this article, summarizes recent developments in microalgal biotechnology and examines the prospects for generating specialty and commodity products through the processes of metabolic engineering and synthetic biology. After a brief examination of empirical engineering methods and vector design, this article focuses on quantitative transformation cassette design, elaborates on target editing methods and emerging digital design of algal cellular metabolism to arrive at high yields of valuable products. These advances have enabled a transition of manners in microalgal engineering from single-gene and enzyme-based metabolic engineering to systems-level precision engineering, from cells created with genetically modified (GM) tags to that without GM tags, and ultimately from proof of concept to tangible industrial applications. Finally, future trends are proposed in microalgal engineering, aiming to establish individualized transformation systems in newly identified species for strain-specific specialty and commodity products, while developing sophisticated universal toolkits in model algal species.
Collapse
Affiliation(s)
- Yandu Lu
- State Key Laboratory of Marine Resource Utilization in the South China Sea, College of Oceanology, Hainan University, Haikou, China.,Department of Plant and Microbial Biology, University of California, Berkeley, CA, USA
| | - Xinping Gu
- State Key Laboratory of Marine Resource Utilization in the South China Sea, College of Oceanology, Hainan University, Haikou, China
| | - Hanzhi Lin
- Institute of Marine & Environmental Technology, Center for Environmental Science, University of Maryland, College Park, MD, USA
| | - Anastasios Melis
- Department of Plant and Microbial Biology, University of California, Berkeley, CA, USA
| |
Collapse
|
23
|
Freddolino PL, Amemiya HM, Goss TJ, Tavazoie S. Dynamic landscape of protein occupancy across the Escherichia coli chromosome. PLoS Biol 2021; 19:e3001306. [PMID: 34170902 PMCID: PMC8282354 DOI: 10.1371/journal.pbio.3001306] [Citation(s) in RCA: 17] [Impact Index Per Article: 5.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/27/2020] [Revised: 07/15/2021] [Accepted: 06/02/2021] [Indexed: 12/18/2022] Open
Abstract
Free-living bacteria adapt to environmental change by reprogramming gene expression through precise interactions of hundreds of DNA-binding proteins. A predictive understanding of bacterial physiology requires us to globally monitor all such protein-DNA interactions across a range of environmental and genetic perturbations. Here, we show that such global observations are possible using an optimized version of in vivo protein occupancy display technology (in vivo protein occupancy display-high resolution, IPOD-HR) and present a pilot application to Escherichia coli. We observe that the E. coli protein-DNA interactome organizes into 2 distinct prototypic features: (1) highly dynamic condition-dependent transcription factor (TF) occupancy; and (2) robust kilobase scale occupancy by nucleoid factors, forming silencing domains analogous to eukaryotic heterochromatin. We show that occupancy dynamics across a range of conditions can rapidly reveal the global transcriptional regulatory organization of a bacterium. Beyond discovery of previously hidden regulatory logic, we show that these observations can be utilized to computationally determine sequence specificity models for the majority of active TFs. Our study demonstrates that global observations of protein occupancy combined with statistical inference can rapidly and systematically reveal the transcriptional regulatory and structural features of a bacterial genome. This capacity is particularly crucial for non-model bacteria that are not amenable to routine genetic manipulation.
Collapse
Affiliation(s)
- Peter L. Freddolino
- Department of Biological Chemistry, University of Michigan Medical School, Ann Arbor, Michigan, United States of America
- Department of Computational Medicine and Bioinformatics, University of Michigan Medical School, Ann Arbor, Michigan, United States of America
| | - Haley M. Amemiya
- Department of Computational Medicine and Bioinformatics, University of Michigan Medical School, Ann Arbor, Michigan, United States of America
- Cellular and Molecular Biology Program, University of Michigan Medical School, Ann Arbor, Michigan, United States of America
| | - Thomas J. Goss
- Department of Biological Chemistry, University of Michigan Medical School, Ann Arbor, Michigan, United States of America
| | - Saeed Tavazoie
- Department of Biological Sciences, Columbia University, New York, New York, United States of America
- Department of Systems Biology, Columbia University, New York, New York, United States of America
- Department of Biochemistry and Molecular Biophysics, Columbia University, New York, New York, United States of America
| |
Collapse
|
24
|
A Major Facilitator Superfamily (MFS) Efflux Pump, SCO4121, from Streptomyces coelicolor with Roles in Multidrug Resistance and Oxidative Stress Tolerance and Its Regulation by a MarR Regulator. Appl Environ Microbiol 2021; 87:AEM.02238-20. [PMID: 33483304 DOI: 10.1128/aem.02238-20] [Citation(s) in RCA: 13] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/12/2020] [Accepted: 01/12/2021] [Indexed: 12/12/2022] Open
Abstract
Overexpression of efflux pumps is one of the major determinants of resistance in bacteria. Streptomyces species harbor a large array of efflux pumps that are transcriptionally silenced under laboratory conditions. However, their dissemination results in multidrug resistance in different clinical pathogens. In this study, we have identified an efflux pump from Streptomyces coelicolor, SCO4121, belonging to the major facilitator superfamily (MFS) family of transporters and characterized its role in antibiotic resistance. SCO4121 provided resistance to multiple dissimilar drugs upon overexpression in both native and heterologous hosts. Further, deletion of SCO4121 resulted in increased sensitivity toward ciprofloxacin and chloramphenicol, suggesting the pump to be a major transporter of these substrates. Apart from providing multidrug resistance, SCO4121 imparted increased tolerance against the strong oxidant HOCl. In wild-type Streptomyces coelicolor cells, these drugs were found to transcriptionally regulate the pump in a concentration-dependent manner. Additionally, we identified SCO4122, a MarR regulator that positively regulates SCO4121 in response to various drugs and the oxidant HOCl. Thus, through these studies we present the multiple roles of SCO4121 in S. coelicolor and highlight the intricate mechanisms via which it is regulated in response to antibiotics and oxidative stress.IMPORTANCE One of the key mechanisms of drug resistance in bacteria is overexpression of efflux pumps. Streptomyces species are a reservoir of a large number of efflux pumps, potentially to provide resistance to both endogenous and nonendogenous antibiotics. While many of these pumps are not expressed under standard laboratory conditions, they result in resistance to multiple drugs when spread to other bacterial pathogens through horizontal gene transfer. In this study, we have identified a widely conserved efflux pump SCO4121 from Streptomyces coelicolor with roles in both multidrug resistance and oxidative stress tolerance. We also report the presence of an adjacent MarR regulator, SCO4122, which positively regulates SCO4121 in the presence of diverse substrates in a redox-responsive manner. This study highlights that soil bacteria such as Streptomyces can reveal novel mechanisms of antibiotic resistance that may potentially emerge in clinically important bacteria.
Collapse
|
25
|
Gárate F, Dokas S, Lanfranco MF, Canavan C, Wang I, Correia JJ, Maillard RA. cAMP is an allosteric modulator of DNA-binding specificity in the cAMP receptor protein from Mycobacterium tuberculosis. J Biol Chem 2021; 296:100480. [PMID: 33640453 PMCID: PMC8026907 DOI: 10.1016/j.jbc.2021.100480] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/27/2020] [Revised: 02/21/2021] [Accepted: 02/24/2021] [Indexed: 11/28/2022] Open
Abstract
Allosteric proteins with multiple subunits and ligand-binding sites are central in regulating biological signals. The cAMP receptor protein from Mycobacterium tuberculosis (CRPMTB) is a global regulator of transcription composed of two identical subunits, each one harboring structurally conserved cAMP- and DNA-binding sites. The mechanisms by which these four binding sites are allosterically coupled in CRPMTB remain unclear. Here, we investigate the binding mechanism between CRPMTB and cAMP, and the linkage between cAMP and DNA interactions. Using calorimetric and fluorescence-based assays, we find that cAMP binding is entropically driven and displays negative cooperativity. Fluorescence anisotropy experiments show that apo-CRPMTB forms high-order CRPMTB–DNA oligomers through interactions with nonspecific DNA sequences or preformed CRPMTB–DNA complexes. Moreover, we find that cAMP prevents and reverses the formation of CRPMTB–DNA oligomers, reduces the affinity of CRPMTB for nonspecific DNA sequences, and stabilizes a 1-to-1 CRPMTB–DNA complex, but does not increase the affinity for DNA like in the canonical CRP from Escherichia coli (CRPEcoli). DNA-binding assays as a function of cAMP concentration indicate that one cAMP molecule per homodimer dissociates high-order CRPMTB–DNA oligomers into 1-to-1 complexes. These cAMP-mediated allosteric effects are lost in the double-mutant L47P/E178K found in CRP from Mycobacterium bovis Bacille Calmette-Guérin (CRPBCG). The functional behavior, thermodynamic stability, and dimerization constant of CRPBCG are not due to additive effects of L47P and E178K, indicating long-range interactions between these two sites. Altogether, we provide a previously undescribed archetype of cAMP-mediated allosteric regulation that differs from CRPEcoli, illustrating that structural homology does not imply allosteric homology.
Collapse
Affiliation(s)
- Fernanda Gárate
- Department of Chemistry, Georgetown University, Washington, District of Columbia, USA
| | - Stephen Dokas
- Department of Chemistry, Georgetown University, Washington, District of Columbia, USA
| | - Maria Fe Lanfranco
- Department of Chemistry, Georgetown University, Washington, District of Columbia, USA
| | - Clare Canavan
- Department of Chemistry, Georgetown University, Washington, District of Columbia, USA
| | - Irina Wang
- Department of Chemistry, Georgetown University, Washington, District of Columbia, USA
| | - John J Correia
- Department of Cell and Molecular Biology, The University of Mississippi Medical Center, Jackson, Mississippi, USA
| | - Rodrigo A Maillard
- Department of Chemistry, Georgetown University, Washington, District of Columbia, USA.
| |
Collapse
|
26
|
Del Valle I, Fulk EM, Kalvapalle P, Silberg JJ, Masiello CA, Stadler LB. Translating New Synthetic Biology Advances for Biosensing Into the Earth and Environmental Sciences. Front Microbiol 2021; 11:618373. [PMID: 33633695 PMCID: PMC7901896 DOI: 10.3389/fmicb.2020.618373] [Citation(s) in RCA: 28] [Impact Index Per Article: 9.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/16/2020] [Accepted: 12/17/2020] [Indexed: 12/26/2022] Open
Abstract
The rapid diversification of synthetic biology tools holds promise in making some classically hard-to-solve environmental problems tractable. Here we review longstanding problems in the Earth and environmental sciences that could be addressed using engineered microbes as micron-scale sensors (biosensors). Biosensors can offer new perspectives on open questions, including understanding microbial behaviors in heterogeneous matrices like soils, sediments, and wastewater systems, tracking cryptic element cycling in the Earth system, and establishing the dynamics of microbe-microbe, microbe-plant, and microbe-material interactions. Before these new tools can reach their potential, however, a suite of biological parts and microbial chassis appropriate for environmental conditions must be developed by the synthetic biology community. This includes diversifying sensing modules to obtain information relevant to environmental questions, creating output signals that allow dynamic reporting from hard-to-image environmental materials, and tuning these sensors so that they reliably function long enough to be useful for environmental studies. Finally, ethical questions related to the use of synthetic biosensors in environmental applications are discussed.
Collapse
Affiliation(s)
- Ilenne Del Valle
- Systems, Synthetic, and Physical Biology Graduate Program, Rice University, Houston, TX, United States
| | - Emily M. Fulk
- Systems, Synthetic, and Physical Biology Graduate Program, Rice University, Houston, TX, United States
| | - Prashant Kalvapalle
- Systems, Synthetic, and Physical Biology Graduate Program, Rice University, Houston, TX, United States
| | - Jonathan J. Silberg
- Department of BioSciences, Rice University, Houston, TX, United States
- Department of Bioengineering, Rice University, Houston, TX, United States
- Department of Chemical and Biomolecular Engineering, Rice University, Houston, TX, United States
| | - Caroline A. Masiello
- Department of BioSciences, Rice University, Houston, TX, United States
- Department of Earth, Environmental and Planetary Sciences, Rice University, Houston, TX, United States
- Department of Chemistry, Rice University, Houston, TX, United States
| | - Lauren B. Stadler
- Department of Civil and Environmental Engineering, Rice University, Houston, TX, United States
| |
Collapse
|
27
|
Morrison M, Razo-Mejia M, Phillips R. Reconciling kinetic and thermodynamic models of bacterial transcription. PLoS Comput Biol 2021; 17:e1008572. [PMID: 33465069 PMCID: PMC7845990 DOI: 10.1371/journal.pcbi.1008572] [Citation(s) in RCA: 9] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/14/2020] [Revised: 01/29/2021] [Accepted: 11/28/2020] [Indexed: 11/18/2022] Open
Abstract
The study of transcription remains one of the centerpieces of modern biology with implications in settings from development to metabolism to evolution to disease. Precision measurements using a host of different techniques including fluorescence and sequencing readouts have raised the bar for what it means to quantitatively understand transcriptional regulation. In particular our understanding of the simplest genetic circuit is sufficiently refined both experimentally and theoretically that it has become possible to carefully discriminate between different conceptual pictures of how this regulatory system works. This regulatory motif, originally posited by Jacob and Monod in the 1960s, consists of a single transcriptional repressor binding to a promoter site and inhibiting transcription. In this paper, we show how seven distinct models of this so-called simple-repression motif, based both on thermodynamic and kinetic thinking, can be used to derive the predicted levels of gene expression and shed light on the often surprising past success of the thermodynamic models. These different models are then invoked to confront a variety of different data on mean, variance and full gene expression distributions, illustrating the extent to which such models can and cannot be distinguished, and suggesting a two-state model with a distribution of burst sizes as the most potent of the seven for describing the simple-repression motif.
Collapse
Affiliation(s)
- Muir Morrison
- Department of Physics, California Institute of Technology, Pasadena, California, USA
| | - Manuel Razo-Mejia
- Division of Biology and Biological Engineering, California Institute of Technology, Pasadena, California, USA
| | - Rob Phillips
- Department of Physics, California Institute of Technology, Pasadena, California, USA
- Division of Biology and Biological Engineering, California Institute of Technology, Pasadena, California, USA
- * E-mail:
| |
Collapse
|
28
|
Abstract
Simple biophysical models successfully describe bacterial regulatory code, by predicting gene expression from DNA sequences that bind specialized regulatory proteins. Analogous simple models fail in multicellular organisms, where regulatory proteins bind DNA very transiently, yet, nevertheless, effect precise control over gene expression. To date, the more general, “nonequilibrium” models have proven difficult to analyze and connect to data. Here, we reduce this complexity theoretically, by constructing simple nonequilibrium models which perform optimal gene regulation within known experimental constraints. In prokaryotes, thermodynamic models of gene regulation provide a highly quantitative mapping from promoter sequences to gene-expression levels that is compatible with in vivo and in vitro biophysical measurements. Such concordance has not been achieved for models of enhancer function in eukaryotes. In equilibrium models, it is difficult to reconcile the reported short transcription factor (TF) residence times on the DNA with the high specificity of regulation. In nonequilibrium models, progress is difficult due to an explosion in the number of parameters. Here, we navigate this complexity by looking for minimal nonequilibrium enhancer models that yield desired regulatory phenotypes: low TF residence time, high specificity, and tunable cooperativity. We find that a single extra parameter, interpretable as the “linking rate,” by which bound TFs interact with Mediator components, enables our models to escape equilibrium bounds and access optimal regulatory phenotypes, while remaining consistent with the reported phenomenology and simple enough to be inferred from upcoming experiments. We further find that high specificity in nonequilibrium models is in a trade-off with gene-expression noise, predicting bursty dynamics—an experimentally observed hallmark of eukaryotic transcription. By drastically reducing the vast parameter space of nonequilibrium enhancer models to a much smaller subspace that optimally realizes biological function, we deliver a rich class of models that could be tractably inferred from data in the near future.
Collapse
|
29
|
Lin Z, Sun Y, Liu Y, Tong S, Shang Z, Cai Y, Lin W. Structural and Functional Analyses of the Transcription Repressor DgoR From Escherichia coli Reveal a Divalent Metal-Containing D-Galactonate Binding Pocket. Front Microbiol 2020; 11:590330. [PMID: 33224125 PMCID: PMC7674646 DOI: 10.3389/fmicb.2020.590330] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/01/2020] [Accepted: 10/20/2020] [Indexed: 11/21/2022] Open
Abstract
The transcription repressor of D-galactonate metabolism, DgoR, from Escherichia coli belongs to the FadR family of the GntR superfamily. In the presence of D-galactonate, DgoR binds to two inverted repeats overlapping the dgo cis-acting promoter repressing the expression of genes involved in D-galactonate metabolism. To further understand the structural and molecular details of ligand and effector interactions between D-galactonate and this FadR family member, herein we solved the crystal structure of C-terminal domain of DgoR (DgoR_C), which revealed a unique divalent metal-containing substrate binding pocket. The metal ion is required for D-galactonate binding, as evidenced by the dramatically decreased affinity between D-galactonate and DgoR in the presence of EDTA, which can be reverted by the addition of Zn2+, Mg2+, and Ca2+. The key amino acid residues involved in the interactions between D-galactonate and DgoR were revealed by molecular docking studies and further validated with biochemical studies by site-directed mutagenesis. It was found that changes to alanine in residues R102, W181, T191, and R224 resulted in significantly decreased binding affinities for D-galactonate, as determined by EMSA and MST assays. These results suggest that the molecular modifications induced by a D-galactonate and a metal binding in the DgoR are required for DNA binding activity and consequently, transcriptional inhibition.
Collapse
Affiliation(s)
- Zhaozhu Lin
- Department of Microbiology and Immunology, School of Medicine & Holistic Integrative Medicine, Nanjing University of Chinese Medicine, Nanjing, China
| | - Yi Sun
- Department of Microbiology and Immunology, School of Medicine & Holistic Integrative Medicine, Nanjing University of Chinese Medicine, Nanjing, China
| | - Yu Liu
- Department of Chemistry, Waksman Institute of Microbiology, Rutgers University, Piscataway, NJ, United States
| | - Shujuan Tong
- Department of Microbiology and Immunology, School of Medicine & Holistic Integrative Medicine, Nanjing University of Chinese Medicine, Nanjing, China
| | - Zhuo Shang
- Department of Microbiology and Immunology, School of Medicine & Holistic Integrative Medicine, Nanjing University of Chinese Medicine, Nanjing, China
| | - Yuanheng Cai
- Biochemistry and Cell Biology Department, Stony Brook University, Stony Brook, NY, United States
| | - Wei Lin
- Department of Microbiology and Immunology, School of Medicine & Holistic Integrative Medicine, Nanjing University of Chinese Medicine, Nanjing, China.,State Key Laboratory of Natural Medicines, China Pharmaceutical University, Nanjing, China.,Jiangsu Collaborative Innovation Center of Chinese Medicinal Resources Industrialization, Nanjing, China
| |
Collapse
|
30
|
Shi M, Tan S, Xie XP, Li A, Yang W, Zhu T, Wang HQ. Globally learning gene regulatory networks based on hidden atomic regulators from transcriptomic big data. BMC Genomics 2020; 21:711. [PMID: 33054712 PMCID: PMC7559338 DOI: 10.1186/s12864-020-07079-8] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/02/2020] [Accepted: 09/18/2020] [Indexed: 12/02/2022] Open
Abstract
Background Genes are regulated by various types of regulators and most of them are still unknown or unobserved. Current gene regulatory networks (GRNs) reverse engineering methods often neglect the unknown regulators and infer regulatory relationships in a local and sub-optimal manner. Results This paper proposes a global GRNs inference framework based on dictionary learning, named dlGRN. The method intends to learn atomic regulators (ARs) from gene expression data using a modified dictionary learning (DL) algorithm, which reflects the whole gene regulatory system, and predicts the regulation between a known regulator and a target gene in a global regression way. The modified DL algorithm fits the scale-free property of biological network, rendering dlGRN intrinsically discern direct and indirect regulations. Conclusions Extensive experimental results on simulation and real-world data demonstrate the effectiveness and efficiency of dlGRN in reverse engineering GRNs. A novel predicted transcription regulation between a TF TFAP2C and an oncogene EGFR was experimentally verified in lung cancer cells. Furthermore, the real application reveals the prevalence of DNA methylation regulation in gene regulatory system. dlGRN can be a standalone tool for GRN inference for its globalization and robustness.
Collapse
Affiliation(s)
- Ming Shi
- MICB Laboratory, Institute of Intelligent Machines, Hefei Institutes of Physical Science, CAS, 350 Shushanghu Road, Hefei, Anhui, 230031, P. R. China.,Current Address: MOE Key Laboratory of Bioinformatics, Division of Bioinformatics and Center for Synthetic and Systems Biology, TNLIST, Department of Automation, Tsinghua University, Beijing, 100084, China
| | - Sheng Tan
- The CAS Key Laboratory of Innate Immunity and Chronic Disease, Division of Life Sciences and Medicine, University of Science and Technology of China, 96 Jinzhai Road, Hefei, Anhui, 230026, P. R. China
| | - Xin-Ping Xie
- School of Mathematics and Physics, Anhui Jianzhu University, 856 Jinzhai Road, Hefei, Anhui, 230022, P. R. China
| | - Ao Li
- School of Information Science and Technology, University of Science and Technology of China, 96 Jinzhai Road, Hefei, Anhui, 230026, P. R. China
| | - Wulin Yang
- Cancer hospital & Anhui Province Key Laboratory of Medical Physics and Technology, Center of Medical Physics and Technology, Hefei Institutes of Physical Science, CAS, 350 Shushanghu Road, Hefei, Anhui, 230031, P. R. China
| | - Tao Zhu
- Current Address: MOE Key Laboratory of Bioinformatics, Division of Bioinformatics and Center for Synthetic and Systems Biology, TNLIST, Department of Automation, Tsinghua University, Beijing, 100084, China.
| | - Hong-Qiang Wang
- MICB Laboratory, Institute of Intelligent Machines, Hefei Institutes of Physical Science, CAS, 350 Shushanghu Road, Hefei, Anhui, 230031, P. R. China. .,Cancer hospital & Anhui Province Key Laboratory of Medical Physics and Technology, Center of Medical Physics and Technology, Hefei Institutes of Physical Science, CAS, 350 Shushanghu Road, Hefei, Anhui, 230031, P. R. China.
| |
Collapse
|
31
|
Fuqua T, Jordan J, van Breugel ME, Halavatyi A, Tischer C, Polidoro P, Abe N, Tsai A, Mann RS, Stern DL, Crocker J. Dense and pleiotropic regulatory information in a developmental enhancer. Nature 2020; 587:235-239. [PMID: 33057197 DOI: 10.1038/s41586-020-2816-5] [Citation(s) in RCA: 43] [Impact Index Per Article: 10.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/09/2020] [Accepted: 07/22/2020] [Indexed: 01/08/2023]
Abstract
Changes in gene regulation underlie much of phenotypic evolution1. However, our understanding of the potential for regulatory evolution is biased, because most evidence comes from either natural variation or limited experimental perturbations2. Using an automated robotics pipeline, we surveyed an unbiased mutation library for a developmental enhancer in Drosophila melanogaster. We found that almost all mutations altered gene expression and that parameters of gene expression-levels, location, and state-were convolved. The widespread pleiotropic effects of most mutations may constrain the evolvability of developmental enhancers. Consistent with these observations, comparisons of diverse Drosophila larvae revealed apparent biases in the phenotypes influenced by the enhancer. Developmental enhancers may encode a higher density of regulatory information than has been appreciated previously, imposing constraints on regulatory evolution.
Collapse
Affiliation(s)
- Timothy Fuqua
- European Molecular Biology Laboratory, Heidelberg, Germany.,Joint PhD Collaboration, EMBL and Faculty of Biosciences Heidelberg University, Heidelberg, Germany
| | | | | | | | | | | | - Namiko Abe
- Mortimer B. Zuckerman Mind Brain Behavior Institute, Columbia University, New York, NY, USA
| | - Albert Tsai
- European Molecular Biology Laboratory, Heidelberg, Germany
| | - Richard S Mann
- Mortimer B. Zuckerman Mind Brain Behavior Institute, Columbia University, New York, NY, USA
| | | | - Justin Crocker
- European Molecular Biology Laboratory, Heidelberg, Germany.
| |
Collapse
|
32
|
Galbusera L, Bellement-Theroue G, Urchueguia A, Julou T, van Nimwegen E. Using fluorescence flow cytometry data for single-cell gene expression analysis in bacteria. PLoS One 2020; 15:e0240233. [PMID: 33045012 PMCID: PMC7549788 DOI: 10.1371/journal.pone.0240233] [Citation(s) in RCA: 9] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/15/2020] [Accepted: 09/22/2020] [Indexed: 01/08/2023] Open
Abstract
Fluorescence flow cytometry is increasingly being used to quantify single-cell expression distributions in bacteria in high-throughput. However, there has been no systematic investigation into the best practices for quantitative analysis of such data, what systematic biases exist, and what accuracy and sensitivity can be obtained. We investigate these issues by measuring the same E. coli strains carrying fluorescent reporters using both flow cytometry and microscopic setups and systematically comparing the resulting single-cell expression distributions. Using these results, we develop methods for rigorous quantitative inference of single-cell expression distributions from fluorescence flow cytometry data. First, we present a Bayesian mixture model to separate debris from viable cells using all scattering signals. Second, we show that cytometry measurements of fluorescence are substantially affected by autofluorescence and shot noise, which can be mistaken for intrinsic noise in gene expression, and present methods to correct for these using calibration measurements. Finally, we show that because forward- and side-scatter signals scale non-linearly with cell size, and are also affected by a substantial shot noise component that cannot be easily calibrated unless independent measurements of cell size are available, it is not possible to accurately estimate the variability in the sizes of individual cells using flow cytometry measurements alone. To aid other researchers with quantitative analysis of flow cytometry expression data in bacteria, we distribute E-Flow, an open-source R package that implements our methods for filtering debris and for estimating true biological expression means and variances from the fluorescence signal. The package is available at https://github.com/vanNimwegenLab/E-Flow.
Collapse
Affiliation(s)
- Luca Galbusera
- Biozentrum, University of Basel, Basel, Switzerland
- Swiss Institute of Bioinformatics, Lausanne, Switzerland
| | | | - Arantxa Urchueguia
- Biozentrum, University of Basel, Basel, Switzerland
- Swiss Institute of Bioinformatics, Lausanne, Switzerland
| | - Thomas Julou
- Biozentrum, University of Basel, Basel, Switzerland
- Swiss Institute of Bioinformatics, Lausanne, Switzerland
| | - Erik van Nimwegen
- Biozentrum, University of Basel, Basel, Switzerland
- Swiss Institute of Bioinformatics, Lausanne, Switzerland
| |
Collapse
|
33
|
Ireland WT, Beeler SM, Flores-Bautista E, McCarty NS, Röschinger T, Belliveau NM, Sweredoski MJ, Moradian A, Kinney JB, Phillips R. Deciphering the regulatory genome of Escherichia coli, one hundred promoters at a time. eLife 2020; 9:e55308. [PMID: 32955440 PMCID: PMC7567609 DOI: 10.7554/elife.55308] [Citation(s) in RCA: 21] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/20/2020] [Accepted: 09/18/2020] [Indexed: 01/28/2023] Open
Abstract
Advances in DNA sequencing have revolutionized our ability to read genomes. However, even in the most well-studied of organisms, the bacterium Escherichia coli, for ≈65% of promoters we remain ignorant of their regulation. Until we crack this regulatory Rosetta Stone, efforts to read and write genomes will remain haphazard. We introduce a new method, Reg-Seq, that links massively parallel reporter assays with mass spectrometry to produce a base pair resolution dissection of more than a E. coli promoters in 12 growth conditions. We demonstrate that the method recapitulates known regulatory information. Then, we examine regulatory architectures for more than 80 promoters which previously had no known regulatory information. In many cases, we also identify which transcription factors mediate their regulation. This method clears a path for highly multiplexed investigations of the regulatory genome of model organisms, with the potential of moving to an array of microbes of ecological and medical relevance.
Collapse
Affiliation(s)
- William T Ireland
- Department of Physics, California Institute of TechnologyPasadenaUnited States
| | - Suzannah M Beeler
- Division of Biology and Biological Engineering, California Institute of TechnologyPasadenaUnited States
| | - Emanuel Flores-Bautista
- Division of Biology and Biological Engineering, California Institute of TechnologyPasadenaUnited States
| | - Nicholas S McCarty
- Division of Biology and Biological Engineering, California Institute of TechnologyPasadenaUnited States
| | - Tom Röschinger
- Division of Chemistry and Chemical Engineering, California Institute of TechnologyPasadenaUnited States
| | - Nathan M Belliveau
- Division of Biology and Biological Engineering, California Institute of TechnologyPasadenaUnited States
| | - Michael J Sweredoski
- Proteome Exploration Laboratory, Division of Biology and Biological Engineering, Beckman Institute, California Institute of TechnologyPasadenaUnited States
| | - Annie Moradian
- Proteome Exploration Laboratory, Division of Biology and Biological Engineering, Beckman Institute, California Institute of TechnologyPasadenaUnited States
| | - Justin B Kinney
- Simons Center for Quantitative Biology, Cold Spring Harbor LaboratoryCold Spring HarborUnited States
| | - Rob Phillips
- Department of Physics, California Institute of TechnologyPasadenaUnited States
- Division of Biology and Biological Engineering, California Institute of TechnologyPasadenaUnited States
| |
Collapse
|
34
|
Tareen A, Kinney JB. Logomaker: beautiful sequence logos in Python. Bioinformatics 2020; 36:2272-2274. [PMID: 31821414 PMCID: PMC7141850 DOI: 10.1093/bioinformatics/btz921] [Citation(s) in RCA: 187] [Impact Index Per Article: 46.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/10/2019] [Revised: 11/14/2019] [Accepted: 12/06/2019] [Indexed: 01/09/2023] Open
Abstract
Summary Sequence logos are visually compelling ways of illustrating the biological properties of DNA, RNA and protein sequences, yet it is currently difficult to generate and customize such logos within the Python programming environment. Here we introduce Logomaker, a Python API for creating publication-quality sequence logos. Logomaker can produce both standard and highly customized logos from either a matrix-like array of numbers or a multiple-sequence alignment. Logos are rendered as native matplotlib objects that are easy to stylize and incorporate into multi-panel figures. Availability and implementation Logomaker can be installed using the pip package manager and is compatible with both Python 2.7 and Python 3.6. Documentation is provided at http://logomaker.readthedocs.io; source code is available at http://github.com/jbkinney/logomaker.
Collapse
Affiliation(s)
- Ammar Tareen
- Simons Center for Quantitative Biology, Cold Spring Harbor Laboratory, Cold Spring Harbor, NY 11724, USA
| | - Justin B Kinney
- Simons Center for Quantitative Biology, Cold Spring Harbor Laboratory, Cold Spring Harbor, NY 11724, USA
| |
Collapse
|
35
|
Abstract
The correct mapping of promoter elements is a crucial step in microbial genomics. Also, when combining new DNA elements into synthetic sequences, predicting the potential generation of new promoter sequences is critical. Over the last years, many bioinformatics tools have been created to allow users to predict promoter elements in a sequence or genome of interest. Here, we assess the predictive power of some of the main prediction tools available using well-defined promoter data sets. Using Escherichia coli as a model organism, we demonstrated that while some tools are biased toward AT-rich sequences, others are very efficient in identifying real promoters with low false-negative rates. We hope the potentials and limitations presented here will help the microbiology community to choose promoter prediction tools among many available alternatives. The promoter region is a key element required for the production of RNA in bacteria. While new high-throughput technology allows massively parallel mapping of promoter elements, we still mainly rely on bioinformatics tools to predict such elements in bacterial genomes. Additionally, despite many different prediction tools having become popular to identify bacterial promoters, no systematic comparison of such tools has been performed. Here, we performed a systematic comparison between several widely used promoter prediction tools (BPROM, bTSSfinder, BacPP, CNNProm, IBBP, Virtual Footprint, iPro70-FMWin, 70ProPred, iPromoter-2L, and MULTiPly) using well-defined sequence data sets and standardized metrics to determine how well those tools performed related to each other. For this, we used data sets of experimentally validated promoters from Escherichia coli and a control data set composed of randomly generated sequences with similar nucleotide distributions. We compared the performance of the tools using metrics such as specificity, sensitivity, accuracy, and Matthews correlation coefficient (MCC). We show that the widely used BPROM presented the worse performance among the compared tools, while four tools (CNNProm, iPro70-FMWin, 70ProPred, and iPromoter-2L) offered high predictive power. Of these tools, iPro70-FMWin exhibited the best results for most of the metrics used. We present here some potentials and limitations of available tools, and we hope that future work can build upon our effort to systematically characterize this useful class of bioinformatics tools. IMPORTANCE The correct mapping of promoter elements is a crucial step in microbial genomics. Also, when combining new DNA elements into synthetic sequences, predicting the potential generation of new promoter sequences is critical. Over the last years, many bioinformatics tools have been created to allow users to predict promoter elements in a sequence or genome of interest. Here, we assess the predictive power of some of the main prediction tools available using well-defined promoter data sets. Using Escherichia coli as a model organism, we demonstrated that while some tools are biased toward AT-rich sequences, others are very efficient in identifying real promoters with low false-negative rates. We hope the potentials and limitations presented here will help the microbiology community to choose promoter prediction tools among many available alternatives.
Collapse
|
36
|
Daletos G, Katsimpouras C, Stephanopoulos G. Novel Strategies and Platforms for Industrial Isoprenoid Engineering. Trends Biotechnol 2020; 38:811-822. [DOI: 10.1016/j.tibtech.2020.03.009] [Citation(s) in RCA: 29] [Impact Index Per Article: 7.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/17/2019] [Revised: 03/16/2020] [Accepted: 03/17/2020] [Indexed: 12/13/2022]
|
37
|
Davis JE, Insigne KD, Jones EM, Hastings QA, Boldridge WC, Kosuri S. Dissection of c-AMP Response Element Architecture by Using Genomic and Episomal Massively Parallel Reporter Assays. Cell Syst 2020; 11:75-85.e7. [PMID: 32603702 DOI: 10.1016/j.cels.2020.05.011] [Citation(s) in RCA: 21] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/21/2019] [Revised: 02/16/2020] [Accepted: 05/26/2020] [Indexed: 11/15/2022]
Abstract
In eukaryotes, transcription factors (TFs) orchestrate gene expression by binding to TF-binding sites (TFBSs) and localizing transcriptional co-regulators and RNA polymerase II to cis-regulatory elements. However, we lack a basic understanding of the relationship between TFBS composition and their quantitative transcriptional responses. Here, we measured expression driven by 17,406 synthetic cis-regulatory elements with varied compositions of a model TFBS, the c-AMP response element (CRE) by using massively parallel reporter assays (MPRAs). We find CRE number, affinity, and promoter proximity largely determines expression. In addition, we observe expression modulation based on the spacing between CREs and CRE distance to the promoter, where expression follows a helical periodicity. Finally, we compare library expression between an episomal MPRA and a genomically integrated MPRA, where a single cis-regulatory element is assayed per cell at a defined locus. These assays largely recapitulate each other, although weaker, non-canonical CREs exhibit greater activity in a genomic context.
Collapse
Affiliation(s)
- Jessica E Davis
- Department of Chemistry and Biochemistry, UCLA-DOE Institute for Genomics and Proteomics, Molecular Biology Institute, Quantitative and Computational Biology Institute, Eli and Edythe Broad Center of Regenerative Medicine and Stem Cell Research, and Jonsson Comprehensive Cancer Center, University of California, Los Angeles, Los Angeles, CA 90095, USA
| | - Kimberly D Insigne
- Department of Chemistry and Biochemistry, UCLA-DOE Institute for Genomics and Proteomics, Molecular Biology Institute, Quantitative and Computational Biology Institute, Eli and Edythe Broad Center of Regenerative Medicine and Stem Cell Research, and Jonsson Comprehensive Cancer Center, University of California, Los Angeles, Los Angeles, CA 90095, USA; Bioinformatics Interdepartmental Graduate Program, University of California, Los Angeles, Los Angeles, CA 90095, USA
| | - Eric M Jones
- Department of Chemistry and Biochemistry, UCLA-DOE Institute for Genomics and Proteomics, Molecular Biology Institute, Quantitative and Computational Biology Institute, Eli and Edythe Broad Center of Regenerative Medicine and Stem Cell Research, and Jonsson Comprehensive Cancer Center, University of California, Los Angeles, Los Angeles, CA 90095, USA
| | - Quinn A Hastings
- Department of Chemistry and Biochemistry, UCLA-DOE Institute for Genomics and Proteomics, Molecular Biology Institute, Quantitative and Computational Biology Institute, Eli and Edythe Broad Center of Regenerative Medicine and Stem Cell Research, and Jonsson Comprehensive Cancer Center, University of California, Los Angeles, Los Angeles, CA 90095, USA
| | - W Clifford Boldridge
- Department of Chemistry and Biochemistry, UCLA-DOE Institute for Genomics and Proteomics, Molecular Biology Institute, Quantitative and Computational Biology Institute, Eli and Edythe Broad Center of Regenerative Medicine and Stem Cell Research, and Jonsson Comprehensive Cancer Center, University of California, Los Angeles, Los Angeles, CA 90095, USA
| | - Sriram Kosuri
- Department of Chemistry and Biochemistry, UCLA-DOE Institute for Genomics and Proteomics, Molecular Biology Institute, Quantitative and Computational Biology Institute, Eli and Edythe Broad Center of Regenerative Medicine and Stem Cell Research, and Jonsson Comprehensive Cancer Center, University of California, Los Angeles, Los Angeles, CA 90095, USA.
| |
Collapse
|
38
|
Laxhuber KS, Morrison MJ, Chure G, Belliveau NM, Strandkvist C, Naughton KL, Phillips R. Theoretical investigation of a genetic switch for metabolic adaptation. PLoS One 2020; 15:e0226453. [PMID: 32379825 PMCID: PMC7205307 DOI: 10.1371/journal.pone.0226453] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/26/2019] [Accepted: 04/01/2020] [Indexed: 12/02/2022] Open
Abstract
Membrane transporters carry key metabolites across the cell membrane and, from a resource standpoint, are hypothesized to be produced when necessary. The expression of membrane transporters in metabolic pathways is often upregulated by the transporter substrate. In E. coli, such systems include for example the lacY, araFGH, and xylFGH genes, which encode for lactose, arabinose, and xylose transporters, respectively. As a case study of a minimal system, we build a generalizable physical model of the xapABR genetic circuit, which features a regulatory feedback loop via membrane transport (positive feedback) and enzymatic degradation (negative feedback) of an inducer. Dynamical systems analysis and stochastic simulations show that the membrane transport makes the model system bistable in certain parameter regimes. Thus, it serves as a genetic “on-off” switch, enabling the cell to only produce a set of metabolic enzymes when the corresponding metabolite is present in large amounts. We find that the negative feedback from the degradation enzyme does not significantly disturb the positive feedback from the membrane transporter. We investigate hysteresis in the switching and discuss the role of cooperativity and multiple binding sites in the model circuit. Fundamentally, this work explores how a stable genetic switch for a set of enzymes is obtained from transcriptional auto-activation of a membrane transporter through its substrate.
Collapse
Affiliation(s)
- Kathrin S Laxhuber
- Department of Chemistry and Applied Biosciences, ETH Zurich, Zurich, Switzerland
| | - Muir J Morrison
- Department of Physics, California Institute of Technology, Pasadena, CA, United States of America
| | - Griffin Chure
- Division of Biology and Biological Engineering, California Institute of Technology, Pasadena, CA, United States of America
| | - Nathan M Belliveau
- Howard Hughes Medical Institute, University of Washington, Seattle, WA, United States of America
| | - Charlotte Strandkvist
- Department of Systems Biology, Harvard Medical School, Boston, MA, United States of America
| | - Kyle L Naughton
- Department of Physics and Astronomy, University of Southern California, Los Angeles, CA, United States of America
| | - Rob Phillips
- Department of Physics, California Institute of Technology, Pasadena, CA, United States of America
- Division of Biology and Biological Engineering, California Institute of Technology, Pasadena, CA, United States of America
| |
Collapse
|
39
|
|
40
|
Phillips R, Belliveau NM, Chure G, Garcia HG, Razo-Mejia M, Scholes C. Figure 1 Theory Meets Figure 2 Experiments in the Study of Gene Expression. Annu Rev Biophys 2020; 48:121-163. [PMID: 31084583 DOI: 10.1146/annurev-biophys-052118-115525] [Citation(s) in RCA: 37] [Impact Index Per Article: 9.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]
Abstract
It is tempting to believe that we now own the genome. The ability to read and rewrite it at will has ushered in a stunning period in the history of science. Nonetheless, there is an Achilles' heel exposed by all of the genomic data that has accrued: We still do not know how to interpret them. Many genes are subject to sophisticated programs of transcriptional regulation, mediated by DNA sequences that harbor binding sites for transcription factors, which can up- or down-regulate gene expression depending upon environmental conditions. This gives rise to an input-output function describing how the level of expression depends upon the parameters of the regulated gene-for instance, on the number and type of binding sites in its regulatory sequence. In recent years, the ability to make precision measurements of expression, coupled with the ability to make increasingly sophisticated theoretical predictions, has enabled an explicit dialogue between theory and experiment that holds the promise of covering this genomic Achilles' heel. The goal is to reach a predictive understanding of transcriptional regulation that makes it possible to calculate gene expression levels from DNA regulatory sequence. This review focuses on the canonical simple repression motif to ask how well the models that have been used to characterize it actually work. We consider a hierarchy of increasingly sophisticated experiments in which the minimal parameter set learned at one level is applied to make quantitative predictions at the next. We show that these careful quantitative dissections provide a template for a predictive understanding of the many more complex regulatory arrangements found across all domains of life.
Collapse
Affiliation(s)
- Rob Phillips
- Department of Physics, California Institute of Technology, Pasadena, California, USA; .,Division of Biology and Biological Engineering, California Institute of Technology, Pasadena, California, USA
| | - Nathan M Belliveau
- Howard Hughes Medical Institute, Chevy Chase, Maryland 20815, USA.,Department of Biology, University of Washington, Seattle, Washington 98195, USA
| | - Griffin Chure
- Division of Biology and Biological Engineering, California Institute of Technology, Pasadena, California, USA
| | - Hernan G Garcia
- Department of Molecular & Cell Biology, Department of Physics, Biophysics Graduate Group, and Institute for Quantitative Biosciences-QB3, University of California, Berkeley, California, USA
| | - Manuel Razo-Mejia
- Division of Biology and Biological Engineering, California Institute of Technology, Pasadena, California, USA
| | - Clarissa Scholes
- Department of Systems Biology, Harvard Medical School, Boston, Massachusetts, USA
| |
Collapse
|
41
|
Bajic D, Sanchez A. The ecology and evolution of microbial metabolic strategies. Curr Opin Biotechnol 2019; 62:123-128. [PMID: 31670179 DOI: 10.1016/j.copbio.2019.09.003] [Citation(s) in RCA: 12] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/04/2019] [Revised: 08/21/2019] [Accepted: 09/06/2019] [Indexed: 12/21/2022]
Abstract
Free-living microbes are generally capable of growing on multiple different nutrients. Some of those nutrients are used simultaneously, while others are used sequentially. The pattern of nutrient preferences and co-utilization defines the metabolic strategy of a microorganism. Metabolic strategies can substantially affect ecological interactions between species, but their evolution and distribution across the tree of life remain poorly characterized. We discuss how the confluence of better computational models of genotype-phenotype maps and high-throughput experimental tools can help us fill gaps in our knowledge and incorporate metabolic strategies into quantitative predictive models of microbial consortia.
Collapse
Affiliation(s)
- Djordje Bajic
- Department of Ecology and Evolutionary Biology, Yale University, New Haven, CT 06511, United States; Microbial Sciences Institute, Yale University West Campus, West Haven, CT 06516, United States
| | - Alvaro Sanchez
- Department of Ecology and Evolutionary Biology, Yale University, New Haven, CT 06511, United States; Microbial Sciences Institute, Yale University West Campus, West Haven, CT 06516, United States.
| |
Collapse
|
42
|
Empirical measures of mutational effects define neutral models of regulatory evolution in Saccharomyces cerevisiae. Proc Natl Acad Sci U S A 2019; 116:21085-21093. [PMID: 31570626 DOI: 10.1073/pnas.1902823116] [Citation(s) in RCA: 15] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
Understanding how phenotypes evolve requires disentangling the effects of mutation generating new variation from the effects of selection filtering it. Tests for selection frequently assume that mutation introduces phenotypic variation symmetrically around the population mean, yet few studies have tested this assumption by deeply sampling the distributions of mutational effects for particular traits. Here, we examine distributions of mutational effects for gene expression in the budding yeast Saccharomyces cerevisiae by measuring the effects of thousands of point mutations introduced randomly throughout the genome. We find that the distributions of mutational effects differ for the 10 genes surveyed and are inconsistent with normality. For example, all 10 distributions of mutational effects included more mutations with large effects than expected for normally distributed phenotypes. In addition, some genes also showed asymmetries in their distribution of mutational effects, with new mutations more likely to increase than decrease the gene's expression or vice versa. Neutral models of regulatory evolution that take these empirically determined distributions into account suggest that neutral processes may explain more expression variation within natural populations than currently appreciated.
Collapse
|
43
|
Yim SS, Johns NI, Park J, Gomes ALC, McBee RM, Richardson M, Ronda C, Chen SP, Garenne D, Noireaux V, Wang HH. Multiplex transcriptional characterizations across diverse bacterial species using cell-free systems. Mol Syst Biol 2019; 15:e8875. [PMID: 31464371 PMCID: PMC6692573 DOI: 10.15252/msb.20198875] [Citation(s) in RCA: 36] [Impact Index Per Article: 7.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/19/2019] [Revised: 07/01/2019] [Accepted: 07/03/2019] [Indexed: 12/14/2022] Open
Abstract
Cell-free expression systems enable rapid prototyping of genetic programs in vitro. However, current throughput of cell-free measurements is limited by the use of channel-limited fluorescent readouts. Here, we describe DNA Regulatory element Analysis by cell-Free Transcription and Sequencing (DRAFTS), a rapid and robust in vitro approach for multiplexed measurement of transcriptional activities from thousands of regulatory sequences in a single reaction. We employ this method in active cell lysates developed from ten diverse bacterial species. Interspecies analysis of transcriptional profiles from > 1,000 diverse regulatory sequences reveals functional differences in promoter activity that can be quantitatively modeled, providing a rich resource for tuning gene expression in diverse bacterial species. Finally, we examine the transcriptional capacities of dual-species hybrid lysates that can simultaneously harness gene expression properties of multiple organisms. We expect that this cell-free multiplex transcriptional measurement approach will improve genetic part prototyping in new bacterial chassis for synthetic biology.
Collapse
Affiliation(s)
- Sung Sun Yim
- Department of Systems BiologyColumbia UniversityNew YorkNYUSA
| | - Nathan I Johns
- Department of Systems BiologyColumbia UniversityNew YorkNYUSA
- Integrated Program in Cellular, Molecular, and Biomedical StudiesColumbia UniversityNew YorkNYUSA
- Present address:
Department of BioengineeringStanford UniversityStanfordCAUSA
| | - Jimin Park
- Department of Systems BiologyColumbia UniversityNew YorkNYUSA
- Integrated Program in Cellular, Molecular, and Biomedical StudiesColumbia UniversityNew YorkNYUSA
| | - Antonio LC Gomes
- Department of ImmunologyMemorial Sloan Kettering Cancer CenterNew YorkNYUSA
| | - Ross M McBee
- Department of Systems BiologyColumbia UniversityNew YorkNYUSA
- Department of Biological SciencesColumbia UniversityNew YorkNYUSA
| | - Miles Richardson
- Department of Systems BiologyColumbia UniversityNew YorkNYUSA
- Integrated Program in Cellular, Molecular, and Biomedical StudiesColumbia UniversityNew YorkNYUSA
| | - Carlotta Ronda
- Department of Systems BiologyColumbia UniversityNew YorkNYUSA
| | - Sway P Chen
- Department of Systems BiologyColumbia UniversityNew YorkNYUSA
- Integrated Program in Cellular, Molecular, and Biomedical StudiesColumbia UniversityNew YorkNYUSA
| | - David Garenne
- School of Physics and AstronomyUniversity of MinnesotaMinneapolisMNUSA
| | - Vincent Noireaux
- School of Physics and AstronomyUniversity of MinnesotaMinneapolisMNUSA
| | - Harris H Wang
- Department of Systems BiologyColumbia UniversityNew YorkNYUSA
- Department of Pathology and Cell BiologyColumbia UniversityNew YorkNYUSA
| |
Collapse
|
44
|
Bervoets I, Charlier D. Diversity, versatility and complexity of bacterial gene regulation mechanisms: opportunities and drawbacks for applications in synthetic biology. FEMS Microbiol Rev 2019; 43:304-339. [PMID: 30721976 PMCID: PMC6524683 DOI: 10.1093/femsre/fuz001] [Citation(s) in RCA: 87] [Impact Index Per Article: 17.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/27/2018] [Accepted: 01/21/2019] [Indexed: 12/15/2022] Open
Abstract
Gene expression occurs in two essential steps: transcription and translation. In bacteria, the two processes are tightly coupled in time and space, and highly regulated. Tight regulation of gene expression is crucial. It limits wasteful consumption of resources and energy, prevents accumulation of potentially growth inhibiting reaction intermediates, and sustains the fitness and potential virulence of the organism in a fluctuating, competitive and frequently stressful environment. Since the onset of studies on regulation of enzyme synthesis, numerous distinct regulatory mechanisms modulating transcription and/or translation have been discovered. Mostly, various regulatory mechanisms operating at different levels in the flow of genetic information are used in combination to control and modulate the expression of a single gene or operon. Here, we provide an extensive overview of the very diverse and versatile bacterial gene regulatory mechanisms with major emphasis on their combined occurrence, intricate intertwinement and versatility. Furthermore, we discuss the potential of well-characterized basal expression and regulatory elements in synthetic biology applications, where they may ensure orthogonal, predictable and tunable expression of (heterologous) target genes and pathways, aiming at a minimal burden for the host.
Collapse
Affiliation(s)
- Indra Bervoets
- Research Group of Microbiology, Department of Bioengineering Sciences, Vrije Universiteit Brussel, Pleinlaan 2, B-1050 Brussels, Belgium
| | - Daniel Charlier
- Research Group of Microbiology, Department of Bioengineering Sciences, Vrije Universiteit Brussel, Pleinlaan 2, B-1050 Brussels, Belgium
| |
Collapse
|
45
|
Park J, Estrada J, Johnson G, Vincent BJ, Ricci-Tam C, Bragdon MDJ, Shulgina Y, Cha A, Wunderlich Z, Gunawardena J, DePace AH. Dissecting the sharp response of a canonical developmental enhancer reveals multiple sources of cooperativity. eLife 2019; 8:e41266. [PMID: 31223115 PMCID: PMC6588347 DOI: 10.7554/elife.41266] [Citation(s) in RCA: 33] [Impact Index Per Article: 6.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/20/2018] [Accepted: 03/04/2019] [Indexed: 12/19/2022] Open
Abstract
Developmental enhancers integrate graded concentrations of transcription factors (TFs) to create sharp gene expression boundaries. Here we examine the hunchback P2 (HbP2) enhancer which drives a sharp expression pattern in the Drosophila blastoderm embryo in response to the transcriptional activator Bicoid (Bcd). We systematically interrogate cis and trans factors that influence the shape and position of expression driven by HbP2, and find that the prevailing model, based on pairwise cooperative binding of Bcd to HbP2 is not adequate. We demonstrate that other proteins, such as pioneer factors, Mediator and histone modifiers influence the shape and position of the HbP2 expression pattern. Comparing our results to theory reveals how higher-order cooperativity and energy expenditure impact boundary location and sharpness. Our results emphasize that the bacterial view of transcription regulation, where pairwise interactions between regulatory proteins dominate, must be reexamined in animals, where multiple molecular mechanisms collaborate to shape the gene regulatory function.
Collapse
Affiliation(s)
- Jeehae Park
- Department of Systems BiologyHarvard Medical SchoolBostonUnited States
| | - Javier Estrada
- Department of Systems BiologyHarvard Medical SchoolBostonUnited States
| | - Gemma Johnson
- Department of Systems BiologyHarvard Medical SchoolBostonUnited States
| | - Ben J Vincent
- Department of Systems BiologyHarvard Medical SchoolBostonUnited States
| | - Chiara Ricci-Tam
- Department of Systems BiologyHarvard Medical SchoolBostonUnited States
| | - Meghan DJ Bragdon
- Department of Systems BiologyHarvard Medical SchoolBostonUnited States
| | | | - Anna Cha
- Department of Systems BiologyHarvard Medical SchoolBostonUnited States
| | - Zeba Wunderlich
- Department of Systems BiologyHarvard Medical SchoolBostonUnited States
| | | | - Angela H DePace
- Department of Systems BiologyHarvard Medical SchoolBostonUnited States
| |
Collapse
|
46
|
How the avidity of polymerase binding to the -35/-10 promoter sites affects gene expression. Proc Natl Acad Sci U S A 2019; 116:13340-13345. [PMID: 31196959 PMCID: PMC6613100 DOI: 10.1073/pnas.1905615116] [Citation(s) in RCA: 20] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
Although the key promoter elements necessary to drive transcription in Escherichia coli have long been understood, we still cannot predict the behavior of arbitrary novel promoters, hampering our ability to characterize the myriad sequenced regulatory architectures as well as to design new synthetic circuits. This work builds upon a beautiful recent experiment by Urtecho et al. [G. Urtecho, et al, Biochemistry, 68, 1539-1551 (2019)] who measured the gene expression of over 10,000 promoters spanning all possible combinations of a small set of regulatory elements. Using these data, we demonstrate that a central claim in energy matrix models of gene expression-that each promoter element contributes independently and additively to gene expression-contradicts experimental measurements. We propose that a key missing ingredient from such models is the avidity between the -35 and -10 RNA polymerase binding sites and develop what we call a multivalent model that incorporates this effect and can successfully characterize the full suite of gene expression data. We explore several applications of this framework, namely, how multivalent binding at the -35 and -10 sites can buffer RNA polymerase (RNAP) kinetics against mutations and how promoters that bind overly tightly to RNA polymerase can inhibit gene expression. The success of our approach suggests that avidity represents a key physical principle governing the interaction of RNA polymerase to its promoter.
Collapse
|
47
|
Kinney JB, McCandlish DM. Massively Parallel Assays and Quantitative Sequence-Function Relationships. Annu Rev Genomics Hum Genet 2019; 20:99-127. [PMID: 31091417 DOI: 10.1146/annurev-genom-083118-014845] [Citation(s) in RCA: 71] [Impact Index Per Article: 14.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]
Abstract
Over the last decade, a rich variety of massively parallel assays have revolutionized our understanding of how biological sequences encode quantitative molecular phenotypes. These assays include deep mutational scanning, high-throughput SELEX, and massively parallel reporter assays. Here, we review these experimental methods and how the data they produce can be used to quantitatively model sequence-function relationships. In doing so, we touch on a diverse range of topics, including the identification of clinically relevant genomic variants, the modeling of transcription factor binding to DNA, the functional and evolutionary landscapes of proteins, and cis-regulatory mechanisms in both transcription and mRNA splicing. We further describe a unified conceptual framework and a core set of mathematical modeling strategies that studies in these diverse areas can make use of. Finally, we highlight key aspects of experimental design and mathematical modeling that are important for the results of such studies to be interpretable and reproducible.
Collapse
Affiliation(s)
- Justin B Kinney
- Simons Center for Quantitative Biology, Cold Spring Harbor Laboratory, Cold Spring Harbor, New York 11724, USA; ,
| | - David M McCandlish
- Simons Center for Quantitative Biology, Cold Spring Harbor Laboratory, Cold Spring Harbor, New York 11724, USA; ,
| |
Collapse
|
48
|
Gorochowski TE, Chelysheva I, Eriksen M, Nair P, Pedersen S, Ignatova Z. Absolute quantification of translational regulation and burden using combined sequencing approaches. Mol Syst Biol 2019; 15:e8719. [PMID: 31053575 PMCID: PMC6498945 DOI: 10.15252/msb.20188719] [Citation(s) in RCA: 43] [Impact Index Per Article: 8.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/04/2018] [Revised: 04/09/2019] [Accepted: 04/15/2019] [Indexed: 12/26/2022] Open
Abstract
Translation of mRNAs into proteins is a key cellular process. Ribosome binding sites and stop codons provide signals to initiate and terminate translation, while stable secondary mRNA structures can induce translational recoding events. Fluorescent proteins are commonly used to characterize such elements but require the modification of a part's natural context and allow only a few parameters to be monitored concurrently. Here, we combine Ribo-seq with quantitative RNA-seq to measure at nucleotide resolution and in absolute units the performance of elements controlling transcriptional and translational processes during protein synthesis. We simultaneously measure 779 translation initiation rates and 750 translation termination efficiencies across the Escherichia coli transcriptome, in addition to translational frameshifting induced at a stable RNA pseudoknot structure. By analyzing the transcriptional and translational response, we discover that sequestered ribosomes at the pseudoknot contribute to a σ32-mediated stress response, codon-specific pausing, and a drop in translation initiation rates across the cell. Our work demonstrates the power of integrating global approaches toward a comprehensive and quantitative understanding of gene regulation and burden in living cells.
Collapse
Affiliation(s)
- Thomas E Gorochowski
- BrisSynBio, University of Bristol, Bristol, UK
- School of Biological Sciences, University of Bristol, Bristol, UK
| | - Irina Chelysheva
- Biochemistry and Molecular Biology, Department of Chemistry, University of Hamburg, Hamburg, Germany
| | - Mette Eriksen
- Biomolecular Sciences, Department of Biology, University of Copenhagen, Copenhagen, Denmark
| | - Priyanka Nair
- Biochemistry and Molecular Biology, Department of Chemistry, University of Hamburg, Hamburg, Germany
| | - Steen Pedersen
- Biomolecular Sciences, Department of Biology, University of Copenhagen, Copenhagen, Denmark
| | - Zoya Ignatova
- Biochemistry and Molecular Biology, Department of Chemistry, University of Hamburg, Hamburg, Germany
| |
Collapse
|
49
|
Campos AI, Freyre-González JA. Evolutionary constraints on the complexity of genetic regulatory networks allow predictions of the total number of genetic interactions. Sci Rep 2019; 9:3618. [PMID: 30842463 PMCID: PMC6403251 DOI: 10.1038/s41598-019-39866-z] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/22/2018] [Accepted: 02/04/2019] [Indexed: 11/26/2022] Open
Abstract
Genetic regulatory networks (GRNs) have been widely studied, yet there is a lack of understanding with regards to the final size and properties of these networks, mainly due to no network currently being complete. In this study, we analyzed the distribution of GRN structural properties across a large set of distinct prokaryotic organisms and found a set of constrained characteristics such as network density and number of regulators. Our results allowed us to estimate the number of interactions that complete networks would have, a valuable insight that could aid in the daunting task of network curation, prediction, and validation. Using state-of-the-art statistical approaches, we also provided new evidence to settle a previously stated controversy that raised the possibility of complete biological networks being random and therefore attributing the observed scale-free properties to an artifact emerging from the sampling process during network discovery. Furthermore, we identified a set of properties that enabled us to assess the consistency of the connectivity distribution for various GRNs against different alternative statistical distributions. Our results favor the hypothesis that highly connected nodes (hubs) are not a consequence of network incompleteness. Finally, an interaction coverage computed for the GRNs as a proxy for completeness revealed that high-throughput based reconstructions of GRNs could yield biased networks with a low average clustering coefficient, showing that classical targeted discovery of interactions is still needed.
Collapse
Affiliation(s)
- Adrian I Campos
- Regulatory Systems Biology Research Group, Laboratory of Systems and Synthetic Biology, Center for Genomic Sciences, Universidad Nacional Autónoma de México, Av. Universidad s/n, Col. Chamilpa, 62210, Cuernavaca, Morelos, Mexico.,Undergraduate Program in Genomic Sciences, Center for Genomics Sciences, Universidad Nacional Autónoma de México, Av. Universidad s/n, Col. Chamilpa, 62210, Cuernavaca, Morelos, Mexico
| | - Julio A Freyre-González
- Regulatory Systems Biology Research Group, Laboratory of Systems and Synthetic Biology, Center for Genomic Sciences, Universidad Nacional Autónoma de México, Av. Universidad s/n, Col. Chamilpa, 62210, Cuernavaca, Morelos, Mexico.
| |
Collapse
|
50
|
Barnes SL, Belliveau NM, Ireland WT, Kinney JB, Phillips R. Mapping DNA sequence to transcription factor binding energy in vivo. PLoS Comput Biol 2019; 15:e1006226. [PMID: 30716072 PMCID: PMC6375646 DOI: 10.1371/journal.pcbi.1006226] [Citation(s) in RCA: 22] [Impact Index Per Article: 4.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/22/2018] [Revised: 02/14/2019] [Accepted: 11/06/2018] [Indexed: 11/18/2022] Open
Abstract
Despite the central importance of transcriptional regulation in biology, it has proven difficult to determine the regulatory mechanisms of individual genes, let alone entire gene networks. It is particularly difficult to decipher the biophysical mechanisms of transcriptional regulation in living cells and determine the energetic properties of binding sites for transcription factors and RNA polymerase. In this work, we present a strategy for dissecting transcriptional regulatory sequences using in vivo methods (massively parallel reporter assays) to formulate quantitative models that map a transcription factor binding site’s DNA sequence to transcription factor-DNA binding energy. We use these models to predict the binding energies of transcription factor binding sites to within 1 kBT of their measured values. We further explore how such a sequence-energy mapping relates to the mechanisms of trancriptional regulation in various promoter contexts. Specifically, we show that our models can be used to design specific induction responses, analyze the effects of amino acid mutations on DNA sequence preference, and determine how regulatory context affects a transcription factor’s sequence specificity. It has been said that we live in the “genomic era,” a time where we can readily sequence full genomes at will. However, it remains difficult to interpret much of the information within a genome. This is especially true of non-coding sequences such as promoters, which contain a number of features such as transcription factor binding sites that determine how genes are regulated. There is no straightforward regulatory “code” that tells us how transcription factor binding sites are organized within a promoter. In this work we examine how DNA sequence determines one of the most important features of a promoter, the strength with which a transcription factor binds to its DNA binding site. We discuss an approach to modeling DNA sequence-specific transcription factor binding energies in vivo using a massively parellel reporter assay. We develop models that allow us to predict the binding energy between a transcription factor and a mutated version of its binding site. We then show that this modeling technique can be used to address a number of scientific and design questions, such as engineering the behavior of genetic circuit elements or examining how transcription factors and their binding sites co-evolve.
Collapse
Affiliation(s)
- Stephanie L. Barnes
- Division of Biology and Biological Engineering, California Institute of Technology, Pasadena, California, United States of America
| | - Nathan M. Belliveau
- Division of Biology and Biological Engineering, California Institute of Technology, Pasadena, California, United States of America
| | - William T. Ireland
- Department of Physics, California Institute of Technology, Pasadena, California, United States of America
| | - Justin B. Kinney
- Simons Center for Quantitative Biology, Cold Spring Harbor Laboratory, Cold Spring Harbor, New York, United States of America
| | - Rob Phillips
- Division of Biology and Biological Engineering, California Institute of Technology, Pasadena, California, United States of America
- Department of Physics, California Institute of Technology, Pasadena, California, United States of America
- * E-mail:
| |
Collapse
|