1
|
Myers KS, Noguera DR, Donohue TJ. Promoter Architecture Differences among Alphaproteobacteria and Other Bacterial Taxa. mSystems 2021; 6:e0052621. [PMID: 34254822 PMCID: PMC8407463 DOI: 10.1128/msystems.00526-21] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/27/2021] [Accepted: 06/17/2021] [Indexed: 11/20/2022] Open
Abstract
Much of our knowledge of bacterial transcription initiation has been derived from studying the promoters of Escherichia coli and Bacillus subtilis. Given the expansive diversity across the bacterial phylogeny, it is unclear how much of this knowledge can be applied to other organisms. Here, we report on bioinformatic analyses of promoter sequences of the primary σ factor (σ70) by leveraging publicly available transcription start site (TSS) sequencing data sets for nine bacterial species spanning five phyla. This analysis identifies previously unreported differences in the -35 and -10 elements of σ70-dependent promoters in several groups of bacteria. We found that Actinobacteria and Betaproteobacteria σ70-dependent promoters lack the TTG triad in their -35 element, which is predicted to be conserved across the bacterial phyla. In addition, the majority of the Alphaproteobacteria σ70-dependent promoters analyzed lacked the thymine at position -7 that is highly conserved in other phyla. Bioinformatic examination of the Alphaproteobacteria σ70-dependent promoters identifies a significant overrepresentation of essential genes and ones encoding proteins with common cellular functions downstream of promoters containing an A, C, or G at position -7. We propose that transcription of many σ70-dependent promoters in Alphaproteobacteria depends on the transcription factor CarD, which is an essential protein in several members of this phylum. Our analysis expands the knowledge of promoter architecture across the bacterial phylogeny and provides new information that can be used to engineer bacteria for use in medical, environmental, agricultural, and biotechnological processes. IMPORTANCE Transcription of DNA to RNA by RNA polymerase is essential for cells to grow, develop, and respond to stress. Understanding the process and control of transcription is important for health, disease, the environment, and biotechnology. Decades of research on a few bacteria have identified promoter DNA sequences that are recognized by the σ subunit of RNA polymerase. We used bioinformatic analyses to reveal previously unreported differences in promoter DNA sequences across the bacterial phylogeny. We found that many Actinobacteria and Betaproteobacteria promoters lack a sequence in their -35 DNA recognition element that was previously assumed to be conserved and that Alphaproteobacteria lack a thymine residue at position -7, also previously assumed to be conserved. Our work reports important new information about bacterial transcription, illustrates the benefits of studying bacteria across the phylogenetic tree, and proposes new lines of future investigation.
Collapse
Affiliation(s)
- Kevin S. Myers
- Wisconsin Energy Institute and Great Lakes Bioenergy Research Center, University of Wisconsin—Madison, Madison, Wisconsin, USA
| | - Daniel R. Noguera
- Wisconsin Energy Institute and Great Lakes Bioenergy Research Center, University of Wisconsin—Madison, Madison, Wisconsin, USA
- Department of Civil & Environmental Engineering, University of Wisconsin—Madison, Madison, Wisconsin, USA
| | - Timothy J. Donohue
- Wisconsin Energy Institute and Great Lakes Bioenergy Research Center, University of Wisconsin—Madison, Madison, Wisconsin, USA
- Department of Bacteriology, University of Wisconsin—Madison, Madison, Wisconsin, USA
| |
Collapse
|
2
|
Barnes SL, Belliveau NM, Ireland WT, Kinney JB, Phillips R. Mapping DNA sequence to transcription factor binding energy in vivo. PLoS Comput Biol 2019; 15:e1006226. [PMID: 30716072 PMCID: PMC6375646 DOI: 10.1371/journal.pcbi.1006226] [Citation(s) in RCA: 22] [Impact Index Per Article: 4.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/22/2018] [Revised: 02/14/2019] [Accepted: 11/06/2018] [Indexed: 11/18/2022] Open
Abstract
Despite the central importance of transcriptional regulation in biology, it has proven difficult to determine the regulatory mechanisms of individual genes, let alone entire gene networks. It is particularly difficult to decipher the biophysical mechanisms of transcriptional regulation in living cells and determine the energetic properties of binding sites for transcription factors and RNA polymerase. In this work, we present a strategy for dissecting transcriptional regulatory sequences using in vivo methods (massively parallel reporter assays) to formulate quantitative models that map a transcription factor binding site’s DNA sequence to transcription factor-DNA binding energy. We use these models to predict the binding energies of transcription factor binding sites to within 1 kBT of their measured values. We further explore how such a sequence-energy mapping relates to the mechanisms of trancriptional regulation in various promoter contexts. Specifically, we show that our models can be used to design specific induction responses, analyze the effects of amino acid mutations on DNA sequence preference, and determine how regulatory context affects a transcription factor’s sequence specificity. It has been said that we live in the “genomic era,” a time where we can readily sequence full genomes at will. However, it remains difficult to interpret much of the information within a genome. This is especially true of non-coding sequences such as promoters, which contain a number of features such as transcription factor binding sites that determine how genes are regulated. There is no straightforward regulatory “code” that tells us how transcription factor binding sites are organized within a promoter. In this work we examine how DNA sequence determines one of the most important features of a promoter, the strength with which a transcription factor binds to its DNA binding site. We discuss an approach to modeling DNA sequence-specific transcription factor binding energies in vivo using a massively parellel reporter assay. We develop models that allow us to predict the binding energy between a transcription factor and a mutated version of its binding site. We then show that this modeling technique can be used to address a number of scientific and design questions, such as engineering the behavior of genetic circuit elements or examining how transcription factors and their binding sites co-evolve.
Collapse
Affiliation(s)
- Stephanie L. Barnes
- Division of Biology and Biological Engineering, California Institute of Technology, Pasadena, California, United States of America
| | - Nathan M. Belliveau
- Division of Biology and Biological Engineering, California Institute of Technology, Pasadena, California, United States of America
| | - William T. Ireland
- Department of Physics, California Institute of Technology, Pasadena, California, United States of America
| | - Justin B. Kinney
- Simons Center for Quantitative Biology, Cold Spring Harbor Laboratory, Cold Spring Harbor, New York, United States of America
| | - Rob Phillips
- Division of Biology and Biological Engineering, California Institute of Technology, Pasadena, California, United States of America
- Department of Physics, California Institute of Technology, Pasadena, California, United States of America
- * E-mail:
| |
Collapse
|
3
|
Gao Y, Yurkovich JT, Seo SW, Kabimoldayev I, Dräger A, Chen K, Sastry AV, Fang X, Mih N, Yang L, Eichner J, Cho BK, Kim D, Palsson BO. Systematic discovery of uncharacterized transcription factors in Escherichia coli K-12 MG1655. Nucleic Acids Res 2018; 46:10682-10696. [PMID: 30137486 PMCID: PMC6237786 DOI: 10.1093/nar/gky752] [Citation(s) in RCA: 43] [Impact Index Per Article: 7.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/13/2018] [Revised: 07/11/2018] [Accepted: 08/08/2018] [Indexed: 02/03/2023] Open
Abstract
Transcriptional regulation enables cells to respond to environmental changes. Of the estimated 304 candidate transcription factors (TFs) in Escherichia coli K-12 MG1655, 185 have been experimentally identified, but ChIP methods have been used to fully characterize only a few dozen. Identifying these remaining TFs is key to improving our knowledge of the E. coli transcriptional regulatory network (TRN). Here, we developed an integrated workflow for the computational prediction and comprehensive experimental validation of TFs using a suite of genome-wide experiments. We applied this workflow to (i) identify 16 candidate TFs from over a hundred uncharacterized genes; (ii) capture a total of 255 DNA binding peaks for ten candidate TFs resulting in six high-confidence binding motifs; (iii) reconstruct the regulons of these ten TFs by determining gene expression changes upon deletion of each TF and (iv) identify the regulatory roles of three TFs (YiaJ, YdcI, and YeiE) as regulators of l-ascorbate utilization, proton transfer and acetate metabolism, and iron homeostasis under iron-limited conditions, respectively. Together, these results demonstrate how this workflow can be used to discover, characterize, and elucidate regulatory functions of uncharacterized TFs in parallel.
Collapse
Affiliation(s)
- Ye Gao
- Division of Biological Sciences, University of California, San Diego, La Jolla, CA 92093, USA
- Department of Bioengineering, University of California San Diego, La Jolla, CA 92093, USA
| | - James T Yurkovich
- Department of Bioengineering, University of California San Diego, La Jolla, CA 92093, USA
- Bioinformatics and Systems Biology Program, University of California San Diego, La Jolla, CA 92093, USA
| | - Sang Woo Seo
- School of Chemical and Biological Engineering, Seoul National University, Seoul, Republic of Korea
| | - Ilyas Kabimoldayev
- Department of Genetic Engineering and Graduate School of Biotechnology, College of Life Sciences, Kyung Hee University, Yongin, Republic of Korea
| | - Andreas Dräger
- Computational Systems Biology of Infection and Antimicrobial-Resistant Pathogens, Center for Bioinformatics Tübingen (ZBIT), 72076 Tübingen, Germany
- Department of Computer Science, University of Tübingen, 72076 Tübingen, Germany
| | - Ke Chen
- Department of Bioengineering, University of California San Diego, La Jolla, CA 92093, USA
| | - Anand V Sastry
- Department of Bioengineering, University of California San Diego, La Jolla, CA 92093, USA
| | - Xin Fang
- Department of Bioengineering, University of California San Diego, La Jolla, CA 92093, USA
| | - Nathan Mih
- Department of Bioengineering, University of California San Diego, La Jolla, CA 92093, USA
- Bioinformatics and Systems Biology Program, University of California San Diego, La Jolla, CA 92093, USA
| | - Laurence Yang
- Department of Bioengineering, University of California San Diego, La Jolla, CA 92093, USA
| | - Johannes Eichner
- Computational Systems Biology of Infection and Antimicrobial-Resistant Pathogens, Center for Bioinformatics Tübingen (ZBIT), 72076 Tübingen, Germany
| | - Byung-Kwan Cho
- Novo Nordisk Foundation Center for Biosustainability, 2800 Kongens Lyngby, Denmark
- Department of Biological Sciences, Korea Advanced Institute of Science and Technology, Daejeon 34141, Republic of Korea
| | - Donghyuk Kim
- Department of Genetic Engineering and Graduate School of Biotechnology, College of Life Sciences, Kyung Hee University, Yongin, Republic of Korea
- School of Energy and Chemical Engineering, Ulsan National Institute of Science and Technology (UNIST), Ulsan, Republic of Korea
- School of Biological Sciences, Ulsan National Institute of Science and Technology (UNIST), Ulsan, Republic of Korea
| | - Bernhard O Palsson
- Department of Bioengineering, University of California San Diego, La Jolla, CA 92093, USA
- Bioinformatics and Systems Biology Program, University of California San Diego, La Jolla, CA 92093, USA
- Novo Nordisk Foundation Center for Biosustainability, 2800 Kongens Lyngby, Denmark
- Department of Pediatrics, University of California, San Diego, La Jolla, CA 92093, USA
| |
Collapse
|
4
|
Systematic approach for dissecting the molecular mechanisms of transcriptional regulation in bacteria. Proc Natl Acad Sci U S A 2018; 115:E4796-E4805. [PMID: 29728462 PMCID: PMC6003448 DOI: 10.1073/pnas.1722055115] [Citation(s) in RCA: 52] [Impact Index Per Article: 8.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/21/2022] Open
Abstract
Organisms must constantly make regulatory decisions in response to a change in cellular state or environment. However, while the catalog of genomes expands rapidly, we remain ignorant about how the genes in these genomes are regulated. Here, we show how a massively parallel reporter assay, Sort-Seq, and information-theoretic modeling can be used to identify regulatory sequences. We then use chromatography and mass spectrometry to identify the regulatory proteins that bind these sequences. The approach results in quantitative base pair-resolution models of promoter mechanism and was shown in both well-characterized and unannotated promoters in Escherichia coli. Given the generality of the approach, it opens up the possibility of quantitatively dissecting the mechanisms of promoter function in a wide range of bacteria. Gene regulation is one of the most ubiquitous processes in biology. However, while the catalog of bacterial genomes continues to expand rapidly, we remain ignorant about how almost all of the genes in these genomes are regulated. At present, characterizing the molecular mechanisms by which individual regulatory sequences operate requires focused efforts using low-throughput methods. Here, we take a first step toward multipromoter dissection and show how a combination of massively parallel reporter assays, mass spectrometry, and information-theoretic modeling can be used to dissect multiple bacterial promoters in a systematic way. We show this approach on both well-studied and previously uncharacterized promoters in the enteric bacterium Escherichia coli. In all cases, we recover nucleotide-resolution models of promoter mechanism. For some promoters, including previously unannotated ones, the approach allowed us to further extract quantitative biophysical models describing input–output relationships. Given the generality of the approach presented here, it opens up the possibility of quantitatively dissecting the mechanisms of promoter function in E. coli and a wide range of other bacteria.
Collapse
|
5
|
Huang Z, Zhou Q, Sun P, Yang J, Guo M. Two Agrobacterium tumefaciens CheW Proteins Are Incorporated into One Chemosensory Pathway with Different Efficiencies. MOLECULAR PLANT-MICROBE INTERACTIONS : MPMI 2018; 31:460-470. [PMID: 29182466 DOI: 10.1094/mpmi-10-17-0255-r] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/14/2023]
Abstract
Agrobacterium tumefaciens is the agent that causes crown gall tumor disease on more than 140 species of dicotyledonous plants. Chemotaxis of A. tumefaciens toward the wound sites of the host plant is the first step to recognize the host. CheW is a coupling protein that bridges the histidine kinase CheA and the chemoreceptors to form the chemotaxis core signaling complex and plays a crucial role in the assembly and function of the large chemosensory array. Unlike all previously reported chemotaxis systems, A. tumefaciens has only one major che operon but two cheW homologs (atu2075 as cheW1 and atu2617 as cheW2) on unlinked loci. The in-frame deletion of either cheW gene significantly affects A. tumefaciens chemotaxis but does not abolish the chemotaxis, unless both cheW genes were deleted. The effect of cheW2 deletion on the chemotaxis is more severe than that of cheW1 deletion. Either CheW can interact with CheA and couple it to the cell poles. The promoter activity of cheW2 is always higher than that of cheW1 under all of the tested conditions. When two cheW genes were adjusted to the same expression level by using the identical promoter, the difference between the effects of two CheW proteins on the chemotaxis still existed. Therefore, we envision that both the different molecular ratio of two CheW proteins in cell and the different affinities of two CheW proteins with CheA and chemoreceptors result in the efficiency difference of two CheW proteins in functioning in the large chemosensory array.
Collapse
Affiliation(s)
- Zhiwei Huang
- College of Bioscience and Biotechnology, Yangzhou University, Yangzhou City, Jiangsu 225009, P R China
| | - Qingxuan Zhou
- College of Bioscience and Biotechnology, Yangzhou University, Yangzhou City, Jiangsu 225009, P R China
| | - Pan Sun
- College of Bioscience and Biotechnology, Yangzhou University, Yangzhou City, Jiangsu 225009, P R China
| | - Jing Yang
- College of Bioscience and Biotechnology, Yangzhou University, Yangzhou City, Jiangsu 225009, P R China
| | - Minliang Guo
- College of Bioscience and Biotechnology, Yangzhou University, Yangzhou City, Jiangsu 225009, P R China
| |
Collapse
|
6
|
Abstract
The H-NS family of DNA-binding proteins is the subject of intense study due to its important roles in the regulation of horizontally acquired genes critical for virulence, antibiotic resistance, and metabolism. Xenogeneic silencing proteins, typified by the H-NS protein of Escherichia coli, specifically target and downregulate expression from AT-rich genes by selectively recognizing specific structural features unique to the AT-rich minor groove. In doing so, these proteins facilitate bacterial evolution; enabling these cells to engage in horizontal gene transfer while buffering potential any detrimental fitness consequences that may result from it. Xenogeneic silencing and counter-silencing explain how bacterial cells can evolve effective gene regulatory strategies in the face of rampant gene gain and loss and it has extended our understanding of bacterial gene regulation beyond the classic operon model. Here we review the structures and mechanisms of xenogeneic silencers as well as their impact on bacterial evolution. Several H-NS-like proteins appear to play a role in facilitating gene transfer by other mechanisms including by regulating transposition, conjugation, and participating in the activation of virulence loci like the locus of enterocyte effacement pathogenicity island of pathogenic strains of E. coli. Evidence suggests that the critical determinants that dictate whether an H-NS-like protein will be a silencer or will perform a different function do not lie in the DNA-binding domain but, rather, in the domains that control oligomerization. This suggests that H-NS-like proteins are transcription factors that both recognize and alter the shape of DNA to exert specific effects that include but are not limited to gene silencing.
Collapse
|
7
|
Leyn SA, Suvorova IA, Kazakov AE, Ravcheev DA, Stepanova VV, Novichkov PS, Rodionov DA. Comparative genomics and evolution of transcriptional regulons in Proteobacteria. Microb Genom 2016; 2:e000061. [PMID: 28348857 PMCID: PMC5343134 DOI: 10.1099/mgen.0.000061] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/29/2016] [Accepted: 04/14/2016] [Indexed: 12/16/2022] Open
Abstract
Comparative genomics approaches are broadly used for analysis of transcriptional regulation in bacterial genomes. In this work, we identified binding sites and reconstructed regulons for 33 orthologous groups of transcription factors (TFs) in 196 reference genomes from 21 taxonomic groups of Proteobacteria. Overall, we predict over 10 600 TF binding sites and identified more than 15 600 target genes for 1896 TFs constituting the studied orthologous groups of regulators. These include a set of orthologues for 21 metabolism-associated TFs from Escherichia coli and/or Shewanella that are conserved in five or more taxonomic groups and several additional TFs that represent non-orthologous substitutions of the metabolic regulators in some lineages of Proteobacteria. By comparing gene contents of the reconstructed regulons, we identified the core, taxonomy-specific and genome-specific TF regulon members and classified them by their metabolic functions. Detailed analysis of ArgR, TyrR, TrpR, HutC, HypR and other amino-acid-specific regulons demonstrated remarkable differences in regulatory strategies used by various lineages of Proteobacteria. The obtained genomic collection of in silico reconstructed TF regulons contains a large number of new regulatory interactions that await future experimental validation. The collection provides a framework for future evolutionary studies of transcriptional regulatory networks in Bacteria. It can be also used for functional annotation of putative metabolic transporters and enzymes that are abundant in the reconstructed regulons.
Collapse
Affiliation(s)
- Semen A Leyn
- 1A. A. Kharkevich Institute for Information Transmission Problems, Russian Academy of Sciences, Moscow, Russia
| | - Inna A Suvorova
- 1A. A. Kharkevich Institute for Information Transmission Problems, Russian Academy of Sciences, Moscow, Russia
| | - Alexey E Kazakov
- 2Lawrence Berkeley National Laboratory, Berkeley, CA 94720, USA
| | | | - Vita V Stepanova
- 1A. A. Kharkevich Institute for Information Transmission Problems, Russian Academy of Sciences, Moscow, Russia
| | | | - Dmitry A Rodionov
- 4Sanford-Burnham-Prebys Medical Discovery Institute, La Jolla, CA 92037, USA.,1A. A. Kharkevich Institute for Information Transmission Problems, Russian Academy of Sciences, Moscow, Russia
| |
Collapse
|
8
|
Hobbs ET, Pereira T, O’Neill PK, Erill I. A Bayesian inference method for the analysis of transcriptional regulatory networks in metagenomic data. Algorithms Mol Biol 2016; 11:19. [PMID: 27398089 PMCID: PMC4938975 DOI: 10.1186/s13015-016-0082-8] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/26/2016] [Accepted: 06/30/2016] [Indexed: 11/13/2022] Open
Abstract
Background Metagenomics enables the analysis of bacterial population composition and the study of emergent population features, such as shared metabolic pathways. Recently, we have shown that metagenomics datasets can be leveraged to characterize population-wide transcriptional regulatory networks, or meta-regulons, providing insights into how bacterial populations respond collectively to specific triggers. Here we formalize a Bayesian inference framework to analyze the composition of transcriptional regulatory networks in metagenomes by determining the probability of regulation of orthologous gene sequences. We assess the performance of this approach on synthetic datasets and we validate it by analyzing the copper-homeostasis network of Firmicutes species in the human gut microbiome. Results Assessment on synthetic datasets shows that our method provides a robust and interpretable metric for assessing putative regulation by a transcription factor on sets of promoter sequences mapping to an orthologous gene cluster. The inference framework integrates the regulatory contribution of secondary sites and can discern false positives arising from multiple instances of a clonal sequence. Posterior probabilities for orthologous gene clusters decline sharply when less than 20 % of mapped promoters have binding sites, but we introduce a sensitivity adjustment procedure to speed up computation that enhances regulation assessment in heterogeneous ortholog clusters. Analysis of the copper-homeostasis regulon governed by CsoR in the human gut microbiome Firmicutes reveals that CsoR controls itself and copper-translocating P-type ATPases, but not CopZ-type copper chaperones. Our analysis also indicates that CsoR frequently targets promoters with dual CsoR-binding sites, suggesting that it exploits higher-order binding conformations to fine-tune its activity. Conclusions We introduce and validate a method for the analysis of transcriptional regulatory networks from metagenomic data that enables inference of meta-regulons in a systematic and interpretable way. Validation of this method on the CsoR meta-regulon of gut microbiome Firmicutes illustrates the usefulness of the approach, revealing novel properties of the copper-homeostasis network in poorly characterized bacterial species and putting forward evidence of new mechanisms of DNA binding for this transcriptional regulator. Our approach will enable the comparative analysis of regulatory networks across metagenomes, yielding novel insights into the evolution of transcriptional regulatory networks. Electronic supplementary material The online version of this article (doi:10.1186/s13015-016-0082-8) contains supplementary material, which is available to authorized users.
Collapse
|
9
|
Transcriptional analysis of the MrpJ network: modulation of diverse virulence-associated genes and direct regulation of mrp fimbrial and flhDC flagellar operons in Proteus mirabilis. Infect Immun 2015; 83:2542-56. [PMID: 25847961 DOI: 10.1128/iai.02978-14] [Citation(s) in RCA: 20] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/21/2014] [Accepted: 03/29/2015] [Indexed: 01/14/2023] Open
Abstract
The enteric bacterium Proteus mirabilis is associated with a significant number of catheter-associated urinary tract infections (UTIs). Strict regulation of the antagonistic processes of adhesion and motility, mediated by fimbriae and flagella, respectively, is essential for disease progression. Previously, the transcriptional regulator MrpJ, which is encoded by the mrp fimbrial operon, has been shown to repress both swimming and swarming motility. Here we show that MrpJ affects an array of cellular processes beyond adherence and motility. Microarray analysis found that expression of mrpJ mimicking levels observed during UTIs leads to differential expression of 217 genes related to, among other functions, bacterial virulence, type VI secretion, and metabolism. We probed the molecular mechanism of transcriptional regulation by MrpJ using transcriptional reporters and chromatin immunoprecipitation (ChIP). Binding of MrpJ to two virulence-associated target gene promoters, the promoters of the flagellar master regulator flhDC and mrp itself, appears to be affected by the condensation state of the native chromosome, although both targets share a direct MrpJ binding site proximal to the transcriptional start. Furthermore, an mrpJ deletion mutant colonized the bladders of mice at significantly lower levels in a transurethral model of infection. Additionally, we observed that mrpJ is widely conserved in a collection of recent clinical isolates. Altogether, these findings support a role of MrpJ as a global regulator of P. mirabilis virulence.
Collapse
|
10
|
O'Neill PK, Forder R, Erill I. Informational requirements for transcriptional regulation. J Comput Biol 2014; 21:373-84. [PMID: 24689750 DOI: 10.1089/cmb.2014.0032] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022] Open
Abstract
Transcription factors (TFs) regulate transcription by binding to specific sites in promoter regions. Information theory provides a useful mathematical framework to analyze the binding motifs associated with TFs but imposes several assumptions that limit their applicability to specific regulatory scenarios. Explicit simulations of the co-evolution of TFs and their binding motifs allow the study of the evolution of regulatory networks with a high degree of realism. In this work we analyze the impact of differential regulatory demands on the information content of TF-binding motifs by means of evolutionary simulations. We generalize a predictive index based on information theory, and we validate its applicability to regulatory scenarios in which the TF binds significantly to the genomic background. Our results show a logarithmic dependence of the evolved information content on the occupancy of target sites and indicate that TFs may actively exploit pseudo-sites to modulate their occupancy of target sites. In regulatory networks with differentially regulated targets, we observe that information content in TF-binding motifs is dictated primarily by the fraction of total probability mass that the TF assigns to its target sites, and we provide a predictive index to estimate the amount of information associated with arbitrarily complex regulatory systems. We observe that complex regulatory patterns can exert additional demands on evolved information content, but, given a total occupancy for target sites, we do not find conclusive evidence that this effect is because of the range of required binding affinities.
Collapse
Affiliation(s)
- Patrick K O'Neill
- 1 Department of Biological Sciences, University of Maryland Baltimore County , Baltimore, Maryland
| | | | | |
Collapse
|
11
|
Leyn SA, Kazanov MD, Sernova NV, Ermakova EO, Novichkov PS, Rodionov DA. Genomic reconstruction of the transcriptional regulatory network in Bacillus subtilis. J Bacteriol 2013; 195:2463-73. [PMID: 23504016 PMCID: PMC3676070 DOI: 10.1128/jb.00140-13] [Citation(s) in RCA: 45] [Impact Index Per Article: 4.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/04/2013] [Accepted: 03/11/2013] [Indexed: 12/26/2022] Open
Abstract
The adaptation of microorganisms to their environment is controlled by complex transcriptional regulatory networks (TRNs), which are still only partially understood even for model species. Genome scale annotation of regulatory features of genes and TRN reconstruction are challenging tasks of microbial genomics. We used the knowledge-driven comparative-genomics approach implemented in the RegPredict Web server to infer TRN in the model Gram-positive bacterium Bacillus subtilis and 10 related Bacillales species. For transcription factor (TF) regulons, we combined the available information from the DBTBS database and the literature with bioinformatics tools, allowing inference of TF binding sites (TFBSs), comparative analysis of the genomic context of predicted TFBSs, functional assignment of target genes, and effector prediction. For RNA regulons, we used known RNA regulatory motifs collected in the Rfam database to scan genomes and analyze the genomic context of new RNA sites. The inferred TRN in B. subtilis comprises regulons for 129 TFs and 24 regulatory RNA families. First, we analyzed 66 TF regulons with previously known TFBSs in B. subtilis and projected them to other Bacillales genomes, resulting in refinement of TFBS motifs and identification of novel regulon members. Second, we inferred motifs and described regulons for 28 experimentally studied TFs with previously unknown TFBSs. Third, we discovered novel motifs and reconstructed regulons for 36 previously uncharacterized TFs. The inferred collection of regulons is available in the RegPrecise database (http://regprecise.lbl.gov/) and can be used in genetic experiments, metabolic modeling, and evolutionary analysis.
Collapse
Affiliation(s)
- Semen A. Leyn
- Sanford-Burnham Medical Research Institute, La Jolla, California, USA
- A. A. Kharkevich Institute for Information Transmission Problems, Russian Academy of Sciences, Moscow, Russia
| | - Marat D. Kazanov
- A. A. Kharkevich Institute for Information Transmission Problems, Russian Academy of Sciences, Moscow, Russia
| | - Natalia V. Sernova
- A. A. Kharkevich Institute for Information Transmission Problems, Russian Academy of Sciences, Moscow, Russia
| | - Ekaterina O. Ermakova
- A. A. Kharkevich Institute for Information Transmission Problems, Russian Academy of Sciences, Moscow, Russia
| | | | - Dmitry A. Rodionov
- Sanford-Burnham Medical Research Institute, La Jolla, California, USA
- A. A. Kharkevich Institute for Information Transmission Problems, Russian Academy of Sciences, Moscow, Russia
| |
Collapse
|
12
|
Cornish JP, Matthews F, Thomas JR, Erill I. Inference of self-regulated transcriptional networks by comparative genomics. Evol Bioinform Online 2012; 8:449-61. [PMID: 23032607 PMCID: PMC3422134 DOI: 10.4137/ebo.s9205] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/05/2022] Open
Abstract
The assumption of basic properties, like self-regulation, in simple transcriptional regulatory networks can be exploited to infer regulatory motifs from the growing amounts of genomic and meta-genomic data. These motifs can in principle be used to elucidate the nature and scope of transcriptional networks through comparative genomics. Here we assess the feasibility of this approach using the SOS regulatory network of Gram-positive bacteria as a test case. Using experimentally validated data, we show that the known regulatory motif can be inferred through the assumption of self-regulation. Furthermore, the inferred motif provides a more robust search pattern for comparative genomics than the experimental motifs defined in reference organisms. We take advantage of this robustness to generate a functional map of the SOS response in Gram-positive bacteria. Our results reveal definite differences in the composition of the LexA regulon between Firmicutes and Actinobacteria, and confirm that regulation of cell-division inhibition is a widespread characteristic of this network among Gram-positive bacteria.
Collapse
Affiliation(s)
- Joseph P Cornish
- Department of Biological Sciences, University of Maryland Baltimore County
| | | | | | | |
Collapse
|
13
|
Gaballa A, MacLellan S, Helmann JD. Transcription activation by the siderophore sensor Btr is mediated by ligand-dependent stimulation of promoter clearance. Nucleic Acids Res 2011; 40:3585-95. [PMID: 22210890 PMCID: PMC3333878 DOI: 10.1093/nar/gkr1280] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022] Open
Abstract
Bacterial transcription factors often function as DNA-binding proteins that selectively activate or repress promoters, although the biochemical mechanisms vary. In most well-understood examples, activators function by either increasing the affinity of RNA polymerase (RNAP) for the target promoter, or by increasing the isomerization of the initial closed complex to the open complex. We report that Bacillus subtilis Btr, a member of the AraC family of activators, functions principally as a ligand-dependent activator of promoter clearance. In the presence of its co-activator, the siderophore bacillibactin (BB), the Btr:BB complex enhances productive transcription, while having only modest effects on either RNAP promoter association or the production of abortive transcripts. Btr binds to two direct repeat sequences adjacent to the −35 region; recognition of the downstream motif is most important for establishing a productive interaction between the Btr:BB complex and RNAP. The resulting Btr:BB dependent increase in transcription enables the production of the ferric-BB importer to be activated by the presence of its cognate substrate.
Collapse
Affiliation(s)
- Ahmed Gaballa
- Department of Microbiology, Cornell University, Ithaca, NY 14853-8101, USA
| | | | | |
Collapse
|
14
|
Rodionov DA, Novichkov PS, Stavrovskaya ED, Rodionova IA, Li X, Kazanov MD, Ravcheev DA, Gerasimova AV, Kazakov AE, Kovaleva GY, Permina EA, Laikova ON, Overbeek R, Romine MF, Fredrickson JK, Arkin AP, Dubchak I, Osterman AL, Gelfand MS. Comparative genomic reconstruction of transcriptional networks controlling central metabolism in the Shewanella genus. BMC Genomics 2011; 12 Suppl 1:S3. [PMID: 21810205 PMCID: PMC3223726 DOI: 10.1186/1471-2164-12-s1-s3] [Citation(s) in RCA: 52] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/24/2022] Open
Abstract
Background Genome-scale prediction of gene regulation and reconstruction of transcriptional regulatory networks in bacteria is one of the critical tasks of modern genomics. The Shewanella genus is comprised of metabolically versatile gamma-proteobacteria, whose lifestyles and natural environments are substantially different from Escherichia coli and other model bacterial species. The comparative genomics approaches and computational identification of regulatory sites are useful for the in silico reconstruction of transcriptional regulatory networks in bacteria. Results To explore conservation and variations in the Shewanella transcriptional networks we analyzed the repertoire of transcription factors and performed genomics-based reconstruction and comparative analysis of regulons in 16 Shewanella genomes. The inferred regulatory network includes 82 transcription factors and their DNA binding sites, 8 riboswitches and 6 translational attenuators. Forty five regulons were newly inferred from the genome context analysis, whereas others were propagated from previously characterized regulons in the Enterobacteria and Pseudomonas spp.. Multiple variations in regulatory strategies between the Shewanella spp. and E. coli include regulon contraction and expansion (as in the case of PdhR, HexR, FadR), numerous cases of recruiting non-orthologous regulators to control equivalent pathways (e.g. PsrA for fatty acid degradation) and, conversely, orthologous regulators to control distinct pathways (e.g. TyrR, ArgR, Crp). Conclusions We tentatively defined the first reference collection of ~100 transcriptional regulons in 16 Shewanella genomes. The resulting regulatory network contains ~600 regulated genes per genome that are mostly involved in metabolism of carbohydrates, amino acids, fatty acids, vitamins, metals, and stress responses. Several reconstructed regulons including NagR for N-acetylglucosamine catabolism were experimentally validated in S. oneidensis MR-1. Analysis of correlations in gene expression patterns helps to interpret the reconstructed regulatory network. The inferred regulatory interactions will provide an additional regulatory constrains for an integrated model of metabolism and regulation in S. oneidensis MR-1.
Collapse
Affiliation(s)
- Dmitry A Rodionov
- Sanford-Burnham Medical Research Institute, La Jolla, California, USA.
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
15
|
Islam MS, Bingle LEH, Pallen MJ, Busby SJW. Organization of the LEE1 operon regulatory region of enterohaemorrhagic Escherichia coli O157:H7 and activation by GrlA. Mol Microbiol 2010; 79:468-83. [DOI: 10.1111/j.1365-2958.2010.07460.x] [Citation(s) in RCA: 34] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
|
16
|
Novel plasmid-based genetic tools for the study of promoters and terminators in Streptococcus pneumoniae and Enterococcus faecalis. J Microbiol Methods 2010; 83:156-63. [PMID: 20801171 DOI: 10.1016/j.mimet.2010.08.004] [Citation(s) in RCA: 36] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/21/2010] [Revised: 08/05/2010] [Accepted: 08/12/2010] [Indexed: 11/21/2022]
Abstract
Promoter-probe and terminator-probe plasmid vectors make possible to rapidly examine whether particular sequences function as promoter or terminator signals in various genetic backgrounds and under diverse environmental stimuli. At present, such plasmid-based genetic tools are very scarce in the Gram-positive pathogenic bacteria Streptococcus pneumoniae and Enterococcus faecalis. Hence, we developed novel promoter-probe and terminator-probe vectors based on the Streptococcus agalactiae pMV158 plasmid, which replicates autonomously in numerous Gram-positive bacteria. As reporter gene, a gfp allele encoding a variant of the green fluorescent protein was used. These genetic tools were shown to be suitable to assess the activity of promoters and terminators (both homologous and heterologous) in S. pneumoniae and E. faecalis. In addition, the promoter-probe vector was shown to be a valuable tool for the analysis of regulated promoters in vivo, such as the promoter of the pneumococcal fuculose kinase gene. These new plasmid vectors will be very useful for the experimental verification of predicted promoter and terminator sequences, as well as for the construction of new inducible-expression vectors. Given the promiscuity exhibited by the pMV158 replicon, these vectors could be used in a variety of Gram-positive bacteria.
Collapse
|
17
|
Zafar MA, Shah IM, Wolf RE. Protein-protein interactions between sigma(70) region 4 of RNA polymerase and Escherichia coli SoxS, a transcription activator that functions by the prerecruitment mechanism: evidence for "off-DNA" and "on-DNA" interactions. J Mol Biol 2010; 401:13-32. [PMID: 20595001 DOI: 10.1016/j.jmb.2010.05.052] [Citation(s) in RCA: 18] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/30/2009] [Revised: 05/12/2010] [Accepted: 05/21/2010] [Indexed: 10/19/2022]
Abstract
According to the prerecruitment hypothesis, Escherichia coli SoxS activates the transcription of the genes of the SoxRS regulon by forming binary complexes with RNA polymerase (RNAP) that scan the chromosome for class I and class II SoxS-dependent promoters. We showed previously that the alpha subunit's C-terminal domain plays a role in activating both classes of promoter by making protein-protein contacts with SoxS; some of these contacts are made in solution in the absence of promoter DNA, a critical prediction of the prerecruitment hypothesis. Here, we identified seven single-alanine substitutions of the region 4 of sigma(70) (sigma(70) R4) of RNAP that reduce SoxS activation of class II promoters. With genetic epistasis tests between these sigma(70) R4 mutants and positive control mutants of SoxS, we identified 10 pairs of amino acids that interact with each other in E. coli. Using the yeast two-hybrid system and affinity immobilization assays, we showed that SoxS and sigma(70) R4 can interact in solution (i.e., "off-DNA"). The interaction requires amino acids of the class I/II (but not the class II) positive control surface of SoxS, and five amino acids of sigma(70) R4 that reduce activation in E. coli also reduce the SoxS-sigma(70) R4 interaction in yeast. One of the epistatic interactions that occur in E. coli also occurs in the yeast two-hybrid system (i.e., off-DNA). Importantly, we infer that the five epistatic interactions occurring in E. coli that require an amino acid of the class II surface occur "on-DNA" at class II promoters. Finding that SoxS contacts sigma(70) R4 both off-DNA and on-DNA is consistent with the prerecruitment hypothesis. Moreover, SoxS is now the first example of an E. coli transcriptional activator that uses a single positive control surface to make specific protein-protein contacts with two different subunits of RNAP.
Collapse
Affiliation(s)
- M Ammar Zafar
- Department of Biological Sciences, University of Maryland Baltimore County, 1000 Hilltop Circle, Baltimore, MD 21250, USA
| | | | | |
Collapse
|
18
|
How transcription factors can adjust the gene expression floodgates. PROGRESS IN BIOPHYSICS AND MOLECULAR BIOLOGY 2009; 102:16-37. [PMID: 20025898 DOI: 10.1016/j.pbiomolbio.2009.12.007] [Citation(s) in RCA: 33] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/30/2009] [Revised: 11/17/2009] [Accepted: 12/07/2009] [Indexed: 12/18/2022]
Abstract
The rate of transcription initiation is the main level of quantitative control of gene expression, primarily responsible for the accumulation of mRNAs in the cell. Many, if not all, molecular actors involved in transcription initiation are known but the mechanisms underlying the frequency of initiations, remain elusive. To make the connection between transcription factors and the frequency of transcription initiation, intricated aspects of this complex activity are classified i) depending on whether or not the DNA-bound transcription factors directly activate the commitment to transcription and ii) on the destructive or non-destructive effect of transcription initiation on the stability of promoter complexes. Two possible sources of synergy allowing the combinatorial specificity of transcription factors action are compared, for binding to DNA and for recruiting transcription machineries. Tentative formulations are proposed to discriminate the different micro-reversible modes of DNA binding cooperativity modulating the specificity and dosage of transcription initiation.
Collapse
|
19
|
Butala M, Busby SJW, Lee DJ. DNA sampling: a method for probing protein binding at specific loci on bacterial chromosomes. Nucleic Acids Res 2009; 37:e37. [PMID: 19181705 PMCID: PMC2655658 DOI: 10.1093/nar/gkp043] [Citation(s) in RCA: 28] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/15/2022] Open
Abstract
We describe a protocol, DNA sampling, for the rapid isolation of specific segments of DNA, together with bound proteins, from Escherichia coli K-12. The DNA to be sampled is generated as a discrete fragment within cells by the yeast I-SceI meganuclease, and is purified using FLAG-tagged LacI repressor and beads carrying anti-FLAG antibody. We illustrate the method by investigating the proteins bound to the colicin K gene regulatory region, either before or after induction of the colicin K gene promoter.
Collapse
Affiliation(s)
- Matej Butala
- School of Biosciences, University of Birmingham, Birmingham B15 2TT, UK
| | | | | |
Collapse
|