1
|
Kornberg RD, Lorch Y. Primary Role of the Nucleosome. Mol Cell 2020; 79:371-375. [PMID: 32763226 DOI: 10.1016/j.molcel.2020.07.020] [Citation(s) in RCA: 81] [Impact Index Per Article: 20.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/10/2019] [Revised: 12/16/2019] [Accepted: 03/10/2020] [Indexed: 11/18/2022]
Abstract
Whereas the core nucleosome is thought to serve as a packaging device for the coiling and contraction in length of genomic DNA, we suggest that it serves primarily in the regulation of transcription. A nucleosome on a promoter prevents the initiation of transcription. The association of nucleosomes with most genomic DNA prevents initiation from cryptic promoters. The nucleosome thus serves not only as a general gene repressor, but also as a repressor of all transcription (genic, intragenic, and intergenic). The core nucleosome performs a fundamental regulatory role, apart from the histone "tails," which modulate gene activity.
Collapse
Affiliation(s)
- Roger D Kornberg
- Department of Structural Biology, Stanford School of Medicine, Stanford, CA 94305, USA
| | - Yahli Lorch
- Department of Structural Biology, Stanford School of Medicine, Stanford, CA 94305, USA.
| |
Collapse
|
2
|
Flickinger R. Polymorphism of simple sequence repeats may quantitatively regulate gene transcription. Exp Cell Res 2020; 390:111969. [PMID: 32199920 DOI: 10.1016/j.yexcr.2020.111969] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/13/2019] [Revised: 02/15/2020] [Accepted: 03/17/2020] [Indexed: 02/07/2023]
Abstract
The degree of polymorphism, i.e., DNA sequence divergence, of short AT-rich tandemly arranged simple sequence repeats at or near promoters and 5'- untranslated regions of mRNA may quantitatively regulate transcription of tissue-specific genes. Less polymorphic repeats allow greater gene expression. Preferential binding of hypophosphorylated H1 histone to these repeats may diminish binding of transcription factors. Preferential binding of hypophosphorylated high mobility group chromatin proteins would increase this binding. Shorter simple sequence repeats have undergone fewer point mutations than longer repeats, hence they are less polymorphic and more conserved. The role of transcribed simple sequence repeats in frog embryo germ layer determination is considered.
Collapse
Affiliation(s)
- Reed Flickinger
- Department of Biological Sciences, State University of New York, Buffalo, N.Y. 14260, Mailing Address:P.O. Box 741 Captain Cook, HI, 96704, USA.
| |
Collapse
|
3
|
Mechanistic Differences in Transcription Initiation at TATA-Less and TATA-Containing Promoters. Mol Cell Biol 2017; 38:MCB.00448-17. [PMID: 29038161 DOI: 10.1128/mcb.00448-17] [Citation(s) in RCA: 26] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/18/2017] [Accepted: 10/06/2017] [Indexed: 12/14/2022] Open
Abstract
A yeast in vitro system was developed that is active for transcription at both TATA-containing and TATA-less promoters. Transcription with extracts made from cells depleted of TFIID subunit Taf1 demonstrated that promoters of both classes are TFIID dependent, in agreement with recent in vivo findings. TFIID depletion can be complemented in vitro by additional recombinant TATA binding protein (TBP) at only the TATA-containing promoters. In contrast, high levels of TBP did not complement Taf1 depletion in vivo and instead repressed transcription from both promoter types. We also demonstrate the importance of the TATA-like sequence found at many TATA-less promoters and describe how the presence or absence of the TATA element is likely not the only feature that distinguishes these two types of promoters.
Collapse
|
4
|
Watanabe K, Kokubo T. SAGA mediates transcription from the TATA-like element independently of Taf1p/TFIID but dependent on core promoter structures in Saccharomyces cerevisiae. PLoS One 2017; 12:e0188435. [PMID: 29176831 PMCID: PMC5703507 DOI: 10.1371/journal.pone.0188435] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/03/2017] [Accepted: 11/07/2017] [Indexed: 11/21/2022] Open
Abstract
In Saccharomyces cerevisiae, core promoters of class II genes contain a TATA element, either a TATA box (TATA[A/T]A[A/T][A/G]) or TATA-like element (1 or 2 bp mismatched version of the TATA box). The TATA element directs the assembly of the preinitiation complex (PIC) to ensure accurate transcriptional initiation. It has been proposed the PIC is assembled by two distinct pathways in which TBP is delivered by TFIID or SAGA, leading to the widely accepted model that these complexes mediate transcription mainly from TATA-like element- or TATA box-containing promoters, respectively. Although both complexes are involved in transcription of nearly all class II genes, it remains unclear how efficiently SAGA mediates transcription from TATA-like element-containing promoters independently of TFIID. We found that transcription from the TATA box-containing AGP1 promoter was greatly stimulated in a Spt3p-dependent manner after inactivation of Taf1p/TFIID. Thus, this promoter provides a novel experimental system in which to evaluate SAGA-mediated transcription from TATA-like element(s). We quantitatively measured transcription from various TATA-like elements in the Taf1p-dependent CYC1 promoter and Taf1p-independent AGP1 promoter. The results revealed that SAGA could mediate transcription from at least some TATA-like elements independently of Taf1p/TFIID, and that Taf1p-dependence or -independence is highly robust with respect to variation of the TATA sequence. Furthermore, chimeric promoter mapping revealed that Taf1p-dependence or independence was conferred by the upstream activating sequence (UAS), whereas Spt3p-dependent transcriptional stimulation after inactivation of Taf1p/TFIID was specific to the AGP1 promoter and dependent on core promoter regions other than the TATA box. These results suggest that TFIID and/or SAGA are regulated in two steps: the UAS first specifies TFIID or SAGA as the predominant factor on a given promoter, and then the core promoter structure guides the pertinent factor to conduct transcription in an appropriate manner.
Collapse
Affiliation(s)
- Kiyoshi Watanabe
- Molecular and Cellular Biology Laboratory, Graduate School of Medical Life Science, Yokohama City University, Yokohama, Kanagawa, Japan
| | - Tetsuro Kokubo
- Molecular and Cellular Biology Laboratory, Graduate School of Medical Life Science, Yokohama City University, Yokohama, Kanagawa, Japan
| |
Collapse
|
5
|
Dai Z, Xiong Y, Dai X. DNA signals at isoform promoters. Sci Rep 2016; 6:28977. [PMID: 27353836 PMCID: PMC4926256 DOI: 10.1038/srep28977] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/01/2015] [Accepted: 06/08/2016] [Indexed: 11/14/2022] Open
Abstract
Transcriptional heterogeneity is extensive in the genome, and most genes express variable transcript isoforms. However, whether variable transcript isoforms of one gene are regulated by common promoter elements remain to be elucidated. Here, we investigated whether isoform promoters of one gene have separated DNA signals for transcription and translation initiation. We found that TATA box and nucleosome-disfavored DNA sequences are prevalent in distinct transcript isoform promoters of one gene. These DNA signals are conserved among species. Transcript isoform has a RNA-determined unstructured region around its start site. We found that these DNA/RNA features facilitate isoform transcription and translation. These results suggest a DNA-encoded mechanism by which transcript isoform is generated.
Collapse
Affiliation(s)
- Zhiming Dai
- School of Data and Computer Science, Sun Yat-Sen University, Guangzhou 510006, China.,Guangdong Province Key Laboratory of Big Data Analysis and Processing, Sun Yat-Sen University, Guangzhou 510006, China
| | - Yuanyan Xiong
- Key Laboratory of Gene Engineering of the Ministry of Education and State Key Laboratory of Biocontrol, School of Life Sciences, Sun Yat-Sen University, Guangzhou 510006, China.,SYSU-CMU Shunde International Joint Research Institute, Shunde, China
| | - Xianhua Dai
- School of Electronics and Information Technology, Sun Yat-Sen University, Guangzhou 510006, China
| |
Collapse
|
6
|
Liu C, Liu W, Fan L, Liu J, Li P, Zhang W, Gao J, Li Z, Zhang Q, Wang X. Sequences analyses and expression profiles in tissues and embryos of Japanese flounder (Paralichthys olivaceus) PRDM1. FISH PHYSIOLOGY AND BIOCHEMISTRY 2016; 42:467-482. [PMID: 26508172 DOI: 10.1007/s10695-015-0152-8] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/27/2015] [Accepted: 10/20/2015] [Indexed: 06/05/2023]
Abstract
PRDM1 (PRDI-BF1-RIZ1 homologous domain containing 1) appears to be a pleiotropic regulatory factor in various processes. It contains a PR (PRDI-BF1-RIZ1 homologous) domain protein and five zinc fingers. In the present study, a gene coding the homolog of prdm1 and the 5' regulatory region of prdm1 was identified from the Paralichthys olivaceus (denoted Po-prdm1). Results of real-time quantitative polymerase chain reaction amplification (RT-qPCR) and in situ hybridization (ISH) in embryos revealed that Po-prdm1 was highly expressed between the early gastrula and tail bud stages, with its expression peaking in the mid-gastrula stage, whereas the results of RT-qPCR and ISH in tissues demonstrated that Po-prdm1 transcripts were ubiquitously detected in all tissues, which indicates its pleiotropic function in multiple processes. ISH of gonadal tissues revealed that the transcripts were located in the nucleus and cytoplasm of the oocytes in the ovaries but only in the spermatogonia and not in the spermatocytes in the testes. The Po-prdm1 transcription factor binding sites and their conserved binding region among vertebrates were analyzed in this study. The combined results suggest that Po-PRDM1 has a conserved function in teleosts and mammals.
Collapse
Affiliation(s)
- Conghui Liu
- Key Laboratory of Marine Genetics and Breeding (Ocean University of China), Ministry of Education, Qingdao, 266003, China
| | - Wei Liu
- Key Laboratory of Marine Genetics and Breeding (Ocean University of China), Ministry of Education, Qingdao, 266003, China
| | - Lin Fan
- Key Laboratory of Marine Genetics and Breeding (Ocean University of China), Ministry of Education, Qingdao, 266003, China
| | - Jinxiang Liu
- Key Laboratory of Marine Genetics and Breeding (Ocean University of China), Ministry of Education, Qingdao, 266003, China
| | - Peizhen Li
- Key Laboratory of Marine Genetics and Breeding (Ocean University of China), Ministry of Education, Qingdao, 266003, China
| | - Wei Zhang
- Key Laboratory of Marine Genetics and Breeding (Ocean University of China), Ministry of Education, Qingdao, 266003, China
| | - Jinning Gao
- Key Laboratory of Marine Genetics and Breeding (Ocean University of China), Ministry of Education, Qingdao, 266003, China
| | - Zan Li
- Key Laboratory of Marine Genetics and Breeding (Ocean University of China), Ministry of Education, Qingdao, 266003, China
| | - Quanqi Zhang
- Key Laboratory of Marine Genetics and Breeding (Ocean University of China), Ministry of Education, Qingdao, 266003, China
| | - Xubo Wang
- Key Laboratory of Marine Genetics and Breeding (Ocean University of China), Ministry of Education, Qingdao, 266003, China.
- College of Marine Life Science, Ocean University of China, No. 5 Yushan Road, Qingdao, 266003, China.
| |
Collapse
|
7
|
Shabbir Hussain M, Gambill L, Smith S, Blenner MA. Engineering Promoter Architecture in Oleaginous Yeast Yarrowia lipolytica. ACS Synth Biol 2016; 5:213-23. [PMID: 26635071 DOI: 10.1021/acssynbio.5b00100] [Citation(s) in RCA: 67] [Impact Index Per Article: 8.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/02/2023]
Abstract
Eukaryotic promoters have a complex architecture to control both the strength and timing of gene transcription spanning up to thousands of bases from the initiation site. This complexity makes rational fine-tuning of promoters in fungi difficult to predict; however, this very same complexity enables multiple possible strategies for engineering promoter strength. Here, we studied promoter architecture in the oleaginous yeast, Yarrowia lipolytica. While recent studies have focused on upstream activating sequences, we systematically examined various components common in fungal promoters. Here, we examine several promoter components including upstream activating sequences, proximal promoter sequences, core promoters, and the TATA box in autonomously replicating expression plasmids and integrated into the genome. Our findings show that promoter strength can be fine-tuned through the engineering of the TATA box sequence, core promoter, and upstream activating sequences. Additionally, we identified a previously unreported oleic acid responsive transcription enhancement in the XPR2 upstream activating sequences, which illustrates the complexity of fungal promoters. The promoters engineered here provide new genetic tools for metabolic engineering in Y. lipolytica and provide promoter engineering strategies that may be useful in engineering other non-model fungal systems.
Collapse
Affiliation(s)
- Murtaza Shabbir Hussain
- Department of Chemical and Biomolecular Engineering and ‡Department of Genetics and Biochemistry, Clemson University, Clemson, South Carolina 29634, United States
| | - Lauren Gambill
- Department of Chemical and Biomolecular Engineering and ‡Department of Genetics and Biochemistry, Clemson University, Clemson, South Carolina 29634, United States
| | - Spencer Smith
- Department of Chemical and Biomolecular Engineering and ‡Department of Genetics and Biochemistry, Clemson University, Clemson, South Carolina 29634, United States
| | - Mark A. Blenner
- Department of Chemical and Biomolecular Engineering and ‡Department of Genetics and Biochemistry, Clemson University, Clemson, South Carolina 29634, United States
| |
Collapse
|
8
|
Itokawa K, Komagata O, Kasai S, Tomita T. A single nucleotide change in a core promoter is involved in the progressive overexpression of the duplicated CYP9M10 haplotype lineage in Culex quinquefasciatus. INSECT BIOCHEMISTRY AND MOLECULAR BIOLOGY 2015; 66:96-102. [PMID: 26494013 DOI: 10.1016/j.ibmb.2015.10.006] [Citation(s) in RCA: 20] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/13/2015] [Revised: 10/08/2015] [Accepted: 10/13/2015] [Indexed: 06/05/2023]
Abstract
Although the importance of cis-acting mutations on detoxification enzyme genes for insecticide resistance is widely accepted, only a few of them have been determined as concrete mutations present in genomic DNA till date. The overexpression of a cytochrome P450 gene, CYP9M10, is associated with pyrethroid resistance in the southern house mosquito Culex quinquefasciatus. The haplotypes of CYP9M10 exhibiting overexpression (resistant haplotypes) belong to one specific phylogenetic lineage that shares high nucleotide sequence homology and the same insertion of a transposable element. Among the resistant haplotypes, allelic progression involving an additional cis-acting mutation and gene duplication evolved a CYP9M10 haplotype associated with extremely high transcription and strong pyrethroid resistance. Here we show that a single nucleotide substitution G-27A, which is located near the transcription start site of CYP9M10, is involved in the progression of the duplicated haplotype lineage. The deletion of a 7-bp AT-rich sequence that includes nucleotide -27 inhibited the initiation of transcription from the original transcriptional initiation site. The mutation was suspected to reside within a core promoter, TATA-box, of CYP9M10.
Collapse
Affiliation(s)
- Kentaro Itokawa
- Department of Medical Entomology, National Institute of Infectious Diseases, 1-23-1 Toyama, Shinjuku-ku, Tokyo, Japan; Japan Agency for Medical Research and Development, 20F Yomiuri Shimbun Bldg., 1-7-1 Otemachi, Chiyoda-ku, Tokyo, Japan
| | - Osamu Komagata
- Department of Medical Entomology, National Institute of Infectious Diseases, 1-23-1 Toyama, Shinjuku-ku, Tokyo, Japan
| | - Shinji Kasai
- Department of Medical Entomology, National Institute of Infectious Diseases, 1-23-1 Toyama, Shinjuku-ku, Tokyo, Japan
| | - Takashi Tomita
- Department of Medical Entomology, National Institute of Infectious Diseases, 1-23-1 Toyama, Shinjuku-ku, Tokyo, Japan.
| |
Collapse
|
9
|
Watanabe K, Yabe M, Kasahara K, Kokubo T. A Random Screen Using a Novel Reporter Assay System Reveals a Set of Sequences That Are Preferred as the TATA or TATA-Like Elements in the CYC1 Promoter of Saccharomyces cerevisiae. PLoS One 2015; 10:e0129357. [PMID: 26046838 PMCID: PMC4457894 DOI: 10.1371/journal.pone.0129357] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/08/2015] [Accepted: 05/08/2015] [Indexed: 12/11/2022] Open
Abstract
In Saccharomyces cerevisiae, the core promoters of class II genes contain either TATA or TATA-like elements to direct accurate transcriptional initiation. Genome-wide analyses show that the consensus sequence of the TATA element is TATAWAWR (8 bp), whereas TATA-like elements carry one or two mismatches to this consensus. The fact that several functionally distinct TATA sequences have been identified indicates that these elements may function, at least to some extent, in a gene-specific manner. The purpose of the present study was to identify functional TATA sequences enriched in one particular core promoter and compare them with the TATA or TATA-like elements that serve as the pre-initiation complex (PIC) assembly sites on the yeast genome. For this purpose, we conducted a randomized screen of the TATA element in the CYC1 promoter by using a novel reporter assay system and identified several hundreds of unique sequences that were tentatively classified into nine groups. The results indicated that the 7 bp TATA element (i.e., TATAWAD) and several sets of TATA-like sequences are preferred specifically by this promoter. Furthermore, we find that the most frequently isolated TATA-like sequence, i.e., TATTTAAA, is actually utilized as a functional core promoter element for the endogenous genes, e.g., ADE5,7 and ADE6. Collectively, these results indicate that the sequence requirements for the functional TATA or TATA-like elements in one particular core promoter are not as stringent. However, the variation of these sequences differs significantly from that of the PIC assembly sites on the genome, presumably depending on promoter structures and reflecting the gene-specific function of these sequences.
Collapse
Affiliation(s)
- Kiyoshi Watanabe
- Molecular and Cellular Biology Laboratory, Graduate School of Medical Life Science, Yokohama City University, Yokohama, Kanagawa, Japan
| | - Makoto Yabe
- Molecular and Cellular Biology Laboratory, Graduate School of Medical Life Science, Yokohama City University, Yokohama, Kanagawa, Japan
| | - Koji Kasahara
- Molecular and Cellular Biology Laboratory, Graduate School of Medical Life Science, Yokohama City University, Yokohama, Kanagawa, Japan
| | - Tetsuro Kokubo
- Molecular and Cellular Biology Laboratory, Graduate School of Medical Life Science, Yokohama City University, Yokohama, Kanagawa, Japan
- * E-mail:
| |
Collapse
|
10
|
Lubliner S, Regev I, Lotan-Pompan M, Edelheit S, Weinberger A, Segal E. Core promoter sequence in yeast is a major determinant of expression level. Genome Res 2015; 25:1008-17. [PMID: 25969468 PMCID: PMC4484384 DOI: 10.1101/gr.188193.114] [Citation(s) in RCA: 72] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/06/2014] [Accepted: 05/11/2015] [Indexed: 11/24/2022]
Abstract
The core promoter is the regulatory sequence to which RNA polymerase is recruited and where it acts to initiate transcription. Here, we present the first comprehensive study of yeast core promoters, providing massively parallel measurements of core promoter activity and of TSS locations and relative usage for thousands of native and designed sequences. We found core promoter activity to be highly correlated to the activity of the entire promoter and that sequence variation in different core promoter regions substantially tunes its activity in a predictable way. We also show that location, orientation, and flanking bases critically affect TATA element function, that transcription initiation in highly active core promoters is focused within a narrow region, that poly(dA:dT) orientation has a functional consequence at the 3' end of promoters, and that orthologous core promoters across yeast species have conserved activities. Our results demonstrate the importance of core promoters in the quantitative study of gene regulation.
Collapse
Affiliation(s)
- Shai Lubliner
- Department of Computer Science and Applied Mathematics, Weizmann Institute of Science, 7610001 Rehovot, Israel
| | - Ifat Regev
- Department of Computer Science and Applied Mathematics, Weizmann Institute of Science, 7610001 Rehovot, Israel; Department of Molecular Cell Biology, Weizmann Institute of Science, 7610001 Rehovot, Israel
| | - Maya Lotan-Pompan
- Department of Computer Science and Applied Mathematics, Weizmann Institute of Science, 7610001 Rehovot, Israel; Department of Molecular Cell Biology, Weizmann Institute of Science, 7610001 Rehovot, Israel
| | - Sarit Edelheit
- Department of Molecular Genetics, Weizmann Institute of Science, 7610001 Rehovot, Israel
| | - Adina Weinberger
- Department of Computer Science and Applied Mathematics, Weizmann Institute of Science, 7610001 Rehovot, Israel; Department of Molecular Cell Biology, Weizmann Institute of Science, 7610001 Rehovot, Israel
| | - Eran Segal
- Department of Computer Science and Applied Mathematics, Weizmann Institute of Science, 7610001 Rehovot, Israel; Department of Molecular Cell Biology, Weizmann Institute of Science, 7610001 Rehovot, Israel
| |
Collapse
|
11
|
Danino YM, Even D, Ideses D, Juven-Gershon T. The core promoter: At the heart of gene expression. BIOCHIMICA ET BIOPHYSICA ACTA-GENE REGULATORY MECHANISMS 2015; 1849:1116-31. [PMID: 25934543 DOI: 10.1016/j.bbagrm.2015.04.003] [Citation(s) in RCA: 102] [Impact Index Per Article: 11.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/17/2015] [Revised: 04/19/2015] [Accepted: 04/23/2015] [Indexed: 12/17/2022]
Abstract
The identities of different cells and tissues in multicellular organisms are determined by tightly controlled transcriptional programs that enable accurate gene expression. The mechanisms that regulate gene expression comprise diverse multiplayer molecular circuits of multiple dedicated components. The RNA polymerase II (Pol II) core promoter establishes the center of this spatiotemporally orchestrated molecular machine. Here, we discuss transcription initiation, diversity in core promoter composition, interactions of the basal transcription machinery with the core promoter, enhancer-promoter specificity, core promoter-preferential activation, enhancer RNAs, Pol II pausing, transcription termination, Pol II recycling and translation. We further discuss recent findings indicating that promoters and enhancers share similar features and may not substantially differ from each other, as previously assumed. Taken together, we review a broad spectrum of studies that highlight the importance of the core promoter and its pivotal role in the regulation of metazoan gene expression and suggest future research directions and challenges.
Collapse
Affiliation(s)
- Yehuda M Danino
- The Mina and Everard Goodman Faculty of Life Sciences, Bar-Ilan University, Ramat Gan 5290002, Israel
| | - Dan Even
- The Mina and Everard Goodman Faculty of Life Sciences, Bar-Ilan University, Ramat Gan 5290002, Israel
| | - Diana Ideses
- The Mina and Everard Goodman Faculty of Life Sciences, Bar-Ilan University, Ramat Gan 5290002, Israel
| | - Tamar Juven-Gershon
- The Mina and Everard Goodman Faculty of Life Sciences, Bar-Ilan University, Ramat Gan 5290002, Israel.
| |
Collapse
|
12
|
Role of indirect readout mechanism in TATA box binding protein-DNA interaction. J Comput Aided Mol Des 2015; 29:283-95. [PMID: 25575717 DOI: 10.1007/s10822-014-9828-x] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/26/2014] [Accepted: 12/18/2014] [Indexed: 12/11/2022]
Abstract
Gene expression generally initiates from recognition of TATA-box binding protein (TBP) to the minor groove of DNA of TATA box sequence where the DNA structure is significantly different from B-DNA. We have carried out molecular dynamics simulation studies of TBP-DNA system to understand how the DNA structure alters for efficient binding. We observed rigid nature of the protein while the DNA of TATA box sequence has an inherent flexibility in terms of bending and minor groove widening. The bending analysis of the free DNA and the TBP bound DNA systems indicate presence of some similar structures. Principal coordinate ordination analysis also indicates some structural features of the protein bound and free DNA are similar. Thus we suggest that the DNA of TATA box sequence regularly oscillates between several alternate structures and the one suitable for TBP binding is induced further by the protein for proper complex formation.
Collapse
|
13
|
Unconventional sequence requirement for viral late gene core promoters of murine gammaherpesvirus 68. J Virol 2014; 88:3411-22. [PMID: 24403583 DOI: 10.1128/jvi.01374-13] [Citation(s) in RCA: 34] [Impact Index Per Article: 3.4] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022] Open
Abstract
Infection with the human gammaherpesviruses, Epstein-Barr virus (EBV) and Kaposi's sarcoma-associated herpesvirus (KSHV), is associated with several cancers. During lytic replication of herpesviruses, viral genes are expressed in an ordered cascade. However, the mechanism by which late gene expression is regulated has not been well characterized in gammaherpesviruses. In this study, we have investigated the cis element that mediates late gene expression during de novo lytic infection with murine gammaherpesvirus 68 (MHV-68). A reporter system was established and used to assess the activity of viral late gene promoters upon infection with MHV-68. It was found that the viral origin of lytic replication, orilyt, must be on the reporter plasmid to support activation of the late gene promoter. Furthermore, the DNA sequence required for the activation of late gene promoters was mapped to a core element containing a distinct TATT box and its neighboring sequences. The critical nucleotides of the TATT box region were determined by systematic mutagenesis in the reporter system, and the significance of these nucleotides was confirmed in the context of the viral genome. In addition, EBV and KSHV late gene core promoters could be activated by MHV-68 lytic replication, indicating that the mechanisms controlling late gene expression are conserved among gammaherpesviruses. Therefore, our results on MHV-68 establish a solid foundation for mechanistic studies of late gene regulation.
Collapse
|
14
|
Hamamoto G, Suzuki T, Suzuki MG, Aoki F. Regulation of transketolase like 1 gene expression in the murine one-cell stage embryos. PLoS One 2014; 9:e82087. [PMID: 24392079 PMCID: PMC3879240 DOI: 10.1371/journal.pone.0082087] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/10/2013] [Accepted: 10/18/2013] [Indexed: 12/03/2022] Open
Abstract
In mice, transcription from the zygotic genome starts at the mid-one-cell stage after fertilization. Previous studies showed that an enhancer is not required for transcription at this stage, and that the enhancer-dependent mechanism of transcription is established during the two-cell stage. However, these results were obtained using reporter gene assays with promoters derived from viruses, rather than from endogenous genes. We conducted a reporter-gene assay using the promoter of Tktl1, which is transcribed after fertilization, to investigate the mechanism regulating gene expression at the one-cell stage. When a plasmid containing the 2467 bp upstream and 25 bp downstream of the Tktl1 transcription start site (TSS) was microinjected into the nuclei of growing oocytes, and one-cell stage and early and late two-cell-stage embryos, transcriptional activity was detected in the one-cell- and two-cell-stage embryos, but not in the oocytes. It was highest at the early two-cell stage and was reduced at the late two-cell stage. The decrease in activity at the late two-cell stage was prevented by inhibiting the second round of DNA replication, suggesting that the transcriptionally repressive state is established during the two-cell stage by a mechanism coupled to DNA replication. When the Tktl1 promoter was deleted to leave 56 bp upstream of the TSS which includes GC and TATA boxes, transcriptional activity was still detected in one-cell-stage embryos, but not early or late two-cell-stage embryos. The core promoter of Tktl1 alone seems to be able to induce basal transcription at the one-cell stage. These results suggest that repressive chromatin is established after fertilization in two steps, which occur during the transition from the one- to two-cell stage and during DNA replication at the two-cell stage.
Collapse
Affiliation(s)
- Go Hamamoto
- Department of Integrated Biosciences, Graduate School of Frontier Sciences, University of Tokyo, Kashiwa, Chiba, Japan
| | - Tsukasa Suzuki
- Department of Integrated Biosciences, Graduate School of Frontier Sciences, University of Tokyo, Kashiwa, Chiba, Japan
| | - Masataka G. Suzuki
- Department of Integrated Biosciences, Graduate School of Frontier Sciences, University of Tokyo, Kashiwa, Chiba, Japan
| | - Fugaku Aoki
- Department of Integrated Biosciences, Graduate School of Frontier Sciences, University of Tokyo, Kashiwa, Chiba, Japan
- * E-mail:
| |
Collapse
|
15
|
Tanifuji G, Onodera NT, Moore CE, Archibald JM. Reduced Nuclear Genomes Maintain High Gene Transcription Levels. Mol Biol Evol 2013; 31:625-35. [DOI: 10.1093/molbev/mst254] [Citation(s) in RCA: 18] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/20/2023] Open
|
16
|
Venters BJ, Pugh BF. Genomic organization of human transcription initiation complexes. Nature 2013; 502:53-8. [PMID: 24048476 PMCID: PMC4018585 DOI: 10.1038/nature12535] [Citation(s) in RCA: 47] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/08/2013] [Accepted: 08/09/2013] [Indexed: 11/16/2022]
Abstract
The human genome is pervasively transcribed, yet only a small fraction is coding. Here we address whether this noncoding transcription arises at promoters, and detail the interactions of initiation factors TBP, TFIIB, and RNA polymerase (Pol) II. Using ChIP-exo, we identify ~160,000 transcription initiation complexes across the human K562 genome, and more in other cancer genomes. Only ~5% associate with mRNA genes. The remaining associate with non-polyadenylated noncoding transcription. Regardless, Pol II moves into a transcriptionally paused state, and TBP/TFIIB remain at the promoter. Remarkably, the vast majority of locations contain the four core promoter elements: BREu, TATA, BREd, and INR, in constrained positions. All but the INR also reside at Pol III promoters, where TBP makes similar contacts. This comprehensive and high resolution genome-wide detection of the initiation machinery produces a consolidated view of transcription initiation events from yeast to humans at Pol II/III TATA-containing/TATA-less coding and noncoding genes.
Collapse
Affiliation(s)
- Bryan J Venters
- Center for Eukaryotic Gene Regulation, Department of Biochemistry and Molecular Biology, The Pennsylvania State University, University Park, Pennsylvania 16802, USA
| | | |
Collapse
|
17
|
Major histocompatibility complex class I core promoter elements are not essential for transcription in vivo. Mol Cell Biol 2013; 33:4395-407. [PMID: 24019072 DOI: 10.1128/mcb.00553-13] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022] Open
Abstract
The role of core promoter elements in regulating transcription initiation is largely unknown for genes subject to complex regulation. Major histocompatibility complex class I genes are ubiquitously expressed and governed by tissue-specific and hormonal signals. Transcription initiates at multiple sites within the core promoter, which contains elements homologous to the canonical elements CCAAT, TATAA, Sp1 binding site (Sp1BS), and Initiator (Inr). To determine their functions, expression of class I transgenes with individually mutated elements was assessed. Surprisingly, all mutant promoters supported transcription. However, each mutated core promoter element had a distinct effect on expression: CAAT box mutations modulated constitutive expression in nonlymphoid tissues, whereas TATAA-like element mutations dysregulated transcription in lymphoid tissues. Inr mutations aberrantly elevated expression. Sp1BS element mutations resulted in variegated transgene expression. RNA polymerase II binding and histone H3K4me3 patterns correlated with transgene expression; H3K9me3 marks partially correlated. Whereas the wild-type, TATAA-like, and CAAT mutant promoters were activated by gamma interferon, the Sp1 and Inr mutants were repressed, implicating these elements in regulation of hormonal responses. These results lead to the surprising conclusion that no single element is required for promoter activity. Rather, each plays a distinct role in promoter activity, chromatin structure, tissue-specific expression, and extracellular signaling.
Collapse
|
18
|
Lubliner S, Keren L, Segal E. Sequence features of yeast and human core promoters that are predictive of maximal promoter activity. Nucleic Acids Res 2013; 41:5569-81. [PMID: 23599004 PMCID: PMC3675475 DOI: 10.1093/nar/gkt256] [Citation(s) in RCA: 70] [Impact Index Per Article: 6.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/25/2022] Open
Abstract
The core promoter is the region in which RNA polymerase II is recruited to the DNA and acts to initiate transcription, but the extent to which the core promoter sequence determines promoter activity levels is largely unknown. Here, we identified several base content and k-mer sequence features of the yeast core promoter sequence that are highly predictive of maximal promoter activity. These features are mainly located in the region 75 bp upstream and 50 bp downstream of the main transcription start site, and their associations hold for both constitutively active promoters and promoters that are induced or repressed in specific conditions. Our results unravel several architectural features of yeast core promoters and suggest that the yeast core promoter sequence downstream of the TATA box (or of similar sequences involved in recruitment of the pre-initiation complex) is a major determinant of maximal promoter activity. We further show that human core promoters also contain features that are indicative of maximal promoter activity; thus, our results emphasize the important role of the core promoter sequence in transcriptional regulation.
Collapse
Affiliation(s)
- Shai Lubliner
- Department of Computer Science and Applied Mathematics, Weizmann Institute of Science, Rehovot 76100, Israel
| | | | | |
Collapse
|
19
|
Seidl MF, Wang RP, Van den Ackerveken G, Govers F, Snel B. Bioinformatic inference of specific and general transcription factor binding sites in the plant pathogen Phytophthora infestans. PLoS One 2012; 7:e51295. [PMID: 23251489 PMCID: PMC3520976 DOI: 10.1371/journal.pone.0051295] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/13/2012] [Accepted: 11/01/2012] [Indexed: 11/19/2022] Open
Abstract
Plant infection by oomycete pathogens is a complex process. It requires precise expression of a plethora of genes in the pathogen that contribute to a successful interaction with the host. Whereas much effort has been made to uncover the molecular systems underlying this infection process, mechanisms of transcriptional regulation of the genes involved remain largely unknown. We performed the first systematic de-novo DNA motif discovery analysis in Phytophthora. To this end, we utilized the genome sequence of the late blight pathogen Phytophthora infestans and two related Phytophthora species (P. ramorum and P. sojae), as well as genome-wide in planta gene expression data to systematically predict 19 conserved DNA motifs. This catalog describes common eukaryotic promoter elements whose functionality is supported by the presence of orthologs of known general transcription factors. Together with strong functional enrichment of the common promoter elements towards effector genes involved in pathogenicity, we obtained a new and expanded picture of the promoter structure in P. infestans. More intriguingly, we identified specific DNA motifs that are either highly abundant or whose presence is significantly correlated with gene expression levels during infection. Several of these motifs are observed upstream of genes encoding transporters, RXLR effectors, but also transcriptional regulators. Motifs that are observed upstream of known pathogenicity-related genes are potentially important binding sites for transcription factors. Our analyses add substantial knowledge to the as of yet virtually unexplored question regarding general and specific gene regulation in this important class of pathogens. We propose hypotheses on the effects of cis-regulatory motifs on the gene regulation of pathogenicity-related genes and pinpoint motifs that are prime targets for further experimental validation.
Collapse
Affiliation(s)
- Michael F Seidl
- Theoretical Biology and Bioinformatics, Department of Biology, Utrecht University, Utrecht, The Netherlands.
| | | | | | | | | |
Collapse
|
20
|
Tagwerker C, Dupont CL, Karas BJ, Ma L, Chuang RY, Benders GA, Ramon A, Novotny M, Montague MG, Venepally P, Brami D, Schwartz A, Andrews-Pfannkoch C, Gibson DG, Glass JI, Smith HO, Venter JC, Hutchison CA. Sequence analysis of a complete 1.66 Mb Prochlorococcus marinus MED4 genome cloned in yeast. Nucleic Acids Res 2012; 40:10375-83. [PMID: 22941652 PMCID: PMC3488255 DOI: 10.1093/nar/gks823] [Citation(s) in RCA: 45] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/21/2022] Open
Abstract
Marine cyanobacteria of the genus Prochlorococcus represent numerically dominant photoautotrophs residing throughout the euphotic zones in the open oceans and are major contributors to the global carbon cycle. Prochlorococcus has remained a genetically intractable bacterium due to slow growth rates and low transformation efficiencies using standard techniques. Our recent successes in cloning and genetically engineering the AT-rich, 1.1 Mb Mycoplasma mycoides genome in yeast encouraged us to explore similar methods with Prochlorococcus. Prochlorococcus MED4 has an AT-rich genome, with a GC content of 30.8%, similar to that of Saccharomyces cerevisiae (38%), and contains abundant yeast replication origin consensus sites (ACS) evenly distributed around its 1.66 Mb genome. Unlike Mycoplasma cells, which use the UGA codon for tryptophane, Prochlorococcus uses the standard genetic code. Despite this, we observed no toxic effects of several partial and 15 whole Prochlorococcus MED4 genome clones in S. cerevisiae. Sequencing of a Prochlorococcus genome purified from yeast identified 14 single base pair missense mutations, one frameshift, one single base substitution to a stop codon and one dinucleotide transversion compared to the donor genomic DNA. We thus provide evidence of transformation, replication and maintenance of this 1.66 Mb intact bacterial genome in S. cerevisiae.
Collapse
Affiliation(s)
- Christian Tagwerker
- Department of Synthetic Biology and Bioenergy, J. Craig Venter Institute, 10355 Science Center Drive, San Diego, CA 92121, USA.
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
21
|
A conserved GA element in TATA-less RNA polymerase II promoters. PLoS One 2011; 6:e27595. [PMID: 22110682 PMCID: PMC3217976 DOI: 10.1371/journal.pone.0027595] [Citation(s) in RCA: 27] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/30/2011] [Accepted: 10/20/2011] [Indexed: 11/19/2022] Open
Abstract
Initiation of RNA polymerase (Pol) II transcription requires assembly of the pre-initiation complex (PIC) at the promoter. In the classical view, PIC assembly starts with binding of the TATA box-binding protein (TBP) to the TATA box. However, a TATA box occurs in only 15% of promoters in the yeast Saccharomyces cerevisiae, posing the question how most yeast promoters nucleate PIC assembly. Here we show that one third of all yeast promoters contain a novel conserved DNA element, the GA element (GAE), that generally does not co-occur with the TATA box. The distance of the GAE to the transcription start site (TSS) resembles the distance of the TATA box to the TSS. The TATA-less TMT1 core promoter contains a GAE, recruits TBP, and supports formation of a TBP-TFIIB-DNA-complex. Mutation of the promoter region surrounding the GAE abolishes transcription in vivo and in vitro. A 32-nucleotide promoter region containing the GAE can functionally substitute for the TATA box in a TATA-containing promoter. This identifies the GAE as a conserved promoter element in TATA-less promoters.
Collapse
|
22
|
Bernard V, Lecharny A, Brunaud V. Improved detection of motifs with preferential location in promoters. Genome 2011; 53:739-52. [PMID: 20924423 DOI: 10.1139/g10-042] [Citation(s) in RCA: 11] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/27/2023]
Abstract
Many transcription factor binding sites (TFBSs) involved in gene expression regulation are preferentially located relative to the transcription start site. This property is exploited in in silico prediction approaches, one of which involves studying the local overrepresentation of motifs using a sliding window to scan promoters with considerable accuracy. Nevertheless, the consequences of the choice of the sliding window size have never before been analysed. We propose an automatic adaptation of this size to each motif distribution profile. This approach allows a better characterization of the topological constraints of the motifs and the lists of genes containing them. Moreover, our approach allowed us to highlight a nonconstant frequency of occurrence of spurious motifs that could be counter-selected close to their functional area. Therefore, to improve the accuracy of in silico prediction of TFBSs and the sensitivity of the promoter cartography, we propose, in addition to automatic adaptation of window size, consideration of the nonconstant frequency of motifs in promoters.
Collapse
Affiliation(s)
- Virginie Bernard
- Unité de Recherche en Génomique Végétale (URGV), UMR INRA 1165 - CNRS 8114 - UEVE, 91057 Evry CEDEX, France
| | | | | |
Collapse
|
23
|
Sugihara F, Kasahara K, Kokubo T. Highly redundant function of multiple AT-rich sequences as core promoter elements in the TATA-less RPS5 promoter of Saccharomyces cerevisiae. Nucleic Acids Res 2010; 39:59-75. [PMID: 20805245 PMCID: PMC3017598 DOI: 10.1093/nar/gkq741] [Citation(s) in RCA: 22] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/26/2022] Open
Abstract
In eukaryotes, protein-coding genes are transcribed by RNA polymerase II (pol II) together with general transcription factors (GTFs). TFIID, the largest GTF composed of TATA element-binding protein (TBP) and 14 TBP-associated factors (TAFs), plays a critical role in transcription from TATA-less promoters. In metazoans, several core promoter elements other than the TATA element are thought to be recognition sites for TFIID. However, it is unclear whether functionally homologous elements also exist in TATA-less promoters in Saccharomyces cerevisiae. Here, we identify the cis-elements required to support normal levels of transcription and accurate initiation from sites within the TATA-less and TFIID-dependent RPS5 core promoter. Systematic mutational analyses show that multiple AT-rich sequences are required for these activities and appear to function as recognition sites for TFIID. A single copy of these sequences can support accurate initiation from the endogenous promoter, indicating that they carry highly redundant functions. These results show a novel architecture of yeast TATA-less promoters and support a model in which pol II scans DNA downstream from a recruited site, while searching for appropriate initiation site(s).
Collapse
Affiliation(s)
- Fuminori Sugihara
- Division of Molecular and Cellular Biology, Graduate School of Nanobioscience, Yokohama City University, Yokohama, Kanagawa, Japan
| | | | | |
Collapse
|
24
|
Ponomarenko PM, Suslov VV, Savinkova LK, Ponomarenko MP, Kolchanov NA. A precise equation of equilibrium of four steps of TBP binding with the TATA box for prognosis of phenotypic manifestation of mutations. Biophysics (Nagoya-shi) 2010. [DOI: 10.1134/s0006350910030036] [Citation(s) in RCA: 11] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022] Open
|
25
|
Bernard V, Brunaud V, Lecharny A. TC-motifs at the TATA-box expected position in plant genes: a novel class of motifs involved in the transcription regulation. BMC Genomics 2010; 11:166. [PMID: 20222994 PMCID: PMC2842252 DOI: 10.1186/1471-2164-11-166] [Citation(s) in RCA: 54] [Impact Index Per Article: 3.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/27/2009] [Accepted: 03/12/2010] [Indexed: 01/14/2023] Open
Abstract
BACKGROUND The TATA-box and TATA-variants are regulatory elements involved in the formation of a transcription initiation complex. Both have been conserved throughout evolution in a restricted region close to the Transcription Start Site (TSS). However, less than half of the genes in model organisms studied so far have been found to contain either one of these elements. Indeed different core-promoter elements are involved in the recruitment of the TATA-box-binding protein. Here we assessed the possibility of identifying novel functional motifs in plant genes, sharing the TATA-box topological constraints. RESULTS We developed an ab-initio approach considering the preferential location of motifs relative to the TSS. We identified motifs observed at the TATA-box expected location and conserved in both Arabidopsis thaliana and Oryza sativa promoters. We identified TC-elements within non-TA-rich promoters 30 bases upstream of the TSS. As with the TATA-box and TATA-variant sequences, it was possible to construct a unique distance graph with the TC-element sequences. The structural and functional features of TC-element-containing genes were distinct from those of TATA-box- or TATA-variant-containing genes. Arabidopsis thaliana transcriptome analysis revealed that TATA-box-containing genes were generally those showing relatively high levels of expression and that TC-element-containing genes were generally those expressed in specific conditions. CONCLUSIONS Our observations suggest that the TC-elements might constitute a class of novel regulatory elements participating towards the complex modulation of gene expression in plants.
Collapse
Affiliation(s)
- Virginie Bernard
- Unité de Recherche en Génomique Végétale (URGV), UMR INRA 1165-CNRS 8114-UEVE, 2 Rue Gaston Crémieux, 91057 Evry Cedex, France
| | | | | |
Collapse
|
26
|
Sachadyn P, Stanisławska-Sachadyn A, Kabat EM, Zielińska A, Kur J. A cryptic ribosome binding site, false signals in reporter systems and avoidance of protein translation chaos. J Biotechnol 2009; 143:169-72. [PMID: 19607865 DOI: 10.1016/j.jbiotec.2009.07.001] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/12/2008] [Revised: 06/30/2009] [Accepted: 07/02/2009] [Indexed: 11/27/2022]
Abstract
The expression of reporter gene may be induced by activation of cryptic signalling sequences, as we found while constructing the mutS-lacZ fusion gene. We cloned the Escherichia coli lacZ gene encoding beta-galactosidase into a plasmid vector carrying the Thermus thermophilus mutS gene. The clones expected to produce beta-galactosidase as the C-terminal fusion were selected for the complementation of beta-galactosidase activity in a lacZ deficient E. coli strain. Surprisingly, one of the clones, though displaying beta-galactosidase activity, did not produce the fusion protein. As shown by DNA sequencing a 92 bp fragment in the 3' part of mutS gene was substituted by a 19 bp sequence. As the consequence of the resultant frameshift, a truncated MutS peptide was translated instead of beta-galactosidase fusion. The cloned lacZ gene lacked its ribosome binding site, so lacZ expression could be explained by activation of a cryptic ribosome binding site in the 3' end of mutS gene. This observation shows that fusion domains in reporter systems are possible to produce accidentally misleading signals. This observation also suggests that some triplets like AGG and AGA, present in the canonical ribosome binding sequence, are rarely used codons to prevent chaotic protein translation.
Collapse
Affiliation(s)
- Paweł Sachadyn
- Gdańsk University of Technology, Department of Microbiology, ul. Narutowicza 11/12, 80-233 Gdańsk, Poland.
| | | | | | | | | |
Collapse
|
27
|
Functional characterization of the murine Tnk1 promoter. Gene 2009; 444:1-9. [PMID: 19481140 DOI: 10.1016/j.gene.2009.05.010] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/02/2009] [Revised: 05/05/2009] [Accepted: 05/12/2009] [Indexed: 12/31/2022]
Abstract
Tnk1/Kos1 is a non-receptor protein tyrosine kinase found to be a tumor suppressor. It negatively regulates cell growth by indirectly suppressing Ras activity. We identified and characterized the critical cis-elements required for Tnk1/Kos1's promoter activity. Results indicate that the murine Tnk1 promoter lacks a conventional TATA, CAAT or initiator element (Inr) but contains multiple transcription start sites. Transcription is initiated by a TATA-like element composed of an AT rich sequence at -30 (30 bp upstream) from the major transcription start site and an Inr-like element that overlaps the multiple start sites. Deletion analysis of the m-Tnk1 promoter reveals the presence of both positive (-25 to -151) and negative (-151 to -1201) regulatory regions. The three GC boxes which bind Sp1 and Sp3 with high affinity, an AP2 site (that overlaps with an AML1 site) and a MED1 site comprise the necessary cis-elements of the proximal promoter required for both constitutive and inducible Tnk1/Kos1 expression. Importantly, results reveal that cellular stress reverses the repression of Tnk1/Kos1 and induces its expression through increased high affinity interactions between nuclear proteins Sp1, Sp3, AP2 and MED1 for the m-Tnk1 promoter. These findings provide a mechanism by which the m-Tnk1 promoter can be dynamically regulated during normal growth.
Collapse
|
28
|
Armata IA, Ananthanarayanan M, Balasubramaniyan N, Shashidharan P. Regulation of DYT1 gene expression by the Ets family of transcription factors. J Neurochem 2008; 106:1052-65. [PMID: 18466338 DOI: 10.1111/j.1471-4159.2008.05465.x] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
Abstract
The DYT1 gene encodes for torsinA, a protein with widespread tissue distribution, involved in early onset dystonia (EOD). Numerous studies have focused on torsinA function but no information is available on its transcriptional regulation. We cloned mouse and human 5'-upstream DYT1 DNA fragments, exhibiting high transcriptional activity, as well as tissue specificity. We identified a proximal minimal DYT1 promoter within -141 bp for mouse and -191 bp for human with respect to the ATG codon. Primer extension analysis indicated multiple transcription start sites. In silico analysis of approximately 500 bp 5'-upstream DYT1 fragment demonstrated lack of a classical TATA or CAAT box and the presence of a highly conserved direct repeat of two Ets binding cores within -86 bp to -77 bp and -78 bp to -69 bp of the mouse and human DYT1 gene, respectively. A single or a two base nucleotide alteration within the downstream Ets core resulted in approximately 90% (mouse) or 45-60% (human) drop in activity. Interestingly, a 3-bp distance increase between the two Ets cores dramatically decreased transcriptional activity which was partially restored when the distance was increased up to 10 bp. Ets-like dominant negatives confirmed the Ets factors as DYT1 transcriptional activators.
Collapse
Affiliation(s)
- Ioanna A Armata
- Department of Neurology, Laboratory of Developmental and Molecular Hepatology, Mount Sinai School of Medicine, New York 10029, USA
| | | | | | | |
Collapse
|
29
|
Bjornsdottir G, Myers LC. Minimal components of the RNA polymerase II transcription apparatus determine the consensus TATA box. Nucleic Acids Res 2008; 36:2906-16. [PMID: 18385157 PMCID: PMC2396422 DOI: 10.1093/nar/gkn130] [Citation(s) in RCA: 14] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/28/2022] Open
Abstract
In Saccharomyces cerevisiae, multiple approaches have arrived at a consensus TATA box sequence of TATA(T/A)A(A/T)(A/G). TATA-binding protein (TBP) affinity alone does not determine TATA box function. To discover how a minimal set of factors required for basal and activated transcription contributed to the sequence requirements for a functional TATA box, we performed transcription reactions using highly purified proteins and CYC1 promoter TATA box mutants. The TATA box consensus sequence is a good predictor of promoter activity. However, several nonconsensus sequences are almost fully functional, indicating that mechanistic requirements are not the only selective pressure on the TATA box. We also found that the effect of a mutation at a certain position is often dependent on other bases within a particular TATA box. Although activators and coactivators strongly influence TBP recruitment and stability at promoters, neither Mediator, the activator Gal4-V16, nor TFIID specifically compensate for the low transcription levels of the weak TATA boxes. The addition of Mediator to purified transcription reactions did, however, increase the functional selectivity for certain consensus TATA sequences. Transcription in whole-cell extracts or in vivo with these TATA box mutants indicated that factors, other than those in our purified system, may help initiate transcription from weak TATA boxes.
Collapse
Affiliation(s)
- Gudrun Bjornsdottir
- Department of Biochemistry, Dartmouth Medical School, Hanover, NH 03755, USA
| | | |
Collapse
|
30
|
An analysis of the positional distribution of DNA motifs in promoter regions and its biological relevance. BMC Bioinformatics 2008; 9:89. [PMID: 18257925 PMCID: PMC2375121 DOI: 10.1186/1471-2105-9-89] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/17/2007] [Accepted: 02/07/2008] [Indexed: 11/13/2022] Open
Abstract
Background Motif finding algorithms have developed in their ability to use computationally efficient methods to detect patterns in biological sequences. However the posterior classification of the output still suffers from some limitations, which makes it difficult to assess the biological significance of the motifs found. Previous work has highlighted the existence of positional bias of motifs in the DNA sequences, which might indicate not only that the pattern is important, but also provide hints of the positions where these patterns occur preferentially. Results We propose to integrate position uniformity tests and over-representation tests to improve the accuracy of the classification of motifs. Using artificial data, we have compared three different statistical tests (Chi-Square, Kolmogorov-Smirnov and a Chi-Square bootstrap) to assess whether a given motif occurs uniformly in the promoter region of a gene. Using the test that performed better in this dataset, we proceeded to study the positional distribution of several well known cis-regulatory elements, in the promoter sequences of different organisms (S. cerevisiae, H. sapiens, D. melanogaster, E. coli and several Dicotyledons plants). The results show that position conservation is relevant for the transcriptional machinery. Conclusion We conclude that many biologically relevant motifs appear heterogeneously distributed in the promoter region of genes, and therefore, that non-uniformity is a good indicator of biological relevance and can be used to complement over-representation tests commonly used. In this article we present the results obtained for the S. cerevisiae data sets.
Collapse
|
31
|
DNA sequence and structural properties as predictors of human and mouse promoters. Gene 2007; 410:165-76. [PMID: 18234453 PMCID: PMC2672154 DOI: 10.1016/j.gene.2007.12.011] [Citation(s) in RCA: 33] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/23/2007] [Revised: 11/30/2007] [Accepted: 12/05/2007] [Indexed: 11/21/2022]
Abstract
Promoters play a central role in gene regulation, yet our power to discriminate them from non-promoter sequences in higher eukaryotes is mainly restricted to those associated with CpG islands. Here, we examined in silico the promoters of 30,954 human and 18,083 mouse transcripts in the DBTSS database, to assess the impact of particular sequence and structural features (propeller twist, bendability and nucleosome positioning preference) on promoter classification and prediction. Our analysis showed that a stricter-than-traditional definition of CpG islands captures low and high CpG count promoter classes more accurately than the traditional one. We observed that both human and mouse promoter sequences are flexible with the exception of the TATA box and TSS, which are rigid regions irrespective of association with a CpG island. Therefore varying levels of structural flexibility in promoters may affect their accessibility to proteins, and hence their specificity. For all features investigated, averaged values across core promoters discriminated CpG island associated promoters from background, whereas the same did not hold for promoters without a CpG island. However, local changes around - 34 to - 23 (expected position of TATA box) and the TSS were informative in discriminating promoters (both classes) from non-promoter sequences. Additionally, we investigated ATG deserts and observed that they occur in all promoter sets except those with a TATA-box and without a CpG island in human. Interestingly, all mouse promoter sets showed ATG codon depletion irrespective of the presence of a TATA-box, possibly reflecting a weaker contribution to TSS specificity in mouse.
Collapse
|
32
|
Skovorodkin I, Pimenov A, Raykhel I, Schimanski B, Ammermann D, Günzl A. alpha-tubulin minichromosome promoters in the stichotrichous ciliate Stylonychia lemnae. EUKARYOTIC CELL 2006; 6:28-36. [PMID: 17085637 PMCID: PMC1800363 DOI: 10.1128/ec.00003-06] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/20/2022]
Abstract
Ciliated protists are model organisms for a number of molecular phenomena including telomerase function, self-splicing introns, and an RNA interference-related mechanism in programmed DNA elimination. Despite this relevance, our knowledge about promoters and transcriptional regulation in these organisms is very limited. The macronuclear genome of stichotrichous ciliates consists of minichromosomes which typically encode a single gene. The 5' nontranscribed spacers are usually no longer than 400 bp and highly suitable for promoter characterizations. We used microinjection of two artificial and differently tagged alpha1 tubulin minichromosomes into the macronucleus of Stylonychia lemnae as a means to characterize in detail the corresponding promoter. Clonal cell lines that stably maintained both minichromosomes were generated, enabling comparative expression analysis by primer extension assays. Deletion and block substitution mutations of one of the minichromosomes revealed a TATA-like element, a putative initiator element, and two distinct upstream sequence elements (USEs). Determination of transcription initiation sites and a sequence alignment indicated that both TATA-like and initiator elements are conserved components of S. lemnae minichromosomes, whereas the USEs appear to be specific for the alpha1 tubulin minichromosome. The alpha2 tubulin minichromosome promoter is very short, comprising the two proximal elements but not the USEs. Despite the latter finding, up-regulation of alpha-tubulin expression in cells treated with concanavalin A activated the alpha2 but not the alpha1 tubulin promoter. These results therefore show that gene expression regulation in S. lemnae occurs at the level of transcription initiation on the basis of structurally different promoters.
Collapse
Affiliation(s)
- Ilya Skovorodkin
- Zoologisches Institut der Universität Tübingen, Abteilung Zellbiologie, Auf der Morgenstelle 28, 72076 Tübingen, Germany
| | | | | | | | | | | |
Collapse
|
33
|
Yang C, Bolotin E, Jiang T, Sladek FM, Martinez E. Prevalence of the initiator over the TATA box in human and yeast genes and identification of DNA motifs enriched in human TATA-less core promoters. Gene 2006; 389:52-65. [PMID: 17123746 PMCID: PMC1955227 DOI: 10.1016/j.gene.2006.09.029] [Citation(s) in RCA: 255] [Impact Index Per Article: 14.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/24/2006] [Revised: 09/12/2006] [Accepted: 09/22/2006] [Indexed: 10/24/2022]
Abstract
The core promoter of eukaryotic genes is the minimal DNA region that recruits the basal transcription machinery to direct efficient and accurate transcription initiation. The fraction of human and yeast genes that contain specific core promoter elements such as the TATA box and the initiator (INR) remains unclear and core promoter motifs specific for TATA-less genes remain to be identified. Here, we present genome-scale computational analyses indicating that approximately 76% of human core promoters lack TATA-like elements, have a high GC content, and are enriched in Sp1-binding sites. We further identify two motifs - M3 (SCGGAAGY) and M22 (TGCGCANK) - that occur preferentially in human TATA-less core promoters. About 24% of human genes have a TATA-like element and their promoters are generally AT-rich; however, only approximately 10% of these TATA-containing promoters have the canonical TATA box (TATAWAWR). In contrast, approximately 46% of human core promoters contain the consensus INR (YYANWYY) and approximately 30% are INR-containing TATA-less genes. Significantly, approximately 46% of human promoters lack both TATA-like and consensus INR elements. Surprisingly, mammalian-type INR sequences are present - and tend to cluster - in the transcription start site (TSS) region of approximately 40% of yeast core promoters and the frequency of specific core promoter types appears to be conserved in yeast and human genomes. Gene Ontology analyses reveal that TATA-less genes in humans, as in yeast, are frequently involved in basic "housekeeping" processes, while TATA-containing genes are more often highly regulated, such as by biotic or stress stimuli. These results reveal unexpected similarities in the occurrence of specific core promoter types and in their associated biological processes in yeast and humans and point to novel vertebrate-specific DNA motifs that might play a selective role in TATA-independent transcription.
Collapse
Affiliation(s)
- Chuhu Yang
- Genetics Genomics and Bioinformatics Graduate Program, University of California, Riverside, CA 92521, USA
| | | | | | | | | |
Collapse
|
34
|
Stewart JJ, Fischbeck JA, Chen X, Stargell LA. Non-optimal TATA Elements Exhibit Diverse Mechanistic Consequences. J Biol Chem 2006; 281:22665-73. [PMID: 16772290 DOI: 10.1074/jbc.m603237200] [Citation(s) in RCA: 18] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022] Open
Abstract
To reveal mechanistic differences in transcription initiation between variant TATA elements, in vivo and in vitro assays of the functional activity of 14 different sequences were compared. Variant elements exhibited particular degrees of activation in vivo but universally were unable to support the -fold activation observed for an element consisting of TATAAA. Each element was classified by its functional activity for in vitro interaction with TATA-binding protein (TBP), TFIIA, and TFIIB. Certain off-consensus TATA elements form poor binding sites for TBP and this compromised interaction interferes with higher order complex formation with TFIIA and/or TFIIB. Other elements are only modestly decreased for TBP binding but dramatically affected for higher order complex formation. Another distinct category is comprised of two elements (CATAAA and TATAAG), which are not affected in the initial formation of the TBP, TFIIA-TBP, or TFIIB-TBP complexes. However, CATAAA and TATAAG are unable to form a stable TFIIA-TBP-DNA complex in vitro. Moreover, fusion of TFIIA to TBP specifically restores activity from these two elements in vivo. Taken together, these results indicate that the interplay between the sequence of the TATA element and the components of the general transcription machinery can lead to variations in the formation of functional complexes and/or the stability of these complexes. These differences offer distinct opportunities for an organism to exploit diverse steps in the regulation of gene expression depending on the precise TATA element sequence at a given gene.
Collapse
Affiliation(s)
- Jennifer J Stewart
- Department of Biochemistry and Molecular Biology, Colorado State University, Fort Collins, Colorado 80523-1870, USA
| | | | | | | |
Collapse
|
35
|
Johnston CM, Wood AL, Bolland DJ, Corcoran AE. Complete Sequence Assembly and Characterization of the C57BL/6 Mouse Ig Heavy Chain V Region. THE JOURNAL OF IMMUNOLOGY 2006; 176:4221-34. [PMID: 16547259 DOI: 10.4049/jimmunol.176.7.4221] [Citation(s) in RCA: 125] [Impact Index Per Article: 6.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/19/2022]
Abstract
The mechanisms that regulate variable (V) gene selection during the development of the mouse IgH repertoire are not fully understood, due in part to the absence of the complete locus sequence. To better understand these processes, we have assembled the entire 2.5-Mb mouse IgH (Igh) V region sequence of the C57BL/6 strain from public sequences and present the first complete annotated map of the region, including V genes, pseudogenes, repeats, and nonrepetitive intergenic sequences. In so doing, we have discovered a new V gene family, VH16. We have identified clusters of conserved region-specific intergenic sequences and have verified our assembly by genic and intergenic Southern blotting. We have observed that V pseudogenes are not evenly spread throughout the V region, but rather cluster together. The largest J558 family, which spans more than half of the locus, has two strikingly different domains, which suggest points of evolutionary divergence or duplication. The 5' end contains widely spaced J558 genes interspersed with 3609 genes and is pseudogene poor. The 3' end contains closely spaced J558 genes, no 3609 genes, and is pseudogene rich. Each occupies a different branch of the phylogenetic tree. Detailed analysis of 500-bp upstream of all functional genes has revealed several conserved binding sites, general and B cell-specific, as well as key differences between families. This complete and definitive assembly of the mouse Igh V region will facilitate detailed study of promoter function and large-scale mechanisms associated with V(D)J recombination including locus contraction and antisense intergenic transcription.
Collapse
Affiliation(s)
- Colette M Johnston
- Laboratory of Chromatin and Gene Expression, Babraham Institute, Cambridge, UK
| | | | | | | |
Collapse
|
36
|
Tolić-Nørrelykke SF, Rasmussen MB, Pavone FS, Berg-Sørensen K, Oddershede LB. Stepwise bending of DNA by a single TATA-box binding protein. Biophys J 2006; 90:3694-703. [PMID: 16500964 PMCID: PMC1440750 DOI: 10.1529/biophysj.105.074856] [Citation(s) in RCA: 42] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
The TATA-box binding protein (TBP) is required by all three eukaryotic RNA polymerases for the initiation of transcription from most promoters. TBP recognizes, binds to, and bends promoter sequences called "TATA-boxes" in the DNA. We present results from the study of individual Saccharomyces cerevisiae TBPs interacting with single DNA molecules containing a TATA-box. Using video microscopy, we observed the Brownian motion of beads tethered by short surface-bound DNA. When TBP binds to and bends the DNA, the conformation of the DNA changes and the amplitude of Brownian motion of the tethered bead is reduced compared to that of unbent DNA. We detected individual binding and dissociation events and derived kinetic parameters for the process. Dissociation was induced by increasing the salt concentration or by directly pulling on the tethered bead using optical tweezers. In addition to the well-defined free and bound classes of Brownian motion, we observed another two classes of motion. These extra classes were identified with intermediate states on a three-step, linear-binding pathway. Biological implications of the intermediate states are discussed.
Collapse
|
37
|
Lee MP, Howcroft K, Kotekar A, Yang HH, Buetow KH, Singer DS. ATG deserts define a novel core promoter subclass. Genome Res 2005; 15:1189-97. [PMID: 16109972 PMCID: PMC1199533 DOI: 10.1101/gr.3873705] [Citation(s) in RCA: 20] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/22/2023]
Abstract
The MHC class I gene, PD1, has neither functional TATAA nor Initiator (Inr) elements in its core promoter and initiates transcription at multiple, dispersed sites over an extended region in vitro. Here, we define a novel core promoter feature that supports regulated transcription through selective transcription start site (TSS) usage. We demonstrate that TSS selection is actively regulated and context dependent. Basal and activated transcriptions initiate from largely nonoverlapping TSS regions. Transcripts derived from multiple TSS encode a single protein, due to the absence of any ATG triplets within approximately 430 bp upstream of the major transcription start site. Thus, the PD1 core promoter is embedded within an "ATG desert". Remarkably, extending this analysis genome-wide, we find that ATG deserts define a novel promoter subclass. They occur nonrandomly, are significantly associated with non-TATAA promoters that use multiple TSS, independent of the presence of CpG islands (CGI). We speculate that ATG deserts may provide a core promoter platform upon which complex upstream regulatory signals can be integrated, targeting multiple TSS whose products encode a single protein.
Collapse
Affiliation(s)
- Maxwell P Lee
- Laboratory of Population Genetics, National Cancer Institute, National Institutes of Health, Bethesda, Maryland 20892, USA
| | | | | | | | | | | |
Collapse
|
38
|
Larsen E, Kwon K, Coin F, Egly JM, Klungland A. Transcription activities at 8-oxoG lesions in DNA. DNA Repair (Amst) 2005; 3:1457-68. [PMID: 15380101 DOI: 10.1016/j.dnarep.2004.06.008] [Citation(s) in RCA: 67] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/30/2004] [Indexed: 02/07/2023]
Abstract
7,8-Dihydro-8-oxoguanine (8-oxoG) is the most frequent mutagenic lesion caused by oxidative stress. Eukaryotic cells use a specific DNA glycosylase, OGG1, to excise 8-oxoG from DNA. The mild phenotype of OGG1 null mice has been attributed to the existence of alternative pathways, including Cockayne syndrome B (CSB)-dependent transcription coupled repair (TCR), for removal of 8-oxoG. We have studied repair and transcription activities at 8-oxoG lesions with a reconstituted transcription system (RTS; RNA polymerase II, TBP, TFIIA, TFIIB, TFIIE, TFIIF and TFIIH), as well as in cellular extracts and in vivo. All measurable repair activity at 8-oxoG lesions takes place in the 3'-direction from the lesion, indicating base excision repair (BER) activity and negligible role of nucleotide excision repair (NER). Although 8-oxoG has been shown to be preferentially removed from the transcribed strand, in vitro experiments with purified transcription factors failed to identify a definite block for RNA polymerase II at the lesion. However, a weak block was observed at the lesion during transcription carried out with RTS as well as with cellular extracts. RNA polymerase II was identified at the site of the lesion on obstructed templates. Wild-type cells, as well as cells carrying targeted mutations of genes required for removal of 8-oxoG, were transfected with a luciferase expression vector containing an 8-oxoG lesion. No significant obstruction at 8-oxoG lesions was observed by this in vivo approach. In control experiments transcription elongation was completely blocked by cisplatin.
Collapse
Affiliation(s)
- Elisabeth Larsen
- Centre for Molecular Biology and Neuroscience, Institute of Medical Microbiology, National Hospital, University of Oslo, 0027 Oslo, Norway
| | | | | | | | | |
Collapse
|
39
|
Abstract
MOTIVATION Localizing protein binding sites within genomic DNA is of considerable importance, but remains difficult for protein families, such as transcription factors, which have loosely defined target sequences. It is generally assumed that protein affinity for DNA involves additive contributions from successive nucleotide pairs within the target sequence. This is not necessarily true, and non-additive effects have already been experimentally demonstrated in a small number of cases. The principal origin of non-additivity involves the so-called indirect component of protein-DNA recognition which is related to the sequence dependence of DNA deformation induced during complex formation. Non-additive effects are difficult to study because they require the identification of many more binding sequences than are normally necessary for describing additive specificity (typically via the construction of weight matrices). RESULTS In the present work we will use theoretically estimated binding energies as a basis for overcoming this problem. Our approach enables us to study the full combinatorial set of sequences for a variety of DNA-binding proteins, make a detailed analysis of non-additive effects and exploit this information to improve binding site predictions using either weight matrices or support vector machines. The results underline the fact that, even in the presence of significant deformation, non-additive effects may involve only a limited number of dinucleotide steps. This information helps to reduce the number of binding sites which need to be identified for successful predictions and to avoid problems of over-fitting. AVAILABILITY The SVM software is available upon request from the authors.
Collapse
Affiliation(s)
- R A O'Flanagan
- Department of Physics and Astronomy, BioMaps Institute, Rutgers, The State University of New Jersey, 136 Frelinghuysen Road, Piscataway, NJ 08854-8019, USA
| | | | | | | |
Collapse
|
40
|
Schulmeister A, Heyers O, Morales ME, Brindley PJ, Lucius R, Meusel G, Kalinna BH. Organization and functional analysis of the Schistosoma mansoni cathepsin D-like aspartic protease gene promoter. ACTA ACUST UNITED AC 2005; 1727:27-34. [PMID: 15652155 DOI: 10.1016/j.bbaexp.2004.11.004] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/07/2004] [Revised: 11/04/2004] [Accepted: 11/10/2004] [Indexed: 10/26/2022]
Abstract
We have cloned a 969-bp fragment of genomic DNA that spans 821 bp of the 5' untranslated region, exon 1, a short intron, and part of exon 2 of the Schistosoma mansoni cathepsin D gene by inverse PCR. Inspection of this sequence revealed the presence of two TATA-box motifs, two inverted CCAAT-box (inverted NF-Y) motifs and sequences with homology to binding sites for the transcription factors, AP-1 and NF-Y. This sequence and deletion variants were cloned into reporter gene constructs, in order to examine the ability of these putative regulatory sequences to drive heterologous reporter gene activity. PCR products were cloned into the luciferase reporter vector pXP2. These reporter gene constructs were used to transform HeLa cells which were cultured and examined for luciferase activity. Additionally, HeLa cells transiently transfected with an EGFP reporter plasmid driven by the putative promoter from the S. mansoni cathepsin D gene were examined for EGFP transcripts and fluorescence. The 5' untranslated region of the S. mansoni cathepsin D gene, from position -772 to +40 (translation start ATG), included functional regulatory sequences capable of driving luciferase and EGFP expression, whereas shorter fragments from position -264 or -185 to +40 were insufficient to drive reporter activities.
Collapse
Affiliation(s)
- Alexandra Schulmeister
- Department of Molecular Parasitology, Institute of Biology, Humboldt University Berlin, Philippstrasse 13, 10115 Berlin, Germany
| | | | | | | | | | | | | |
Collapse
|
41
|
Komarnytsky S, Borisjuk N. Functional analysis of promoter elements in plants. GENETIC ENGINEERING 2004; 25:113-41. [PMID: 15260236 DOI: 10.1007/978-1-4615-0073-5_6] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 04/11/2023]
Affiliation(s)
- Slavko Komarnytsky
- Biotech Center, Cook College, Rutgers University, 59 Dudley Rd., New Brunswick, NJ 08901-8520, USA
| | | |
Collapse
|
42
|
Basehoar AD, Zanton SJ, Pugh BF. Identification and distinct regulation of yeast TATA box-containing genes. Cell 2004; 116:699-709. [PMID: 15006352 DOI: 10.1016/s0092-8674(04)00205-3] [Citation(s) in RCA: 495] [Impact Index Per Article: 24.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/11/2003] [Revised: 11/26/2003] [Accepted: 01/08/2004] [Indexed: 11/21/2022]
Abstract
Despite being one of the first eukaryotic transcriptional regulatory elements identified, the sequence of a native TATA box and its significance remain elusive. Applying criteria associated with TATA boxes we queried several Saccharomyces genomes and arrived at the consensus TATA(A/T)A(A/T)(A/G). Approximately 20% of yeast genes contain a TATA box. Strikingly, TATA box-containing genes are associated with responses to stress, are highly regulated, and preferentially utilize SAGA rather than TFIID when compared to TATA-less promoters. Transcriptional regulation in yeast appears to be mechanistically bipolar, possibly reflecting a need to balance inducible stress-related responses with constitutive housekeeping functions.
Collapse
Affiliation(s)
- Andrew D Basehoar
- Graduate Program in Statistics, The Pennsylvania State University, University Park, PA 16802, USA
| | | | | |
Collapse
|
43
|
Grace ML, Chandrasekharan MB, Hall TC, Crowe AJ. Sequence and Spacing of TATA Box Elements Are Critical for Accurate Initiation from the β-Phaseolin Promoter. J Biol Chem 2004; 279:8102-10. [PMID: 14660650 DOI: 10.1074/jbc.m309376200] [Citation(s) in RCA: 62] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022] Open
Abstract
The beta-phaseolin (phas) gene, which encodes one of the major seed storage proteins of P. vulgaris, is tightly regulated at the transcription level resulting in strict tissue-specific and spatial expression during embryonic development. The phas proximal promoter contains a complex arrangement of core promoter elements including three TATA boxes as well as several putative initiator elements. To delineate the respective contributions of the core promoter elements to transcription initiation we have performed site-directed mutagenesis of the phas promoter. In vivo expression studies were performed on transgenic Arabidopsis harboring phas promoter mutants driving expression of the beta-glucuronidase (gus) reporter gene. Quantitative assessment of GUS activity in seeds bearing the promoter mutants indicated that both sequence and spacing of the TATA elements influenced the efficiency of transcription. Substitution, insertion or deletion mutations had no effect on histochemical staining patterns indicating that strict spacing requirements are not essential for correct spatial expression of phas during embryogenesis. Further evaluation of the phas promoter by in vitro transcription analysis revealed the presence of multiple TATA-dependent transcription initiation start sites. The distance between TATA elements and transcription start sites was maintained in insertion and deletion mutants through the creation of novel initiation sites, indicating that positioning of the TATA elements rather than DNA sequence was the primary determinant of start site location. We conclude that, while dispensable for proper spatial distribution, the complex architecture of the phas promoter is required to ensure high levels of accurate phas transcription initiation in the developing embryo.
Collapse
Affiliation(s)
- Margaret L Grace
- Department of Biology, University of Washington, Seattle, Washington 98195, USA
| | | | | | | |
Collapse
|
44
|
Abstract
The events leading to transcription of eukaryotic protein-coding genes culminate in the positioning of RNA polymerase II at the correct initiation site. The core promoter, which can extend ~35 bp upstream and/or downstream of this site, plays a central role in regulating initiation. Specific DNA elements within the core promoter bind the factors that nucleate the assembly of a functional preinitiation complex and integrate stimulatory and repressive signals from factors bound at distal sites. Although core promoter structure was originally thought to be invariant, a remarkable degree of diversity has become apparent. This article reviews the structural and functional diversity of the RNA polymerase II core promoter.
Collapse
Affiliation(s)
- Stephen T Smale
- Howard Hughes Medical Institute and Department of Microbiology, Immunology, and Molecular Genetics, University of California, Los Angeles, California 90095-1662, USA.
| | | |
Collapse
|
45
|
Angermayr M, Schwerdffeger K, Bandlow W. A Nucleosome-Free dG-dC-Rich Sequence Element Promotes Constitutive Transcription of the Essential Yeast RIO1 Gene. Biol Chem 2003; 384:1287-92. [PMID: 14515990 DOI: 10.1515/bc.2003.143] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/15/2022]
Abstract
AbstractRIO1 is an essential gene that encodes a protein serine kinase and is transcribed constitutively at a very low level. Transcriptional activation of RIO1 dispenses with a canonical TATA box as well as with classical transactivators or specific DNA-binding factors. Instead, a dG-dC-rich sequence element, that is located 40 to 48 bp upstream the single site of mRNA initiation, is essential and presumably constitutes the basal promoter. In addition, we demonstrate here that this promoter element comprises a nucleosomefree gap which is centered at the dG-dC tract and flanked by two positioned nucleosomes. This element is both, necessary and sufficient, for basal transcription initiation at the RIO1 promoter and, thus, constitutes a novel type of core promoter element.
Collapse
Affiliation(s)
- Michaela Angermayr
- Department Biologie I, Bereich Genetik der Ludwig-Maximilians-Universität München, Maria-Ward-Strasse 1a, D-80638 München, Germany
| | | | | |
Collapse
|
46
|
Cliften P, Sudarsanam P, Desikan A, Fulton L, Fulton B, Majors J, Waterston R, Cohen BA, Johnston M. Finding functional features in Saccharomyces genomes by phylogenetic footprinting. Science 2003; 301:71-6. [PMID: 12775844 DOI: 10.1126/science.1084337] [Citation(s) in RCA: 634] [Impact Index Per Article: 30.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/02/2022]
Abstract
The sifting and winnowing of DNA sequence that occur during evolution cause nonfunctional sequences to diverge, leaving phylogenetic footprints of functional sequence elements in comparisons of genome sequences. We searched for such footprints among the genome sequences of six Saccharomyces species and identified potentially functional sequences. Comparison of these sequences allowed us to revise the catalog of yeast genes and identify sequence motifs that may be targets of transcriptional regulatory proteins. Some of these conserved sequence motifs reside upstream of genes with similar functional annotations or similar expression patterns or those bound by the same transcription factor and are thus good candidates for functional regulatory sequences.
Collapse
Affiliation(s)
- Paul Cliften
- Department of Genetics, Washington University School of Medicine, 660 South Euclid Avenue, St. Louis, MO 63110, USA
| | | | | | | | | | | | | | | | | |
Collapse
|
47
|
Gumbs OH, Campbell AM, Weil PA. High-affinity DNA binding by a Mot1p-TBP complex: implications for TAF-independent transcription. EMBO J 2003; 22:3131-41. [PMID: 12805227 PMCID: PMC162156 DOI: 10.1093/emboj/cdg304] [Citation(s) in RCA: 35] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
Abstract
Yeast Mot1p, an abundant conserved member of the Snf2p-ATPase family of proteins, both dissociates TBP from DNA in vitro using the energy of ATP and represses gene transcription in vivo, yet paradoxically, loss of Mot1p function also leads to decreased transcription of certain genes. We conducted experiments utilizing fluorescently labeled DNA, TBP, fluorescence anisotropy spectroscopy and native gel electrophoresis to study Mot1p action. We have made a number of observations, the most intriguing being that a stable Mot1p-TBP complex has the ability to bind TATA DNA with high affinity, albeit with dramatically altered specificity. We propose that this altered TBP-DNA recognition is integral to Mot1p's ability to regulate transcription, and further postulate that the Mot1p-TBP complex delivers TBP to TAF-independent mRNA encoding genes.
Collapse
Affiliation(s)
- Orlando H Gumbs
- Department of Molecular Physiology and Biophysics, Vanderbilt University, School of Medicine, Nashville, TN 37232-0615, USA
| | | | | |
Collapse
|
48
|
Stewart CA, Van Bergen J, Trowsdale J. Different and divergent regulation of the KIR2DL4 and KIR3DL1 promoters. JOURNAL OF IMMUNOLOGY (BALTIMORE, MD. : 1950) 2003; 170:6073-81. [PMID: 12794136 DOI: 10.4049/jimmunol.170.12.6073] [Citation(s) in RCA: 45] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/19/2022]
Abstract
The killer Ig-like receptors (KIR) are a family of highly related MHC class I receptors that show extreme genetic polymorphism both within the human population and between closely related primate species, suggestive of rapid evolutionary diversification. Most KIR are expressed in a variegated fashion by the NK population, giving rise to an NK repertoire of specificities for MHC class I. We compared the promoter for KIR3DL1, which exhibits variegated gene expression, with that for KIR2DL4, which is expressed by all NK cell clones. Maximum transcriptional activity of each was encoded within approximately 270 bp upstream of the translation initiation codon. The KIR2DL4 promoter drove reporter gene expression only in NK cells, while the KIR3DL1 promoter was active in a range of cell types, suggesting that the latter requires other regulatory elements for physiological expression. In NK cells, reporter gene expression driven by the KIR2DL4 promoter was greater than that driven by the KIR3DL1 promoter. DNase I footprinting revealed that transcription factor binding sites differ between the two promoters. The data indicate that while the promoters of these two KIR genes share 67% nucleotide identity, they have evolved distinct properties consistent with different roles in regulating the generation of NK repertoire.
Collapse
MESH Headings
- Base Composition
- Base Sequence
- Binding Sites/genetics
- Binding Sites/immunology
- Cell Line
- Codon, Initiator/genetics
- Codon, Initiator/metabolism
- Humans
- Jurkat Cells
- K562 Cells
- Killer Cells, Natural/immunology
- Killer Cells, Natural/metabolism
- Molecular Sequence Data
- Multigene Family/immunology
- Organ Specificity/genetics
- Organ Specificity/immunology
- Promoter Regions, Genetic/immunology
- Receptors, Immunologic/biosynthesis
- Receptors, Immunologic/genetics
- Receptors, Immunologic/metabolism
- Receptors, KIR
- Receptors, KIR2DL4
- Receptors, KIR3DL1
- Transcription Factors/genetics
- Transcription Factors/metabolism
- Transcription Initiation Site
- Tumor Cells, Cultured
Collapse
Affiliation(s)
- C Andrew Stewart
- Department of Pathology, University of Cambridge, Cambridge, United Kingdom
| | | | | |
Collapse
|
49
|
Howcroft TK, Raval A, Weissman JD, Gegonne A, Singer DS. Distinct transcriptional pathways regulate basal and activated major histocompatibility complex class I expression. Mol Cell Biol 2003; 23:3377-91. [PMID: 12724398 PMCID: PMC154244 DOI: 10.1128/mcb.23.10.3377-3391.2003] [Citation(s) in RCA: 34] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022] Open
Abstract
Transcription of major histocompatibility complex (MHC) class I genes is regulated by both tissue-specific (basal) and hormone/cytokine (activated) mechanisms. Although promoter-proximal regulatory elements have been characterized extensively, the role of the core promoter in mediating regulation has been largely undefined. We report here that the class I core promoter consists of distinct elements that are differentially utilized in basal and activated transcription pathways. These pathways recruit distinct transcription factor complexes to the core promoter elements and target distinct transcription initiation sites. Class I transcription initiates at four major sites within the core promoter and is clustered in two distinct regions: "upstream" (-14 and -18) and "downstream" (+12 and +1). Basal transcription initiates predominantly from the upstream start site region and is completely dependent upon the general transcription factor TAF1 (TAF(II)250). Activated transcription initiates predominantly from the downstream region and is TAF1 (TAF(II)250) independent. USF1 augments transcription initiating through the upstream start sites and is dependent on TAF1 (TAF(II)250), a finding consistent with its role in regulating basal class I transcription. In contrast, transcription activated by the interferon mediator CIITA is independent of TAF1 (TAF(II)250) and focuses initiation on the downstream start sites. Thus, basal and activated transcriptions of an MHC class I gene target distinct core promoter domains, nucleate distinct transcription initiation complexes and initiate at distinct sites within the promoter. We propose that transcription initiation at the core promoter is a dynamic process in which the mechanisms of core promoter function differ depending on the cellular environment.
Collapse
Affiliation(s)
- T Kevin Howcroft
- Experimental Immunology Branch, National Cancer Institute/NIH, Building 10, Room 4B-17, 10 Center Drive, MSC 1360, Bethesda, MD 20892-1360, USA.
| | | | | | | | | |
Collapse
|
50
|
Chen D, McKearin DM. A discrete transcriptional silencer in the bam gene determines asymmetric division of the Drosophila germline stem cell. Development 2003; 130:1159-70. [PMID: 12571107 DOI: 10.1242/dev.00325] [Citation(s) in RCA: 327] [Impact Index Per Article: 15.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/20/2022]
Abstract
The Drosophila germline lineage depends on a complex microenvironment of extrinsic and intrinsic factors that regulate the self-renewing and asymmetric divisions of dedicated stem cells. Germline stem cells (GSCs) must express components of the Dpp cassette and the translational repressors Nanos and Pumilio, whereas cystoblasts require the bam and bgcn genes. Bam is especially attractive as a target of GSC differentiation factors because current evidence indicates that bam is both necessary and sufficient for cystoblast differentiation. In this paper, we have sought to distinguish between mutually exclusive transcriptional or post-transcriptional mechanisms as the primary regulators of bam expression in GSCs and cystoblasts. We find that bam transcription is active in young germ cells but is repressed specifically in GSCs. Activation depends on a 50 bp fragment that carries at least one germ cell-specific enhancer element. A non-overlapping 18 bp sequence carries a transcriptional silencer that prevents bam expression in the GSC. Promoters lacking this silencer cause bam expression in the GSC and concomitant GSC loss. Thus, asymmetry of the GSC division can be reduced to identifying the mechanism that selectively activates the silencer element in GSCs.
Collapse
Affiliation(s)
- Dahua Chen
- Department of Molecular Biology, University of Texas-Southwestern Medical Center, Dallas, TX 75390-9148, USA
| | | |
Collapse
|