1
|
Savinkova LK, Sharypova EB, Kolchanov NA. On the Role of TATA Boxes and TATA-Binding Protein in Arabidopsis thaliana. PLANTS (BASEL, SWITZERLAND) 2023; 12:1000. [PMID: 36903861 PMCID: PMC10005294 DOI: 10.3390/plants12051000] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 11/10/2022] [Revised: 01/13/2023] [Accepted: 02/20/2023] [Indexed: 06/18/2023]
Abstract
For transcription initiation by RNA polymerase II (Pol II), all eukaryotes require assembly of basal transcription machinery on the core promoter, a region located approximately in the locus spanning a transcription start site (-50; +50 bp). Although Pol II is a complex multi-subunit enzyme conserved among all eukaryotes, it cannot initiate transcription without the participation of many other proteins. Transcription initiation on TATA-containing promoters requires the assembly of the preinitiation complex; this process is triggered by an interaction of TATA-binding protein (TBP, a component of the general transcription factor TFIID (transcription factor II D)) with a TATA box. The interaction of TBP with various TATA boxes in plants, in particular Arabidopsis thaliana, has hardly been investigated, except for a few early studies that addressed the role of a TATA box and substitutions in it in plant transcription systems. This is despite the fact that the interaction of TBP with TATA boxes and their variants can be used to regulate transcription. In this review, we examine the roles of some general transcription factors in the assembly of the basal transcription complex, as well as functions of TATA boxes of the model plant A. thaliana. We review examples showing not only the involvement of TATA boxes in the initiation of transcription machinery assembly but also their indirect participation in plant adaptation to environmental conditions in responses to light and other phenomena. Examples of an influence of the expression levels of A. thaliana TBP1 and TBP2 on morphological traits of the plants are also examined. We summarize available functional data on these two early players that trigger the assembly of transcription machinery. This information will deepen the understanding of the mechanisms underlying transcription by Pol II in plants and will help to utilize the functions of the interaction of TBP with TATA boxes in practice.
Collapse
|
2
|
Senitzki A, Safieh J, Sharma V, Golovenko D, Danin-Poleg Y, Inga A, Haran TE. The complex architecture of p53 binding sites. Nucleic Acids Res 2021; 49:1364-1382. [PMID: 33444431 PMCID: PMC7897521 DOI: 10.1093/nar/gkaa1283] [Citation(s) in RCA: 10] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/05/2020] [Revised: 12/22/2020] [Accepted: 12/24/2020] [Indexed: 12/13/2022] Open
Abstract
Sequence-specific protein-DNA interactions are at the heart of the response of the tumor-suppressor p53 to numerous physiological and stress-related signals. Large variability has been previously reported in p53 binding to and transactivating from p53 response elements (REs) due, at least in part, to changes in direct (base) and indirect (shape) readouts of p53 REs. Here, we dissect p53 REs to decipher the mechanism by which p53 optimizes this highly regulated variable level of interaction with its DNA binding sites. We show that hemi-specific binding is more prevalent in p53 REs than previously envisioned. We reveal that sequences flanking the REs modulate p53 binding and activity and show that these effects extend to 4–5 bp from the REs. Moreover, we show here that the arrangement of p53 half-sites within its REs, relative to transcription direction, has been fine-tuned by selection pressure to optimize and regulate the response levels from p53 REs. This directionality in the REs arrangement is at least partly encoded in the structural properties of the REs. Furthermore, we show here that in the p21-5′ RE the orientation of the half-sites is such that the effect of the flanking sequences is minimized and we discuss its advantages.
Collapse
Affiliation(s)
- Alon Senitzki
- Department of Biology, Technion - Israel Institute of Technology, Technion City, Haifa 3200003, Israel
| | - Jessy Safieh
- Department of Biology, Technion - Israel Institute of Technology, Technion City, Haifa 3200003, Israel
| | - Vasundhara Sharma
- Department of Cellular, Computational and Integrative Biology (CIBIO), University of Trento, via Sommarive 9, 38123 Trento, TN, Italy
| | - Dmitrij Golovenko
- Department of Biology, Technion - Israel Institute of Technology, Technion City, Haifa 3200003, Israel
| | - Yael Danin-Poleg
- Department of Biology, Technion - Israel Institute of Technology, Technion City, Haifa 3200003, Israel
| | - Alberto Inga
- Department of Cellular, Computational and Integrative Biology (CIBIO), University of Trento, via Sommarive 9, 38123 Trento, TN, Italy
| | - Tali E Haran
- Department of Biology, Technion - Israel Institute of Technology, Technion City, Haifa 3200003, Israel
| |
Collapse
|
3
|
Ponomarenko M, Sharypova E, Drachkova I, Chadaeva I, Arkova O, Podkolodnaya O, Ponomarenko P, Kolchanov N, Savinkova L. Unannotated single nucleotide polymorphisms in the TATA box of erythropoiesis genes show in vitro positive involvements in cognitive and mental disorders. BMC MEDICAL GENETICS 2020; 21:165. [PMID: 33092544 PMCID: PMC7579878 DOI: 10.1186/s12881-020-01106-x] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 07/28/2020] [Accepted: 08/11/2020] [Indexed: 12/20/2022]
Abstract
BACKGROUND Hemoglobin is a tetramer consisting of two α-chains and two β-chains of globin. Hereditary aberrations in the synthesis of one of the globin chains are at the root of thalassemia, one of the most prevalent monogenic diseases worldwide. In humans, in addition to α- and β-globins, embryonic zeta-globin and fetal γ-globin are expressed. Immediately after birth, the expression of fetal Aγ- and Gγ-globin ceases, and then adult β-globin is mostly expressed. It has been shown that in addition to erythroid cells, hemoglobin is widely expressed in nonerythroid cells including neurons of the cortex, hippocampus, and cerebellum in rodents; embryonic and adult brain neurons in mice; and mesencephalic dopaminergic brain cells in humans, mice, and rats. Lately, there is growing evidence that different forms of anemia (changes in the number and quality of blood cells) may be involved in (or may accompany) the pathogenesis of various cognitive and mental disorders, such as Alzheimer's and Parkinson's diseases, depression of various severity levels, bipolar disorders, and schizophrenia. Higher hemoglobin concentrations in the blood may lead to hyperviscosity, hypovolemia, and lung diseases, which may cause brain hypoxia and anomalies of brain function, which may also result in cognitive deficits. METHODS In this study, a search for unannotated single-nucleotide polymorphisms (SNPs) of erythroid genes was initially performed using our previously created and published SNP-TATA_Z-tester, which is a Web service for computational analysis of a given SNP for in silico estimation of its influence on the affinity of TATA-binding protein (TBP) for TATA and TATA-like sequences. The obtained predictions were finally verified in vitro by an electrophoretic mobility shift assay (EMSA). RESULTS On the basis of these experimental in vitro results and literature data, we studied TATA box SNPs influencing both human erythropoiesis and cognitive abilities. For instance, TBP-TATA affinity in the HbZ promoter decreases 6.6-fold as a result of a substitution in the TATA box (rs113180943), thereby possibly disrupting stage-dependent events of "switching" of hemoglobin genes and thus causing erythroblastosis. Therefore, rs113180943 may be a candidate marker of severe hemoglobinopathies with comorbid cognitive and mental disorders associated with cerebral blood flow disturbances. CONCLUSIONS The literature data and experimental and computations results suggest that the uncovered candidate SNP markers of erythropoiesis anomalies may also be studied in cohorts of patients with cognitive and/or mental disorders with comorbid erythropoiesis diseases in comparison to conventionally healthy volunteers. Research into the regulatory mechanisms by which the identified SNP markers contribute to the development of hemoglobinopathies and of the associated cognitive deficits will allow physicians not only to take timely and adequate measures against hemoglobinopathies but also to implement strategies preventing cognitive and mental disorders.
Collapse
Affiliation(s)
- Mikhail Ponomarenko
- Institute of Cytology and Genetics, Siberian Branch of Russian Academy of Sciences, 10 Lavrentyev Ave, Novosibirsk, 630090, Russia. .,Novosibirsk State University, 1 Pirogova Street, Novosibirsk, 630090, Russia.
| | - Ekaterina Sharypova
- Institute of Cytology and Genetics, Siberian Branch of Russian Academy of Sciences, 10 Lavrentyev Ave, Novosibirsk, 630090, Russia
| | - Irina Drachkova
- Institute of Cytology and Genetics, Siberian Branch of Russian Academy of Sciences, 10 Lavrentyev Ave, Novosibirsk, 630090, Russia
| | - Irina Chadaeva
- Institute of Cytology and Genetics, Siberian Branch of Russian Academy of Sciences, 10 Lavrentyev Ave, Novosibirsk, 630090, Russia
| | - Olga Arkova
- Institute of Gene Biology Russian Academy of Sciences, 34/5 Vavilova Street, Moscow, 119334, Russia
| | - Olga Podkolodnaya
- Institute of Cytology and Genetics, Siberian Branch of Russian Academy of Sciences, 10 Lavrentyev Ave, Novosibirsk, 630090, Russia
| | - Petr Ponomarenko
- Institute of Cytology and Genetics, Siberian Branch of Russian Academy of Sciences, 10 Lavrentyev Ave, Novosibirsk, 630090, Russia
| | - Nikolay Kolchanov
- Institute of Cytology and Genetics, Siberian Branch of Russian Academy of Sciences, 10 Lavrentyev Ave, Novosibirsk, 630090, Russia
| | - Ludmila Savinkova
- Institute of Cytology and Genetics, Siberian Branch of Russian Academy of Sciences, 10 Lavrentyev Ave, Novosibirsk, 630090, Russia
| |
Collapse
|
4
|
Long L, Li X, Wei H, Li W. Features of the Influence of a DNA Sequence on Its Adjacent Sequence. ACS OMEGA 2020; 5:23631-23644. [PMID: 32984683 PMCID: PMC7512436 DOI: 10.1021/acsomega.0c02264] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 05/14/2020] [Accepted: 08/14/2020] [Indexed: 06/11/2023]
Abstract
To explore the features of the influence of a DNA sequence (here called sequence A) on its adjacent sequence (here called sequence B), we linked some DNA repeated sequences to the 5'-end of the T7 promoter in the plasmid pET-42a (+) or the 5'- and/or 3'-end(s) of the EcoRI site in some DNA fragments using PCR and other molecular cloning methods. As a result, we found that the efficiency of the T7 promoter and EcoRI could be impacted by some flanking sequences, indicating that sequence B could be impacted by sequence A. The features of such influence include the following: (i) sequence A can directly impact sequence B without changing/modifying the base composition of sequence B or destroying the inherent connection between sequence B and its function-related sequences; (ii) such influence does not need the participation of trans-acting factors or products of sequence A (if any); (iii) such an influence might be undetectable when the activities of trans-acting factors of sequence B are normal but might become detectable when those are lower than the normal one; (iv) such an influence might be enhancive, inhibitory, or unobvious; (v) the influence of sequence A linked to the 5'-end of sequence B might be the same as or opposite to that of sequence A linked to the 3'-end; and (vi) the influences of sequence A linked to different ends of sequence B could enhance or partially offset each other when sequence A is linked to both 5'- and 3'-ends of sequence B. These findings might give us a further understanding of the interaction of two adjacent DNA sequences.
Collapse
Affiliation(s)
- Lijuan Long
- Department
of Pediatrics, First Affiliated Hospital
of Guangxi Medical University, #6, Shuangyong Road, Nanning, 530021 Guangxi, China
| | - Xinxin Li
- Department
of Nuclear Medicine, First Affiliated Hospital
of Guangxi Medical University, #6, Shuangyong Road, Nanning, 530021 Guangxi, China
| | - Hailang Wei
- Medical
Scientific Research Center, Guangxi Medical
University, #22, Shuangyong Road, Nanning, 530021 Guangxi, China
| | - Wei Li
- Medical
Scientific Research Center, Guangxi Medical
University, #22, Shuangyong Road, Nanning, 530021 Guangxi, China
| |
Collapse
|
5
|
Etheve L, Martin J, Lavery R. Protein-DNA interfaces: a molecular dynamics analysis of time-dependent recognition processes for three transcription factors. Nucleic Acids Res 2016; 44:9990-10002. [PMID: 27658967 PMCID: PMC5175364 DOI: 10.1093/nar/gkw841] [Citation(s) in RCA: 17] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/07/2016] [Revised: 09/12/2016] [Accepted: 09/13/2016] [Indexed: 12/14/2022] Open
Abstract
We have studied the dynamics of three transcription factor-DNA complexes using all-atom, microsecond-scale MD simulations. In each case, the salt bridges and hydrogen bond interactions formed at the protein-DNA interface are found to be dynamic, with lifetimes typically in the range of tens to hundreds of picoseconds, although some interactions, notably those involving specific binding to DNA bases, can be a hundred times longer lived. Depending on the complex studied, this dynamics may or may not lead to the existence of distinct conformational substates. Using a sequence threading technique, it has been possible to determine whether DNA sequence recognition is sensitive or not to such conformational changes, and, in one case, to show that recognition appears to be locally dependent on protein-mediated cation distributions.
Collapse
Affiliation(s)
- Loïc Etheve
- MMSB UMR 5086 CNRS/University of Lyon I, Institut de Biologie et Chimie des Protéines, 7 passage du Vercors, Lyon 69367, France
| | - Juliette Martin
- MMSB UMR 5086 CNRS/University of Lyon I, Institut de Biologie et Chimie des Protéines, 7 passage du Vercors, Lyon 69367, France
| | - Richard Lavery
- MMSB UMR 5086 CNRS/University of Lyon I, Institut de Biologie et Chimie des Protéines, 7 passage du Vercors, Lyon 69367, France
| |
Collapse
|
6
|
Samanta S, Raghunathan D, Mukherjee S. Effect of temperature on the structure and hydration layer of TATA-box DNA: A molecular dynamics simulation study. J Mol Graph Model 2016; 66:9-19. [PMID: 27017424 DOI: 10.1016/j.jmgm.2016.03.005] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/17/2015] [Revised: 03/17/2016] [Accepted: 03/21/2016] [Indexed: 01/25/2023]
Abstract
DNA within the living cells experiences a diverse range of temperature, ranging from freezing condition to hot spring water. How the structure, the mechanical properties of DNA, and the solvation dynamics around DNA changes with the temperature is important to understand the functionality of DNA under those acute temperature conditions. In that notion, we have carried out molecular dynamics simulations of a DNA oligomer, containing TATA-box sequence for three different temperatures (250K, 300K and 350K). We observed that the structure of the DNA, in terms of backbone torsion angles, sugar pucker, base pair parameters, and base pair step parameters, did not show any unusual properties within the studied range of temperatures, but significant structural alteration was noticed between BI and BII forms at higher temperature. As expected, the flexibility of the DNA, in terms of the torsional rigidity and the bending rigidity is highly temperature dependent, confirming that flexibility increases with increase in temperature. Additionally, the groove widths of the studied DNA showed temperature sensitivity, specifically, the major groove width decreases and the minor groove width increases, respectively, with the increase in temperature. We observed that at higher temperature, water around both the major and the minor groove of the DNA is less structured. However, the water dynamics around the minor groove of the DNA is more restricted as compared to the water around the major groove throughout the studied range of temperatures, without any anomalous behavior.
Collapse
Affiliation(s)
- Sudipta Samanta
- BioSystems and Micromechanics IRG (BioSyM), Singapore-MIT Alliance for Research and Technology (SMART), 1 Create Way, 117543, Republic of Singapore; Computer Science and Artificial Intelligence Laboratory, Massachusetts Institute of Technology, Cambridge, MA 02139, USA.
| | - Devanathan Raghunathan
- Prochem Solutions Pte. Ltd., 89C Science Park Drive, The Rutherford, # 04-13, Singapore Science Park 1, 118261, Singapore
| | - Sanchita Mukherjee
- Indian Institute of Science Education and Research, Kolkata, Mohanpur, West Bengal, 741246, India
| |
Collapse
|
7
|
Singh V, Kumari B, Maity B, Seth D, Das P. Direct observation of preferential processing of clustered abasic DNA damages with APE1 in TATA box and CpG island by reaction kinetics and fluorescence dynamics. Mutat Res 2014; 766-767:56-65. [PMID: 25847273 DOI: 10.1016/j.mrfmmm.2014.06.002] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/15/2014] [Revised: 05/16/2014] [Accepted: 06/16/2014] [Indexed: 06/04/2023]
Abstract
Sequences like the core element of TATA box and CpG island are frequently encountered in the genome and related to transcription. The fate of repair of clustered abasic sites in such sequences of genomic importance is largely unknown. This prompted us to investigate the sequence dependence of cleavage efficiency of APE1 enzyme at abasic sites within the core sequences of TATA box and CpG island using fluorescence dynamics and reaction kinetics. Simultaneous molecular dynamics study through steady state and time resolved fluorescence spectroscopy using unique ethidium bromide dye release assay confirmed an elevated amount of abasic site cleavage of the TATA box sequence as compared to the core CpG island. Reaction kinetics showed that catalytic efficiency of APE1 for abasic site cleavage of core CpG island sequence was ∼4 times lower as compared to that of the TATA box. Higher value of Km was obtained from the core CpG island sequence than the TATA box sequence. This suggests a greater binding effect of APE1 enzyme on TATA sequence that signifies a prominent role of the sequence context of the DNA substrate. Evidently, a faster response from APE1 was obtained for clustered abasic damage repair of TATA box core sequences than CpG island consensus sequences. The neighboring bases of the abasic sites in the complementary DNA strand were found to have significant contribution in addition to the flanking bases in modulating APE1 activity. The repair refractivity of the bistranded clustered abasic sites arise from the slow processing of the second abasic site, consequently resulting in decreased overall production of potentially lethal double strand breaks.
Collapse
Affiliation(s)
- Vandana Singh
- Department of Chemistry, Indian Institute of Technology Patna, Govt. Polytechnic Campus, Patliputra Colony, Patna 800013, Bihar, India
| | - Bhavini Kumari
- Department of Chemistry, Indian Institute of Technology Patna, Govt. Polytechnic Campus, Patliputra Colony, Patna 800013, Bihar, India
| | - Banibrata Maity
- Department of Chemistry, Indian Institute of Technology Patna, Govt. Polytechnic Campus, Patliputra Colony, Patna 800013, Bihar, India
| | - Debabrata Seth
- Department of Chemistry, Indian Institute of Technology Patna, Govt. Polytechnic Campus, Patliputra Colony, Patna 800013, Bihar, India
| | - Prolay Das
- Department of Chemistry, Indian Institute of Technology Patna, Govt. Polytechnic Campus, Patliputra Colony, Patna 800013, Bihar, India.
| |
Collapse
|
8
|
Hieb AR, Gansen A, Böhm V, Langowski J. The conformational state of the nucleosome entry-exit site modulates TATA box-specific TBP binding. Nucleic Acids Res 2014; 42:7561-76. [PMID: 24829456 PMCID: PMC4081063 DOI: 10.1093/nar/gku423] [Citation(s) in RCA: 21] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/31/2022] Open
Abstract
The TATA binding protein (TBP) is a critical transcription factor used for nucleating assembly of the RNA polymerase II machinery. TBP binds TATA box elements with high affinity and kinetic stability and in vivo is correlated with high levels of transcription activation. However, since most promoters use less stable TATA-less or TATA-like elements, while also competing with nucleosome occupancy, further mechanistic insight into TBP's DNA binding properties and ability to access chromatin is needed. Using bulk and single-molecule FRET, we find that TBP binds a minimal consensus TATA box as a two-state equilibrium process, showing no evidence for intermediate states. However, upon addition of flanking DNA sequence, we observe non-specific cooperative binding to multiple DNA sites that compete for TATA-box specificity. Thus, we conclude that TBP binding is defined by a branched pathway, wherein TBP initially binds with little sequence specificity and is thermodynamically positioned by its kinetic stability to the TATA box. Furthermore, we observed the real-time access of TBP binding to TATA box DNA located within the DNA entry–exit site of the nucleosome. From these data, we determined salt-dependent changes in the nucleosome conformation regulate TBP's access to the TATA box, where access is highly constrained under physiological conditions, but is alleviated by histone acetylation and TFIIA.
Collapse
Affiliation(s)
- Aaron R Hieb
- Division Biophysics of Macromolecules, German Cancer Research Center, D-69120 Heidelberg, Germany
| | - Alexander Gansen
- Division Biophysics of Macromolecules, German Cancer Research Center, D-69120 Heidelberg, Germany
| | - Vera Böhm
- Division Biophysics of Macromolecules, German Cancer Research Center, D-69120 Heidelberg, Germany
| | - Jörg Langowski
- Division Biophysics of Macromolecules, German Cancer Research Center, D-69120 Heidelberg, Germany
| |
Collapse
|
9
|
Real-Time Interaction between TBP and the TATA Box of the Human Triosephosphate Isomerase Gene Promoter in the Norm and Pathology. Acta Naturae 2014; 6:36-40. [PMID: 25093109 PMCID: PMC4115224] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/04/2022] Open
Abstract
The TATA-binding protein (TBP) is a key part of the transcription complex of RNA polymerase II. Alone or as a part of the basal transcription factor TFIID, TBP binds the TATA box located in the core region of the TATA-containing promoters of class II genes. Previously, we studied the effects of single nucleotide polymorphisms (SNPs) on TBP/TATA-box interactions using gel retardation assay. It was demonstrated that most SNPs in the TATA boxes of some human gene promoters cause a 2- to 4-fold decrease in TBP/TATA affinity, which is associated with an increased risk of hereditary diseases, such as β thalassemias of diverse severity, hemophilia B Leyden, myocardial infarction, thrombophlebitis, lung cancer, etc. In this work, the process of TBP/TATA complex formation has been studied in real time by a stopped-flow technique using recombinant human TBP and duplexes, which were identical to the TATA box of the wild-type and a SNP-containing triosephosphate isomerase gene promoter and were fluorescently labeled by the Cy3/Cy5 FRET pair. It has been demonstrated for the first time that real-time binding of TBP to the TATA box of the TPI gene promoter is complete within 10 s and is described by a single-stage kinetic model. The complex formation of TBP with the wild-type TATA box occurs 5.5 times faster and the complex dissociation occurs 31 times slower compared with the SNPcontaining TATA box. Within the first seconds of the interaction, TBP binds to and simultaneously bends the TATA box. Importantly, the TATA box of the wild-type TPI gene promoter requires lower TBP concentrations compared to the TATA box containing the -24T → G SNP, which is associated with neurological and muscular disorders, cardiomyopathy, and other diseases.
Collapse
|
10
|
Savinkova L, Drachkova I, Arshinova T, Ponomarenko P, Ponomarenko M, Kolchanov N. An experimental verification of the predicted effects of promoter TATA-box polymorphisms associated with human diseases on interactions between the TATA boxes and TATA-binding protein. PLoS One 2013; 8:e54626. [PMID: 23424617 PMCID: PMC3570547 DOI: 10.1371/journal.pone.0054626] [Citation(s) in RCA: 41] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/30/2012] [Accepted: 12/13/2012] [Indexed: 11/18/2022] Open
Abstract
Human genome sequencing has resulted in a great body of data, including a stunningly large number of single nucleotide polymorphisms (SNPs) with unknown phenotypic manifestations. Identification and comprehensive analysis of regulatory SNPs in human gene promoters will help quantify the effects of these SNPs on human health. Based on our experimental and computer-aided study of SNPs in TATA boxes and the use of literature data, we have derived an equation for TBP/TATA equilibrium binding in three successive steps: TATA-binding protein (TBP) sliding along DNA due to their nonspecific affinity for each other ↔ recognition of the TATA box ↔ stabilization of the TBP/TATA complex. Using this equation, we have analyzed TATA boxes containing SNPs associated with human diseases and made in silico predictions of changes in TBP/TATA affinity. An electrophoretic mobility shift assay (EMSA)-based experimental study performed under the most standardized conditions demonstrates that the experimentally measured values are highly correlated with the predicted values: the coefficient of linear correlation, r, was 0.822 at a significance level of α<10⁻⁷ for equilibrium K(D) values, (-ln K(D)), and 0.785 at a significance level of α<10⁻³ for changes in equilibrium K(D) (δ) due to SNPs in the TATA boxes (δ= -ln[K(D,TATAMut)]-(-ln[K(D,TATAMut)])). It has been demonstrated that the SNPs associated with increased risk of human diseases such as α-, β- and δ-thalassemia, myocardial infarction and thrombophlebitis, changes in immune response, amyotrophic lateral sclerosis, lung cancer and hemophilia B Leyden cause 2-4-fold changes in TBP/TATA affinity in most cases. The results obtained strongly suggest that the TBP/TATA equilibrium binding equation derived can be used for analysis of TATA-box sequences and identification of SNPs with a potential of being functionally important.
Collapse
Affiliation(s)
- Ludmila Savinkova
- Institute of Cytology and Genetics, Siberian Division, Russian Academy of Sciences, Novosibirsk, Russia.
| | | | | | | | | | | |
Collapse
|
11
|
Low-level p53 expression changes transactivation rules and reveals superactivating sequences. Proc Natl Acad Sci U S A 2012; 109:14387-92. [PMID: 22908277 DOI: 10.1073/pnas.1205971109] [Citation(s) in RCA: 29] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/16/2023] Open
Abstract
Transcriptional activation by the tumor suppressor p53 is considered to depend on cellular level, although there are few systems where this dependence on cellular level of p53 has been directly addressed. Previously, we reported that transactivation from p53 targets was sensitive to both p53 amount and DNA sequence, with some sequences being responsive to much lower p53 levels than others when examined in yeast model systems or human cells. Because p53 is normally present at low levels and perturbations might lead to small increases, we examined transactivation under limiting p53. Unlike the positive relationship between transactivation and binding affinity from target sequences at high cellular levels of human p53 in yeast, no such relationship was found at low levels. However, transactivation in the yeast system and the torsional flexibility of target sequences were highly correlated, revealing a unique structural relationship between transcriptional function and sequence. Surprisingly, a few sequences supported high transactivation at low p53 levels in yeast or when transfected into human cells. On the basis of kinetic and flexibility analyses the "supertransactivation" property was due to low binding off rates of flexible target sites. Interestingly, a supertransactivation response element can differentiate transcriptional capacities of many breast cancer-associated p53 mutants. Overall, these studies, which are relevant to other transcription factors, address the extent to which transactivation properties of p53 target sequences are determined by their intrinsic physical properties and reveal unique rules of engagement of target sequences at low p53 levels.
Collapse
|
12
|
Gan Y, Guan J, Zhou S, Zhang W. Structural features based genome-wide characterization and prediction of nucleosome organization. BMC Bioinformatics 2012; 13:49. [PMID: 22449207 PMCID: PMC3378464 DOI: 10.1186/1471-2105-13-49] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/22/2011] [Accepted: 03/26/2012] [Indexed: 11/24/2022] Open
Abstract
Background Nucleosome distribution along chromatin dictates genomic DNA accessibility and thus profoundly influences gene expression. However, the underlying mechanism of nucleosome formation remains elusive. Here, taking a structural perspective, we systematically explored nucleosome formation potential of genomic sequences and the effect on chromatin organization and gene expression in S. cerevisiae. Results We analyzed twelve structural features related to flexibility, curvature and energy of DNA sequences. The results showed that some structural features such as DNA denaturation, DNA-bending stiffness, Stacking energy, Z-DNA, Propeller twist and free energy, were highly correlated with in vitro and in vivo nucleosome occupancy. Specifically, they can be classified into two classes, one positively and the other negatively correlated with nucleosome occupancy. These two kinds of structural features facilitated nucleosome binding in centromere regions and repressed nucleosome formation in the promoter regions of protein-coding genes to mediate transcriptional regulation. Based on these analyses, we integrated all twelve structural features in a model to predict more accurately nucleosome occupancy in vivo than the existing methods that mainly depend on sequence compositional features. Furthermore, we developed a novel approach, named DLaNe, that located nucleosomes by detecting peaks of structural profiles, and built a meta predictor to integrate information from different structural features. As a comparison, we also constructed a hidden Markov model (HMM) to locate nucleosomes based on the profiles of these structural features. The result showed that the meta DLaNe and HMM-based method performed better than the existing methods, demonstrating the power of these structural features in predicting nucleosome positions. Conclusions Our analysis revealed that DNA structures significantly contribute to nucleosome organization and influence chromatin structure and gene expression regulation. The results indicated that our proposed methods are effective in predicting nucleosome occupancy and positions and that these structural features are highly predictive of nucleosome organization. The implementation of our DLaNe method based on structural features is available online.
Collapse
Affiliation(s)
- Yanglan Gan
- Department of Computer Science and Technology, Tongji University, Shanghai, China
| | | | | | | |
Collapse
|
13
|
A conserved GA element in TATA-less RNA polymerase II promoters. PLoS One 2011; 6:e27595. [PMID: 22110682 PMCID: PMC3217976 DOI: 10.1371/journal.pone.0027595] [Citation(s) in RCA: 27] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/30/2011] [Accepted: 10/20/2011] [Indexed: 11/19/2022] Open
Abstract
Initiation of RNA polymerase (Pol) II transcription requires assembly of the pre-initiation complex (PIC) at the promoter. In the classical view, PIC assembly starts with binding of the TATA box-binding protein (TBP) to the TATA box. However, a TATA box occurs in only 15% of promoters in the yeast Saccharomyces cerevisiae, posing the question how most yeast promoters nucleate PIC assembly. Here we show that one third of all yeast promoters contain a novel conserved DNA element, the GA element (GAE), that generally does not co-occur with the TATA box. The distance of the GAE to the transcription start site (TSS) resembles the distance of the TATA box to the TSS. The TATA-less TMT1 core promoter contains a GAE, recruits TBP, and supports formation of a TBP-TFIIB-DNA-complex. Mutation of the promoter region surrounding the GAE abolishes transcription in vivo and in vitro. A 32-nucleotide promoter region containing the GAE can functionally substitute for the TATA box in a TATA-containing promoter. This identifies the GAE as a conserved promoter element in TATA-less promoters.
Collapse
|
14
|
Drachkova IA, Ponomarenko PM, Arshinova TV, Ponomarenko МP, Suslov VV, Savinkova LK, Kolchanov NА. In vitro examining the existing prognoses how TBP binds to TATA with SNP associated with human diseases. Health (London) 2011. [DOI: 10.4236/health.2011.39099] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022]
|
15
|
Beno I, Rosenthal K, Levitine M, Shaulov L, Haran TE. Sequence-dependent cooperative binding of p53 to DNA targets and its relationship to the structural properties of the DNA targets. Nucleic Acids Res 2010; 39:1919-32. [PMID: 21071400 PMCID: PMC3061056 DOI: 10.1093/nar/gkq1044] [Citation(s) in RCA: 43] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/12/2023] Open
Abstract
The prime mechanism by which p53 acts as a tumor suppressor is as a transcription factor regulating the expression of diverse downstream genes. The DNA-binding domain of p53 (p53DBD) interacts with defined DNA sites and is the main target for mutations in human primary tumors. Here, we show that the CWWG motif, found in the center of each consensus p53 half-site, is a key player in p53/DNA interactions. Gel-mobility-shift assays provide a unique opportunity to directly observe the various oligomeric complexes formed between p53DBD and its target sites. We demonstrate that p53DBD binds to p53 consensus sites containing CATG with relatively low cooperativity, as both dimers and tetramers, and with even lower cooperativity to such sites containing spacer sequences. p53DBD binds to sites containing CAAG and CTAG with measurable affinity only when imbedded in two contiguous p53 half-sites and only as tetramers (with very high cooperativity). There are three orders-of-magnitude difference in the cooperativity of interaction between sites differing in their non-contacted step, and further two orders-of-magnitude difference as a function of spacer sequences. By experimentally measuring the global structural properties of these sites, by cyclization kinetics of DNA minicircles, we correlate these differences with the torsional flexibility of the binding sites.
Collapse
Affiliation(s)
- Itai Beno
- Department of Biology, Technion, Technion City, Haifa 32000, Israel
| | | | | | | | | |
Collapse
|
16
|
Samanta S, Chakrabarti J, Bhattacharyya D. Changes in Thermodynamic Properties of DNA Base Pairs in Protein-DNA Recognition. J Biomol Struct Dyn 2010; 27:429-42. [DOI: 10.1080/07391102.2010.10507328] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/28/2022]
|
17
|
Ponomarenko PM, Ponomarenko MP, Drachkova IA, Lysova MV, Arshinova TV, Savinkova LK, Kolchanov NA. Prediction of the affinity of the TATA-binding protein to TATA boxes with single nucleotide polymorphisms. Mol Biol 2009. [DOI: 10.1134/s0026893309030157] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/23/2022]
|
18
|
Abstract
AbstractShort runs of adenines are a ubiquitous DNA element in regulatory regions of many organisms. When runs of 4–6 adenine base pairs (‘A-tracts’) are repeated with the helical periodicity, they give rise to global curvature of the DNA double helix, which can be macroscopically characterized by anomalously slow migration on polyacrylamide gels. The molecular structure of these DNA tracts is unusual and distinct from that of canonical B-DNA. We review here our current knowledge about the molecular details of A-tract structure and its interaction with sequences flanking them of either side and with the environment. Various molecular models were proposed to describe A-tract structure and how it causes global deflection of the DNA helical axis. We review old and recent findings that enable us to amalgamate the various findings to one model that conforms to the experimental data. Sequences containing phased repeats of A-tracts have from the very beginning been synonymous with global intrinsic DNA bending. In this review, we show that very often it is the unique structure of A-tracts that is at the basis of their widespread occurrence in regulatory regions of many organisms. Thus, the biological importance of A-tracts may often be residing in their distinct structure rather than in the global curvature that they induce on sequences containing them.
Collapse
|
19
|
González-Bulnes L, Gallego J. Indirect Effects Modulating the Interaction between DNA and a Cytotoxic Bisnaphthalimide Reveal a Two-Step Binding Process. J Am Chem Soc 2009; 131:7781-91. [DOI: 10.1021/ja901505p] [Citation(s) in RCA: 22] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/14/2023]
Affiliation(s)
- Luis González-Bulnes
- Centro de Investigación Príncipe Felipe, Avda. Autopista del Saler 16, 46012 Valencia, Spain
| | - José Gallego
- Centro de Investigación Príncipe Felipe, Avda. Autopista del Saler 16, 46012 Valencia, Spain
| |
Collapse
|
20
|
Savinkova LK, Ponomarenko MP, Ponomarenko PM, Drachkova IA, Lysova MV, Arshinova TV, Kolchanov NA. TATA box polymorphisms in human gene promoters and associated hereditary pathologies. BIOCHEMISTRY (MOSCOW) 2009; 74:117-29. [PMID: 19267666 DOI: 10.1134/s0006297909020011] [Citation(s) in RCA: 43] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/23/2022]
Abstract
TATA-binding protein (TBP) is the first basal factor that recognizes and binds a TATA box on TATA-containing gene promoters transcribed by RNA polymerase II. Data available in the literature are indicative of admissible variability of the TATA box. The TATA box flanking sequences can influence TBP affinity as well as the level of basal and activated transcription. The possibility of mediated involvement in in vivo gene expression regulation of the TBP interactions with variant TATA boxes is supported by data on TATA box polymorphisms and associated human hereditary pathologies. A table containing data on TATA element polymorphisms in human gene promoters (about 40 mutations have been described), associated with particular pathologies, their short functional characteristics, and manifestation mechanisms of TATA-box SNPs is presented. Four classes of polymorphisms are considered: TATA box polymorphisms that weaken and enhance promoter, polymorphisms causing TATA box emergence and disappearance, and human virus TATA box polymorphisms. The described examples are indicative of the polymorphism-associated severe pathologies like thalassemia, the increased risk of hepatocellular carcinoma, sensitivity to H. pylori infection, oral cavity and lung cancers, arterial hypertension, etc.
Collapse
Affiliation(s)
- L K Savinkova
- Institute of Cytology and Genetics, Siberian Branch of the Russian Academy of Sciences, Novosibirsk, Russia.
| | | | | | | | | | | | | |
Collapse
|
21
|
Regulation of CART peptide expression by CREB in the rat nucleus accumbens in vivo. Brain Res 2008; 1251:42-52. [PMID: 19046951 DOI: 10.1016/j.brainres.2008.11.011] [Citation(s) in RCA: 19] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/11/2008] [Revised: 10/22/2008] [Accepted: 11/01/2008] [Indexed: 11/20/2022]
Abstract
Production of mRNA from the cocaine- and amphetamine-regulated transcript (CART) gene is regulated by cocaine and other drugs of abuse in the nucleus accumbens (NAc), a brain reward region. Current hypotheses postulate that CART peptides there oppose the rewarding actions of cocaine by opposing the effects of dopaminergic transmission. Since over expression of CREB was shown to decrease cocaine-mediated reward, we hypothesized that CART could be a target gene for CREB in the NAc and that over expression of CREB would increase CART peptide levels. Transcription factor (TF) binding to DNA is influenced by sequences adjacent to consensus TF binding sites and other factors. We thus examined CREB binding to a 27mer oligonucleotide containing the CRE sequence from the CART gene proximal promoter. Using electrophoretic mobility shift assays and TF-antibody super shift assays, CREB was found to bind to the CRE sequence from the CART promoter. To test if over expression of CREB in the NAc affected CART peptide levels, Herpes simplex virus-1 vectors over expressing CREB (HSV-CREB), or a vector that expressed LacZ (HSV-LacZ) as a control, were injected into the NAc of rats. Western blotting and in situ hybridization showed that HSV-CREB injections increased CART mRNA and peptide levels. Injections of a dominant negative CREB mutant (HSV-mCREB) did not alter either CART mRNA or peptide levels. The finding that CREB can regulate the levels of CART mRNA and peptides in vivo in the NAc supports a role for CART peptides in psychostimulant-induced reward and reinforcement.
Collapse
|
22
|
Abrahem A, Pelchat M. Formation of an RNA polymerase II preinitiation complex on an RNA promoter derived from the hepatitis delta virus RNA genome. Nucleic Acids Res 2008; 36:5201-11. [PMID: 18682525 PMCID: PMC2532721 DOI: 10.1093/nar/gkn501] [Citation(s) in RCA: 29] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/17/2023] Open
Abstract
Although RNA polymerases (RNAPs) are able to use RNA as template, it is unknown how they recognize RNA promoters. In this study, we used an RNA fragment derived from the hepatitis delta virus (HDV) genome as a model to investigate the recognition of RNA promoters by RNAP II. Inhibition of the transcription reaction using an antibody specific to the largest subunit of RNAP II and the direct binding of purified RNAP II to the RNA promoter confirmed the involvement of RNAP II in the reaction. RNA affinity chromatography established that an active RNAP II preinitiation complex forms on the RNA promoter and indicated that this complex contains the core RNAP II subunit and the general transcription factors TFIIA, TFIIB, TFIID, TFIIE, TFIIF, TFIIH and TFIIS. Binding assays demonstrated the direct binding of the TATA-binding protein and suggested that this protein is required to nucleate the RNAP II complex on the RNA promoter. Our findings provide a better understanding of the events leading to RNA promoter recognition by RNAP II.
Collapse
Affiliation(s)
- Abrahem Abrahem
- Department of Biochemistry, Microbiology and Immunology, Faculty of Medicine, University of Ottawa, Ottawa, Ontario, K1H 8M5, Canada
| | | |
Collapse
|
23
|
Sprouse RO, Shcherbakova I, Cheng H, Jamison E, Brenowitz M, Auble DT. Function and structural organization of Mot1 bound to a natural target promoter. J Biol Chem 2008; 283:24935-48. [PMID: 18606810 DOI: 10.1074/jbc.m803749200] [Citation(s) in RCA: 14] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022] Open
Abstract
Mot1 is an essential, conserved TATA-binding protein (TBP)-associated factor in Saccharomyces cerevisiae and a member of the Snf2/Swi2 ATPase family. Mot1 uses ATP hydrolysis to displace TBP from DNA, an activity that can be readily reconciled with its global role in gene repression. Less well understood is how Mot1 directly activates gene expression. It has been suggested that Mot1-mediated activation can occur by displacement of inactive TBP-containing complexes from promoters, thereby permitting assembly of functional transcription complexes. Mot1 may also activate transcription by other mechanisms that have not yet been defined. A gap in our understanding has been the absence of biochemical information related to the activity of Mot1 on natural target genes. Using URA1 as a model Mot1-activated promoter, we show striking differences in the way that both TBP and Mot1 interact with DNA compared with other model DNA substrates analyzed previously. These differences are due at least in part to the propensity of TBP alone to bind to the URA1 promoter in the wrong orientation to direct appropriate assembly of the URA1 preinitiation complex. The results suggest that Mot1-mediated activation of URA1 transcription involves at least two steps, one of which is the removal of TBP bound to the promoter in the opposite orientation required for URA1 transcription.
Collapse
Affiliation(s)
- Rebekka O Sprouse
- Department of Biochemistry and Molecular Genetics, University of Virginia Health System, Charlottesville, Virginia 22908, USA
| | | | | | | | | | | |
Collapse
|
24
|
Coffin SR, Reich NO. Modulation of Escherichia coli DNA methyltransferase activity by biologically derived GATC-flanking sequences. J Biol Chem 2008; 283:20106-16. [PMID: 18502761 DOI: 10.1074/jbc.m802502200] [Citation(s) in RCA: 21] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022] Open
Abstract
Escherichia coli DNA adenine methyltransferase (EcoDam) methylates the N-6 position of the adenine in the sequence 5'-GATC-3' and plays vital roles in gene regulation, mismatch repair, and DNA replication. It remains unclear how the small number of critical GATC sites involved in the regulation of replication and gene expression are differentially methylated, whereas the approximately 20,000 GATCs important for mismatch repair and dispersed throughout the genome are extensively methylated. Our prior work, limited to the pap regulon, showed that methylation efficiency is controlled by sequences immediately flanking the GATC sites. We extend these studies to include GATC sites involved in diverse gene regulatory and DNA replication pathways as well as sites previously shown to undergo differential in vivo methylation but whose function remains to be assigned. EcoDam shows no change in affinity with variations in flanking sequences derived from these sources, but methylation kinetics varied 12-fold. A-tracts immediately adjacent to the GATC site contribute significantly to these differences in methylation kinetics. Interestingly, only when the poly(A) is located 5' of the GATC are the changes in methylation kinetics revealed. Preferential methylation is obscured when two GATC sites are positioned on the same DNA molecule, unless both sites are surrounded by large amounts of nonspecific DNA. Thus, facilitated diffusion and sequences immediately flanking target sites contribute to higher order specificity for EcoDam; we suggest that the diverse biological roles of the enzyme are in part regulated by these two factors, which may be important for other enzymes that sequence-specifically modify DNA.
Collapse
Affiliation(s)
- Stephanie R Coffin
- Department of Chemistry and Biochemistry, University of California, Santa Barbara, CA 93106-9510, USA
| | | |
Collapse
|
25
|
Bjornsdottir G, Myers LC. Minimal components of the RNA polymerase II transcription apparatus determine the consensus TATA box. Nucleic Acids Res 2008; 36:2906-16. [PMID: 18385157 PMCID: PMC2396422 DOI: 10.1093/nar/gkn130] [Citation(s) in RCA: 14] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/28/2022] Open
Abstract
In Saccharomyces cerevisiae, multiple approaches have arrived at a consensus TATA box sequence of TATA(T/A)A(A/T)(A/G). TATA-binding protein (TBP) affinity alone does not determine TATA box function. To discover how a minimal set of factors required for basal and activated transcription contributed to the sequence requirements for a functional TATA box, we performed transcription reactions using highly purified proteins and CYC1 promoter TATA box mutants. The TATA box consensus sequence is a good predictor of promoter activity. However, several nonconsensus sequences are almost fully functional, indicating that mechanistic requirements are not the only selective pressure on the TATA box. We also found that the effect of a mutation at a certain position is often dependent on other bases within a particular TATA box. Although activators and coactivators strongly influence TBP recruitment and stability at promoters, neither Mediator, the activator Gal4-V16, nor TFIID specifically compensate for the low transcription levels of the weak TATA boxes. The addition of Mediator to purified transcription reactions did, however, increase the functional selectivity for certain consensus TATA sequences. Transcription in whole-cell extracts or in vivo with these TATA box mutants indicated that factors, other than those in our purified system, may help initiate transcription from weak TATA boxes.
Collapse
Affiliation(s)
- Gudrun Bjornsdottir
- Department of Biochemistry, Dartmouth Medical School, Hanover, NH 03755, USA
| | | |
Collapse
|
26
|
Abeel T, Saeys Y, Bonnet E, Rouzé P, Van de Peer Y. Generic eukaryotic core promoter prediction using structural features of DNA. Genes Dev 2008; 18:310-23. [PMID: 18096745 PMCID: PMC2203629 DOI: 10.1101/gr.6991408] [Citation(s) in RCA: 133] [Impact Index Per Article: 8.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/03/2007] [Accepted: 11/14/2007] [Indexed: 11/24/2022]
Abstract
Despite many recent efforts, in silico identification of promoter regions is still in its infancy. However, the accurate identification and delineation of promoter regions is important for several reasons, such as improving genome annotation and devising experiments to study and understand transcriptional regulation. Current methods to identify the core region of promoters require large amounts of high-quality training data and often behave like black box models that output predictions that are difficult to interpret. Here, we present a novel approach for predicting promoters in whole-genome sequences by using large-scale structural properties of DNA. Our technique requires no training, is applicable to many eukaryotic genomes, and performs extremely well in comparison with the best available promoter prediction programs. Moreover, it is fast, simple in design, and has no size constraints, and the results are easily interpretable. We compared our approach with 14 current state-of-the-art implementations using human gene and transcription start site data and analyzed the ENCODE region in more detail. We also validated our method on 12 additional eukaryotic genomes, including vertebrates, invertebrates, plants, fungi, and protists.
Collapse
Affiliation(s)
- Thomas Abeel
- Department of Plant Systems Biology, Flanders Institute for Biotechnology (VIB), 9052 Gent, Belgium
- Department of Molecular Genetics, Ghent University, 9052 Gent, Belgium
| | - Yvan Saeys
- Department of Plant Systems Biology, Flanders Institute for Biotechnology (VIB), 9052 Gent, Belgium
- Department of Molecular Genetics, Ghent University, 9052 Gent, Belgium
| | - Eric Bonnet
- Department of Plant Systems Biology, Flanders Institute for Biotechnology (VIB), 9052 Gent, Belgium
- Department of Molecular Genetics, Ghent University, 9052 Gent, Belgium
| | - Pierre Rouzé
- Department of Plant Systems Biology, Flanders Institute for Biotechnology (VIB), 9052 Gent, Belgium
- Department of Molecular Genetics, Ghent University, 9052 Gent, Belgium
- Laboratoire Associé de l’INRA (France), Ghent University, 9052 Gent, Belgium
| | - Yves Van de Peer
- Department of Plant Systems Biology, Flanders Institute for Biotechnology (VIB), 9052 Gent, Belgium
- Department of Molecular Genetics, Ghent University, 9052 Gent, Belgium
| |
Collapse
|
27
|
Monovalent cation binding by curved DNA molecules containing variable numbers of a-tracts. Biophys J 2007; 94:1719-25. [PMID: 17993492 DOI: 10.1529/biophysj.107.121236] [Citation(s) in RCA: 16] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
Monovalent cation binding by DNA A-tracts, runs of four or more contiguous adenine or thymine residues, has been determined for two curved approximately 200 basepair (bp) restriction fragments, one taken from the M13 origin of replication and the other from the VP1 gene of SV40. These two fragments have previously been shown to contain stable, centrally located bends of 44 degrees and 46 degrees , respectively, located within approximately 60 bp "curvature modules" containing four or five irregularly spaced A-tracts. Transient electric birefringence measurements of these two fragments, sequence variants containing reduced numbers of A-tracts in the SV40 curvature module or changes in the residues flanking the A-tracts in the M13 curvature module, have been combined with the free solution electrophoretic mobilities of the same fragments using known equations to estimate the effective charge of each fragment. The effective charge is reduced, on average, by one-third charge for each A-tract in the curvature module, suggesting that each A-tract binds a monovalent cation approximately one-third of the time. Monovalent cation binding to two or more A-tracts is required to observe significant curvature of the DNA helix axis.
Collapse
|
28
|
Coker JA, DasSarma S. Genetic and transcriptomic analysis of transcription factor genes in the model halophilic Archaeon: coordinate action of TbpD and TfbA. BMC Genet 2007; 8:61. [PMID: 17892563 PMCID: PMC2121645 DOI: 10.1186/1471-2156-8-61] [Citation(s) in RCA: 51] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/15/2007] [Accepted: 09/24/2007] [Indexed: 11/10/2022] Open
Abstract
Background Archaea are prokaryotic organisms with simplified versions of eukaryotic transcription systems. Genes coding for the general transcription factors TBP and TFB are present in multiple copies in several Archaea, including Halobacterium sp. NRC-1. Multiple TBP and TFBs have been proposed to participate in transcription of genes via recognition and recruitment of RNA polymerase to different classes of promoters. Results We attempted to knock out all six TBP and seven TFB genes in Halobacterium sp. NRC-1 using the ura3-based gene deletion system. Knockouts were obtained for six out of thirteen genes, tbpCDF and tfbACG, indicating that they are not essential for cell viability under standard conditions. Screening of a population of 1,000 candidate mutants showed that genes which did not yield mutants contained less that 0.1% knockouts, strongly suggesting that they are essential. The transcriptomes of two mutants, ΔtbpD and ΔtfbA, were compared to the parental strain and showed coordinate down regulation of many genes. Over 500 out of 2,677 total genes were regulated in the ΔtbpD and ΔtfbA mutants with 363 regulated in both, indicating that over 10% of genes in both strains require the action of both TbpD and TfbA for normal transcription. Culturing studies on the ΔtbpD and ΔtfbA mutant strains showed them to grow more slowly than the wild-type at an elevated temperature, 49°C, and they showed reduced viability at 56°C, suggesting TbpD and TfbA are involved in the heat shock response. Alignment of TBP and TFB protein sequences suggested the expansion of the TBP gene family, especially in Halobacterium sp. NRC-1, and TFB gene family in representatives of five different genera of haloarchaea in which genome sequences are available. Conclusion Six of thirteen TBP and TFB genes of Halobacterium sp. NRC-1 are non-essential under standard growth conditions. TbpD and TfbA coordinate the expression of over 10% of the genes in the NRC-1 genome. The ΔtbpD and ΔtfbA mutant strains are temperature sensitive, possibly as a result of down regulation of heat shock genes. Sequence alignments suggest the existence of several families of TBP and TFB transcription factors in Halobacterium which may function in transcription of different classes of genes.
Collapse
Affiliation(s)
- James A Coker
- University of Maryland Biotechnology Institute, Center of Marine Biotechnology, 701 East Pratt Street, Baltimore, MD 21202, USA
| | - Shiladitya DasSarma
- University of Maryland Biotechnology Institute, Center of Marine Biotechnology, 701 East Pratt Street, Baltimore, MD 21202, USA
| |
Collapse
|
29
|
Faiger H, Ivanchenko M, Haran TE. Nearest-neighbor non-additivity versus long-range non-additivity in TATA-box structure and its implications for TBP-binding mechanism. Nucleic Acids Res 2007; 35:4409-19. [PMID: 17576671 PMCID: PMC1935006 DOI: 10.1093/nar/gkm451] [Citation(s) in RCA: 22] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/30/2022] Open
Abstract
TBP recognizes its target sites, TATA boxes, by recognizing their sequence-dependent structure and flexibility. Studying this mode of TATA-box recognition, termed ‘indirect readout’, is important for elucidating the binding mechanism in this system, as well as for developing methods to locate new binding sites in genomic DNA. We determined the binding stability and TBP-induced TATA-box bending for consensus-like TATA boxes. In addition, we calculated the individual information score of all studied sequences. We show that various non-additive effects exist in TATA boxes, dependent on their structural properties. By several criterions, we divide TATA boxes to two main groups. The first group contains sequences with 3–4 consecutive adenines. Sequences in this group have a rigid context-independent cooperative structure, best described by a nearest-neighbor non-additive model. Sequences in the second group have a flexible, context-dependent conformation, which cannot be described by an additive model or by a nearest-neighbor non-additive model. Classifying TATA boxes by these and other structural rules clarifies the different recognition pathways and binding mechanisms used by TBP upon binding to different TATA boxes. We discuss the structural and evolutionary sources of the difficulties in predicting new binding sites by probabilistic weight-matrix methods for proteins in which indirect readout is dominant.
Collapse
Affiliation(s)
| | | | - Tali E. Haran
- *To whom correspondence should be addressed. 972 4 8293767972 4 8225153
| |
Collapse
|
30
|
Hsieh TY, Shiu TY, Huang SM, Lin HH, Lee TC, Chen PJ, Chu HC, Chang WK, Jeng KS, Lai MMC, Chao YC. Molecular pathogenesis of Gilbert's syndrome: decreased TATA-binding protein binding affinity of UGT1A1 gene promoter. Pharmacogenet Genomics 2007; 17:229-36. [PMID: 17496722 DOI: 10.1097/fpc.0b013e328012d0da] [Citation(s) in RCA: 55] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/25/2022]
Abstract
OBJECTIVES Gilbert's syndrome is a congenital, nonhemolytic, unconjugated hyperbilirubinemia. The most common genotype of Gilbert's syndrome is the homozygous polymorphism, A(TA)7TAA, in the promoter of the gene for UDP-glucuronosyltransferase 1A1 (UGT1A1), with a thymine adenine insertion in the TATA-box-like sequence, which results in a decrease in UGT1A1 activity. The mechanism responsible for this decrease in UGT1A1 activity, however, has not been elucidated. To clarify the mechanism underlying this deficiency in UGT1A1 activity in patients with Gilbert's syndrome. METHODS The promoter activity assay using the wild-type A(TA)6TAA or the mutant A(TA)7TAA promoter and a luciferase reporter was performed in two different hepatoma cell lines. The binding affinity for a nuclear protein complex or for TATA-binding protein was evaluated by a competitive electophoretic mobility shift assay using wild-type or mutant TATA-box-like oligonucleotide probes and nuclear extract or TATA-binding protein. The formation of complexes between TATA-binding protein and wild-type or mutant oligonucleotide probes was also studied by a quantitive electophoretic mobility shift assay. RESULTS A TA insertion in the TATA-box-like sequence of the promoter activity of UGT1A1 gene. A competitive electrophoretic mobility shift assay showed a decrease in nuclear protein complex binding affinity and TATA-binding protein binding affinity of the mutant TATA-box-like sequence A(TA)7TAA. When the mutants A(TA)5TAA and A(TA)8TAA were also compared, quantitative electrophoretic mobility shift assay demonstrated that the TATA-binding protein binding affinity progressively decreased as the number of TA repeats in the TATA-box-like sequence increased. CONCLUSION TA insertion in the TATA-box-like sequence of the UGT1A1 promoter affected its binding affinity for TATA-binding protein, causing a decrease in its activity. This explains the pathogenesis of Gilbert's syndrome.
Collapse
Affiliation(s)
- Tsai-Yuan Hsieh
- Department of Internal Medicine, Tri-Service General Hospital, Taipei, Taiwan, ROC
| | | | | | | | | | | | | | | | | | | | | |
Collapse
|
31
|
Tsihlis ND, Grove A. The Saccharomyces cerevisiae RNA polymerase III recruitment factor subunits Brf1 and Bdp1 impose a strict sequence preference for the downstream half of the TATA box. Nucleic Acids Res 2006; 34:5585-93. [PMID: 17028095 PMCID: PMC1636458 DOI: 10.1093/nar/gkl534] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/23/2023] Open
Abstract
Association of the TATA-binding protein (TBP) with its cognate site within eukaryotic promoters is key to accurate and efficient transcriptional initiation. To achieve recruitment of Saccharomyces cerevisiae RNA polymerase III, TBP is associated with two additional factors, Brf1 and Bdp1, to form the initiation factor TFIIIB. Previous data have suggested that the structure or dynamics of the TBP–DNA complex may be altered upon entry of Brf1 and Bdp1 into the complex. We show here, using the altered specificity TBP mutant TBPm3 and an iterative in vitro selection assay, that entry of Brf1 and Bdp1 into the complex imposes a strict sequence preference for the downstream half of the TATA box. Notably, the selected sequence (TGTAAATA) is a perfect match to the TATA box of the RNA polymerase III-transcribed U6 small nuclear RNA (SNR6) gene. We suggest that the selected T•A base pair step at the downstream end of the 8 bp TBP site may provide a DNA flexure that promotes TFIIIB-DNA complex formation.
Collapse
Affiliation(s)
| | - Anne Grove
- To whom correspondence should be addressed. Tel: +1 225 578 5148; Fax: +1 225 578 8790;
| |
Collapse
|
32
|
Abstract
The conformational deformability of nucleic acids can influence their function and recognition by proteins. A class of DNA binding proteins including the TATA box binding protein binds to the DNA minor groove, resulting in an opening of the minor groove and DNA bending toward the major groove. Explicit solvent molecular dynamics simulations in combination with the umbrella sampling approach have been performed to investigate the molecular mechanism of DNA minor groove deformations and the indirect energetic contribution to protein binding. As a reaction coordinate, the distance between backbone segments on opposite strands was used. The resulting deformed structures showed close agreement with experimental DNA structures in complex with minor groove-binding proteins. The calculated free energy of minor groove deformation was approximately 4-6 kcal mol(-1) in the case of a central TATATA sequence. A smaller equilibrium minor groove width and more restricted minor groove mobility was found for the central AAATTT and also a significantly ( approximately 2 times) larger free energy change for opening the minor groove. The helical parameter analysis of trajectories indicates that an easier partial unstacking of a central TA versus AT basepair step is a likely reason for the larger groove flexibility of the central TATATA case.
Collapse
Affiliation(s)
- Martin Zacharias
- School of Engineering and Science, International University Bremen, D-28759 Bremen, Germany.
| |
Collapse
|