1
|
Sato K, Knipscheer P. G-quadruplex resolution: From molecular mechanisms to physiological relevance. DNA Repair (Amst) 2023; 130:103552. [PMID: 37572578 DOI: 10.1016/j.dnarep.2023.103552] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/24/2023] [Revised: 07/29/2023] [Accepted: 08/01/2023] [Indexed: 08/14/2023]
Abstract
Guanine-rich DNA sequences can fold into stable four-stranded structures called G-quadruplexes or G4s. Research in the past decade demonstrated that G4 structures are widespread in the genome and prevalent in regulatory regions of actively transcribed genes. The formation of G4s has been tightly linked to important biological processes including regulation of gene expression and genome maintenance. However, they can also pose a serious threat to genome integrity especially by impeding DNA replication, and G4-associated somatic mutations have been found accumulated in the cancer genomes. Specialised DNA helicases and single stranded DNA binding proteins that can resolve G4 structures play a crucial role in preventing genome instability. The large variety of G4 unfolding proteins suggest the presence of multiple G4 resolution mechanisms in cells. Recently, there has been considerable progress in our detailed understanding of how G4s are resolved, especially during DNA replication. In this review, we first discuss the current knowledge of the genomic G4 landscapes and the impact of G4 structures on DNA replication and genome integrity. We then describe the recent progress on the mechanisms that resolve G4 structures and their physiological relevance. Finally, we discuss therapeutic opportunities to target G4 structures.
Collapse
Affiliation(s)
- Koichi Sato
- Oncode Institute, Hubrecht Institute-KNAW & University Medical Center Utrecht, Utrecht, the Netherlands.
| | - Puck Knipscheer
- Oncode Institute, Hubrecht Institute-KNAW & University Medical Center Utrecht, Utrecht, the Netherlands; Department of Human Genetics, Leiden University Medical Center, Leiden, the Netherlands.
| |
Collapse
|
2
|
Deng Z, Zhang Y, Gao C, Shen W, Wang S, Ni X, Liu S, Li X. A transposon-introduced G-quadruplex motif is selectively retained and constrained to downregulate CYP321A1. INSECT SCIENCE 2022; 29:1629-1642. [PMID: 35226400 DOI: 10.1111/1744-7917.13021] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/13/2021] [Revised: 01/18/2022] [Accepted: 01/28/2022] [Indexed: 06/14/2023]
Abstract
Insects utilize xenobiotic compounds to up- and downregulate cytochrome P450 monooxygenases (P450s) involved in detoxification of toxic xenobiotics including phytochemicals and pesticides. G-quadruplexes (G4)-forming DNA motifs are enriched in the promoter regions of transcription factors and function as cis-acting elements to regulate these genes. Whether and how P450s gain and keep G4 DNA motifs to regulate their expression still remain unexplored. Here, we show that CYP321A1, a xenobiotic-metabolizing P450 from Helicoverpa zea, a polyphagous insect of economic importance, has acquired and preserved a G4 DNA motif by selectively retaining a transposon known as HzIS1-3 that carries this G4 DNA motif in its promoter region. The HzIS1-3 G4 DNA motif acts as a silencer to suppress the constitutive and induced expression of CYP321A1 by plant allelochemicals flavone and xanthotoxin through folding into an intramolecular parallel or hybrid-1 conformation in the absence or presence of K+ . The G4 ligand N-methylmesoporphyrin IX (NMM) strengthens the silencing effect of HzIS1-3 G4 DNA motif by switching its structure from hybrid-1 to hybrid-2. The enrichment of transposons in P450s and other environment-adaptation genes implies that selective retention of G4 DNA motif-carrying transposons may be the main evolutionary route for these genes to obtain G4 DNA motifs.
Collapse
Affiliation(s)
- Zhongyuan Deng
- School of Agricultural Sciences, Zhengzhou University, Zhengzhou, China
- Department of Entomology and BIO5 Institute, University of Arizona, Tucson, AZ, USA
| | - Yuting Zhang
- School of Agricultural Sciences, Zhengzhou University, Zhengzhou, China
| | - Chao Gao
- State Key Laboratory of Agricultural Microbiology, Huazhong Agricultural University, Wuhan, China
| | - Wei Shen
- College of Science, Huazhong Agricultural University, Wuhan, China
| | - Shan Wang
- School of Agricultural Sciences, Zhengzhou University, Zhengzhou, China
| | - Xinzhi Ni
- USDA-ARS, Crop Genetics and Breeding Research Unit, University of Georgia, Tifton Campus, Tifton, GA, USA
| | - Sisi Liu
- State Key Laboratory of Agricultural Microbiology, Huazhong Agricultural University, Wuhan, China
- College of Science, Huazhong Agricultural University, Wuhan, China
| | - Xianchun Li
- Department of Entomology and BIO5 Institute, University of Arizona, Tucson, AZ, USA
| |
Collapse
|
3
|
Genome-wide analysis of DNA G-quadruplex motifs across 37 species provides insights into G4 evolution. Commun Biol 2021; 4:98. [PMID: 33483610 PMCID: PMC7822830 DOI: 10.1038/s42003-020-01643-4] [Citation(s) in RCA: 39] [Impact Index Per Article: 13.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/31/2019] [Accepted: 12/29/2020] [Indexed: 01/30/2023] Open
Abstract
G-quadruplex (G4) structures have been predicted in the genomes of many organisms and proven to play regulatory roles in diverse cellular activities. However, there is little information on the evolutionary history and distribution characteristics of G4s. Here, whole-genome characteristics of potential G4s were studied in 37 evolutionarily representative species. During evolution, the number, length, and density of G4s generally increased. Immunofluorescence in seven species confirmed G4s' presence and evolutionary pattern. G4s tended to cluster in chromosomes and were enriched in genetic regions. Short-loop G4s were conserved in most species, while loop-length diversity also existed, especially in mammals. The proportion of G4-bearing genes and orthologue genes, which appeared to be increasingly enriched in transcription factors, gradually increased. The antagonistic relationship between G4s and DNA methylation sites was detected. These findings imply that organisms may have evolutionarily developed G4 into a novel reversible and elaborate transcriptional regulatory mechanism benefiting multiple physiological activities of higher organisms.
Collapse
|
4
|
Stefos GC, Theodorou G, Politis I. DNA G-quadruplexes: functional significance in plant and farm animal science. Anim Biotechnol 2019; 32:262-271. [PMID: 31642375 DOI: 10.1080/10495398.2019.1679823] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/25/2022]
Abstract
G-quadruplexes (G4s) are non-canonical structures that can be formed in DNA and RNA sequences which carry four short runs of guanines. They are distributed in the whole genome but are enriched in gene promoter regions, gene UTRs and chromosome telomeres. The whole array of their functional roles is not fully explored yet but there is solid evidence supporting their implication in a number of processes like regulation of transcription, replication and telomere organization, among others. During the last decade, there is an increased research interest for G4s that has resulted in a better understanding of their role in several physiological and pathological conditions. On the other hand, these structures are poorly studied in plant species and animals of agricultural interest. Here, we summarize the current methods that are used for studying G4s, we review the studies concerning plants and farm animals and we discuss the advantages of a more thorough inclusion of G4s research in the agricultural sciences.
Collapse
Affiliation(s)
- Georgios C Stefos
- Independent researcher, Agricultural University of Athens, Athens, Greece
| | - Georgios Theodorou
- Department of Animal Science and Aquaculture, Agricultural University of Athens, Athens, Greece
| | - Ioannis Politis
- Department of Animal Science and Aquaculture, Agricultural University of Athens, Athens, Greece
| |
Collapse
|
5
|
Nieuwenhuis M, van de Peppel LJJ, Bakker FT, Zwaan BJ, Aanen DK. Enrichment of G4DNA and a Large Inverted Repeat Coincide in the Mitochondrial Genomes of Termitomyces. Genome Biol Evol 2019; 11:1857-1869. [PMID: 31209489 PMCID: PMC6609731 DOI: 10.1093/gbe/evz122] [Citation(s) in RCA: 15] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 06/11/2019] [Indexed: 12/20/2022] Open
Abstract
Mitochondria retain their own genome, a hallmark of their bacterial ancestry. Mitochondrial genomes (mtDNA) are highly diverse in size, shape, and structure, despite their conserved function across most eukaryotes. Exploring extreme cases of mtDNA architecture can yield important information on fundamental aspects of genome biology. We discovered that the mitochondrial genomes of a basidiomycete fungus (Termitomyces spp.) contain an inverted repeat (IR), a duplicated region half the size of the complete genome. In addition, we found an abundance of sequences capable of forming G-quadruplexes (G4DNA); structures that can disrupt the double helical formation of DNA. G4DNA is implicated in replication fork stalling, double-stranded breaks, altered gene expression, recombination, and other effects. To determine whether this occurrence of IR and G4DNA was correlated within the genus Termitomyces, we reconstructed the mitochondrial genomes of 11 additional species including representatives of several closely related genera. We show that the mtDNA of all sampled species of Termitomyces and its sister group, represented by the species Tephrocybe rancida and Blastosporella zonata, are characterized by a large IR and enrichment of G4DNA. To determine whether high mitochondrial G4DNA content is common in fungi, we conducted the first broad survey of G4DNA content in fungal mtDNA, revealing it to be a highly variable trait. The results of this study provide important direction for future research on the function and evolution of G4DNA and organellar IRs.
Collapse
Affiliation(s)
| | | | - Freek T Bakker
- Biosystematics Group, Wageningen University & Research, The Netherlands
| | - Bas J Zwaan
- Laboratory of Genetics, Wageningen University & Research, The Netherlands
| | - Duur K Aanen
- Laboratory of Genetics, Wageningen University & Research, The Netherlands
| |
Collapse
|
6
|
Zorzan E, Da Ros S, Giantin M, Shahidian LZ, Guerra G, Palumbo M, Sissi C, Dacasto M. Targeting Canine KIT Promoter by Candidate DNA G-Quadruplex Ligands. J Pharmacol Exp Ther 2018; 367:461-472. [PMID: 30275152 DOI: 10.1124/jpet.118.248997] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/08/2018] [Accepted: 09/26/2018] [Indexed: 12/16/2022] Open
Abstract
G-quadruplexes (G4) are nucleic acid secondary structures frequently assumed by G-rich sequences located mostly at telomeres and proto-oncogenes promoters. Recently, we identified, in canine KIT (v-kit Hardy-Zuckerman 4 feline sarcoma viral oncogene homolog) promoter, two G-rich sequences able to fold into G4: d_kit1 and d_kit2_A16. In this study, an anthraquinone (AQ1) and an anthracene derivative (AN6), known to stabilize the G4 structures of the corresponding human h_kit1 and h_kit2, were tested on the canine G4 and in two canine mast cell tumor (MCT) cell lines (C2 and NI-1) to verify their capability to down-regulate KIT expression. The cytotoxicity of AQ1 and AN6 was determined using the Alamar Blue test; also the constitutive expression of KIT and other proto-oncogenes containing G4 structures in their promoter (BCL2, VEGFα, VEGFR2, KRAS, and TERT) was assessed by quantitative real-time polymerase chain reaction (qRT-PCR). Then the time- and dose-dependent effects of both ligands on target gene expression were assessed by qRT-PCR. All target genes were constitutively expressed up to 96 hours of culture. Both ligands decreased KIT mRNA levels and c-kit protein amount, and AN6 was comparatively fairly more effective. DNA interaction studies and a dual-luciferase gene reporter assay performed on a noncancerous canine cell line (Madin-Darby Canine Kidney cells) proved that this down-regulation was the result of the interaction of AN6 with KIT proximal promoter. Interestingly, our results only partially overlap with those previously obtained in human cell lines, where AQ1 was found as the most effective compound. These preliminary data might suggest AN6 as a promising candidate for the selective targeting of canine KIT-dependent tumors.
Collapse
Affiliation(s)
- Eleonora Zorzan
- Department of Comparative Biomedicine and Food Science, University of Padua, Agripolis Legnaro, Padua, Italy (E.Z., M.G., L.Z.S., G.G., M.D.), and Department of Pharmaceutical and Pharmacological Sciences, University of Padua, Padua, Italy (S.D.R., M.P., C.S.)
| | - Silvia Da Ros
- Department of Comparative Biomedicine and Food Science, University of Padua, Agripolis Legnaro, Padua, Italy (E.Z., M.G., L.Z.S., G.G., M.D.), and Department of Pharmaceutical and Pharmacological Sciences, University of Padua, Padua, Italy (S.D.R., M.P., C.S.)
| | - Mery Giantin
- Department of Comparative Biomedicine and Food Science, University of Padua, Agripolis Legnaro, Padua, Italy (E.Z., M.G., L.Z.S., G.G., M.D.), and Department of Pharmaceutical and Pharmacological Sciences, University of Padua, Padua, Italy (S.D.R., M.P., C.S.)
| | - Lara Zorro Shahidian
- Department of Comparative Biomedicine and Food Science, University of Padua, Agripolis Legnaro, Padua, Italy (E.Z., M.G., L.Z.S., G.G., M.D.), and Department of Pharmaceutical and Pharmacological Sciences, University of Padua, Padua, Italy (S.D.R., M.P., C.S.)
| | - Giorgia Guerra
- Department of Comparative Biomedicine and Food Science, University of Padua, Agripolis Legnaro, Padua, Italy (E.Z., M.G., L.Z.S., G.G., M.D.), and Department of Pharmaceutical and Pharmacological Sciences, University of Padua, Padua, Italy (S.D.R., M.P., C.S.)
| | - Manlio Palumbo
- Department of Comparative Biomedicine and Food Science, University of Padua, Agripolis Legnaro, Padua, Italy (E.Z., M.G., L.Z.S., G.G., M.D.), and Department of Pharmaceutical and Pharmacological Sciences, University of Padua, Padua, Italy (S.D.R., M.P., C.S.)
| | - Claudia Sissi
- Department of Comparative Biomedicine and Food Science, University of Padua, Agripolis Legnaro, Padua, Italy (E.Z., M.G., L.Z.S., G.G., M.D.), and Department of Pharmaceutical and Pharmacological Sciences, University of Padua, Padua, Italy (S.D.R., M.P., C.S.)
| | - Mauro Dacasto
- Department of Comparative Biomedicine and Food Science, University of Padua, Agripolis Legnaro, Padua, Italy (E.Z., M.G., L.Z.S., G.G., M.D.), and Department of Pharmaceutical and Pharmacological Sciences, University of Padua, Padua, Italy (S.D.R., M.P., C.S.)
| |
Collapse
|
7
|
Kerkour A, Marquevielle J, Ivashchenko S, Yatsunyk LA, Mergny JL, Salgado GF. High-resolution three-dimensional NMR structure of the KRAS proto-oncogene promoter reveals key features of a G-quadruplex involved in transcriptional regulation. J Biol Chem 2017; 292:8082-8091. [PMID: 28330874 DOI: 10.1074/jbc.m117.781906] [Citation(s) in RCA: 50] [Impact Index Per Article: 7.1] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/17/2017] [Revised: 03/15/2017] [Indexed: 12/13/2022] Open
Abstract
Non-canonical base pairing within guanine-rich DNA and RNA sequences can produce G-quartets, whose stacking leads to the formation of a G-quadruplex (G4). G4s can coexist with canonical duplex DNA in the human genome and have been suggested to suppress gene transcription, and much attention has therefore focused on studying G4s in promotor regions of disease-related genes. For example, the human KRAS proto-oncogene contains a nuclease-hypersensitive element located upstream of the major transcription start site. The KRAS nuclease-hypersensitive element (NHE) region contains a G-rich element (22RT; 5'-AGGGCGGTGTGGGAATAGGGAA-3') and encompasses a Myc-associated zinc finger-binding site that regulates KRAS transcription. The NEH region therefore has been proposed as a target for new drugs that control KRAS transcription, which requires detailed knowledge of the NHE structure. In this study, we report a high-resolution NMR structure of the G-rich element within the KRAS NHE. We found that the G-rich element forms a parallel structure with three G-quartets connected by a four-nucleotide loop and two short one-nucleotide double-chain reversal loops. In addition, a thymine bulge is found between G8 and G9. The loops of different lengths and the presence of a bulge between the G-quartets are structural elements that potentially can be targeted by small chemical ligands that would further stabilize the structure and interfere or block transcriptional regulators such as Myc-associated zinc finger from accessing their binding sites on the KRAS promoter. In conclusion, our work suggests a possible new route for the development of anticancer agents that could suppress KRAS expression.
Collapse
Affiliation(s)
- Abdelaziz Kerkour
- From the Université Bordeaux, INSERM, CNRS, ARNA laboratory, European Institute of Chemistry and Biology, U1212, UMR 5320, 2 Rue Robert Escarpit, 33000 Pessac, France and
| | - Julien Marquevielle
- From the Université Bordeaux, INSERM, CNRS, ARNA laboratory, European Institute of Chemistry and Biology, U1212, UMR 5320, 2 Rue Robert Escarpit, 33000 Pessac, France and
| | - Stefaniia Ivashchenko
- From the Université Bordeaux, INSERM, CNRS, ARNA laboratory, European Institute of Chemistry and Biology, U1212, UMR 5320, 2 Rue Robert Escarpit, 33000 Pessac, France and
| | - Liliya A Yatsunyk
- From the Université Bordeaux, INSERM, CNRS, ARNA laboratory, European Institute of Chemistry and Biology, U1212, UMR 5320, 2 Rue Robert Escarpit, 33000 Pessac, France and.,Department of Chemistry and Biochemistry, Swarthmore College, Swarthmore, Pennsylvania 19081
| | - Jean-Louis Mergny
- From the Université Bordeaux, INSERM, CNRS, ARNA laboratory, European Institute of Chemistry and Biology, U1212, UMR 5320, 2 Rue Robert Escarpit, 33000 Pessac, France and
| | - Gilmar F Salgado
- From the Université Bordeaux, INSERM, CNRS, ARNA laboratory, European Institute of Chemistry and Biology, U1212, UMR 5320, 2 Rue Robert Escarpit, 33000 Pessac, France and
| |
Collapse
|
8
|
Abe H, Gemmell NJ. Abundance, arrangement, and function of sequence motifs in the chicken promoters. BMC Genomics 2014; 15:900. [PMID: 25318583 PMCID: PMC4203960 DOI: 10.1186/1471-2164-15-900] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/04/2014] [Accepted: 10/08/2014] [Indexed: 01/01/2023] Open
Abstract
Background Eukaryotic promoters are regions containing various sequence motifs necessary to control gene transcription. Much evidence has emerged showing that structural and/or contextual changes in regulatory elements can critically affect cis-regulatory activity. As sequence motifs can be key factors in maintaining complex promoter architectures, one effective approach to further understand the evolution of promoter regions in vertebrates is to compare the abundance and distribution patterns of sequence motifs in these regions between divergent species. When compared with mammals, the chicken (Gallus gallus) has a very different genome composition and sufficient genomic information to make it a good model for the exploration of promoter structure and evolution. Results More than 10% of chicken genes contained short tandem repeat (STR) in the region 2 kb upstream of promoters, but the total number of STRs observed in chicken is approximately half of that detected in human promoters. In terms of the STR motif frequencies, chicken promoter regions were more similar to other avian and mammalian promoters than these were to the entire chicken genome. Unlike other STRs, nearly half of the trinucleotide repeats found in promoters partly or entirely overlapped with CpG islands, indicating potential association with nucleosome positions. Moreover, the chicken promoters are abundant with sequence motifs such as poly-A, poly-G and G-quadruplexes, especially in the core region, that are otherwise rare in the genome. Most of sequence motifs showed strong functional enrichment for particular gene ontology (GO) categories, indicating roles in regulation of transcription and gene expression, as well as immune response and cognition. Conclusions Chicken promoter regions share some, but not all, of the structural features observed in mammalian promoters. The findings presented here provide empirical evidence suggesting that the frequencies and locations of STR motifs have been conserved through promoter evolution in a lineage-specific manner. Correlation analysis between GO categories and sequence motifs suggests motif-specific constraints acting on gene function. Electronic supplementary material The online version of this article (doi:10.1186/1471-2164-15-900) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Hideaki Abe
- Department of Anatomy, University of Otago, Dunedin, New Zealand.
| | | |
Collapse
|
9
|
Zhang JY, Zheng KW, Xiao S, Hao YH, Tan Z. Mechanism and manipulation of DNA:RNA hybrid G-quadruplex formation in transcription of G-rich DNA. J Am Chem Soc 2014; 136:1381-90. [PMID: 24392825 DOI: 10.1021/ja4085572] [Citation(s) in RCA: 58] [Impact Index Per Article: 5.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/31/2023]
Abstract
We recently reported that a DNA:RNA hybrid G-quadruplex (HQ) forms during transcription of DNA that bears two or more tandem guanine tracts (G-tract) on the nontemplate strand. Putative HQ-forming sequences are enriched in the nearby 1000 nt region right downstream of transcription start sites in the nontemplate strand of warm-blooded animals, and HQ regulates transcription under both in vitro and in vivo conditions. Therefore, knowledge of the mechanism of HQ formation is important for understanding the biological function of HQ as well as for manipulating gene expression by targeting HQ. In this work, we studied the mechanism of HQ formation using an in vitro T7 transcription model. We show that RNA synthesis initially produces an R-loop, a DNA:RNA heteroduplex formed by a nascent RNA transcript and the template DNA strand. In the following round of transcription, the RNA in the R-loop is displaced, releasing the RNA in single-stranded form (ssRNA). Then the G-tracts in the RNA can jointly form HQ with those in the nontemplate DNA strand. We demonstrate that the structural cascade R-loop → ssRNA → HQ offers opportunities to intercept HQ formation, which may provide a potential method to manipulate gene expression.
Collapse
Affiliation(s)
- Jia-yu Zhang
- State Key Laboratory of Biomembrane and Membrane Biotechnology, Institute of Zoology, Chinese Academy of Sciences , Beijing 100101, People's Republic of China
| | | | | | | | | |
Collapse
|
10
|
Xiao S, Zhang JY, Zheng KW, Hao YH, Tan Z. Bioinformatic analysis reveals an evolutional selection for DNA:RNA hybrid G-quadruplex structures as putative transcription regulatory elements in warm-blooded animals. Nucleic Acids Res 2013; 41:10379-90. [PMID: 23999096 PMCID: PMC3905843 DOI: 10.1093/nar/gkt781] [Citation(s) in RCA: 51] [Impact Index Per Article: 4.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/04/2023] Open
Abstract
Recently, we reported the co-transcriptional formation of DNA:RNA hybrid G-quadruplex (HQ) structure by the non-template DNA strand and nascent RNA transcript, which in turn modulates transcription under both in vitro and in vivo conditions. Here we present bioinformatic analysis on putative HQ-forming sequences (PHQS) in the genomes of eukaryotic organisms. Starting from amphibian, PHQS motifs are concentrated in the immediate 1000-nt region downstream of transcription start sites, implying their potential role in transcription regulation. Moreover, their occurrence shows a strong bias toward the non-template versus the template strand. PHQS has become constitutional in genes in warm-blooded animals, and the magnitude of the strand bias correlates with the ability of PHQS to form HQ, suggesting a selection based on HQ formation. This strand bias is reversed in lower species, implying that the selection of PHQS/HQ depended on the living temperature of the organisms. In comparison with the putative intramolecular G-quadruplex-forming sequences (PQS), PHQS motifs are far more prevalent and abundant in the transcribed regions, making them the dominant candidates in the formation of G-quadruplexes in transcription. Collectively, these results suggest that the HQ structures are evolutionally selected to function in transcription and other transcription-mediated processes that involve guanine-rich non-template strand.
Collapse
Affiliation(s)
- Shan Xiao
- State Key Laboratory of Biomembrane and Membrane Biotechnology, Institute of Zoology, Chinese Academy of Sciences, Beijing 100101, P. R. China
| | | | | | | | | |
Collapse
|
11
|
Du X, Wojtowicz D, Bowers AA, Levens D, Benham CJ, Przytycka TM. The genome-wide distribution of non-B DNA motifs is shaped by operon structure and suggests the transcriptional importance of non-B DNA structures in Escherichia coli. Nucleic Acids Res 2013; 41:5965-77. [PMID: 23620297 PMCID: PMC3695496 DOI: 10.1093/nar/gkt308] [Citation(s) in RCA: 44] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open
Abstract
Although the right-handed double helical B-form DNA is most common under physiological conditions, DNA is dynamic and can adopt a number of alternative structures, such as the four-stranded G-quadruplex, left-handed Z-DNA, cruciform and others. Active transcription necessitates strand separation and can induce such non-canonical forms at susceptible genomic sequences. Therefore, it has been speculated that these non-B DNA motifs can play regulatory roles in gene transcription. Such conjecture has been supported in higher eukaryotes by direct studies of several individual genes, as well as a number of large-scale analyses. However, the role of non-B DNA structures in many lower organisms, in particular proteobacteria, remains poorly understood and incompletely documented. In this study, we performed the first comprehensive study of the occurrence of B DNA-non-B DNA transition-susceptible sites (non-B DNA motifs) within the context of the operon structure of the Escherichia coli genome. We compared the distributions of non-B DNA motifs in the regulatory regions of operons with those from internal regions. We found an enrichment of some non-B DNA motifs in regulatory regions, and we show that this enrichment cannot be simply explained by base composition bias in these regions. We also showed that the distribution of several non-B DNA motifs within intergenic regions separating divergently oriented operons differs from the distribution found between convergent ones. In particular, we found a strong enrichment of cruciforms in the termination region of operons; this enrichment was observed for operons with Rho-dependent, as well as Rho-independent terminators. Finally, a preference for some non-B DNA motifs was observed near transcription factor-binding sites. Overall, the conspicuous enrichment of transition-susceptible sites in these specific regulatory regions suggests that non-B DNA structures may have roles in the transcriptional regulation of specific operons within the E. coli genome.
Collapse
Affiliation(s)
- Xiangjun Du
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health 8600 Rockville Pike, Bethesda, MD 20894, USA
| | | | | | | | | | | |
Collapse
|
12
|
Zhou W, Suntharalingam K, Brand NJ, Barton PJR, Vilar R, Ying L. Possible regulatory roles of promoter g-quadruplexes in cardiac function-related genes - human TnIc as a model. PLoS One 2013; 8:e53137. [PMID: 23326389 PMCID: PMC3541360 DOI: 10.1371/journal.pone.0053137] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/17/2012] [Accepted: 11/23/2012] [Indexed: 12/15/2022] Open
Abstract
G-quadruplexes (G4s) are four-stranded DNA secondary structures, which are involved in a diverse range of biological processes. Although the anti-cancer potential of G4s in oncogene promoters has been thoroughly investigated, the functions of promoter G4s in non-cancer-related genes are not well understood. We have explored the possible regulatory roles of promoter G4s in cardiac function-related genes using both computational and a wide range of experimental approaches. According to our bioinformatics results, it was found that potential G4-forming sequences are particularly enriched in the transcription regulatory regions (TRRs) of cardiac function-related genes. Subsequently, the promoter of human cardiac troponin I (TnIc) was chosen as a model, and G4s found in this region were subjected to biophysical characterisations. The chromosome 19 specific minisatellite G4 sequence (MNSG4) and near transcription start site (TSS) G4 sequence (−80 G4) adopt anti-parallel and parallel structures respectively in 100 mM KCl, with stabilities comparable to those of oncogene G4s. It was also found that TnIc G4s act cooperatively as enhancers in gene expression regulation in HEK293 cells, when stabilised by a synthetic G4-binding ligand. This study provides the first evidence of the biological significance of promoter G4s in cardiac function-related genes. The feasibility of using a single ligand to target multiple G4s in a particular gene has also been discussed.
Collapse
Affiliation(s)
- Wenhua Zhou
- Molecular Medicine, National Heart and Lung Institute, Imperial College London, London, United Kingdom
| | | | - Nigel J. Brand
- Harefield Heart Science Centre, National Heart and Lung Institute, Imperial College London, Middlesex, United Kingdom
| | - Paul J. R. Barton
- Harefield Heart Science Centre, National Heart and Lung Institute, Imperial College London, Middlesex, United Kingdom
- NIHR Cardiovascular Biomedical Research Unit, Royal Brompton and Harefield NHS Trust, London, United Kingdom
| | - Ramon Vilar
- Department of Chemistry, Imperial College London, London, United Kingdom
| | - Liming Ying
- Molecular Medicine, National Heart and Lung Institute, Imperial College London, London, United Kingdom
- * E-mail:
| |
Collapse
|
13
|
Baral A, Kumar P, Pathak R, Chowdhury S. Emerging trends in G-quadruplex biology – role in epigenetic and evolutionary events. MOLECULAR BIOSYSTEMS 2013; 9:1568-75. [DOI: 10.1039/c3mb25492e] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/07/2023]
|
14
|
Bugaut A, Balasubramanian S. 5'-UTR RNA G-quadruplexes: translation regulation and targeting. Nucleic Acids Res 2012; 40:4727-41. [PMID: 22351747 PMCID: PMC3367173 DOI: 10.1093/nar/gks068] [Citation(s) in RCA: 469] [Impact Index Per Article: 39.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/29/2022] Open
Abstract
RNA structures in the untranslated regions (UTRs) of mRNAs influence post-transcriptional regulation of gene expression. Much of the knowledge in this area depends on canonical double-stranded RNA elements. There has been considerable recent advancement of our understanding of guanine(G)-rich nucleic acids sequences that form four-stranded structures, called G-quadruplexes. While much of the research has been focused on DNA G-quadruplexes, there has recently been a rapid emergence of interest in RNA G-quadruplexes, particularly in the 5′-UTRs of mRNAs. Collectively, these studies suggest that RNA G-quadruplexes exist in the 5′-UTRs of many genes, including genes of clinical interest, and that such structural elements can influence translation. This review features the progresses in the study of 5′-UTR RNA G-quadruplex-mediated translational control. It covers computational analysis, cell-free, cell-based and chemical biology studies that have sought to elucidate the roles of RNA G-quadruplexes in both cap-dependent and -independent regulation of mRNA translation. We also discuss protein trans-acting factors that have been implicated and the evidence that such RNA motifs have potential as small molecule target. Finally, we close the review with a perspective on the future challenges in the field of 5′-UTR RNA G-quadruplex-mediated translation regulation.
Collapse
Affiliation(s)
- Anthony Bugaut
- Department of Chemistry, University of Cambridge, Lensfield Road, Cambridge CB2 1EW, UK.
| | | |
Collapse
|
15
|
Baral A, Kumar P, Halder R, Mani P, Yadav VK, Singh A, Das SK, Chowdhury S. Quadruplex-single nucleotide polymorphisms (Quad-SNP) influence gene expression difference among individuals. Nucleic Acids Res 2012; 40:3800-11. [PMID: 22238381 PMCID: PMC3351168 DOI: 10.1093/nar/gkr1258] [Citation(s) in RCA: 48] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/08/2023] Open
Abstract
Non-canonical guanine quadruplex structures are not only predominant but also conserved among bacterial and mammalian promoters. Moreover recent findings directly implicate quadruplex structures in transcription. These argue for an intrinsic role of the structural motif and thereby posit that single nucleotide polymorphisms (SNP) that compromise the quadruplex architecture could influence function. To test this, we analysed SNPs within quadruplex motifs (Quad-SNP) and gene expression in 270 individuals across four populations (HapMap) representing more than 14,500 genotypes. Findings reveal significant association between quadruplex-SNPs and expression of the corresponding gene in individuals (P < 0.0001). Furthermore, analysis of Quad-SNPs obtained from population-scale sequencing of 1000 human genomes showed relative selection bias against alteration of the structural motif. To directly test the quadruplex-SNP-transcription connection, we constructed a reporter system using the RPS3 promoter-remarkable difference in promoter activity in the 'quadruplex-destabilized' versus 'quadruplex-intact' promoter was noticed. As a further test, we incorporated a quadruplex motif or its disrupted counterpart within a synthetic promoter reporter construct. The quadruplex motif, and not the disrupted-motif, enhanced transcription in human cell lines of different origin. Together, these findings build direct support for quadruplex-mediated transcription and suggest quadruplex-SNPs may play significant role in mechanistically understanding variations in gene expression among individuals.
Collapse
Affiliation(s)
- Aradhita Baral
- Proteomics and Structural Biology Unit, Institute of Genomics and Integrative Biology, CSIR, Mall Road, Delhi 110 007, India
| | | | | | | | | | | | | | | |
Collapse
|
16
|
Kumar P, Yadav VK, Baral A, Kumar P, Saha D, Chowdhury S. Zinc-finger transcription factors are associated with guanine quadruplex motifs in human, chimpanzee, mouse and rat promoters genome-wide. Nucleic Acids Res 2011; 39:8005-16. [PMID: 21729868 PMCID: PMC3185432 DOI: 10.1093/nar/gkr536] [Citation(s) in RCA: 51] [Impact Index Per Article: 3.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/03/2023] Open
Abstract
Function of non-B DNA structures are poorly understood though several bioinformatics studies predict role of the G-quadruplex DNA structure in transcription. Earlier, using transcriptome profiling we found evidence of widespread G-quadruplex-mediated gene regulation. Herein, we asked whether potential G-quadruplex (PG4) motifs associate with transcription factors (TF). This was analyzed using 220 position weight matrices [designated as transcription factor binding sites (TFBS)], representing 187 unique TF, in >75 000 genes in human, chimpanzee, mouse and rat. Results show binding sites of nine TFs, including that of AP-2, SP1, MAZ and VDR, occurred significantly within 100 bases of the PG4 motif (P < 1.24E-10). PG4–TFBS combinations were conserved in ‘orthologously’ related promoters across all four organisms and were associated with >850 genes in each genome. Remarkably, seven of the nine TFs were zinc-finger binding proteins indicating a novel characteristic of PG4 motifs. To test these findings, transcriptome profiles from human cell lines treated with G-quadruplex-specific molecules were used; 66 genes were significantly differentially expressed across both cell-types, which also harbored conserved PG4 motifs along with one/more of the nine TFBS. In addition, genes regulated by PG4–TFBS combinations were found to be co-regulated in human tissues, further emphasizing the regulatory significance of the associations.
Collapse
Affiliation(s)
- Pankaj Kumar
- GNR Knowledge Centre for Genome Informatics, Institute of Genomics and Integrative Biology, CSIR, Mall Road, Delhi 110 007, India
| | | | | | | | | | | |
Collapse
|
17
|
Abstract
The knowledge that potential guanine quadruplex sequences (PQs) are non-randomly distributed in relation to genomic features is now well established. However, this is for a general potential quadruplex motif which is characterized by short runs of guanine separated by loop regions, regardless of the nature of the loop sequence. There have been no studies to date which map the distribution of PQs in terms of primary sequence or which categorize PQs. To this end, we have generated clusters of PQ sequence groups of various sizes and various degrees of similarity for the non-template strand of introns in the human genome. We started with 86 697 sequences, and successively merged them into groups based on sequence similarity, carrying out 66 clustering cycles before convergence. We have demonstrated here that by using complete linkage hierarchical agglomerative clustering such PQ sequence categorization can be achieved. Our results give an insight into sequence diversity and categories of PQ sequences which occur in human intronic regions. We also highlight a number of clusters for which interesting relationships among their members were immediately evident and other clusters whose members seem unrelated, illustrating, we believe, a distinct role for different sequence types.
Collapse
Affiliation(s)
- Alan K Todd
- CRUK Biomolecular Structure Group, The School of Pharmacy, University of London, 29-39 Brunswick Square, London WC1N 1AX, UK
| | | |
Collapse
|
18
|
Sarkies P, Reams C, Simpson LJ, Sale JE. Epigenetic instability due to defective replication of structured DNA. Mol Cell 2011; 40:703-13. [PMID: 21145480 PMCID: PMC3145961 DOI: 10.1016/j.molcel.2010.11.009] [Citation(s) in RCA: 221] [Impact Index Per Article: 17.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/14/2010] [Revised: 07/30/2010] [Accepted: 09/10/2010] [Indexed: 01/22/2023]
Abstract
The accurate propagation of histone marks during chromosomal replication is proposed to rely on the tight coupling of replication with the recycling of parental histones to the daughter strands. Here, we show in the avian cell line DT40 that REV1, a key regulator of DNA translesion synthesis at the replication fork, is required for the maintenance of repressive chromatin marks and gene silencing in the vicinity of DNA capable of forming G-quadruplex (G4) structures. We demonstrate a previously unappreciated requirement for REV1 in replication of G4 forming sequences and show that transplanting a G4 forming sequence into a silent locus leads to its derepression in REV1-deficient cells. Together, our observations support a model in which failure to maintain processive DNA replication at G4 DNA in REV1-deficient cells leads to uncoupling of DNA synthesis from histone recycling, resulting in localized loss of repressive chromatin through biased incorporation of newly synthesized histones.
Collapse
Affiliation(s)
- Peter Sarkies
- Medical Research Council Laboratory of Molecular Biology, Hills Road, Cambridge CB2 0QH, UK
| | - Charlie Reams
- University of Cambridge Computer Laboratory, William Gates Building, 15, J.J. Thomson Avenue, Cambridge CB3 0FD, UK
| | - Laura J. Simpson
- Medical Research Council Laboratory of Molecular Biology, Hills Road, Cambridge CB2 0QH, UK
| | - Julian E. Sale
- Medical Research Council Laboratory of Molecular Biology, Hills Road, Cambridge CB2 0QH, UK
- Corresponding author
| |
Collapse
|
19
|
Zheng KW, Zhang D, Zhang LX, Hao YH, Zhou X, Tan Z. Dissecting the strand folding orientation and formation of G-quadruplexes in single- and double-stranded nucleic acids by ligand-induced photocleavage footprinting. J Am Chem Soc 2011; 133:1475-83. [PMID: 21207997 DOI: 10.1021/ja108972e] [Citation(s) in RCA: 28] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/31/2022]
Abstract
The widespread of G-quadruplex-forming sequences in genomic DNA and their role in regulating gene expression has made G-quadruplex structures attractive therapeutic targets against a variety of diseases, such as cancer. Information on the structure of G-quadruplexes is crucial for understanding their physiological roles and designing effective drugs against them. Resolving the structures of G-quadruplexes, however, remains a challenge especially for those in double-stranded DNA. In this work, we developed a photocleavage footprinting technique to determine the folding orientation of each individual G-tract in intramolecular G-quadruplex formed in both single- and double-stranded nucleic acids. Based on the differential photocleavage induced by a ligand tetrakis(2-trimethylaminoethylethanol) phthalocyaninato zinc tetraiodine (Zn-TTAPc) to the guanines between the two terminal G-quartets in a G-quadruplex, this method identifies the guanines hosted in each terminal G-quartets to reveal G-tract orientation. The method is extremely intuitive, straightforward, and requires little expertise. Besides, it also detects G-quadruplex formation in long single- and double-stranded nucleic acids.
Collapse
Affiliation(s)
- Ke-wei Zheng
- State Key Laboratory of Biomembrane and Membrane Biotechnology, Institute of Zoology, Chinese Academy of Sciences, Beijing 100101, People's Republic of China
| | | | | | | | | | | |
Collapse
|
20
|
Basundra R, Kumar A, Amrane S, Verma A, Phan AT, Chowdhury S. A novel G-quadruplex motif modulates promoter activity of human thymidine kinase 1. FEBS J 2010; 277:4254-64. [DOI: 10.1111/j.1742-4658.2010.07814.x] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
|
21
|
Liu JQ, Chen CY, Xue Y, Hao YH, Tan Z. G-Quadruplex Hinders Translocation of BLM Helicase on DNA: A Real-Time Fluorescence Spectroscopic Unwinding Study and Comparison with Duplex Substrates. J Am Chem Soc 2010; 132:10521-7. [DOI: 10.1021/ja1038165] [Citation(s) in RCA: 47] [Impact Index Per Article: 3.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/22/2022]
Affiliation(s)
- Jia-quan Liu
- Laboratory of Biochemistry and Biophysics, College of Life Sciences, Wuhan University, Wuhan 430072, P. R. China, and State Key Laboratory of Biomembrane and Membrane Biotechnology, Institute of Zoology, Chinese Academy of Sciences, Beijing 100101, P. R. China
| | - Chang-yue Chen
- Laboratory of Biochemistry and Biophysics, College of Life Sciences, Wuhan University, Wuhan 430072, P. R. China, and State Key Laboratory of Biomembrane and Membrane Biotechnology, Institute of Zoology, Chinese Academy of Sciences, Beijing 100101, P. R. China
| | - Yong Xue
- Laboratory of Biochemistry and Biophysics, College of Life Sciences, Wuhan University, Wuhan 430072, P. R. China, and State Key Laboratory of Biomembrane and Membrane Biotechnology, Institute of Zoology, Chinese Academy of Sciences, Beijing 100101, P. R. China
| | - Yu-hua Hao
- Laboratory of Biochemistry and Biophysics, College of Life Sciences, Wuhan University, Wuhan 430072, P. R. China, and State Key Laboratory of Biomembrane and Membrane Biotechnology, Institute of Zoology, Chinese Academy of Sciences, Beijing 100101, P. R. China
| | - Zheng Tan
- Laboratory of Biochemistry and Biophysics, College of Life Sciences, Wuhan University, Wuhan 430072, P. R. China, and State Key Laboratory of Biomembrane and Membrane Biotechnology, Institute of Zoology, Chinese Academy of Sciences, Beijing 100101, P. R. China
| |
Collapse
|
22
|
Zhao J, Bacolla A, Wang G, Vasquez KM. Non-B DNA structure-induced genetic instability and evolution. Cell Mol Life Sci 2010; 67:43-62. [PMID: 19727556 PMCID: PMC3017512 DOI: 10.1007/s00018-009-0131-2] [Citation(s) in RCA: 310] [Impact Index Per Article: 22.1] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/18/2009] [Revised: 07/22/2009] [Accepted: 08/11/2009] [Indexed: 11/26/2022]
Abstract
Repetitive DNA motifs are abundant in the genomes of various species and have the capacity to adopt non-canonical (i.e., non-B) DNA structures. Several non-B DNA structures, including cruciforms, slipped structures, triplexes, G-quadruplexes, and Z-DNA, have been shown to cause mutations, such as deletions, expansions, and translocations in both prokaryotes and eukaryotes. Their distributions in genomes are not random and often co-localize with sites of chromosomal breakage associated with genetic diseases. Current genome-wide sequence analyses suggest that the genomic instabilities induced by non-B DNA structure-forming sequences not only result in predisposition to disease, but also contribute to rapid evolutionary changes, particularly in genes associated with development and regulatory functions. In this review, we describe the occurrence of non-B DNA-forming sequences in various species, the classes of genes enriched in non-B DNA-forming sequences, and recent mechanistic studies on DNA structure-induced genomic instability to highlight their importance in genomes.
Collapse
Affiliation(s)
- Junhua Zhao
- Department of Carcinogenesis, Science Park-Research Division, The University of Texas M.D. Anderson Cancer Center, 1808 Park Road 1-C, P.O. Box 389, Smithville, TX 78957 USA
| | - Albino Bacolla
- Department of Carcinogenesis, Science Park-Research Division, The University of Texas M.D. Anderson Cancer Center, 1808 Park Road 1-C, P.O. Box 389, Smithville, TX 78957 USA
| | - Guliang Wang
- Department of Carcinogenesis, Science Park-Research Division, The University of Texas M.D. Anderson Cancer Center, 1808 Park Road 1-C, P.O. Box 389, Smithville, TX 78957 USA
| | - Karen M. Vasquez
- Department of Carcinogenesis, Science Park-Research Division, The University of Texas M.D. Anderson Cancer Center, 1808 Park Road 1-C, P.O. Box 389, Smithville, TX 78957 USA
| |
Collapse
|
23
|
Abstract
DNA can adopt a variety of non-standard conformations, including structures known as G-quadruplexes (G4-DNA), which consist of stacked tetrads of guanines. There are growing indications that G4-DNA is of biological importance, including evidence that it plays roles in telomere function, DNA recombination and the regulation of transcription and translation. However, it has been difficult to obtain direct, physical evidence for the presence of G-quadruplex DNA in vivo due, in part, to a lack of tools for G4-DNA identification. Here, we describe a method for coupling the G4-DNA binding ligand N-methyl mesoporphyrin IX (NMM) to a Sepharose resin, and demonstrate the ability of the resin to bind tightly and selectively to DNA oligonucleotides with the capacity to form G4-DNA. This technique might also be extended to examine genomic distributions of G4-DNA isolated from in vivo sources.
Collapse
|
24
|
Zheng KW, Chen Z, Hao YH, Tan Z. Molecular crowding creates an essential environment for the formation of stable G-quadruplexes in long double-stranded DNA. Nucleic Acids Res 2009; 38:327-38. [PMID: 19858105 PMCID: PMC2800236 DOI: 10.1093/nar/gkp898] [Citation(s) in RCA: 112] [Impact Index Per Article: 7.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/13/2023] Open
Abstract
Large numbers of guanine-rich sequences with potential to form G-quadruplexes have been identified in genomes of various organisms. Such sequences are constrained at both ends by long DNA duplex with a complementary strand in close proximity to compete for duplex formation. G-quadruplex/duplex competition in long double-stranded DNA has rarely been studied. In this work, we used DMS footprinting and gel electrophoresis to study G-quadruplex formation in long double-stranded DNA derived from human genome under both dilute and molecular crowding condition created by PEG. G-quadruplex formation was observed in the process of RNA transcription and after heat denaturation/renaturation under molecular crowding condition. Our results showed that the heat denaturation/renaturation treatment followed by gel electrophoresis could provide a simple method to quantitatively access the ability of G-quadruplex formation in long double-stranded DNA. The effect of K+ and PEG concentration was investigated and we found that stable G-quadruplexes could only form under the crowding condition with PEG at concentrations near the physiological concentration of biomass in living cells. This observation reveals a physical basis for the formation of stable G-quadruplexes in genome and supports its presence under the in vivo molecular crowding condition.
Collapse
Affiliation(s)
- Ke-wei Zheng
- Laboratory of Biochemistry and Biophysics, College of Life Sciences, Wuhan University, Wuhan, PR China
| | | | | | | |
Collapse
|
25
|
Du Z, Zhao Y, Li N. Genome-wide colonization of gene regulatory elements by G4 DNA motifs. Nucleic Acids Res 2009; 37:6784-98. [PMID: 19759215 PMCID: PMC2777415 DOI: 10.1093/nar/gkp710] [Citation(s) in RCA: 68] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/22/2023] Open
Abstract
G-quadruplex (or G4 DNA), a stable four-stranded structure found in guanine-rich regions, is implicated in the transcriptional regulation of genes involved in growth and development. Previous studies on the role of G4 DNA in gene regulation mostly focused on genomic regions proximal to transcription start sites (TSSs). To gain a more comprehensive understanding of the regulatory role of G4 DNA, we examined the landscape of potential G4 DNA (PG4Ms) motifs in the human genome and found that G4 motifs, not restricted to those found in the TSS-proximal regions, are bias toward gene-associated regions. Significantly, analyses of G4 motifs in seven types of well-known gene regulatory elements revealed a constitutive enrichment pattern and the clusters of G4 motifs tend to be colocalized with regulatory elements. Considering our analysis from a genome evolutionary perspective, we found evidence that the occurrence and accumulation of certain progenitors and canonical G4 DNA motifs within regulatory regions were progressively favored by natural selection. Our results suggest that G4 DNA motifs are ‘colonized’ in regulatory regions, supporting a likely genome-wide role of G4 DNA in gene regulation. We hypothesize that G4 DNA is a regulatory apparatus situated in regulatory elements, acting as a molecular switch that can modulate the role of the host functional regions, by transition in DNA structure.
Collapse
Affiliation(s)
- Zhuo Du
- State Key Laboratory of Agrobiotechnology, College of Biological Science, China Agricultural University, Beijing 100193, PR China
| | | | | |
Collapse
|
26
|
Eddy J, Maizels N. Selection for the G4 DNA motif at the 5' end of human genes. Mol Carcinog 2009; 48:319-25. [PMID: 19306310 DOI: 10.1002/mc.20496] [Citation(s) in RCA: 41] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/10/2022]
Abstract
Formation of G4 DNA may occur in the course of replication and transcription, and contribute to genomic instability. We have quantitated abundance of G4 motifs and potential for G4 DNA formation of the nontemplate strand of 5' exons and introns of transcripts of human genes. We find that, for all human genes, G4 motifs are enriched in 5' regions of transcripts relative to downstream regions; and in 5' regulatory regions relative to coding regions. Notably, although tumor suppressor genes are depleted and proto-oncogenes enriched in G4 motifs, abundance of G4 motifs in the 5' regions of transcripts of genes in these categories does not differ. These results support the hypothesis that G4 motifs are under selection in the human genome. They further show that for tumor suppressor genes and proto-oncogenes, independent selection determines potential for G4 DNA formation of 5' regulatory regions of transcripts and downstream coding regions.
Collapse
Affiliation(s)
- Johanna Eddy
- Molecular and Cellular Biology Graduate Program, University of Washington School of Medicine, 1959 N.E. Pacific Street, Seattle, WA 98195-7650, USA
| | | |
Collapse
|
27
|
Verma A, Yadav VK, Basundra R, Kumar A, Chowdhury S. Evidence of genome-wide G4 DNA-mediated gene expression in human cancer cells. Nucleic Acids Res 2009; 37:4194-204. [PMID: 19211664 PMCID: PMC2715224 DOI: 10.1093/nar/gkn1076] [Citation(s) in RCA: 110] [Impact Index Per Article: 7.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022] Open
Abstract
Guanine-rich DNA of a particular sequence adopts four-stranded structural forms known as G-quadruplex or G4 DNA. Though in vitro formation of G4 DNA is known for several years, in vivo presence of G4 DNA was only recently noted in eukaryote telomeres. Recent bioinformatics analyses showing prevalence of G4 DNA within promoters of human and related species seems to implicate G4 DNA in a genome-wide cis-regulatory role. Herein we demonstrate that G4 DNA may present regulatory sites on a genome-wide scale by showing widespread effect on gene expression in response to the established intracellular G4 DNA-binding ligands. This is particularly relevant to genes that harbor conserved potential G4 DNA (PG4 DNA) forming sequence across human, mouse and rat promoters of orthologous genes. Genes with conserved PG4 DNA in promoters show co-regulated expression in 79 human and 61 mouse normal tissues (z-score > 3.5; P < 0.0001). Conservation of G4 DNA across related species also emphasizes the biological importance of G4 DNA and its role in transcriptional regulation of genes; shedding light on a relatively novel mechanism of regulation of gene expression in eukaryotes.
Collapse
Affiliation(s)
- Anjali Verma
- Proteomics and Structural Biology Unit, Institute of Genomics and Integrative Biology, CSIR, Delhi 110 007, India
| | | | | | | | | |
Collapse
|
28
|
Mani P, Yadav VK, Das SK, Chowdhury S. Genome-wide analyses of recombination prone regions predict role of DNA structural motif in recombination. PLoS One 2009; 4:e4399. [PMID: 19198658 PMCID: PMC2635932 DOI: 10.1371/journal.pone.0004399] [Citation(s) in RCA: 64] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/14/2008] [Accepted: 12/17/2008] [Indexed: 11/18/2022] Open
Abstract
HapMap findings reveal surprisingly asymmetric distribution of recombinogenic regions. Short recombinogenic regions (hotspots) are interspersed between large relatively non-recombinogenic regions. This raises the interesting possibility of DNA sequence and/or other cis- elements as determinants of recombination. We hypothesized the involvement of non-canonical sequences that can result in local non-B DNA structures and tested this using the G-quadruplex DNA as a model. G-quadruplex or G4 DNA is a unique form of four-stranded non-B DNA structure that engages certain G-rich sequences, presence of such motifs has been noted within telomeres. In support of this hypothesis, genome-wide computational analyses presented here reveal enrichment of potential G4 (PG4) DNA forming sequences within 25618 human hotspots relative to 9290 coldspots (p<0.0001). Furthermore, co-occurrence of PG4 DNA within several short sequence elements that are associated with recombinogenic regions was found to be significantly more than randomly expected. Interestingly, analyses of more than 50 DNA binding factors revealed that co-occurrence of PG4 DNA with target DNA binding sites of transcription factors c-Rel, NF-kappa B (p50 and p65) and Evi-1 was significantly enriched in recombination-prone regions. These observations support involvement of G4 DNA in recombination, predicting a functional model that is consistent with duplex-strand separation induced by formation of G4 motifs in supercoiled DNA and/or when assisted by other cellular factors.
Collapse
Affiliation(s)
- Prithvi Mani
- G. N. Ramachandran Knowledge Centre for Genome Informatics, Institute of Genomics and Integrative Biology, CSIR, Delhi, India
| | - Vinod Kumar Yadav
- G. N. Ramachandran Knowledge Centre for Genome Informatics, Institute of Genomics and Integrative Biology, CSIR, Delhi, India
| | - Swapan Kumar Das
- Functional Genomics Unit, Institute of Genomics and Integrative Biology, CSIR, Delhi, India
| | - Shantanu Chowdhury
- G. N. Ramachandran Knowledge Centre for Genome Informatics, Institute of Genomics and Integrative Biology, CSIR, Delhi, India
- Proteomics and Structural Biology Unit, Institute of Genomics and Integrative Biology, CSIR, Delhi, India
- * E-mail:
| |
Collapse
|
29
|
Halder K, Halder R, Chowdhury S. Genome-wide analysis predicts DNA structural motifs as nucleosome exclusion signals. MOLECULAR BIOSYSTEMS 2009; 5:1703-12. [DOI: 10.1039/b905132e] [Citation(s) in RCA: 43] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/12/2023]
|
30
|
Verma A, Halder K, Halder R, Yadav VK, Rawal P, Thakur RK, Mohd F, Sharma A, Chowdhury S. Genome-wide computational and expression analyses reveal G-quadruplex DNA motifs as conserved cis-regulatory elements in human and related species. J Med Chem 2008; 51:5641-9. [PMID: 18767830 DOI: 10.1021/jm800448a] [Citation(s) in RCA: 175] [Impact Index Per Article: 10.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/25/2022]
Abstract
Using a combination of in silico and experimental approaches, we present evidence that the G-quadruplex (G4) motif (an alternative higher-order DNA conformation) has regulatory potential. Genome-wide analyses of 99980 human, chimpanzee, mouse, and rat promoters showed enrichment of sequence with potential to adopt G4 (potential G4 or PG4) motifs near transcription start sites (TSS; P < 0.0001), supporting earlier findings. Interestingly, we found >700 orthologously related promoters in human, mouse, and rat conserve PG4 motif(s). The corresponding genes have enriched (z score > 4.0) tissue-specific expression in 75 of 79 human tissues and are significantly overrepresented in signaling and regulation of cell-cycle (P < 10(-05)). This is supported by results from whole genome expression experiments in human HeLa S3 cells following treatment with TMPyP4 [5,10,15,20-tetra(N-methyl-4-pyridyl) porphine chloride], which is known to bind the G4 motif inside cells. Our results implicate G4-motif mediated regulation as a more general mode of transcription control than currently appreciated.
Collapse
Affiliation(s)
- Anjali Verma
- Proteomics and Structural Biology Unit, Institute of Genomics and Integrative Biology, CSIR, Mall Road, Delhi 110 007, India
| | | | | | | | | | | | | | | | | |
Collapse
|
31
|
|
32
|
Zhuang XY, Tang J, Hao YH, Tan Z. Fast detection of quadruplex structure in DNA by the intrinsic fluorescence of a single-stranded DNA binding protein. J Mol Recognit 2008; 20:386-91. [PMID: 17891754 DOI: 10.1002/jmr.847] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]
Abstract
Single-stranded guanine-rich (G-rich) DNA can fold into a four-stranded G-quadruplex structure and such structures are implicated in important biological processes and therapeutic applications. So far, bioinformatic analysis has identified up to several hundred thousand of putative quadruplex sequences in the genome of human and other animal. Given such a large number of sequences, a fast assay would be desired to experimentally verify the structure of these sequences. Here we describe a method that identifies the quadruplex structure by a single-stranded DNA binding protein from a thermoautotrophic archaeon. This protein binds single-stranded DNA in the unfolded, but not in the folded form. Upon binding to DNA, its fluorescence can be quenched by up to 70%. Formation of quadruplex greatly reduces fluorescence quenching in a K+-dependent manner. This structure-dependent quenching provides simple and fast detection of quadruplex in DNA at low concentration without DNA labelling.
Collapse
Affiliation(s)
- Xin-ying Zhuang
- Laboratory of Biochemistry and Biophysics, College of Life Sciences, Wuhan University, Wuhan 430072, PR China
| | | | | | | |
Collapse
|
33
|
Johnson JE, Smith JS, Kozak ML, Johnson FB. In vivo veritas: using yeast to probe the biological functions of G-quadruplexes. Biochimie 2008; 90:1250-63. [PMID: 18331848 DOI: 10.1016/j.biochi.2008.02.013] [Citation(s) in RCA: 68] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/23/2007] [Accepted: 02/07/2008] [Indexed: 12/20/2022]
Abstract
Certain guanine-rich sequences are capable of forming higher order structures known as G-quadruplexes. Moreover, particular genomic regions in a number of highly divergent organisms are enriched for such sequences, raising the possibility that G-quadruplexes form in vivo and affect cellular processes. While G-quadruplexes have been rigorously studied in vitro, whether these structures actually form in vivo and what their roles might be in the context of the cell have remained largely unanswered questions. Recent studies suggest that G-quadruplexes participate in the regulation of such varied processes as telomere maintenance, transcriptional regulation and ribosome biogenesis. Here we review studies aimed at elucidating the in vivo functions of quadruplex structures, with a particular focus on findings in yeast. In addition, we discuss the utility of yeast model systems in the study of the cellular roles of G-quadruplexes.
Collapse
Affiliation(s)
- Jay E Johnson
- Department of Pathology and Laboratory Medicine, University of Pennsylvania School of Medicine, Philadelphia, PA 19104, USA
| | | | | | | |
Collapse
|
34
|
Eddy J, Maizels N. Conserved elements with potential to form polymorphic G-quadruplex structures in the first intron of human genes. Nucleic Acids Res 2008; 36:1321-33. [PMID: 18187510 PMCID: PMC2275096 DOI: 10.1093/nar/gkm1138] [Citation(s) in RCA: 226] [Impact Index Per Article: 14.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/25/2022] Open
Abstract
To understand how potential for G-quadruplex formation might influence regulation of gene expression, we examined the 2 kb spanning the transcription start sites (TSS) of the 18 217 human RefSeq genes, distinguishing contributions of template and nontemplate strands. Regions both upstream and downstream of the TSS are G-rich, but the downstream region displays a clear bias toward G-richness on the nontemplate strand. Upstream of the TSS, much of the G-richness and potential for G-quadruplex formation derives from the presence of well-defined canonical regulatory motifs in duplex DNA, including CpG dinucleotides which are sites of regulatory methylation, and motifs recognized by the transcription factor SP1. This challenges the notion that quadruplex formation upstream of the TSS contributes to regulation of gene expression. Downstream of the TSS, G-richness is concentrated in the first intron, and on the nontemplate strand, where polymorphic sequence elements with potential to form G-quadruplex structures and which cannot be accounted for by known regulatory motifs are found in almost 3000 (16%) of the human RefSeq genes, and are conserved through frogs. These elements could in principle be recognized either as DNA or as RNA, providing structural targets for regulation at the level of transcription or RNA processing.
Collapse
Affiliation(s)
- Johanna Eddy
- Molecular and Cellular Biology Graduate Program, University of Washington, Seattle, WA 98195-7650, USA
| | | |
Collapse
|
35
|
Hershman SG, Chen Q, Lee JY, Kozak ML, Yue P, Wang LS, Johnson FB. Genomic distribution and functional analyses of potential G-quadruplex-forming sequences in Saccharomyces cerevisiae. Nucleic Acids Res 2008; 36:144-56. [PMID: 17999996 PMCID: PMC2248735 DOI: 10.1093/nar/gkm986] [Citation(s) in RCA: 224] [Impact Index Per Article: 14.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/09/2007] [Revised: 10/14/2007] [Accepted: 10/19/2007] [Indexed: 11/24/2022] Open
Abstract
Although well studied in vitro, the in vivo functions of G-quadruplexes (G4-DNA and G4-RNA) are only beginning to be defined. Recent studies have demonstrated enrichment for sequences with intramolecular G-quadruplex forming potential (QFP) in transcriptional promoters of humans, chickens and bacteria. Here we survey the yeast genome for QFP sequences and similarly find strong enrichment for these sequences in upstream promoter regions, as well as weaker but significant enrichment in open reading frames (ORFs). Further, four findings are consistent with roles for QFP sequences in transcriptional regulation. First, QFP is correlated with upstream promoter regions with low histone occupancy. Second, treatment of cells with N-methyl mesoporphyrin IX (NMM), which binds G-quadruplexes selectively in vitro, causes significant upregulation of loci with QFP-possessing promoters or ORFs. NMM also causes downregulation of loci connected with the function of the ribosomal DNA (rDNA), which itself has high QFP. Third, ORFs with QFP are selectively downregulated in sgs1 mutants that lack the G4-DNA-unwinding helicase Sgs1p. Fourth, a screen for yeast mutants that enhance or suppress growth inhibition by NMM revealed enrichment for chromatin and transcriptional regulators, as well as telomere maintenance factors. These findings raise the possibility that QFP sequences form bona fide G-quadruplexes in vivo and thus regulate transcription.
Collapse
Affiliation(s)
- Steve G. Hershman
- College of Arts and Sciences and Vagelos Scholars Program, University of Pennsylvania, Department of Pathology and Laboratory Medicine, Cell and Molecular Biology Graduate Program, Penn Center for Bioinformatics, and Penn Institute on Aging, University of Pennsylvania School of Medicine, Philadelphia, PA, USA
| | - Qijun Chen
- College of Arts and Sciences and Vagelos Scholars Program, University of Pennsylvania, Department of Pathology and Laboratory Medicine, Cell and Molecular Biology Graduate Program, Penn Center for Bioinformatics, and Penn Institute on Aging, University of Pennsylvania School of Medicine, Philadelphia, PA, USA
| | - Julia Y. Lee
- College of Arts and Sciences and Vagelos Scholars Program, University of Pennsylvania, Department of Pathology and Laboratory Medicine, Cell and Molecular Biology Graduate Program, Penn Center for Bioinformatics, and Penn Institute on Aging, University of Pennsylvania School of Medicine, Philadelphia, PA, USA
| | - Marina L. Kozak
- College of Arts and Sciences and Vagelos Scholars Program, University of Pennsylvania, Department of Pathology and Laboratory Medicine, Cell and Molecular Biology Graduate Program, Penn Center for Bioinformatics, and Penn Institute on Aging, University of Pennsylvania School of Medicine, Philadelphia, PA, USA
| | - Peng Yue
- College of Arts and Sciences and Vagelos Scholars Program, University of Pennsylvania, Department of Pathology and Laboratory Medicine, Cell and Molecular Biology Graduate Program, Penn Center for Bioinformatics, and Penn Institute on Aging, University of Pennsylvania School of Medicine, Philadelphia, PA, USA
| | - Li-San Wang
- College of Arts and Sciences and Vagelos Scholars Program, University of Pennsylvania, Department of Pathology and Laboratory Medicine, Cell and Molecular Biology Graduate Program, Penn Center for Bioinformatics, and Penn Institute on Aging, University of Pennsylvania School of Medicine, Philadelphia, PA, USA
| | - F. Brad Johnson
- College of Arts and Sciences and Vagelos Scholars Program, University of Pennsylvania, Department of Pathology and Laboratory Medicine, Cell and Molecular Biology Graduate Program, Penn Center for Bioinformatics, and Penn Institute on Aging, University of Pennsylvania School of Medicine, Philadelphia, PA, USA
| |
Collapse
|
36
|
Abstract
G-quadruplex or G4 DNA, a four-stranded DNA structure formed in G-rich sequences, has been hypothesized to be a structural motif involved in gene regulation. In this study, we examined the regulatory role of potential G4 DNA motifs (PG4Ms) located in the putative transcriptional regulatory region (TRR, -500 to +500) of genes across the human genome. We found that PG4Ms in the 500-bp region downstream of the annotated transcription start site (TSS; PG4M(D500)) are associated with gene expression. Generally, PG4M(D500)-positive genes are expressed at higher levels than PG4M(D500)-negative genes, and an increased number of PG4M(D500) provides a cumulative effect. This observation was validated by controlling for attributes, including gene family, function, and promoter similarity. We also observed an asymmetric pattern of PG4M(D500) distribution between strands, whereby the frequency of PG4M(D500) in the coding strand is generally higher than that in the template strand. Further analysis showed that the presence of PG4M(D500) and its strand asymmetry are associated with significant enrichment of RNAP II at the putative TRR. On the basis of these results, we propose a model of G4 DNA-mediated stimulation of transcription with the hypothesis that PG4M(D500) contributes to gene transcription by maintaining the DNA in an open conformation, while the asymmetric distribution of PG4M(D500) considerably reduces the probability of blocking the progression of the RNA polymerase complex on the template strand. Our findings provide a comprehensive view of the regulatory function of G4 DNA in gene transcription.
Collapse
|
37
|
Zhao Y, Du Z, Li N. Extensive selection for the enrichment of G4 DNA motifs in transcriptional regulatory regions of warm blooded animals. FEBS Lett 2007; 581:1951-6. [PMID: 17462634 DOI: 10.1016/j.febslet.2007.04.017] [Citation(s) in RCA: 62] [Impact Index Per Article: 3.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/31/2007] [Revised: 04/02/2007] [Accepted: 04/03/2007] [Indexed: 01/01/2023]
Abstract
A comprehensive analysis of potential G4 DNA motifs (G4Ms) in genomic regions flanking transcription start sites (TSS) was performed across 13 animal species. We found that G4Ms are significantly enriched in the transcriptional regulatory regions (TRRs) of warm-blooded animals. Further analysis of human genes in different temporal groups reveals that the enrichment is not specific to genes found only in warm-blooded species but instead exist in a wide range of genes. Our findings therefore suggest that the high prevalence of G4Ms in TRRs is extensively selected in warm-blooded animals, supporting the hypothesis that G4Ms are involved in the regulation of gene transcription.
Collapse
Affiliation(s)
- Yiqiang Zhao
- State Key Laboratory for Agrobiotechnology, China Agricultural University, Beijing, 10094, China
| | | | | |
Collapse
|