1
|
Yi C, Liu Q, Huang Y, Liu C, Guo X, Fan C, Zhang K, Liu Y, Han F. Non-B-form DNA is associated with centromere stability in newly-formed polyploid wheat. SCIENCE CHINA. LIFE SCIENCES 2024; 67:1479-1488. [PMID: 38639838 DOI: 10.1007/s11427-023-2513-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/07/2023] [Accepted: 12/18/2023] [Indexed: 04/20/2024]
Abstract
Non-B-form DNA differs from the classic B-DNA double helix structure and plays a crucial regulatory role in replication and transcription. However, the role of non-B-form DNA in centromeres, especially in polyploid wheat, remains elusive. Here, we systematically analyzed seven non-B-form DNA motif profiles (A-phased DNA repeat, direct repeat, G-quadruplex, inverted repeat, mirror repeat, short tandem repeat, and Z-DNA) in hexaploid wheat. We found that three of these non-B-form DNA motifs were enriched at centromeric regions, especially at the CENH3-binding sites, suggesting that non-B-form DNA may create a favorable loading environment for the CENH3 nucleosome. To investigate the dynamics of centromeric non-B form DNA during the alloploidization process, we analyzed DNA secondary structure using CENH3 ChIP-seq data from newly formed allotetraploid wheat and its two diploid ancestors. We found that newly formed allotetraploid wheat formed more non-B-form DNA in centromeric regions compared with their parents, suggesting that non-B-form DNA is related to the localization of the centromeric regions in newly formed wheat. Furthermore, non-B-form DNA enriched in the centromeric regions was found to preferentially form on young LTR retrotransposons, explaining CENH3's tendency to bind to younger LTR. Collectively, our study describes the landscape of non-B-form DNA in the wheat genome, and sheds light on its potential role in the evolution of polyploid centromeres.
Collapse
Affiliation(s)
- Congyang Yi
- Institute of Genetics and Developmental Biology, Chinese Academy of Sciences, Beijing, 100101, China
- University of Chinese Academy of Sciences, Beijing, 100049, China
| | - Qian Liu
- Institute of Genetics and Developmental Biology, Chinese Academy of Sciences, Beijing, 100101, China
- University of Chinese Academy of Sciences, Beijing, 100049, China
| | - Yuhong Huang
- Institute of Genetics and Developmental Biology, Chinese Academy of Sciences, Beijing, 100101, China
- University of Chinese Academy of Sciences, Beijing, 100049, China
| | - Chang Liu
- Institute of Genetics and Developmental Biology, Chinese Academy of Sciences, Beijing, 100101, China
- University of Chinese Academy of Sciences, Beijing, 100049, China
| | - Xianrui Guo
- Institute of Genetics and Developmental Biology, Chinese Academy of Sciences, Beijing, 100101, China
- University of Chinese Academy of Sciences, Beijing, 100049, China
| | - Chaolan Fan
- Institute of Genetics and Developmental Biology, Chinese Academy of Sciences, Beijing, 100101, China
- University of Chinese Academy of Sciences, Beijing, 100049, China
| | - Kaibiao Zhang
- Institute of Genetics and Developmental Biology, Chinese Academy of Sciences, Beijing, 100101, China
- University of Chinese Academy of Sciences, Beijing, 100049, China
| | - Yang Liu
- Institute of Genetics and Developmental Biology, Chinese Academy of Sciences, Beijing, 100101, China.
- University of Chinese Academy of Sciences, Beijing, 100049, China.
| | - Fangpu Han
- Institute of Genetics and Developmental Biology, Chinese Academy of Sciences, Beijing, 100101, China.
- University of Chinese Academy of Sciences, Beijing, 100049, China.
| |
Collapse
|
2
|
Pedraza-Reyes M, Abundiz-Yañez K, Rangel-Mendoza A, Martínez LE, Barajas-Ornelas RC, Cuéllar-Cruz M, Leyva-Sánchez HC, Ayala-García VM, Valenzuela-García LI, Robleto EA. Bacillus subtilis stress-associated mutagenesis and developmental DNA repair. Microbiol Mol Biol Rev 2024; 88:e0015823. [PMID: 38551349 PMCID: PMC11332352 DOI: 10.1128/mmbr.00158-23] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 04/04/2024] Open
Abstract
SUMMARYThe metabolic conditions that prevail during bacterial growth have evolved with the faithful operation of repair systems that recognize and eliminate DNA lesions caused by intracellular and exogenous agents. This idea is supported by the low rate of spontaneous mutations (10-9) that occur in replicating cells, maintaining genome integrity. In contrast, when growth and/or replication cease, bacteria frequently process DNA lesions in an error-prone manner. DNA repairs provide cells with the tools needed for maintaining homeostasis during stressful conditions and depend on the developmental context in which repair events occur. Thus, different physiological scenarios can be anticipated. In nutritionally stressed bacteria, different components of the base excision repair pathway may process damaged DNA in an error-prone approach, promoting genetic variability. Interestingly, suppressing the mismatch repair machinery and activating specific DNA glycosylases promote stationary-phase mutations. Current evidence also suggests that in resting cells, coupling repair processes to actively transcribed genes may promote multiple genetic transactions that are advantageous for stressed cells. DNA repair during sporulation is of interest as a model to understand how transcriptional processes influence the formation of mutations in conditions where replication is halted. Current reports indicate that transcriptional coupling repair-dependent and -independent processes operate in differentiating cells to process spontaneous and induced DNA damage and that error-prone synthesis of DNA is involved in these events. These and other noncanonical ways of DNA repair that contribute to mutagenesis, survival, and evolution are reviewed in this manuscript.
Collapse
Affiliation(s)
- Mario Pedraza-Reyes
- Department of Biology, Division of Natural and Exact Sciences, University of Guanajuato, Guanajuato, Mexico
| | - Karen Abundiz-Yañez
- Department of Biology, Division of Natural and Exact Sciences, University of Guanajuato, Guanajuato, Mexico
| | - Alejandra Rangel-Mendoza
- Department of Biology, Division of Natural and Exact Sciences, University of Guanajuato, Guanajuato, Mexico
| | - Lissett E. Martínez
- Department of Biology, Division of Natural and Exact Sciences, University of Guanajuato, Guanajuato, Mexico
| | - Rocío C. Barajas-Ornelas
- Department of Biology, Division of Natural and Exact Sciences, University of Guanajuato, Guanajuato, Mexico
| | - Mayra Cuéllar-Cruz
- Department of Biology, Division of Natural and Exact Sciences, University of Guanajuato, Guanajuato, Mexico
| | | | | | - Luz I. Valenzuela-García
- Department of Sustainable Engineering, Advanced Materials Research Center (CIMAV), Arroyo Seco, Durango, Mexico
| | | |
Collapse
|
3
|
Raimer Young HM, Hou PC, Bartosik AR, Atkin ND, Wang L, Wang Z, Ratan A, Zang C, Wang YH. DNA fragility at topologically associated domain boundaries is promoted by alternative DNA secondary structure and topoisomerase II activity. Nucleic Acids Res 2024; 52:3837-3855. [PMID: 38452213 DOI: 10.1093/nar/gkae164] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/15/2023] [Revised: 02/03/2024] [Accepted: 02/23/2024] [Indexed: 03/09/2024] Open
Abstract
CCCTC-binding factor (CTCF) binding sites are hotspots of genome instability. Although many factors have been associated with CTCF binding site fragility, no study has integrated all fragility-related factors to understand the mechanism(s) of how they work together. Using an unbiased, genome-wide approach, we found that DNA double-strand breaks (DSBs) are enriched at strong, but not weak, CTCF binding sites in five human cell types. Energetically favorable alternative DNA secondary structures underlie strong CTCF binding sites. These structures coincided with the location of topoisomerase II (TOP2) cleavage complex, suggesting that DNA secondary structure acts as a recognition sequence for TOP2 binding and cleavage at CTCF binding sites. Furthermore, CTCF knockdown significantly increased DSBs at strong CTCF binding sites and at CTCF sites that are located at topologically associated domain (TAD) boundaries. TAD boundary-associated CTCF sites that lost CTCF upon knockdown displayed increased DSBs when compared to the gained sites, and those lost sites are overrepresented with G-quadruplexes, suggesting that the structures act as boundary insulators in the absence of CTCF, and contribute to increased DSBs. These results model how alternative DNA secondary structures facilitate recruitment of TOP2 to CTCF binding sites, providing mechanistic insight into DNA fragility at CTCF binding sites.
Collapse
Affiliation(s)
- Heather M Raimer Young
- Department of Biochemistry and Molecular Genetics, University of Virginia School of Medicine, Charlottesville, VA 22908-0733, USA
| | - Pei-Chi Hou
- Department of Biochemistry and Molecular Genetics, University of Virginia School of Medicine, Charlottesville, VA 22908-0733, USA
| | - Anna R Bartosik
- Department of Biochemistry and Molecular Genetics, University of Virginia School of Medicine, Charlottesville, VA 22908-0733, USA
| | - Naomi D Atkin
- Department of Biochemistry and Molecular Genetics, University of Virginia School of Medicine, Charlottesville, VA 22908-0733, USA
| | - Lixin Wang
- Department of Biomedical Engineering, University of Virginia, Charlottesville, VA 22908, USA
| | - Zhenjia Wang
- Center for Public Health Genomics, University of Virginia School of Medicine, Charlottesville, VA 22908-0717, USA
| | - Aakrosh Ratan
- Department of Biochemistry and Molecular Genetics, University of Virginia School of Medicine, Charlottesville, VA 22908-0733, USA
- Center for Public Health Genomics, University of Virginia School of Medicine, Charlottesville, VA 22908-0717, USA
- Department of Public Health Sciences, University of Virginia School of Medicine, Charlottesville, VA 22908, USA
- University of Virginia Comprehensive Cancer Center, Charlottesville, VA 22908, USA
| | - Chongzhi Zang
- Department of Biochemistry and Molecular Genetics, University of Virginia School of Medicine, Charlottesville, VA 22908-0733, USA
- Center for Public Health Genomics, University of Virginia School of Medicine, Charlottesville, VA 22908-0717, USA
- Department of Public Health Sciences, University of Virginia School of Medicine, Charlottesville, VA 22908, USA
- University of Virginia Comprehensive Cancer Center, Charlottesville, VA 22908, USA
| | - Yuh-Hwa Wang
- Department of Biochemistry and Molecular Genetics, University of Virginia School of Medicine, Charlottesville, VA 22908-0733, USA
- University of Virginia Comprehensive Cancer Center, Charlottesville, VA 22908, USA
| |
Collapse
|
4
|
Ajit K, Alagia A, Burger K, Gullerova M. Tyrosine 1-phosphorylated RNA polymerase II transcribes PROMPTs to facilitate proximal promoter pausing and induce global transcriptional repression in response to DNA damage. Genome Res 2024; 34:201-216. [PMID: 38467418 PMCID: PMC10984383 DOI: 10.1101/gr.278644.123] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/18/2023] [Accepted: 02/15/2024] [Indexed: 03/13/2024]
Abstract
DNA damage triggers a complex transcriptional response that involves both activation and repression of gene expression. In this study, we investigated global changes in transcription in response to ionizing irradiation (IR), which induces double-strand breaks in DNA. We used mNET-seq to profile nascent transcripts bound to different phosphorylated forms of the RNA polymerase II (RNA Pol II) C-terminal domain (CTD). We found that IR leads to global transcriptional repression of protein-coding genes, accompanied by an increase in antisense transcripts near promoters, called PROMPTs, transcribed by RNA Pol II phosphorylated on tyrosine 1 (Y1P) residue of the CTD. These Y1P-transcribed PROMPTs are enriched for PRC2 binding sites and associated with RNA Pol II proximal promoter pausing. We show the interaction between Y1P RNA Pol II and PRC2, as well as PRC2 binding to PROMPTs. Inhibition of PROMPTs or depletion of PRC2 leads to loss of transcriptional repression. Our results reveal a novel function of Y1P-dependent PROMPTs in mediating PRC2 recruitment to chromatin and RNA Pol II promoter pausing in response to DNA damage.
Collapse
Affiliation(s)
- Kamal Ajit
- Sir William Dunn School of Pathology, Oxford, OX1 3RE, United Kingdom
| | - Adele Alagia
- Sir William Dunn School of Pathology, Oxford, OX1 3RE, United Kingdom
| | - Kaspar Burger
- Mildred Scheel Early Career Center for Cancer Research, University Hospital Würzburg, 97080 Würzburg, Germany
- Department of Biochemistry and Molecular Biology, Biocenter of the University of Würzburg, 97074 Würzburg, Germany
| | - Monika Gullerova
- Sir William Dunn School of Pathology, Oxford, OX1 3RE, United Kingdom;
| |
Collapse
|
5
|
Paramonova N, Trapina I, Gradauskiene (Sitkauskiene) B, Plavina S, Tamasauskiene L, Bastyte D, Rumba-Rozenfelde I, Tapina S, Stakaitiene I, Ugenskiene R, Shih-Hsin Wu L, Wang JY, Hsieh MH, Chen PC, Sjakste N. Genetic Diversity in Bronchial Asthma Susceptibility: Exploring the Role of Vitamin D Receptor Gene Polymorphisms in Varied Geographic Contexts. Int J Mol Sci 2024; 25:1943. [PMID: 38339221 PMCID: PMC10856277 DOI: 10.3390/ijms25031943] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/18/2023] [Revised: 01/18/2024] [Accepted: 01/28/2024] [Indexed: 02/12/2024] Open
Abstract
Bronchial asthma (BA) exhibits varying prevalence across global populations, prompting a comprehensive investigation into genetic and environmental determinants. Vitamin D is a potent immunomodulator capable of suppressing inflammatory signals in several cell types involved in the asthmatic response; it exerts effects on the immune system by binding to the nuclear vitamin D receptor (VDR). VDR gene genetic variations are affecting serum vitamin D levels with a possible role in the BA risk. The current study aimed to examine the complex interaction of various factors (genetic background, serum vitamin D levels, and geographic location) to identify differences in the influence of these factors on the susceptibility to asthma between populations at different latitudes. Focusing on Eastern European cohorts from Latvia and Lithuania and comparing them with published data on East Asian populations, we explore the impact of VDR gene polymorphisms on BA susceptibility. Genotyping four key VDR SNPs and assessing their association with 25-hydroxyvitamin D levels, our study unveils significant associations of the studied loci with the risk of asthma-both risk-reducing and increasing effects, differently distributed between Baltic and East Asian populations. The functional effects of in silico VDR gene genetic variations are also identified and discussed.
Collapse
Affiliation(s)
- Natalia Paramonova
- Laboratory of Genomics and Bioinformatics, Institute of Biology, University of Latvia, LV-1004 Riga, Latvia; (N.P.); (S.P.); (N.S.)
| | - Ilva Trapina
- Laboratory of Genomics and Bioinformatics, Institute of Biology, University of Latvia, LV-1004 Riga, Latvia; (N.P.); (S.P.); (N.S.)
| | | | - Samanta Plavina
- Laboratory of Genomics and Bioinformatics, Institute of Biology, University of Latvia, LV-1004 Riga, Latvia; (N.P.); (S.P.); (N.S.)
| | - Laura Tamasauskiene
- Department of Immunology and Allergology, Lithuanian University of Health Sciences, LT-50161 Kaunas, Lithuania (L.T.); (D.B.)
| | - Daina Bastyte
- Department of Immunology and Allergology, Lithuanian University of Health Sciences, LT-50161 Kaunas, Lithuania (L.T.); (D.B.)
| | | | - Sandra Tapina
- Faculty of Medicine, University of Latvia, LV-1586 Riga, Latvia; (I.R.-R.); (S.T.)
| | - Ieva Stakaitiene
- Department of Genetics and Molecular Medicine, Lithuanian University of Health Sciences, LT-50161 Kaunas, Lithuania; (I.S.); (R.U.)
| | - Rasa Ugenskiene
- Department of Genetics and Molecular Medicine, Lithuanian University of Health Sciences, LT-50161 Kaunas, Lithuania; (I.S.); (R.U.)
| | - Lawrence Shih-Hsin Wu
- Graduate Institute of Biomedical Sciences, China Medical University, Taichung 406040, Taiwan;
- Research Center of Allergy, Immunology, and Microbiome (AIM), China Medical University Hospital, Taichung 404327, Taiwan; (J.-Y.W.); (M.-H.H.); (P.-C.C.)
| | - Jiu-Yao Wang
- Research Center of Allergy, Immunology, and Microbiome (AIM), China Medical University Hospital, Taichung 404327, Taiwan; (J.-Y.W.); (M.-H.H.); (P.-C.C.)
- Department of Allergy and Immunology, China Medical University Children’s Hospital, Taichung 404327, Taiwan
| | - Miao-Hsi Hsieh
- Research Center of Allergy, Immunology, and Microbiome (AIM), China Medical University Hospital, Taichung 404327, Taiwan; (J.-Y.W.); (M.-H.H.); (P.-C.C.)
| | - Pei-Chi Chen
- Research Center of Allergy, Immunology, and Microbiome (AIM), China Medical University Hospital, Taichung 404327, Taiwan; (J.-Y.W.); (M.-H.H.); (P.-C.C.)
| | - Nikolajs Sjakste
- Laboratory of Genomics and Bioinformatics, Institute of Biology, University of Latvia, LV-1004 Riga, Latvia; (N.P.); (S.P.); (N.S.)
| |
Collapse
|
6
|
Dueñas Rey A, Del Pozo Valero M, Bouckaert M, Wood KA, Van den Broeck F, Daich Varela M, Thomas HB, Van Heetvelde M, De Bruyne M, Van de Sompele S, Bauwens M, Lenaerts H, Mahieu Q, Josifova D, Rivolta C, O'Keefe RT, Ellingford J, Webster AR, Arno G, Ayuso C, De Zaeytijd J, Leroy BP, De Baere E, Coppieters F. Combining a prioritization strategy and functional studies nominates 5'UTR variants underlying inherited retinal disease. Genome Med 2024; 16:7. [PMID: 38184646 PMCID: PMC10771650 DOI: 10.1186/s13073-023-01277-1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/16/2023] [Accepted: 12/15/2023] [Indexed: 01/08/2024] Open
Abstract
BACKGROUND 5' untranslated regions (5'UTRs) are essential modulators of protein translation. Predicting the impact of 5'UTR variants is challenging and rarely performed in routine diagnostics. Here, we present a combined approach of a comprehensive prioritization strategy and functional assays to evaluate 5'UTR variation in two large cohorts of patients with inherited retinal diseases (IRDs). METHODS We performed an isoform-level re-analysis of retinal RNA-seq data to identify the protein-coding transcripts of 378 IRD genes with highest expression in retina. We evaluated the coverage of their 5'UTRs by different whole exome sequencing (WES) kits. The selected 5'UTRs were analyzed in whole genome sequencing (WGS) and WES data from IRD sub-cohorts from the 100,000 Genomes Project (n = 2397 WGS) and an in-house database (n = 1682 WES), respectively. Identified variants were annotated for 5'UTR-relevant features and classified into seven categories based on their predicted functional consequence. We developed a variant prioritization strategy by integrating population frequency, specific criteria for each category, and family and phenotypic data. A selection of candidate variants underwent functional validation using diverse approaches. RESULTS Isoform-level re-quantification of retinal gene expression revealed 76 IRD genes with a non-canonical retina-enriched isoform, of which 20 display a fully distinct 5'UTR compared to that of their canonical isoform. Depending on the probe design, 3-20% of IRD genes have 5'UTRs fully captured by WES. After analyzing these regions in both cohorts, we prioritized 11 (likely) pathogenic variants in 10 genes (ARL3, MERTK, NDP, NMNAT1, NPHP4, PAX6, PRPF31, PRPF4, RDH12, RD3), of which 7 were novel. Functional analyses further supported the pathogenicity of three variants. Mis-splicing was demonstrated for the PRPF31:c.-9+1G>T variant. The MERTK:c.-125G>A variant, overlapping a transcriptional start site, was shown to significantly reduce both luciferase mRNA levels and activity. The RDH12:c.-123C>T variant was found in cis with the hypomorphic RDH12:c.701G>A (p.Arg234His) variant in 11 patients. This 5'UTR variant, predicted to introduce an upstream open reading frame, was shown to result in reduced RDH12 protein but unaltered mRNA levels. CONCLUSIONS This study demonstrates the importance of 5'UTR variants implicated in IRDs and provides a systematic approach for 5'UTR annotation and validation that is applicable to other inherited diseases.
Collapse
Affiliation(s)
- Alfredo Dueñas Rey
- Center for Medical Genetics Ghent (CMGG), Ghent University Hospital, Ghent, Belgium
- Department of Biomolecular Medicine, Ghent University, Corneel Heymanslaan 10, Ghent, 9000, Belgium
| | - Marta Del Pozo Valero
- Center for Medical Genetics Ghent (CMGG), Ghent University Hospital, Ghent, Belgium
- Department of Biomolecular Medicine, Ghent University, Corneel Heymanslaan 10, Ghent, 9000, Belgium
- Department of Genetics, Instituto de Investigación Sanitaria-Fundación Jiménez Díaz, University Hospital, Universidad Autónoma de Madrid (IIS-FJD, UAM), Madrid, Spain
| | - Manon Bouckaert
- Center for Medical Genetics Ghent (CMGG), Ghent University Hospital, Ghent, Belgium
- Department of Biomolecular Medicine, Ghent University, Corneel Heymanslaan 10, Ghent, 9000, Belgium
| | - Katherine A Wood
- Division of Evolution, Infection and Genomics, School of Biological Sciences, Faculty of Biology, Medicines and Health, University of Manchester, Manchester, UK
| | - Filip Van den Broeck
- Department of Ophthalmology, Ghent University Hospital, Ghent, Belgium
- Department of Head & Skin, Ghent University, Ghent, Belgium
| | - Malena Daich Varela
- UCL Institute of Ophthalmology, University College London, London, UK
- Moorfields Eye Hospital, London, UK
| | - Huw B Thomas
- Division of Evolution, Infection and Genomics, School of Biological Sciences, Faculty of Biology, Medicines and Health, University of Manchester, Manchester, UK
| | - Mattias Van Heetvelde
- Center for Medical Genetics Ghent (CMGG), Ghent University Hospital, Ghent, Belgium
- Department of Biomolecular Medicine, Ghent University, Corneel Heymanslaan 10, Ghent, 9000, Belgium
| | - Marieke De Bruyne
- Center for Medical Genetics Ghent (CMGG), Ghent University Hospital, Ghent, Belgium
- Department of Biomolecular Medicine, Ghent University, Corneel Heymanslaan 10, Ghent, 9000, Belgium
| | - Stijn Van de Sompele
- Center for Medical Genetics Ghent (CMGG), Ghent University Hospital, Ghent, Belgium
- Department of Biomolecular Medicine, Ghent University, Corneel Heymanslaan 10, Ghent, 9000, Belgium
| | - Miriam Bauwens
- Center for Medical Genetics Ghent (CMGG), Ghent University Hospital, Ghent, Belgium
- Department of Biomolecular Medicine, Ghent University, Corneel Heymanslaan 10, Ghent, 9000, Belgium
| | - Hanne Lenaerts
- Center for Medical Genetics Ghent (CMGG), Ghent University Hospital, Ghent, Belgium
- Department of Biomolecular Medicine, Ghent University, Corneel Heymanslaan 10, Ghent, 9000, Belgium
| | - Quinten Mahieu
- Center for Medical Genetics Ghent (CMGG), Ghent University Hospital, Ghent, Belgium
- Department of Biomolecular Medicine, Ghent University, Corneel Heymanslaan 10, Ghent, 9000, Belgium
| | | | - Carlo Rivolta
- Department of Ophthalmology, University of Basel, Basel, Switzerland
- Institute of Molecular and Clinical Ophthalmology Basel (IOB), Basel, Switzerland
- Department of Genetics and Genome Biology, University of Leicester, Leicester, UK
| | - Raymond T O'Keefe
- Division of Evolution, Infection and Genomics, School of Biological Sciences, Faculty of Biology, Medicines and Health, University of Manchester, Manchester, UK
| | - Jamie Ellingford
- Division of Evolution, Infection and Genomics, School of Biological Sciences, Faculty of Biology, Medicines and Health, University of Manchester, Manchester, UK
- Genomics England, London, UK
- Manchester Centre for Genomic Medicine, St Mary's Hospital, Manchester University NHS Foundation Trust, Manchester, UK
| | - Andrew R Webster
- UCL Institute of Ophthalmology, University College London, London, UK
- Moorfields Eye Hospital, London, UK
| | - Gavin Arno
- UCL Institute of Ophthalmology, University College London, London, UK
- Moorfields Eye Hospital, London, UK
| | - Carmen Ayuso
- Department of Genetics, Instituto de Investigación Sanitaria-Fundación Jiménez Díaz, University Hospital, Universidad Autónoma de Madrid (IIS-FJD, UAM), Madrid, Spain
- Center for Biomedical Network Research on Rare Diseases (CIBERER), Instituto de Salud Carlos III, Madrid, Spain
| | - Julie De Zaeytijd
- Department of Ophthalmology, Ghent University Hospital, Ghent, Belgium
- Department of Head & Skin, Ghent University, Ghent, Belgium
| | - Bart P Leroy
- Center for Medical Genetics Ghent (CMGG), Ghent University Hospital, Ghent, Belgium
- Department of Ophthalmology, Ghent University Hospital, Ghent, Belgium
- Department of Head & Skin, Ghent University, Ghent, Belgium
- Division of Ophthalmology, Children's Hospital of Philadelphia, Philadelphia, PA, USA
| | - Elfride De Baere
- Center for Medical Genetics Ghent (CMGG), Ghent University Hospital, Ghent, Belgium
- Department of Biomolecular Medicine, Ghent University, Corneel Heymanslaan 10, Ghent, 9000, Belgium
| | - Frauke Coppieters
- Center for Medical Genetics Ghent (CMGG), Ghent University Hospital, Ghent, Belgium.
- Department of Biomolecular Medicine, Ghent University, Corneel Heymanslaan 10, Ghent, 9000, Belgium.
- Department of Pharmaceutics, Ghent University, Ghent, Belgium.
| |
Collapse
|
7
|
Kühnl F, Stadler PF, Findeiß S. Assessing the Quality of Cotranscriptional Folding Simulations. Methods Mol Biol 2024; 2726:347-376. [PMID: 38780738 DOI: 10.1007/978-1-0716-3519-3_14] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 05/25/2024]
Abstract
Structural changes in RNAs are an important contributor to controlling gene expression not only at the posttranscriptional stage but also during transcription. A subclass of riboswitches and RNA thermometers located in the 5' region of the primary transcript regulates the downstream functional unit - usually an ORF - through premature termination of transcription. Not only such elements occur naturally, but they are also attractive devices in synthetic biology. The possibility to design such riboswitches or RNA thermometers is thus of considerable practical interest. Since these functional RNA elements act already during transcription, it is important to model and understand the dynamics of folding and, in particular, the formation of intermediate structures concurrently with transcription. Cotranscriptional folding simulations are therefore an important step to verify the functionality of design constructs before conducting expensive and labor-intensive wet lab experiments. For RNAs, full-fledged molecular dynamics simulations are far beyond practical reach because of both the size of the molecules and the timescales of interest. Even at the simplified level of secondary structures, further approximations are necessary. The BarMap approach is based on representing the secondary structure landscape for each individual transcription step by a coarse-grained representation that only retains a small set of low-energy local minima and the energy barriers between them. The folding dynamics between two transcriptional elongation steps is modeled as a Markov process on this representation. Maps between pairs of consecutive coarse-grained landscapes make it possible to follow the folding process as it changes in response to transcription elongation. In its original implementation, the BarMap software provides a general framework to investigate RNA folding dynamics on temporally changing landscapes. It is, however, difficult to use in particular for specific scenarios such as cotranscriptional folding. To overcome this limitation, we developed the user-friendly BarMap-QA pipeline described in detail in this contribution. It is illustrated here by an elaborate example that emphasizes the careful monitoring of several quality measures. Using an iterative workflow, a reliable and complete kinetics simulation of a synthetic, transcription-regulating riboswitch is obtained using minimal computational resources. All programs and scripts used in this contribution are free software and available for download as a source distribution for Linux® or as a platform-independent Docker® image including support for Apple macOS® and Microsoft Windows®.
Collapse
Affiliation(s)
- Felix Kühnl
- Bioinformatics Group, Department of Computer Science, and Interdisciplinary Center for Bioinformatics, Leipzig University, Leipzig, Germany
| | - Peter F Stadler
- Bioinformatics Group, Department of Computer Science, Interdisciplinary Center of Bioinformatics, German Centre for Integrative Biodiversity Research (iDiv) Halle-Jena-Leipzig, Competence Center for Scalable Data Services and Solutions, and Leipzig Research Center for Civilization Diseases, Leipzig University, Leipzig, Germany
- Max Planck Institute for Mathematics in the Sciences, Leipzig, Germany
- Institute for Theoretical Chemistry, University of Vienna, Vienna, Austria
- Facultad de Ciencias, Universidad Nacional de Colombia, Bogotá, D.C., Colombia
- Santa Fe Institute, Santa Fe, NM, USA
| | - Sven Findeiß
- Bioinformatics Group, Department of Computer Science, and Interdisciplinary Center for Bioinformatics, Leipzig University, Leipzig, Germany.
| |
Collapse
|
8
|
Bunch H, Kim D, Naganuma M, Nakagawa R, Cong A, Jeong J, Ehara H, Vu H, Chang JH, Schellenberg MJ, Sekine SI. ERK2-topoisomerase II regulatory axis is important for gene activation in immediate early genes. Nat Commun 2023; 14:8341. [PMID: 38097570 PMCID: PMC10721843 DOI: 10.1038/s41467-023-44089-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/14/2022] [Accepted: 11/29/2023] [Indexed: 12/17/2023] Open
Abstract
The function of the mitogen-activated protein kinase signaling pathway is required for the activation of immediate early genes (IEGs), including EGR1 and FOS, for cell growth and proliferation. Recent studies have identified topoisomerase II (TOP2) as one of the important regulators of the transcriptional activation of IEGs. However, the mechanism underlying transcriptional regulation involving TOP2 in IEG activation has remained unknown. Here, we demonstrate that ERK2, but not ERK1, is important for IEG transcriptional activation and report a critical ELK1 binding sequence for ERK2 function at the EGR1 gene. Our data indicate that both ERK1 and ERK2 extensively phosphorylate the C-terminal domain of TOP2B at mutual and distinctive residues. Although both ERK1 and ERK2 enhance the catalytic rate of TOP2B required to relax positive DNA supercoiling, ERK2 delays TOP2B catalysis of negative DNA supercoiling. In addition, ERK1 may relax DNA supercoiling by itself. ERK2 catalytic inhibition or knock-down interferes with transcription and deregulates TOP2B in IEGs. Furthermore, we present the first cryo-EM structure of the human cell-purified TOP2B and etoposide together with the EGR1 transcriptional start site (-30 to +20) that has the strongest affinity to TOP2B within -423 to +332. The structure shows TOP2B-mediated breakage and dramatic bending of the DNA. Transcription is activated by etoposide, while it is inhibited by ICRF193 at EGR1 and FOS, suggesting that TOP2B-mediated DNA break to favor transcriptional activation. Taken together, this study suggests that activated ERK2 phosphorylates TOP2B to regulate TOP2-DNA interactions and favor transcriptional activation in IEGs. We propose that TOP2B association, catalysis, and dissociation on its substrate DNA are important processes for regulating transcription and that ERK2-mediated TOP2B phosphorylation may be key for the catalysis and dissociation steps.
Collapse
Affiliation(s)
- Heeyoun Bunch
- Department of Applied Biosciences, Kyungpook National University, Daegu, 41566, Republic of Korea.
- School of Applied Biosciences, College of Agriculture & Life Sciences, Kyungpook National University, Daegu, 41566, Republic of Korea.
| | - Deukyeong Kim
- School of Applied Biosciences, College of Agriculture & Life Sciences, Kyungpook National University, Daegu, 41566, Republic of Korea
- Department of Biological Sciences, Korea Advanced Institute of Science and Technology (KAIST), Daejeon, 34141, Republic of Korea
| | - Masahiro Naganuma
- Laboratory for Transcription Structural Biology, RIKEN Center for Biosystems Dynamics Research, 1-7-22 Suehiro-cho, Tsurumi-ku, Yokohama, 230-0045, Japan
| | - Reiko Nakagawa
- RIKEN BDR Laboratory for Phyloinformatics, Hyogo, 650-0047, Japan
| | - Anh Cong
- Department of Biochemistry and Molecular Biology, Mayo Clinic, Rochester, MN, 55905, USA
| | - Jaehyeon Jeong
- Department of Applied Biosciences, Kyungpook National University, Daegu, 41566, Republic of Korea
| | - Haruhiko Ehara
- Laboratory for Transcription Structural Biology, RIKEN Center for Biosystems Dynamics Research, 1-7-22 Suehiro-cho, Tsurumi-ku, Yokohama, 230-0045, Japan
| | - Hongha Vu
- Department of Biology Education, Kyungpook National University, Daegu, 41566, Republic of Korea
| | - Jeong Ho Chang
- Department of Biology Education, Kyungpook National University, Daegu, 41566, Republic of Korea
| | - Matthew J Schellenberg
- Department of Biochemistry and Molecular Biology, Mayo Clinic, Rochester, MN, 55905, USA
| | - Shun-Ichi Sekine
- Laboratory for Transcription Structural Biology, RIKEN Center for Biosystems Dynamics Research, 1-7-22 Suehiro-cho, Tsurumi-ku, Yokohama, 230-0045, Japan
| |
Collapse
|
9
|
Lu Y, Lee J, Li J, Allu SR, Wang J, Kim H, Bullaughey KL, Fisher SA, Nordgren CE, Rosario JG, Anderson SA, Ulyanova AV, Brem S, Chen HI, Wolf JA, Grady MS, Vinogradov SA, Kim J, Eberwine J. CHEX-seq detects single-cell genomic single-stranded DNA with catalytical potential. Nat Commun 2023; 14:7346. [PMID: 37963886 PMCID: PMC10645931 DOI: 10.1038/s41467-023-43158-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/30/2023] [Accepted: 11/02/2023] [Indexed: 11/16/2023] Open
Abstract
Genomic DNA (gDNA) undergoes structural interconversion between single- and double-stranded states during transcription, DNA repair and replication, which is critical for cellular homeostasis. We describe "CHEX-seq" which identifies the single-stranded DNA (ssDNA) in situ in individual cells. CHEX-seq uses 3'-terminal blocked, light-activatable probes to prime the copying of ssDNA into complementary DNA that is sequenced, thereby reporting the genome-wide single-stranded chromatin landscape. CHEX-seq is benchmarked in human K562 cells, and its utilities are demonstrated in cultures of mouse and human brain cells as well as immunostained spatially localized neurons in brain sections. The amount of ssDNA is dynamically regulated in response to perturbation. CHEX-seq also identifies single-stranded regions of mitochondrial DNA in single cells. Surprisingly, CHEX-seq identifies single-stranded loci in mouse and human gDNA that catalyze porphyrin metalation in vitro, suggesting a catalytic activity for genomic ssDNA. We posit that endogenous DNA enzymatic activity is a function of genomic ssDNA.
Collapse
Affiliation(s)
- Youtao Lu
- Department of Biology, School of Arts and Sciences, University of Pennsylvania, Philadelphia, PA, 19104, USA
| | - Jaehee Lee
- Department of Systems Pharmacology and Translational Therapeutics Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, 19104, USA
| | - Jifen Li
- Department of Systems Pharmacology and Translational Therapeutics Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, 19104, USA
| | - Srinivasa Rao Allu
- Department of Biochemistry and Biophysics, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, 19104, USA
| | - Jinhui Wang
- Department of Systems Pharmacology and Translational Therapeutics Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, 19104, USA
| | - HyunBum Kim
- Department of Systems Pharmacology and Translational Therapeutics Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, 19104, USA
| | - Kevin L Bullaughey
- Department of Biology, School of Arts and Sciences, University of Pennsylvania, Philadelphia, PA, 19104, USA
| | - Stephen A Fisher
- Department of Biology, School of Arts and Sciences, University of Pennsylvania, Philadelphia, PA, 19104, USA
| | - C Erik Nordgren
- Department of Biology, School of Arts and Sciences, University of Pennsylvania, Philadelphia, PA, 19104, USA
| | - Jean G Rosario
- Department of Biology, School of Arts and Sciences, University of Pennsylvania, Philadelphia, PA, 19104, USA
| | - Stewart A Anderson
- Department of Psychiatry, Children's Hospital of Philadelphia, ARC 517, 3615 Civic Center Blvd, Philadelphia, PA, 19104, USA
| | - Alexandra V Ulyanova
- Department of Neurosurgery, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, 19104, USA
| | - Steven Brem
- Department of Neurosurgery, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, 19104, USA
| | - H Isaac Chen
- Department of Neurosurgery, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, 19104, USA
| | - John A Wolf
- Department of Neurosurgery, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, 19104, USA
| | - M Sean Grady
- Department of Neurosurgery, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, 19104, USA
| | - Sergei A Vinogradov
- Department of Biochemistry and Biophysics, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, 19104, USA
| | - Junhyong Kim
- Department of Biology, School of Arts and Sciences, University of Pennsylvania, Philadelphia, PA, 19104, USA
| | - James Eberwine
- Department of Systems Pharmacology and Translational Therapeutics Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, 19104, USA.
| |
Collapse
|
10
|
Benegas G, Batra SS, Song YS. DNA language models are powerful predictors of genome-wide variant effects. Proc Natl Acad Sci U S A 2023; 120:e2311219120. [PMID: 37883436 PMCID: PMC10622914 DOI: 10.1073/pnas.2311219120] [Citation(s) in RCA: 10] [Impact Index Per Article: 10.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/03/2023] [Accepted: 09/08/2023] [Indexed: 10/28/2023] Open
Abstract
The expanding catalog of genome-wide association studies (GWAS) provides biological insights across a variety of species, but identifying the causal variants behind these associations remains a significant challenge. Experimental validation is both labor-intensive and costly, highlighting the need for accurate, scalable computational methods to predict the effects of genetic variants across the entire genome. Inspired by recent progress in natural language processing, unsupervised pretraining on large protein sequence databases has proven successful in extracting complex information related to proteins. These models showcase their ability to learn variant effects in coding regions using an unsupervised approach. Expanding on this idea, we here introduce the Genomic Pre-trained Network (GPN), a model designed to learn genome-wide variant effects through unsupervised pretraining on genomic DNA sequences. Our model also successfully learns gene structure and DNA motifs without any supervision. To demonstrate its utility, we train GPN on unaligned reference genomes of Arabidopsis thaliana and seven related species within the Brassicales order and evaluate its ability to predict the functional impact of genetic variants in A. thaliana by utilizing allele frequencies from the 1001 Genomes Project and a comprehensive database of GWAS. Notably, GPN outperforms predictors based on popular conservation scores such as phyloP and phastCons. Our predictions for A. thaliana can be visualized as sequence logos in the UCSC Genome Browser (https://genome.ucsc.edu/s/gbenegas/gpn-arabidopsis). We provide code (https://github.com/songlab-cal/gpn) to train GPN for any given species using its DNA sequence alone, enabling unsupervised prediction of variant effects across the entire genome.
Collapse
Affiliation(s)
- Gonzalo Benegas
- Graduate Group in Computational Biology, University of California, Berkeley, CA94720
| | | | - Yun S. Song
- Computer Science Division, University of California, Berkeley, CA94720
- Department of Statistics, University of California, Berkeley, CA94720
- Center for Computational Biology, University of California, Berkeley, CA94720
| |
Collapse
|
11
|
How does precursor RNA structure influence RNA processing and gene expression? Biosci Rep 2023; 43:232489. [PMID: 36689327 PMCID: PMC9977717 DOI: 10.1042/bsr20220149] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/20/2022] [Revised: 01/17/2023] [Accepted: 01/23/2023] [Indexed: 01/24/2023] Open
Abstract
RNA is a fundamental biomolecule that has many purposes within cells. Due to its single-stranded and flexible nature, RNA naturally folds into complex and dynamic structures. Recent technological and computational advances have produced an explosion of RNA structural data. Many RNA structures have regulatory and functional properties. Studying the structure of nascent RNAs is particularly challenging due to their low abundance and long length, but their structures are important because they can influence RNA processing. Precursor RNA processing is a nexus of pathways that determines mature isoform composition and that controls gene expression. In this review, we examine what is known about human nascent RNA structure and the influence of RNA structure on processing of precursor RNAs. These known structures provide examples of how other nascent RNAs may be structured and show how novel RNA structures may influence RNA processing including splicing and polyadenylation. RNA structures can be targeted therapeutically to treat disease.
Collapse
|
12
|
Roles of G4-DNA and G4-RNA in Class Switch Recombination and Additional Regulations in B-Lymphocytes. Molecules 2023; 28:molecules28031159. [PMID: 36770824 PMCID: PMC9921937 DOI: 10.3390/molecules28031159] [Citation(s) in RCA: 4] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/24/2022] [Revised: 01/18/2023] [Accepted: 01/19/2023] [Indexed: 01/26/2023] Open
Abstract
Mature B cells notably diversify immunoglobulin (Ig) production through class switch recombination (CSR), allowing the junction of distant "switch" (S) regions. CSR is initiated by activation-induced deaminase (AID), which targets cytosines adequately exposed within single-stranded DNA of transcribed targeted S regions, with a specific affinity for WRCY motifs. In mammals, G-rich sequences are additionally present in S regions, forming canonical G-quadruplexes (G4s) DNA structures, which favor CSR. Small molecules interacting with G4-DNA (G4 ligands), proved able to regulate CSR in B lymphocytes, either positively (such as for nucleoside diphosphate kinase isoforms) or negatively (such as for RHPS4). G4-DNA is also implicated in the control of transcription, and due to their impact on both CSR and transcriptional regulation, G4-rich sequences likely play a role in the natural history of B cell malignancies. Since G4-DNA stands at multiple locations in the genome, notably within oncogene promoters, it remains to be clarified how it can more specifically promote legitimate CSR in physiology, rather than pathogenic translocation. The specific regulatory role of G4 structures in transcribed DNA and/or in corresponding transcripts and recombination hereby appears as a major issue for understanding immune responses and lymphomagenesis.
Collapse
|
13
|
Bunch H. Studying RNA Polymerase II Promoter-Proximal Pausing by In Vitro Immobilized Template and Transcription Assays. Methods Mol Biol 2023; 2693:13-24. [PMID: 37540423 DOI: 10.1007/978-1-0716-3342-7_2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 08/05/2023]
Abstract
The immobilized template assay is a versatile biochemical method for studying protein-nucleic acid interactions. Using this method, immobilized nucleic acid-associated or specific proteins can be identified and quantified by techniques such as mass spectrometry and immunoblotting. Here, a modified immobilized template assay combined with in vitro transcription assay to study the function of transcription factors and transcriptional activities at the human heat shock protein 70 (HSP70) gene is described. Notably, this method can be applied to study other important genes and transcription factors in vitro.
Collapse
Affiliation(s)
- Heeyoun Bunch
- Department of Applied Biosciences, Kyungpook National University, Daegu, Republic of Korea
| |
Collapse
|
14
|
Kari H, Bandi SMS, Kumar A, Yella VR. DeePromClass: Delineator for Eukaryotic Core Promoters Employing Deep Neural Networks. IEEE/ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS 2023; 20:802-807. [PMID: 35353704 DOI: 10.1109/tcbb.2022.3163418] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/04/2023]
Abstract
Computational promoter identification in eukaryotes is a classical biological problem that should be refurbished with the availability of an avalanche of experimental data and emerging deep learning technologies. The current knowledge indicates that eukaryotic core promoters display multifarious signals such as TATA-Box, Inr element, TCT, and Pause-button, etc., and structural motifs such as G-quadruplexes. In the present study, we combined the power of deep learning with a plethora of promoter motifs to delineate promoter and non-promoters gleaned from the statistical properties of DNA sequence arrangement. To this end, we implemented convolutional neural network (CNN) and long short-term memory (LSTM) recurrent neural network architecture for five model systems with [-100 to +50] segments relative to the transcription start site being the core promoter. Unlike previous state-of-the-art tools, which furnish a binary decision of promoter or non-promoter, we classify a chunk of 151mer sequence into a promoter along with the consensus signal type or a non-promoter. The combined CNN-LSTM model; we call "DeePromClass", achieved testing accuracy of 90.6%, 93.6%, 91.8%, 86.5%, and 84.0% for S. cerevisiae, C. elegans, D. melanogaster, Mus musculus, and Homo sapiens respectively. In total, our tool provides an insightful update on next-generation promoter prediction tools for promoter biologists.
Collapse
|
15
|
Avgeros C, Patsatsi A, Dimitriadis D, Malousi A, Koletsa T, Papathemeli D, Syrnioti A, Avgerou P, Lazaridou E, Tzimagiorgis G, Georgiou E. Dysregulation of Plasma miR-146a and miR-155 Expression Profile in Mycosis Fungoides Is Associated with rs2910164 and rs767649 Polymorphisms. Int J Mol Sci 2022; 24:ijms24010271. [PMID: 36613718 PMCID: PMC9820385 DOI: 10.3390/ijms24010271] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/20/2022] [Revised: 12/15/2022] [Accepted: 12/18/2022] [Indexed: 12/28/2022] Open
Abstract
Diagnosis of Mycosis Fungoides (MF) may be challenging, due to its polymorphic nature. The use of miRNAs as biomarkers to assist in diagnosis has been investigated, mainly in skin lesion biopsies. The purpose of this study is to evaluate the plasma levels of miR-146a and miR-155 in MF patients and to investigate their association with SNPs of their genes. Plasma miRNAs were quantified by RT-qPCR. Genomic DNA was used for SNPs’ genotyping by Sanger sequencing. Plasma levels of miR-146a and miR-155 were significantly higher in patients vs. controls, in early MF patients vs. controls, and in advanced vs. early MF patients. Both miRNAs’ levels were significantly higher in stage IIB vs. early-stage patients. miR-155 plasma levels were significantly higher in patients with skin tumors or erythroderma. CC genotype (rs2910164 C>G) was significantly more frequent in healthy controls and associated with lower MF risk and lower miR-146a levels. The AA genotype (rs767649 T>A) was significantly more frequent in patients and correlated with increased MF risk and increased miR-155 levels. The combination of GG+AA was only detected in patients and was correlated with higher MF susceptibility. Increased mir-146a and mir-155 plasma levels in MF is an important finding to establish putative noninvasive biomarkers. The presence of SNPs is closely associated with miRs’ expression, and possibly with disease susceptibility.
Collapse
Affiliation(s)
- Chrysostomos Avgeros
- Laboratory of Biological Chemistry, School of Medicine, Faculty of Health Sciences, Aristotle University of Thessaloniki, 54124 Thessaloniki, Greece
| | - Aikaterini Patsatsi
- 2nd Dermatology Department, School of Medicine, Faculty of Health Sciences, Aristotle University of Thessaloniki, “Papageorgiou” General Hospital, 56403 Thessaloniki, Greece
- Center for Interdisciplinary Research and Innovation (CIRI-AUTH), 57001 Thessaloniki, Greece
| | - Dimitrios Dimitriadis
- School of Economics, Aristotle University of Thessaloniki, 54124 Thessaloniki, Greece
| | - Andigoni Malousi
- Laboratory of Biological Chemistry, School of Medicine, Faculty of Health Sciences, Aristotle University of Thessaloniki, 54124 Thessaloniki, Greece
- Center for Interdisciplinary Research and Innovation (CIRI-AUTH), 57001 Thessaloniki, Greece
| | - Triantafyllia Koletsa
- Department of Pathology, School of Medicine, Faculty of Health Sciences, Aristotle University of Thessaloniki, 54124 Thessaloniki, Greece
| | - Despoina Papathemeli
- 2nd Dermatology Department, School of Medicine, Faculty of Health Sciences, Aristotle University of Thessaloniki, “Papageorgiou” General Hospital, 56403 Thessaloniki, Greece
| | - Antonia Syrnioti
- Department of Pathology, School of Medicine, Faculty of Health Sciences, Aristotle University of Thessaloniki, 54124 Thessaloniki, Greece
| | - Paraskevi Avgerou
- Laboratory of Biological Chemistry, School of Medicine, Faculty of Health Sciences, Aristotle University of Thessaloniki, 54124 Thessaloniki, Greece
| | - Elizabeth Lazaridou
- 2nd Dermatology Department, School of Medicine, Faculty of Health Sciences, Aristotle University of Thessaloniki, “Papageorgiou” General Hospital, 56403 Thessaloniki, Greece
| | - Georgios Tzimagiorgis
- Laboratory of Biological Chemistry, School of Medicine, Faculty of Health Sciences, Aristotle University of Thessaloniki, 54124 Thessaloniki, Greece
- Center for Interdisciplinary Research and Innovation (CIRI-AUTH), 57001 Thessaloniki, Greece
| | - Elisavet Georgiou
- Laboratory of Biological Chemistry, School of Medicine, Faculty of Health Sciences, Aristotle University of Thessaloniki, 54124 Thessaloniki, Greece
- Center for Interdisciplinary Research and Innovation (CIRI-AUTH), 57001 Thessaloniki, Greece
- Correspondence: ; Tel.: +30-2310999171
| |
Collapse
|
16
|
High-throughput techniques enable advances in the roles of DNA and RNA secondary structures in transcriptional and post-transcriptional gene regulation. Genome Biol 2022; 23:159. [PMID: 35851062 PMCID: PMC9290270 DOI: 10.1186/s13059-022-02727-6] [Citation(s) in RCA: 5] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/09/2022] [Accepted: 07/07/2022] [Indexed: 12/27/2022] Open
Abstract
The most stable structure of DNA is the canonical right-handed double helix termed B DNA. However, certain environments and sequence motifs favor alternative conformations, termed non-canonical secondary structures. The roles of DNA and RNA secondary structures in transcriptional regulation remain incompletely understood. However, advances in high-throughput assays have enabled genome wide characterization of some secondary structures. Here, we describe their regulatory functions in promoters and 3’UTRs, providing insights into key mechanisms through which they regulate gene expression. We discuss their implication in human disease, and how advances in molecular technologies and emerging high-throughput experimental methods could provide additional insights.
Collapse
|
17
|
Deng N, Zhang Y, Ma Z, Lin R, Cheng TH, Tang H, Snyder M, Cohen S. DSIF modulates RNA polymerase II occupancy according to template G + C content. NAR Genom Bioinform 2022; 4:lqac054. [PMID: 35910045 PMCID: PMC9326580 DOI: 10.1093/nargab/lqac054] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/05/2022] [Revised: 06/03/2022] [Accepted: 07/19/2022] [Indexed: 11/12/2022] Open
Abstract
The DSIF complex comprising the Supt4h and Supt5h transcription elongation proteins clamps RNA polymerase II (RNAPII) onto DNA templates, facilitating polymerase processivity. Lowering DSIF components can differentially decrease expression of alleles containing nucleotide repeat expansions, suggesting that RNAPII transit through repeat expansions is dependent on DSIF functions. To globally identify sequence features that affect dependence of the polymerase on DSIF in human cells, we used ultra-deep ChIP-seq analysis and RNA-seq to investigate and quantify the genome-wide effects of Supt4h loss on template occupancy and transcript production. Our results indicate that RNAPII dependence on Supt4h varies according to G + C content. Effects of DSIF knockdown were prominent during transcription of sequences high in G + C but minimal for sequences low in G + C and were particularly evident for G + C-rich segments of long genes. Reanalysis of previously published ChIP-seq data obtained from mouse cells showed similar effects of template G + C composition on Supt5h actions. Our evidence that DSIF dependency varies globally in different template regions according to template sequence composition suggests that G + C content may have a role in the selectivity of Supt4h knockdown and Supt5h knockdown during transcription of gene alleles containing expansions of G + C-rich repeats.
Collapse
Affiliation(s)
- Ning Deng
- Department of Genetics, Stanford University School of Medicine , Stanford, CA 94305, USA
| | - Yue Zhang
- Department of Genetics, Stanford University School of Medicine , Stanford, CA 94305, USA
| | - Zhihai Ma
- Department of Genetics, Stanford University School of Medicine , Stanford, CA 94305, USA
| | - Richard Lin
- Department of Genetics, Stanford University School of Medicine , Stanford, CA 94305, USA
| | - Tzu-Hao Cheng
- Institute of Biochemistry and Molecular Biology, National Yang Ming Chiao Tung University , Taipei 112, Taiwan
| | - Hua Tang
- Department of Genetics, Stanford University School of Medicine , Stanford, CA 94305, USA
| | - Michael P Snyder
- Department of Genetics, Stanford University School of Medicine , Stanford, CA 94305, USA
| | - Stanley N Cohen
- Department of Genetics, Stanford University School of Medicine , Stanford, CA 94305, USA
| |
Collapse
|
18
|
Vanaja A, Yella VR. Delineation of the DNA Structural Features of Eukaryotic Core Promoter Classes. ACS OMEGA 2022; 7:5657-5669. [PMID: 35224327 PMCID: PMC8867553 DOI: 10.1021/acsomega.1c04603] [Citation(s) in RCA: 10] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/23/2021] [Accepted: 01/27/2022] [Indexed: 05/02/2023]
Abstract
The eukaryotic transcription is orchestrated from a chunk of the DNA region stated as the core promoter. Multifarious and punctilious core promoter signals, viz., TATA-box, Inr, BREs, and Pause Button, are associated with a subset of genes and regulate their spatiotemporal expression. However, the core promoter architecture linked with these signals has not been investigated exhaustively for several species. In this study, we attempted to envisage the adaptive binding landscape of the transcription initiation machinery as a function of DNA structure. To this end, we deployed a set of k-mer based DNA structural estimates and regular expression models derived from experiments, molecular dynamic simulations, and theoretical frameworks, and high-throughout promoter data sets retrieved from the eukaryotic promoter database. We categorized protein-coding gene core promoters based on characteristic motifs at precise locations and analyzed the B-DNA structural properties and non-B-DNA structural motifs for 15 different eukaryotic genomes. We observed that Inr, BREd, and no-motif classes display common patterns of DNA sequence and structural environment. TATA-containing, BREu, and Pause Button classes show a deviant behavior with the TATA class displaying varied axial and twisting flexibility while BREu and Pause Button leaned toward G-quadruplex motif enrichment. Intriguingly, DNA meltability and shape signals are conserved irrespective of the presence or absence of distinct core promoter motifs in the majority of species. Altogether, here we delineated the conserved DNA structural signals associated with several promoter classes that may contribute to the chromatin configuration, orchestration of transcription machinery, and DNA duplex melting during the transcription process.
Collapse
Affiliation(s)
- Akkinepally Vanaja
- Department
of Biotechnology, Koneru Lakshmaiah Education
Foundation, Vaddeswaram, Guntur 522502, Andhra
Pradesh, India
- KL
College of Pharmacy, Koneru Lakshmaiah Education
Foundation, Vaddeswaram, Guntur 522502, Andhra
Pradesh, India
| | - Venkata Rajesh Yella
- Department
of Biotechnology, Koneru Lakshmaiah Education
Foundation, Vaddeswaram, Guntur 522502, Andhra
Pradesh, India
| |
Collapse
|
19
|
Zhang X, Garrett S, Graveley BR, Terns MP. Unique properties of spacer acquisition by the type III-A CRISPR-Cas system. Nucleic Acids Res 2021; 50:1562-1582. [PMID: 34893878 PMCID: PMC8860593 DOI: 10.1093/nar/gkab1193] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/18/2021] [Revised: 11/12/2021] [Accepted: 11/19/2021] [Indexed: 12/11/2022] Open
Abstract
Type III CRISPR-Cas systems have a unique mode of interference, involving crRNA-guided recognition of nascent RNA and leading to DNA and RNA degradation. How type III systems acquire new CRISPR spacers is currently not well understood. Here, we characterize CRISPR spacer uptake by a type III-A system within its native host, Streptococcus thermophilus. Adaptation by the type II-A system in the same host provided a basis for comparison. Cas1 and Cas2 proteins were critical for type III adaptation but deletion of genes responsible for crRNA biogenesis or interference did not detectably change spacer uptake patterns, except those related to host counter-selection. Unlike the type II-A system, type III spacers are acquired in a PAM- and orientation-independent manner. Interestingly, certain regions of plasmids and the host genome were particularly well-sampled during type III-A, but not type II-A, spacer uptake. These regions included the single-stranded origins of rolling-circle replicating plasmids, rRNA and tRNA encoding gene clusters, promoter regions of expressed genes and 5′ UTR regions involved in transcription attenuation. These features share the potential to form DNA secondary structures, suggesting a preferred substrate for type III adaptation. Lastly, the type III-A system adapted to and protected host cells from lytic phage infection.
Collapse
Affiliation(s)
- Xinfu Zhang
- Department of Biochemistry and Molecular Biology, University of Georgia, Athens, GA 30602, USA
| | - Sandra Garrett
- Department of Genetics and Genome Sciences, Institute for Systems Genomics, University of Connecticut Health Center, Farmington, CT 06030, USA
| | - Brenton R Graveley
- Department of Genetics and Genome Sciences, Institute for Systems Genomics, University of Connecticut Health Center, Farmington, CT 06030, USA
| | - Michael P Terns
- Department of Biochemistry and Molecular Biology, University of Georgia, Athens, GA 30602, USA.,Department of Microbiology, University of Georgia, Athens, GA 30602, USA.,Department of Genetics, University of Georgia, Athens, GA 30602, USA
| |
Collapse
|
20
|
Zhang J, Cavallaro M, Hebenstreit D. Timing RNA polymerase pausing with TV-PRO-seq. CELL REPORTS METHODS 2021; 1:None. [PMID: 34723238 PMCID: PMC8547241 DOI: 10.1016/j.crmeth.2021.100083] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 05/03/2019] [Revised: 08/03/2021] [Accepted: 08/18/2021] [Indexed: 11/28/2022]
Abstract
Transcription of many genes in metazoans is subject to polymerase pausing, which is the transient stop of transcriptionally engaged polymerases. This is known to mainly occur in promoter-proximal regions but it is not well understood. In particular, a genome-wide measurement of pausing times at high resolution has been lacking. We present here the time-variant precision nuclear run-on and sequencing (TV-PRO-seq) assay, an extension of the standard PRO-seq that allows us to estimate genome-wide pausing times at single-base resolution. Its application to human cells demonstrates that, proximal to promoters, polymerases pause more frequently but for shorter times than in other genomic regions. Comparison with single-cell gene expression data reveals that the polymerase pausing times are longer in highly expressed genes, while transcriptionally noisier genes have higher pausing frequencies and slightly longer pausing times. Analyses of histone modifications suggest that the marker H3K36me3 is related to the polymerase pausing.
Collapse
Affiliation(s)
- Jie Zhang
- School of Life Sciences, Gibbet Hill Campus, the University of Warwick, CV4 7AL Coventry, UK
| | - Massimo Cavallaro
- School of Life Sciences, Gibbet Hill Campus, the University of Warwick, CV4 7AL Coventry, UK
- Mathematics Institute and Zeeman Institute for Systems Biology and Infectious Disease Epidemiology Research, the University of Warwick, CV4 7AL Coventry, UK
| | - Daniel Hebenstreit
- School of Life Sciences, Gibbet Hill Campus, the University of Warwick, CV4 7AL Coventry, UK
| |
Collapse
|
21
|
Yu G, Wu Y, Duan Z, Tang C, Xing H, Scharff MD, MacCarthy T. A Bayesian model based computational analysis of the relationship between bisulfite accessible single-stranded DNA in chromatin and somatic hypermutation of immunoglobulin genes. PLoS Comput Biol 2021; 17:e1009323. [PMID: 34491985 PMCID: PMC8462741 DOI: 10.1371/journal.pcbi.1009323] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/22/2021] [Revised: 09/24/2021] [Accepted: 08/04/2021] [Indexed: 11/19/2022] Open
Abstract
The B cells in our body generate protective antibodies by introducing somatic hypermutations (SHM) into the variable region of immunoglobulin genes (IgVs). The mutations are generated by activation induced deaminase (AID) that converts cytosine to uracil in single stranded DNA (ssDNA) generated during transcription. Attempts have been made to correlate SHM with ssDNA using bisulfite to chemically convert cytosines that are accessible in the intact chromatin of mutating B cells. These studies have been complicated by using different definitions of "bisulfite accessible regions" (BARs). Recently, deep-sequencing has provided much larger datasets of such regions but computational methods are needed to enable this analysis. Here we leveraged the deep-sequencing approach with unique molecular identifiers and developed a novel Hidden Markov Model based Bayesian Segmentation algorithm to characterize the ssDNA regions in the IGHV4-34 gene of the human Ramos B cell line. Combining hierarchical clustering and our new Bayesian model, we identified recurrent BARs in certain subregions of both top and bottom strands of this gene. Using this new system, the average size of BARs is about 15 bp. We also identified potential G-quadruplex DNA structures in this gene and found that the BARs co-locate with G-quadruplex structures in the opposite strand. Using various correlation analyses, there is not a direct site-to-site relationship between the bisulfite accessible ssDNA and all sites of SHM but most of the highly AID mutated sites are within 15 bp of a BAR. In summary, we developed a novel platform to study single stranded DNA in chromatin at a base pair resolution that reveals potential relationships among BARs, SHM and G-quadruplexes. This platform could be applied to genome wide studies in the future.
Collapse
Affiliation(s)
- Guojun Yu
- Department of Cell Biology, Albert Einstein College of Medicine, Bronx, New York, United States of America
| | - Yingru Wu
- Department of Applied Mathematics and Statistics, Stony Brook University, Stony Brook, New York, United States of America
| | - Zhi Duan
- Department of Cell Biology, Albert Einstein College of Medicine, Bronx, New York, United States of America
| | - Catherine Tang
- Department of Applied Mathematics and Statistics, Stony Brook University, Stony Brook, New York, United States of America
| | - Haipeng Xing
- Department of Applied Mathematics and Statistics, Stony Brook University, Stony Brook, New York, United States of America
| | - Matthew D. Scharff
- Department of Cell Biology, Albert Einstein College of Medicine, Bronx, New York, United States of America
| | - Thomas MacCarthy
- Department of Applied Mathematics and Statistics, Stony Brook University, Stony Brook, New York, United States of America
| |
Collapse
|
22
|
G-Quadruplex in Gene Encoding Large Subunit of Plant RNA Polymerase II: A Billion-Year-Old Story. Int J Mol Sci 2021; 22:ijms22147381. [PMID: 34299001 PMCID: PMC8306923 DOI: 10.3390/ijms22147381] [Citation(s) in RCA: 11] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/28/2021] [Revised: 06/24/2021] [Accepted: 07/05/2021] [Indexed: 12/12/2022] Open
Abstract
G-quadruplexes have long been perceived as rare and physiologically unimportant nucleic acid structures. However, several studies have revealed their importance in molecular processes, suggesting their possible role in replication and gene expression regulation. Pathways involving G-quadruplexes are intensively studied, especially in the context of human diseases, while their involvement in gene expression regulation in plants remains largely unexplored. Here, we conducted a bioinformatic study and performed a complex circular dichroism measurement to identify a stable G-quadruplex in the gene RPB1, coding for the RNA polymerase II large subunit. We found that this G-quadruplex-forming locus is highly evolutionarily conserved amongst plants sensu lato (Archaeplastida) that share a common ancestor more than one billion years old. Finally, we discussed a new hypothesis regarding G-quadruplexes interacting with UV light in plants to potentially form an additional layer of the regulatory network.
Collapse
|
23
|
Krassovsky K, Ghosh RP, Meyer BJ. Genome-wide profiling reveals functional interplay of DNA sequence composition, transcriptional activity, and nucleosome positioning in driving DNA supercoiling and helix destabilization in C. elegans. Genome Res 2021; 31:1187-1202. [PMID: 34168009 PMCID: PMC8256864 DOI: 10.1101/gr.270082.120] [Citation(s) in RCA: 7] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/11/2020] [Accepted: 05/25/2021] [Indexed: 12/11/2022]
Abstract
DNA topology and alternative DNA structures are implicated in regulating diverse biological processes. Although biomechanical properties of these structures have been studied extensively in vitro, characterization in vivo, particularly in multicellular organisms, is limited. We devised new methods to map DNA supercoiling and single-stranded DNA in Caenorhabditis elegans embryos and diapause larvae. To map supercoiling, we quantified the incorporation of biotinylated psoralen into DNA using high-throughput sequencing. To map single-stranded DNA, we combined permanganate treatment with genome-wide sequencing of induced double-stranded breaks. We found high levels of negative supercoiling at transcription start sites (TSSs) in embryos. GC-rich regions flanked by a sharp GC-to-AT transition delineate boundaries of supercoil propagation. In contrast to TSSs in embryos, TSSs in diapause larvae showed dramatic reductions in negative supercoiling without concomitant attenuation of transcription, suggesting developmental-stage-specific regulation. To assess whether alternative DNA structures control chromosome architecture and gene expression, we examined DNA supercoiling in the context of X-Chromosome dosage compensation. We showed that the condensin dosage compensation complex creates negative supercoils locally at its highest-occupancy binding sites but found no evidence for large-scale supercoiling domains along X Chromosomes. In contrast to transcription-coupled negative supercoiling, single-strandedness, which is most pronounced at transcript end sites, is dependent on high AT content and symmetrically positioned nucleosomes. We propose that sharp transitions in sequence composition at functional genomic elements constitute a common regulatory code and that DNA structure and propagation of torsional stress at regulatory elements are critical parameters in shaping important developmental events.
Collapse
Affiliation(s)
- Kristina Krassovsky
- Department of Molecular and Cell Biology, University of California, Berkeley, California 94720-3204, USA
| | - Rajarshi P Ghosh
- Department of Molecular and Cell Biology, University of California, Berkeley, California 94720-3204, USA
- Howard Hughes Medical Institute, University of California, Berkeley, California 94720-3204, USA
| | - Barbara J Meyer
- Department of Molecular and Cell Biology, University of California, Berkeley, California 94720-3204, USA
- Howard Hughes Medical Institute, University of California, Berkeley, California 94720-3204, USA
| |
Collapse
|
24
|
Pipier A, Devaux A, Lavergne T, Adrait A, Couté Y, Britton S, Calsou P, Riou JF, Defrancq E, Gomez D. Constrained G4 structures unveil topology specificity of known and new G4 binding proteins. Sci Rep 2021; 11:13469. [PMID: 34188089 PMCID: PMC8241873 DOI: 10.1038/s41598-021-92806-8] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/06/2021] [Accepted: 06/11/2021] [Indexed: 12/20/2022] Open
Abstract
G-quadruplexes (G4) are non-canonical secondary structures consisting in stacked tetrads of hydrogen-bonded guanines bases. An essential feature of G4 is their intrinsic polymorphic nature, which is characterized by the equilibrium between several conformations (also called topologies) and the presence of different types of loops with variable lengths. In cells, G4 functions rely on protein or enzymatic factors that recognize and promote or resolve these structures. In order to characterize new G4-dependent mechanisms, extensive researches aimed at identifying new G4 binding proteins. Using G-rich single-stranded oligonucleotides that adopt non-controlled G4 conformations, a large number of G4-binding proteins have been identified in vitro, but their specificity towards G4 topology remained unknown. Constrained G4 structures are biomolecular objects based on the use of a rigid cyclic peptide scaffold as a template for directing the intramolecular assembly of the anchored oligonucleotides into a single and stabilized G4 topology. Here, using various constrained RNA or DNA G4 as baits in human cell extracts, we establish the topology preference of several well-known G4-interacting factors. Moreover, we identify new G4-interacting proteins such as the NELF complex involved in the RNA-Pol II pausing mechanism, and we show that it impacts the clastogenic effect of the G4-ligand pyridostatin.
Collapse
Affiliation(s)
- A Pipier
- Institut de Pharmacologie et Biologie Structurale, IPBS, Université de Toulouse, CNRS, UPS, Toulouse, France
- Equipe Labellisée Ligue Contre Le Cancer 2018, Toulouse, France
| | - A Devaux
- Département de Chimie Moléculaire, UMR CNRS 5250, Université Grenoble Alpes, 38058, Grenoble, France
| | - T Lavergne
- Département de Chimie Moléculaire, UMR CNRS 5250, Université Grenoble Alpes, 38058, Grenoble, France
| | - A Adrait
- CEA, INSERM, IRIG, BGE, Université Grenoble Alpes, 38000, Grenoble, France
| | - Y Couté
- CEA, INSERM, IRIG, BGE, Université Grenoble Alpes, 38000, Grenoble, France
| | - S Britton
- Institut de Pharmacologie et Biologie Structurale, IPBS, Université de Toulouse, CNRS, UPS, Toulouse, France
- Equipe Labellisée Ligue Contre Le Cancer 2018, Toulouse, France
| | - P Calsou
- Institut de Pharmacologie et Biologie Structurale, IPBS, Université de Toulouse, CNRS, UPS, Toulouse, France
- Equipe Labellisée Ligue Contre Le Cancer 2018, Toulouse, France
| | - J F Riou
- Structure et Instabilité des Génomes, Muséum National d'Histoire Naturelle, CNRS, INSERM, CP 26, 75005, Paris, France
| | - E Defrancq
- Département de Chimie Moléculaire, UMR CNRS 5250, Université Grenoble Alpes, 38058, Grenoble, France
| | - D Gomez
- Institut de Pharmacologie et Biologie Structurale, IPBS, Université de Toulouse, CNRS, UPS, Toulouse, France.
- Equipe Labellisée Ligue Contre Le Cancer 2018, Toulouse, France.
| |
Collapse
|
25
|
Non-B DNA-Forming Motifs Promote Mfd-Dependent Stationary-Phase Mutagenesis in Bacillus subtilis. Microorganisms 2021; 9:microorganisms9061284. [PMID: 34204686 PMCID: PMC8231525 DOI: 10.3390/microorganisms9061284] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/12/2021] [Revised: 06/08/2021] [Accepted: 06/09/2021] [Indexed: 02/07/2023] Open
Abstract
Transcription-induced mutagenic mechanisms limit genetic changes to times when expression happens and to coding DNA. It has been hypothesized that intrinsic sequences that have the potential to form alternate DNA structures, such as non-B DNA structures, influence these mechanisms. Non-B DNA structures are promoted by transcription and induce genome instability in eukaryotic cells, but their impact in bacterial genomes is less known. Here, we investigated if G4 DNA- and hairpin-forming motifs influence stationary-phase mutagenesis in Bacillus subtilis. We developed a system to measure the influence of non-B DNA on B. subtilis stationary-phase mutagenesis by deleting the wild-type argF at its chromosomal position and introducing IPTG-inducible argF alleles differing in their ability to form hairpin and G4 DNA structures into an ectopic locus. Using this system, we found that sequences predicted to form non-B DNA structures promoted mutagenesis in B. subtilis stationary-phase cells; such a response did not occur in growing conditions. We also found that the transcription-coupled repair factor Mfd promoted mutagenesis at these predicted structures. In summary, we showed that non-B DNA-forming motifs promote genetic instability, particularly in coding regions in stressed cells; therefore, non-B DNA structures may have a spatial and temporal mutagenic effect in bacteria. This study provides insights into mechanisms that prevent or promote mutagenesis and advances our understanding of processes underlying bacterial evolution.
Collapse
|
26
|
Gajos M, Jasnovidova O, van Bömmel A, Freier S, Vingron M, Mayer A. Conserved DNA sequence features underlie pervasive RNA polymerase pausing. Nucleic Acids Res 2021; 49:4402-4420. [PMID: 33788942 PMCID: PMC8096220 DOI: 10.1093/nar/gkab208] [Citation(s) in RCA: 22] [Impact Index Per Article: 7.3] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/11/2020] [Revised: 03/05/2021] [Accepted: 03/15/2021] [Indexed: 12/17/2022] Open
Abstract
Pausing of transcribing RNA polymerase is regulated and creates opportunities to control gene expression. Research in metazoans has so far mainly focused on RNA polymerase II (Pol II) promoter-proximal pausing leaving the pervasive nature of pausing and its regulatory potential in mammalian cells unclear. Here, we developed a pause detecting algorithm (PDA) for nucleotide-resolution occupancy data and a new native elongating transcript sequencing approach, termed nested NET-seq, that strongly reduces artifactual peaks commonly misinterpreted as pausing sites. Leveraging PDA and nested NET-seq reveal widespread genome-wide Pol II pausing at single-nucleotide resolution in human cells. Notably, the majority of Pol II pauses occur outside of promoter-proximal gene regions primarily along the gene-body of transcribed genes. Sequence analysis combined with machine learning modeling reveals DNA sequence properties underlying widespread transcriptional pausing including a new pause motif. Interestingly, key sequence determinants of RNA polymerase pausing are conserved between human cells and bacteria. These studies indicate pervasive sequence-induced transcriptional pausing in human cells and the knowledge of exact pause locations implies potential functional roles in gene expression.
Collapse
Affiliation(s)
- Martyna Gajos
- Otto-Warburg-Laboratory, Max Planck Institute for Molecular Genetics, Berlin 14195, Germany.,Department of Mathematics and Computer Science, Freie Universität Berlin, Berlin 14195, Germany
| | - Olga Jasnovidova
- Otto-Warburg-Laboratory, Max Planck Institute for Molecular Genetics, Berlin 14195, Germany
| | - Alena van Bömmel
- Department of Mathematics and Computer Science, Freie Universität Berlin, Berlin 14195, Germany.,Department of Computational Molecular Biology, Max Planck Institute for Molecular Genetics, Berlin 14195, Germany
| | - Susanne Freier
- Otto-Warburg-Laboratory, Max Planck Institute for Molecular Genetics, Berlin 14195, Germany
| | - Martin Vingron
- Department of Computational Molecular Biology, Max Planck Institute for Molecular Genetics, Berlin 14195, Germany
| | - Andreas Mayer
- Otto-Warburg-Laboratory, Max Planck Institute for Molecular Genetics, Berlin 14195, Germany
| |
Collapse
|
27
|
Mylonas C, Lee C, Auld AL, Cisse II, Boyer LA. A dual role for H2A.Z.1 in modulating the dynamics of RNA polymerase II initiation and elongation. Nat Struct Mol Biol 2021; 28:435-442. [PMID: 33972784 DOI: 10.1038/s41594-021-00589-3] [Citation(s) in RCA: 23] [Impact Index Per Article: 7.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/27/2021] [Accepted: 04/06/2021] [Indexed: 02/03/2023]
Abstract
RNA polymerase II (RNAPII) pausing immediately downstream of the transcription start site is a critical rate-limiting step for the expression of most metazoan genes. During pause release, RNAPII encounters a highly conserved +1 H2A.Z nucleosome, yet how this histone variant contributes to transcription is poorly understood. Here, using an inducible protein degron system combined with genomic approaches and live cell super-resolution microscopy, we show that H2A.Z.1 modulates RNAPII dynamics across most genes in murine embryonic stem cells. Our quantitative analysis shows that H2A.Z.1 slows the rate of RNAPII pause release and consequently impacts negative elongation factor dynamics as well as nascent transcription. Consequently, H2A.Z.1 also impacts re-loading of the pre-initiation complex components TFIIB and TBP. Altogether, this work provides a critical mechanistic link between H2A.Z.1 and the proper induction of mammalian gene expression programs through the regulation of RNAPII dynamics and pause release.
Collapse
Affiliation(s)
- Constantine Mylonas
- Department of Biology, Massachusetts Institute of Technology, Cambridge, MA, USA
| | - Choongman Lee
- Department of Physics, Massachusetts Institute of Technology, Cambridge, MA, USA
| | - Alexander L Auld
- Department of Biology, Massachusetts Institute of Technology, Cambridge, MA, USA
| | - Ibrahim I Cisse
- Department of Biology, Massachusetts Institute of Technology, Cambridge, MA, USA.,Department of Physics, Massachusetts Institute of Technology, Cambridge, MA, USA
| | - Laurie A Boyer
- Department of Biology, Massachusetts Institute of Technology, Cambridge, MA, USA. .,Department of Biological Engineering, Massachusetts Institute of Technology, Cambridge, MA, USA.
| |
Collapse
|
28
|
Brázda V, Bartas M, Bowater RP. Evolution of Diverse Strategies for Promoter Regulation. Trends Genet 2021; 37:730-744. [PMID: 33931265 DOI: 10.1016/j.tig.2021.04.003] [Citation(s) in RCA: 28] [Impact Index Per Article: 9.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/29/2021] [Revised: 03/31/2021] [Accepted: 04/01/2021] [Indexed: 12/15/2022]
Abstract
DNA is fundamentally important for all cellular organisms due to its role as a store of hereditary genetic information. The precise and accurate regulation of gene transcription depends primarily on promoters, which vary significantly within and between genomes. Some promoters are rich in specific types of bases, while others have more varied, complex sequence characteristics. However, it is not only base sequence but also epigenetic modifications and altered DNA structure that regulate promoter activity. Significantly, many promoters across all organisms contain sequences that can form intrastrand hairpins (cruciforms) or four-stranded structures (G-quadruplex or i-motif). In this review we integrate recent studies on promoter regulation that highlight the importance of DNA structure in the evolutionary adaptation of promoter sequences.
Collapse
Affiliation(s)
- Václav Brázda
- Institute of Biophysics of the Czech Academy of Sciences, Královopolská 135, 612 65 Brno, Czech Republic
| | - Martin Bartas
- Department of Biology and Ecology/Institute of Environmental Technologies, Faculty of Science, University of Ostrava, 710 00 Ostrava, Czech Republic
| | - Richard P Bowater
- School of Biological Sciences, University of East Anglia, Norwich Research Park, Norwich NR4 7TJ, UK.
| |
Collapse
|
29
|
Walker CR, Scally A, De Maio N, Goldman N. Short-range template switching in great ape genomes explored using pair hidden Markov models. PLoS Genet 2021; 17:e1009221. [PMID: 33651813 PMCID: PMC7954356 DOI: 10.1371/journal.pgen.1009221] [Citation(s) in RCA: 7] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/12/2020] [Revised: 03/12/2021] [Accepted: 02/10/2021] [Indexed: 12/14/2022] Open
Abstract
Many complex genomic rearrangements arise through template switch errors, which occur in DNA replication when there is a transient polymerase switch to an alternate template nearby in three-dimensional space. While typically investigated at kilobase-to-megabase scales, the genomic and evolutionary consequences of this mutational process are not well characterised at smaller scales, where they are often interpreted as clusters of independent substitutions, insertions and deletions. Here we present an improved statistical approach using pair hidden Markov models, and use it to detect and describe short-range template switches underlying clusters of mutations in the multi-way alignment of hominid genomes. Using robust statistics derived from evolutionary genomic simulations, we show that template switch events have been widespread in the evolution of the great apes’ genomes and provide a parsimonious explanation for the presence of many complex mutation clusters in their phylogenetic context. Larger-scale mechanisms of genome rearrangement are typically associated with structural features around breakpoints, and accordingly we show that atypical patterns of secondary structure formation and DNA bending are present at the initial template switch loci. Our methods improve on previous non-probabilistic approaches for computational detection of template switch mutations, allowing the statistical significance of events to be assessed. By specifying realistic evolutionary parameters based on the genomes and taxa involved, our methods can be readily adapted to other intra- or inter-species comparisons. DNA replication is an imperfect process which causes the mutations that give rise to genetic diversity during the evolution of genomes. While many mutations are independent, single-nucleotide substitutions or small insertions and deletions, some mutations arise as nonindependent clusters of substitutions and larger scale chromosomal rearrangements. Large-scale rearrangements (also called structural variants) in particular can have a profound impact on genome evolution and contribute to both germline and somatic disease in humans. The replication-based mechanisms underlying structural variation typically involve a polymerase switch event in which a large segment of DNA is copied using a template from an alternate location in the genome. Methods for identifying these template switch mutations lack the power to detect smaller scale rearrangements which can arise through the same replication-based pathways. Here we outline a model which can detect and assess the statistical significance of such small-scale template switches within their evolutionary context. We show that these events are widespread in the evolution of great apes and that the genomic features associated with these small-scale rearrangements are similar to those of large-scale structural variants.
Collapse
Affiliation(s)
- Conor R. Walker
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton, United Kingdom
- Department of Genetics, University of Cambridge, Cambridge, United Kingdom
| | - Aylwyn Scally
- Department of Genetics, University of Cambridge, Cambridge, United Kingdom
| | - Nicola De Maio
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton, United Kingdom
| | - Nick Goldman
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton, United Kingdom
- * E-mail:
| |
Collapse
|
30
|
Luan J, Xiang G, Gómez-García PA, Tome JM, Zhang Z, Vermunt MW, Zhang H, Huang A, Keller CA, Giardine BM, Zhang Y, Lan Y, Lis JT, Lakadamyali M, Hardison RC, Blobel GA. Distinct properties and functions of CTCF revealed by a rapidly inducible degron system. Cell Rep 2021; 34:108783. [PMID: 33626344 PMCID: PMC7999233 DOI: 10.1016/j.celrep.2021.108783] [Citation(s) in RCA: 43] [Impact Index Per Article: 14.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/09/2020] [Revised: 11/25/2020] [Accepted: 02/02/2021] [Indexed: 02/07/2023] Open
Abstract
CCCTC-binding factor (CTCF) is a conserved zinc finger transcription factor implicated in a wide range of functions, including genome organization, transcription activation, and elongation. To explore the basis for CTCF functional diversity, we coupled an auxin-induced degron system with precision nuclear run-on. Unexpectedly, oriented CTCF motifs in gene bodies are associated with transcriptional stalling in a manner independent of bound CTCF. Moreover, CTCF at different binding sites (CBSs) displays highly variable resistance to degradation. Motif sequence does not significantly predict degradation behavior, but location at chromatin boundaries and chromatin loop anchors, as well as co-occupancy with cohesin, are associated with delayed degradation. Single-molecule tracking experiments link chromatin residence time to CTCF degradation kinetics, which has ramifications regarding architectural CTCF functions. Our study highlights the heterogeneity of CBSs, uncovers properties specific to architecturally important CBSs, and provides insights into the basic processes of genome organization and transcription regulation.
Collapse
Affiliation(s)
- Jing Luan
- Medical Scientist Training Program, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, USA
| | - Guanjue Xiang
- Department of Biochemistry and Molecular Biology, Pennsylvania State University, University Park, PA 16802, USA
| | - Pablo Aurelio Gómez-García
- Centre for Genomic Regulation (CRG), The Barcelona Institute of Science and Technology, Barcelona, Spain
| | - Jacob M Tome
- Department of Molecular Biology and Genetics, Cornell University, Ithaca, NY 14853, USA
| | - Zhe Zhang
- Department of Biomedical and Health Informatics, Children's Hospital of Pennsylvania, Philadelphia, PA, USA
| | - Marit W Vermunt
- Division of Hematology, The Children's Hospital of Pennsylvania, Philadelphia, PA, USA
| | - Haoyue Zhang
- Division of Hematology, The Children's Hospital of Pennsylvania, Philadelphia, PA, USA
| | - Anran Huang
- Division of Hematology, The Children's Hospital of Pennsylvania, Philadelphia, PA, USA
| | - Cheryl A Keller
- Department of Biochemistry and Molecular Biology, Pennsylvania State University, University Park, PA 16802, USA
| | - Belinda M Giardine
- Department of Biochemistry and Molecular Biology, Pennsylvania State University, University Park, PA 16802, USA
| | - Yu Zhang
- Department of Statistics, Pennsylvania State University, University Park, PA 16802, USA
| | - Yemin Lan
- Penn Epigenetics Institute, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, USA
| | - John T Lis
- Department of Molecular Biology and Genetics, Cornell University, Ithaca, NY 14853, USA
| | - Melike Lakadamyali
- Department of Physiology, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, USA
| | - Ross C Hardison
- Department of Biochemistry and Molecular Biology, Pennsylvania State University, University Park, PA 16802, USA
| | - Gerd A Blobel
- Division of Hematology, The Children's Hospital of Pennsylvania, Philadelphia, PA, USA.
| |
Collapse
|
31
|
Noe Gonzalez M, Blears D, Svejstrup JQ. Causes and consequences of RNA polymerase II stalling during transcript elongation. Nat Rev Mol Cell Biol 2021; 22:3-21. [PMID: 33208928 DOI: 10.1038/s41580-020-00308-8] [Citation(s) in RCA: 104] [Impact Index Per Article: 34.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 10/08/2020] [Indexed: 02/07/2023]
Abstract
The journey of RNA polymerase II (Pol II) as it transcribes a gene is anything but a smooth ride. Transcript elongation is discontinuous and can be perturbed by intrinsic regulatory barriers, such as promoter-proximal pausing, nucleosomes, RNA secondary structures and the underlying DNA sequence. More substantial blocking of Pol II translocation can be caused by other physiological circumstances and extrinsic obstacles, including other transcribing polymerases, the replication machinery and several types of DNA damage, such as bulky lesions and DNA double-strand breaks. Although numerous different obstacles cause Pol II stalling or arrest, the cell somehow distinguishes between them and invokes different mechanisms to resolve each roadblock. Resolution of Pol II blocking can be as straightforward as temporary backtracking and transcription elongation factor S-II (TFIIS)-dependent RNA cleavage, or as drastic as premature transcription termination or degradation of polyubiquitylated Pol II and its associated nascent RNA. In this Review, we discuss the current knowledge of how these different Pol II stalling contexts are distinguished by the cell, how they overlap with each other, how they are resolved and how, when unresolved, they can cause genome instability.
Collapse
Affiliation(s)
- Melvin Noe Gonzalez
- Mechanisms of Transcription Laboratory, The Francis Crick Institute, London, UK
- Department of Cellular and Molecular Medicine, University of Copenhagen, Copenhagen, Denmark
| | - Daniel Blears
- Mechanisms of Transcription Laboratory, The Francis Crick Institute, London, UK
- Department of Cellular and Molecular Medicine, University of Copenhagen, Copenhagen, Denmark
| | - Jesper Q Svejstrup
- Mechanisms of Transcription Laboratory, The Francis Crick Institute, London, UK.
- Department of Cellular and Molecular Medicine, University of Copenhagen, Copenhagen, Denmark.
| |
Collapse
|
32
|
Szlachta K, Manukyan A, Raimer HM, Singh S, Salamon A, Guo W, Lobachev KS, Wang YH. Topoisomerase II contributes to DNA secondary structure-mediated double-stranded breaks. Nucleic Acids Res 2020; 48:6654-6671. [PMID: 32501506 PMCID: PMC7337936 DOI: 10.1093/nar/gkaa483] [Citation(s) in RCA: 25] [Impact Index Per Article: 6.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/21/2020] [Revised: 05/20/2020] [Accepted: 06/01/2020] [Indexed: 12/12/2022] Open
Abstract
DNA double-stranded breaks (DSBs) trigger human genome instability, therefore identifying what factors contribute to DSB induction is critical for our understanding of human disease etiology. Using an unbiased, genome-wide approach, we found that genomic regions with the ability to form highly stable DNA secondary structures are enriched for endogenous DSBs in human cells. Human genomic regions predicted to form non-B-form DNA induced gross chromosomal rearrangements in yeast and displayed high indel frequency in human genomes. The extent of instability in both analyses is in concordance with the structure forming ability of these regions. We also observed an enrichment of DNA secondary structure-prone sites overlapping transcription start sites (TSSs) and CCCTC-binding factor (CTCF) binding sites, and uncovered an increase in DSBs at highly stable DNA secondary structure regions, in response to etoposide, an inhibitor of topoisomerase II (TOP2) re-ligation activity. Importantly, we found that TOP2 deficiency in both yeast and human leads to a significant reduction in DSBs at structure-prone loci, and that sites of TOP2 cleavage have a greater ability to form highly stable DNA secondary structures. This study reveals a direct role for TOP2 in generating secondary structure-mediated DNA fragility, advancing our understanding of mechanisms underlying human genome instability.
Collapse
Affiliation(s)
- Karol Szlachta
- Department of Biochemistry and Molecular Genetics, University of Virginia, Charlottesville, VA 22908-0733, USA
| | - Arkadi Manukyan
- Department of Biochemistry and Molecular Genetics, University of Virginia, Charlottesville, VA 22908-0733, USA
| | - Heather M Raimer
- Department of Biochemistry and Molecular Genetics, University of Virginia, Charlottesville, VA 22908-0733, USA
| | - Sandeep Singh
- Department of Biochemistry and Molecular Genetics, University of Virginia, Charlottesville, VA 22908-0733, USA
| | - Anita Salamon
- Department of Biochemistry and Molecular Genetics, University of Virginia, Charlottesville, VA 22908-0733, USA
| | - Wenying Guo
- School of Biological Sciences and Institute for Bioengineering and Bioscience, Georgia Institute of Technology, Atlanta, GA 30332, USA
| | - Kirill S Lobachev
- School of Biological Sciences and Institute for Bioengineering and Bioscience, Georgia Institute of Technology, Atlanta, GA 30332, USA
| | - Yuh-Hwa Wang
- Department of Biochemistry and Molecular Genetics, University of Virginia, Charlottesville, VA 22908-0733, USA
| |
Collapse
|
33
|
Tan Y, Li Y, Tang F. Oncogenic seRNA functional activation: a novel mechanism of tumorigenesis. Mol Cancer 2020; 19:74. [PMID: 32278350 PMCID: PMC7149907 DOI: 10.1186/s12943-020-01195-5] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/29/2019] [Accepted: 03/30/2020] [Indexed: 02/06/2023] Open
Abstract
seRNA is a noncoding RNA (ncRNA) transcribed from active super-enhancer (SE), through which SE exerts biological functions and participates in various physiological and pathological processes. seRNA recruits cofactor, RNA polymerase II and mediator to constitute and stabilize chromatin loop SE and promoter region, which regulates target genes transcription. In tumorigenesis, DNA insertion, deletion, translocation, focal amplification and carcinogen factor mediate oncogenic SE generation, meanwhile, oncogenic SE transcribes into tumor-related seRNA, termed as oncogenic seRNA. Oncogenic seRNA participates in tumorigenesis through activating various signal-pathways. The recent reports showed that oncogenic seRNA implicates in a widespread range of cytopathological processes in cancer progression including cell proliferation, apoptosis, autophagy, epithelial-mesenchymal transition, extracellular matrix stiffness and angiogenesis. In this article, we comprehensively summarized seRNA’s characteristics and functions, and emphatically introduced inducible formation of oncogenic seRNA and its functional mechanisms. Lastly, some research strategies on oncogenic seRNA were introduced, and the perspectives on cancer therapy that targets oncogenic seRNA were also discussed.
Collapse
Affiliation(s)
- Yuan Tan
- Department of Clinical Laboratory and Hunan Key Laboratory of Oncotarget gene, Hunan Cancer Hospital & The affiliated Cancer Hospital of Xiangya School of Medicine, Central South University, Changsha, 410013, China
| | - Yuejin Li
- Department of Clinical Laboratory and Hunan Key Laboratory of Oncotarget gene, Hunan Cancer Hospital & The affiliated Cancer Hospital of Xiangya School of Medicine, Central South University, Changsha, 410013, China
| | - Faqing Tang
- Department of Clinical Laboratory and Hunan Key Laboratory of Oncotarget gene, Hunan Cancer Hospital & The affiliated Cancer Hospital of Xiangya School of Medicine, Central South University, Changsha, 410013, China.
| |
Collapse
|
34
|
Singh S, Szlachta K, Manukyan A, Raimer HM, Dinda M, Bekiranov S, Wang YH. Pausing sites of RNA polymerase II on actively transcribed genes are enriched in DNA double-stranded breaks. J Biol Chem 2020; 295:3990-4000. [PMID: 32029477 DOI: 10.1074/jbc.ra119.011665] [Citation(s) in RCA: 14] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/28/2019] [Revised: 02/05/2020] [Indexed: 12/16/2022] Open
Abstract
DNA double-stranded breaks (DSBs) are strongly associated with active transcription, and promoter-proximal pausing of RNA polymerase II (Pol II) is a critical step in transcriptional regulation. Mapping the distribution of DSBs along actively expressed genes and identifying the location of DSBs relative to pausing sites can provide mechanistic insights into transcriptional regulation. Using genome-wide DNA break mapping/sequencing techniques at single-nucleotide resolution in human cells, we found that DSBs are preferentially located around transcription start sites of highly transcribed and paused genes and that Pol II promoter-proximal pausing sites are enriched in DSBs. We observed that DSB frequency at pausing sites increases as the strength of pausing increases, regardless of whether the pausing sites are near or far from annotated transcription start sites. Inhibition of topoisomerase I and II by camptothecin and etoposide treatment, respectively, increased DSBs at the pausing sites as the concentrations of drugs increased, demonstrating the involvement of topoisomerases in DSB generation at the pausing sites. DNA breaks generated by topoisomerases are short-lived because of the religation activity of these enzymes, which these drugs inhibit; therefore, the observation of increased DSBs with increasing drug doses at pausing sites indicated active recruitment of topoisomerases to these sites. Furthermore, the enrichment and locations of DSBs at pausing sites were shared among different cell types, suggesting that Pol II promoter-proximal pausing is a common regulatory mechanism. Our findings support a model in which topoisomerases participate in Pol II promoter-proximal pausing and indicated that DSBs at pausing sites contribute to transcriptional activation.
Collapse
Affiliation(s)
- Sandeep Singh
- Department of Biochemistry and Molecular Genetics, University of Virginia, Charlottesville, Virginia 22908
| | - Karol Szlachta
- Department of Biochemistry and Molecular Genetics, University of Virginia, Charlottesville, Virginia 22908
| | - Arkadi Manukyan
- Department of Biochemistry and Molecular Genetics, University of Virginia, Charlottesville, Virginia 22908
| | - Heather M Raimer
- Department of Biochemistry and Molecular Genetics, University of Virginia, Charlottesville, Virginia 22908
| | - Manikarna Dinda
- Department of Biochemistry and Molecular Genetics, University of Virginia, Charlottesville, Virginia 22908
| | - Stefan Bekiranov
- Department of Biochemistry and Molecular Genetics, University of Virginia, Charlottesville, Virginia 22908
| | - Yuh-Hwa Wang
- Department of Biochemistry and Molecular Genetics, University of Virginia, Charlottesville, Virginia 22908
| |
Collapse
|
35
|
Atkin ND, Raimer HM, Wang YH. Broken by the Cut: A Journey into the Role of Topoisomerase II in DNA Fragility. Genes (Basel) 2019; 10:E791. [PMID: 31614754 PMCID: PMC6826763 DOI: 10.3390/genes10100791] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/14/2019] [Revised: 10/05/2019] [Accepted: 10/10/2019] [Indexed: 02/07/2023] Open
Abstract
DNA topoisomerase II (TOP2) plays a critical role in many processes such as replication and transcription, where it resolves DNA structures and relieves torsional stress. Recent evidence demonstrated the association of TOP2 with topologically associated domains (TAD) boundaries and CCCTC-binding factor (CTCF) binding sites. At these sites, TOP2 promotes interactions between enhancers and gene promoters, and relieves torsional stress that accumulates at these physical barriers. Interestingly, in executing its enzymatic function, TOP2 contributes to DNA fragility through re-ligation failure, which results in persistent DNA breaks when unrepaired or illegitimately repaired. Here, we discuss the biological processes for which TOP2 is required and the steps at which it can introduce DNA breaks. We describe the repair processes that follow removal of TOP2 adducts and the resultant broken DNA ends, and present how these processes can contribute to disease-associated mutations. Furthermore, we examine the involvement of TOP2-induced breaks in the formation of oncogenic translocations of leukemia and papillary thyroid cancer, as well as the role of TOP2 and proteins which repair TOP2 adducts in other diseases. The participation of TOP2 in generating persistent DNA breaks and leading to diseases such as cancer, could have an impact on disease treatment and prevention.
Collapse
Affiliation(s)
- Naomi D Atkin
- Department of Biochemistry and Molecular Genetics, School of Medicine, University of Virginia, Charlottesville, VA 22908, USA.
| | - Heather M Raimer
- Department of Biochemistry and Molecular Genetics, School of Medicine, University of Virginia, Charlottesville, VA 22908, USA
| | - Yuh-Hwa Wang
- Department of Biochemistry and Molecular Genetics, School of Medicine, University of Virginia, Charlottesville, VA 22908, USA.
| |
Collapse
|
36
|
Abstract
In this review, Core et al. discuss the recent advances in our understanding of the early steps in Pol II transcription, highlighting the events and factors involved in the establishment and release of paused Pol II. They also discuss a number of unanswered questions about the regulation and function of Pol II pausing. Precise spatio–temporal control of gene activity is essential for organismal development, growth, and survival in a changing environment. Decisive steps in gene regulation involve the pausing of RNA polymerase II (Pol II) in early elongation, and the controlled release of paused polymerase into productive RNA synthesis. Here we describe the factors that enable pausing and the events that trigger Pol II release into the gene. We also discuss open questions in the field concerning the stability of paused Pol II, nucleosomes as obstacles to elongation, and potential roles of pausing in defining the precision and dynamics of gene expression.
Collapse
Affiliation(s)
- Leighton Core
- Department of Molecular and Cell Biology, Institute of Systems Genomics, University of Connecticut, Storrs, Connecticut 06269, USA
| | - Karen Adelman
- Department of Biological Chemistry and Molecular Pharmacology, Blavatnik Institute, Harvard Medical School, Boston, Massachusetts 02115, USA
| |
Collapse
|
37
|
Bartas M, Čutová M, Brázda V, Kaura P, Šťastný J, Kolomazník J, Coufal J, Goswami P, Červeň J, Pečinka P. The Presence and Localization of G-Quadruplex Forming Sequences in the Domain of Bacteria. Molecules 2019; 24:molecules24091711. [PMID: 31052562 PMCID: PMC6539912 DOI: 10.3390/molecules24091711] [Citation(s) in RCA: 66] [Impact Index Per Article: 13.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/16/2019] [Revised: 04/30/2019] [Accepted: 05/01/2019] [Indexed: 01/09/2023] Open
Abstract
The role of local DNA structures in the regulation of basic cellular processes is an emerging field of research. Amongst local non-B DNA structures, the significance of G-quadruplexes was demonstrated in the last decade, and their presence and functional relevance has been demonstrated in many genomes, including humans. In this study, we analyzed the presence and locations of G-quadruplex-forming sequences by G4Hunter in all complete bacterial genomes available in the NCBI database. G-quadruplex-forming sequences were identified in all species, however the frequency differed significantly across evolutionary groups. The highest frequency of G-quadruplex forming sequences was detected in the subgroup Deinococcus-Thermus, and the lowest frequency in Thermotogae. G-quadruplex forming sequences are non-randomly distributed and are favored in various evolutionary groups. G-quadruplex-forming sequences are enriched in ncRNA segments followed by mRNAs. Analyses of surrounding sequences showed G-quadruplex-forming sequences around tRNA and regulatory sequences. These data point to the unique and non-random localization of G-quadruplex-forming sequences in bacterial genomes.
Collapse
Affiliation(s)
- Martin Bartas
- Department of Biology and Ecology/Institute of Environmental Technologies, Faculty of Science, University of Ostrava, 710 00 Ostrava, Czech Republic.
| | - Michaela Čutová
- Faculty of Chemistry, Brno University of Technology, Purkyňova 118, 612 00 Brno, Czech Republic.
| | - Václav Brázda
- Faculty of Chemistry, Brno University of Technology, Purkyňova 118, 612 00 Brno, Czech Republic.
- Institute of Biophysics, Academy of Sciences of the Czech Republic v.v.i., Královopolská 135, 612 65 Brno, Czech Republic.
| | - Patrik Kaura
- Faculty of Mechanical Engineering, Brno University of Technology, Technicka 2896/2, 616 69 Brno, Czech Republic.
| | - Jiří Šťastný
- Faculty of Mechanical Engineering, Brno University of Technology, Technicka 2896/2, 616 69 Brno, Czech Republic.
- Department of Informatics, Mendel University in Brno, Zemedelska 1665/1, 61300 Brno, Czech Republic.
| | - Jan Kolomazník
- Department of Informatics, Mendel University in Brno, Zemedelska 1665/1, 61300 Brno, Czech Republic.
| | - Jan Coufal
- Institute of Biophysics, Academy of Sciences of the Czech Republic v.v.i., Královopolská 135, 612 65 Brno, Czech Republic.
| | - Pratik Goswami
- Institute of Biophysics, Academy of Sciences of the Czech Republic v.v.i., Královopolská 135, 612 65 Brno, Czech Republic.
| | - Jiří Červeň
- Department of Biology and Ecology/Institute of Environmental Technologies, Faculty of Science, University of Ostrava, 710 00 Ostrava, Czech Republic.
| | - Petr Pečinka
- Department of Biology and Ecology/Institute of Environmental Technologies, Faculty of Science, University of Ostrava, 710 00 Ostrava, Czech Republic.
| |
Collapse
|
38
|
Kaushal S, Freudenreich CH. The role of fork stalling and DNA structures in causing chromosome fragility. Genes Chromosomes Cancer 2019; 58:270-283. [PMID: 30536896 DOI: 10.1002/gcc.22721] [Citation(s) in RCA: 42] [Impact Index Per Article: 8.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/25/2018] [Revised: 11/13/2018] [Accepted: 12/03/2018] [Indexed: 12/19/2022] Open
Abstract
Alternative non-B form DNA structures, also called secondary structures, can form in certain DNA sequences under conditions that produce single-stranded DNA, such as during replication, transcription, and repair. Direct links between secondary structure formation, replication fork stalling, and genomic instability have been found for many repeated DNA sequences that cause disease when they expand. Common fragile sites (CFSs) are known to be AT-rich and break under replication stress, yet the molecular basis for their fragility is still being investigated. Over the past several years, new evidence has linked both the formation of secondary structures and transcription to fork stalling and fragility of CFSs. How these two events may synergize to cause fragility and the role of nuclease cleavage at secondary structures in rare and CFSs are discussed here. We also highlight evidence for a new hypothesis that secondary structures at CFSs not only initiate fragility but also inhibit healing, resulting in their characteristic appearance.
Collapse
Affiliation(s)
- Simran Kaushal
- Department of Biology, Tufts University, Medford, Massachusetts
| | - Catherine H Freudenreich
- Department of Biology, Tufts University, Medford, Massachusetts.,Program in Genetics, Sackler School of Graduate Biomedical Sciences, Tufts University, Boston, Massachusetts
| |
Collapse
|
39
|
Developing Novel G-Quadruplex Ligands: from Interaction with Nucleic Acids to Interfering with Nucleic Acid⁻Protein Interaction. Molecules 2019; 24:molecules24030396. [PMID: 30678288 PMCID: PMC6384609 DOI: 10.3390/molecules24030396] [Citation(s) in RCA: 72] [Impact Index Per Article: 14.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/28/2018] [Revised: 01/10/2019] [Accepted: 01/22/2019] [Indexed: 12/20/2022] Open
Abstract
G-quadruplex is a special secondary structure of nucleic acids in guanine-rich sequences of genome. G-quadruplexes have been proved to be involved in the regulation of replication, DNA damage repair, and transcription and translation of oncogenes or other cancer-related genes. Therefore, targeting G-quadruplexes has become a novel promising anti-tumor strategy. Different kinds of small molecules targeting the G-quadruplexes have been designed, synthesized, and identified as potential anti-tumor agents, including molecules directly bind to the G-quadruplex and molecules interfering with the binding between the G-quadruplex structures and related binding proteins. This review will explore the feasibility of G-quadruplex ligands acting as anti-tumor drugs, from basis to application. Meanwhile, since helicase is the most well-defined G-quadruplex-related protein, the most extensive research on the relationship between helicase and G-quadruplexes, and its meaning in drug design, is emphasized.
Collapse
|