1
|
Mandler MD, Maligireddy SS, Guiblet WM, Fitzsimmons CM, McDonald KS, Warrell DL, Batista PJ. The modification landscape of P. aeruginosa tRNAs. RNA 2024:rna.080004.124. [PMID: 38684317 DOI: 10.1261/rna.080004.124] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/26/2024] [Accepted: 04/09/2024] [Indexed: 05/02/2024]
Abstract
RNA modifications have a substantial impact on tRNA function, with modifications in the anticodon loop contributing to translational fidelity and modifications in the tRNA core impacting structural stability. In bacteria, tRNA modifications are crucial for responding to stress and regulating the expression of virulence factors. Although tRNA modifications are well-characterized in a few model organisms, our knowledge of tRNA modifications in human pathogens, such as Pseudomonas aeruginosa, remains limited. Here we leveraged two orthogonal approaches to build a reference landscape of tRNA modifications in E. coli, which enabled us to identify similar modifications in P. aeruginosa. Our analysis revealed a substantial degree of conservation between the two organisms, while also uncovering potential sites of tRNA modification in P. aeruginosa tRNAs that are not present in E. coli. The mutational signature at one of these sites, position 46 of tRNAGln1(UUG) is dependent on the P. aeruginosa homolog of TapT, the enzyme responsible for the 3-(3-amino-3-carboxypropyl) uridine (acp3U) modification. Identifying which modifications are present on different tRNAs will uncover the pathways impacted by the different tRNA modifying enzymes, some of which play roles in determining virulence and pathogenicity.
Collapse
Affiliation(s)
- Mariana D Mandler
- Laboratory of Cell Biology, Center for Cancer Research, National Cancer Institute, National Institutes of Health
| | - Siddhardha S Maligireddy
- Laboratory of Cell Biology, Center for Cancer Research, National Cancer Institute, National Institutes of Health, Bethesda, MD, USA
| | - Wilfried M Guiblet
- Laboratory of Cell Biology, Center for Cancer Research, National Cancer Institute, National Institutes of Health, Bethesda, MD, USA
| | - Christina M Fitzsimmons
- Laboratory of Cell Biology, Center for Cancer Research, National Cancer Institute, National Institutes of Health, Bethesda, MD, USA
| | - Kayla S McDonald
- Laboratory of Cell Biology, Center for Cancer Research, National Cancer Institute, National Institutes of Health, Bethesda, MD, USA
| | - Delayna L Warrell
- Laboratory of Molecular Biology, Center for Cancer Research, National Cancer Institute, National Institues of Health
| | - Pedro J Batista
- Laboratory of Cell Biology, Center for Cancer Research, National Cancer Institute, National Institutes of Health;
| |
Collapse
|
2
|
Robey RW, Fitzsimmons CM, Guiblet WM, Frye WJE, González Dalmasy JM, Wang L, Russell DA, Huff LM, Perciaccante AJ, Ali-Rahmani F, Lipsey CC, Wade HM, Mitchell AV, Maligireddy SS, Terrero D, Butcher D, Edmondson EF, Jenkins LM, Nikitina T, Zhurkin VB, Tiwari AK, Piscopio AD, Totah RA, Bates SE, Arda HE, Gottesman MM, Batista PJ. The Methyltransferases METTL7A and METTL7B Confer Resistance to Thiol-Based Histone Deacetylase Inhibitors. Mol Cancer Ther 2024; 23:464-477. [PMID: 38151817 DOI: 10.1158/1535-7163.mct-23-0144] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/08/2023] [Revised: 08/25/2023] [Accepted: 12/20/2023] [Indexed: 12/29/2023]
Abstract
Histone deacetylase inhibitors (HDACi) are part of a growing class of epigenetic therapies used for the treatment of cancer. Although HDACis are effective in the treatment of T-cell lymphomas, treatment of solid tumors with this class of drugs has not been successful. Overexpression of the multidrug resistance protein P-glycoprotein (P-gp), encoded by ABCB1, is known to confer resistance to the HDACi romidepsin in vitro, yet increased ABCB1 expression has not been associated with resistance in patients, suggesting that other mechanisms of resistance arise in the clinic. To identify alternative mechanisms of resistance to romidepsin, we selected MCF-7 breast cancer cells with romidepsin in the presence of the P-gp inhibitor verapamil to reduce the likelihood of P-gp-mediated resistance. The resulting cell line, MCF-7 DpVp300, does not express P-gp and was found to be selectively resistant to romidepsin but not to other HDACis such as belinostat, panobinostat, or vorinostat. RNA-sequencing analysis revealed upregulation of the mRNA coding for the putative methyltransferase, METTL7A, whose paralog, METTL7B, was previously shown to methylate thiol groups on hydrogen sulfide and captopril. As romidepsin has a thiol as the zinc-binding moiety, we hypothesized that METTL7A could inactivate romidepsin and other thiol-based HDACis via methylation of the thiol group. We demonstrate that expression of METTL7A or METTL7B confers resistance to thiol-based HDACis and that both enzymes are capable of methylating thiol-containing HDACis. We thus propose that METTL7A and METTL7B confer resistance to thiol-based HDACis by methylating and inactivating the zinc-binding thiol.
Collapse
Affiliation(s)
- Robert W Robey
- Laboratory of Cell Biology, Center for Cancer Research, National Cancer Institute, NIH, Bethesda, Maryland
| | - Christina M Fitzsimmons
- Laboratory of Cell Biology, Center for Cancer Research, National Cancer Institute, NIH, Bethesda, Maryland
| | - Wilfried M Guiblet
- Laboratory of Cell Biology, Center for Cancer Research, National Cancer Institute, NIH, Bethesda, Maryland
| | - William J E Frye
- Laboratory of Cell Biology, Center for Cancer Research, National Cancer Institute, NIH, Bethesda, Maryland
| | - José M González Dalmasy
- Laboratory of Cell Biology, Center for Cancer Research, National Cancer Institute, NIH, Bethesda, Maryland
| | - Li Wang
- Laboratory of Receptor Biology and Gene Expression, Developmental Genomics Group, Center for Cancer Research, NCI, NIH, Bethesda, Maryland
| | - Drake A Russell
- Department of Medicinal Chemistry, University of Washington, Seattle, Washington
| | - Lyn M Huff
- Laboratory of Cell Biology, Center for Cancer Research, National Cancer Institute, NIH, Bethesda, Maryland
| | - Andrew J Perciaccante
- Laboratory of Cell Biology, Center for Cancer Research, National Cancer Institute, NIH, Bethesda, Maryland
| | - Fatima Ali-Rahmani
- Laboratory of Cell Biology, Center for Cancer Research, National Cancer Institute, NIH, Bethesda, Maryland
| | - Crystal C Lipsey
- Laboratory of Cell Biology, Center for Cancer Research, National Cancer Institute, NIH, Bethesda, Maryland
| | - Heidi M Wade
- Laboratory of Cell Biology, Center for Cancer Research, National Cancer Institute, NIH, Bethesda, Maryland
| | - Allison V Mitchell
- Laboratory of Cell Biology, Center for Cancer Research, National Cancer Institute, NIH, Bethesda, Maryland
| | - Siddhardha S Maligireddy
- Laboratory of Cell Biology, Center for Cancer Research, National Cancer Institute, NIH, Bethesda, Maryland
| | - David Terrero
- Department of Pharmacology and Experimental Therapeutics, Department of Cancer Cell and Cancer Biology, University of Toledo, Toledo, Ohio
| | - Donna Butcher
- Molecular Histopathology Laboratory, Frederick National Laboratory for Cancer Research, Frederick, Maryland
| | - Elijah F Edmondson
- Molecular Histopathology Laboratory, Frederick National Laboratory for Cancer Research, Frederick, Maryland
| | - Lisa M Jenkins
- Laboratory of Cell Biology, Center for Cancer Research, National Cancer Institute, NIH, Bethesda, Maryland
| | - Tatiana Nikitina
- Laboratory of Cell Biology, Center for Cancer Research, National Cancer Institute, NIH, Bethesda, Maryland
| | - Victor B Zhurkin
- Laboratory of Cell Biology, Center for Cancer Research, National Cancer Institute, NIH, Bethesda, Maryland
| | - Amit K Tiwari
- Department of Pharmacology and Experimental Therapeutics, Department of Cancer Cell and Cancer Biology, University of Toledo, Toledo, Ohio
| | | | - Rheem A Totah
- Department of Medicinal Chemistry, University of Washington, Seattle, Washington
| | - Susan E Bates
- Division of Hematology/Oncology, Department of Medicine, Columbia University Medical Center, New York, New York
- Hematology/Oncology Research Department, James J. Peters Department of Veterans Affairs Medical Center, New York, New York
| | - H Efsun Arda
- Laboratory of Receptor Biology and Gene Expression, Developmental Genomics Group, Center for Cancer Research, NCI, NIH, Bethesda, Maryland
| | - Michael M Gottesman
- Laboratory of Cell Biology, Center for Cancer Research, National Cancer Institute, NIH, Bethesda, Maryland
| | - Pedro J Batista
- Laboratory of Cell Biology, Center for Cancer Research, National Cancer Institute, NIH, Bethesda, Maryland
| |
Collapse
|
3
|
Jacob DR, Guiblet WM, Mamayusupova H, Shtumpf M, Ciuta I, Ruje L, Gretton S, Bikova M, Correa C, Dellow E, Agrawal SP, Shafiei N, Drobysevskaja A, Armstrong CM, Lam JDG, Vainshtein Y, Clarkson CT, Thorn GJ, Sohn K, Pradeepa MM, Chandrasekharan S, Brooke GN, Klenova E, Zhurkin VB, Teif VB. Nucleosome reorganisation in breast cancer tissues. Clin Epigenetics 2024; 16:50. [PMID: 38561804 PMCID: PMC10986098 DOI: 10.1186/s13148-024-01656-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/29/2023] [Accepted: 03/11/2024] [Indexed: 04/04/2024] Open
Abstract
BACKGROUND Nucleosome repositioning in cancer is believed to cause many changes in genome organisation and gene expression. Understanding these changes is important to elucidate fundamental aspects of cancer. It is also important for medical diagnostics based on cell-free DNA (cfDNA), which originates from genomic DNA regions protected from digestion by nucleosomes. RESULTS We have generated high-resolution nucleosome maps in paired tumour and normal tissues from the same breast cancer patients using MNase-assisted histone H3 ChIP-seq and compared them with the corresponding cfDNA from blood plasma. This analysis has detected single-nucleosome repositioning at key regulatory regions in a patient-specific manner and common cancer-specific patterns across patients. The nucleosomes gained in tumour versus normal tissue were particularly informative of cancer pathways, with ~ 20-fold enrichment at CpG islands, a large fraction of which marked promoters of genes encoding DNA-binding proteins. The tumour tissues were characterised by a 5-10 bp decrease in the average distance between nucleosomes (nucleosome repeat length, NRL), which is qualitatively similar to the differences between pluripotent and differentiated cells. This effect was correlated with gene activity, differential DNA methylation and changes in local occupancy of linker histone variants H1.4 and H1X. CONCLUSIONS Our study offers a novel resource of high-resolution nucleosome maps in breast cancer patients and reports for the first time the effect of systematic decrease of NRL in paired tumour versus normal breast tissues from the same patient. Our findings provide a new mechanistic understanding of nucleosome repositioning in tumour tissues that can be valuable for patient diagnostics, stratification and monitoring.
Collapse
Affiliation(s)
- Divya R Jacob
- School of Life Sciences, University of Essex, Wivenhoe Park, Colchester, CO4 3SQ, UK
| | - Wilfried M Guiblet
- Center for Cancer Research, National Cancer Institute, National Institutes of Health, Bethesda, MD, 20892, USA
| | - Hulkar Mamayusupova
- School of Life Sciences, University of Essex, Wivenhoe Park, Colchester, CO4 3SQ, UK
| | - Mariya Shtumpf
- School of Life Sciences, University of Essex, Wivenhoe Park, Colchester, CO4 3SQ, UK
| | - Isabella Ciuta
- School of Life Sciences, University of Essex, Wivenhoe Park, Colchester, CO4 3SQ, UK
| | - Luminita Ruje
- School of Life Sciences, University of Essex, Wivenhoe Park, Colchester, CO4 3SQ, UK
| | - Svetlana Gretton
- School of Life Sciences, University of Essex, Wivenhoe Park, Colchester, CO4 3SQ, UK
- School of Engineering, Arts, Science and Technology, University of Suffolk, James Hehir Building, University Avenue, Ipswich, Suffolk, IP3 0FS, UK
| | - Milena Bikova
- School of Life Sciences, University of Essex, Wivenhoe Park, Colchester, CO4 3SQ, UK
| | - Clark Correa
- School of Life Sciences, University of Essex, Wivenhoe Park, Colchester, CO4 3SQ, UK
| | - Emily Dellow
- School of Life Sciences, University of Essex, Wivenhoe Park, Colchester, CO4 3SQ, UK
| | - Shivam P Agrawal
- School of Life Sciences, University of Essex, Wivenhoe Park, Colchester, CO4 3SQ, UK
| | - Navid Shafiei
- School of Life Sciences, University of Essex, Wivenhoe Park, Colchester, CO4 3SQ, UK
| | | | - Chris M Armstrong
- School of Life Sciences, University of Essex, Wivenhoe Park, Colchester, CO4 3SQ, UK
| | - Jonathan D G Lam
- School of Life Sciences, University of Essex, Wivenhoe Park, Colchester, CO4 3SQ, UK
| | - Yevhen Vainshtein
- Fraunhofer-Institut für Grenzflächen- und Bioverfahrenstechnik IGB, Nobelstraße 12, 70569, Stuttgart, Germany
| | - Christopher T Clarkson
- School of Life Sciences, University of Essex, Wivenhoe Park, Colchester, CO4 3SQ, UK
- University College London, Gower St, Bloomsbury, London, WC1E 6BT, UK
| | - Graeme J Thorn
- School of Life Sciences, University of Essex, Wivenhoe Park, Colchester, CO4 3SQ, UK
- Barts Cancer Institute, Queen Mary University of London, Charterhouse Square, London, EC1M 6BQ, UK
| | - Kai Sohn
- Fraunhofer-Institut für Grenzflächen- und Bioverfahrenstechnik IGB, Nobelstraße 12, 70569, Stuttgart, Germany
| | - Madapura M Pradeepa
- School of Life Sciences, University of Essex, Wivenhoe Park, Colchester, CO4 3SQ, UK
- Blizard Institute, Barts and the London School of Medicine and Dentistry, Queen Mary University of London, London, E1 2AT, UK
| | - Sankaran Chandrasekharan
- Colchester General Hospital, East Suffolk and North Essex NHS Foundation Trust, Turner Road, Colchester, CO4 5JL, UK
| | - Greg N Brooke
- School of Life Sciences, University of Essex, Wivenhoe Park, Colchester, CO4 3SQ, UK
| | - Elena Klenova
- School of Life Sciences, University of Essex, Wivenhoe Park, Colchester, CO4 3SQ, UK
| | - Victor B Zhurkin
- Center for Cancer Research, National Cancer Institute, National Institutes of Health, Bethesda, MD, 20892, USA.
| | - Vladimir B Teif
- School of Life Sciences, University of Essex, Wivenhoe Park, Colchester, CO4 3SQ, UK.
| |
Collapse
|
4
|
Mandler MD, Maligireddy SS, Guiblet WM, Fitzsimmons CM, McDonald KS, Warrell DL, Batista PJ. The modification landscape of P. aeruginosa tRNAs. bioRxiv 2024:2024.02.21.581370. [PMID: 38529508 PMCID: PMC10962704 DOI: 10.1101/2024.02.21.581370] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 03/27/2024]
Abstract
RNA modifications have a substantial impact on tRNA function, with modifications in the anticodon loop contributing to translational fidelity and modifications in the tRNA core impacting structural stability. In bacteria, tRNA modifications are crucial for responding to stress and regulating the expression of virulence factors. Although tRNA modifications are well-characterized in a few model organisms, our knowledge of tRNA modifications in human pathogens, such as Pseudomonas aeruginosa , remains limited. Here we leveraged two orthogonal approaches to build a reference landscape of tRNA modifications in E. coli , which enabled us to identify similar modifications in P. aeruginosa . Our analysis revealed a substantial degree of conservation between the two organisms, while also uncovering potential sites of tRNA modification in P. aeruginosa tRNAs that are not present in E. coli . The mutational signature at one of these sites, position 46 of tRNA Gln1(UUG) is dependent on the P. aeruginosa homolog of TapT, the enzyme responsible for the 3-(3-amino-3-carboxypropyl) uridine (acp 3 U) modification. Identifying which modifications are present on different tRNAs will uncover the pathways impacted by the different tRNA modifying enzymes, some of which play roles in determining virulence and pathogenicity.
Collapse
|
5
|
Nikitina T, Guiblet WM, Cui F, Zhurkin VB. Histone N-tails modulate sequence-specific positioning of nucleosomes. bioRxiv 2023:2023.11.30.569460. [PMID: 38076910 PMCID: PMC10705531 DOI: 10.1101/2023.11.30.569460] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Indexed: 12/17/2023]
Abstract
The precise mechanisms governing sequence-dependent positioning of nucleosomes on DNA remain unknown in detail. Existing algorithms, taking into account the sequence-dependent deformability of DNA and its interactions with the histone globular domains, predict rotational setting of only 65% of human nucleosomes mapped in vivo. To uncover novel factors responsible for the nucleosome positioning, we analyzed potential involvement of the histone N-tails in this process. To this aim, we reconstituted the H2A/H4 N-tailless nucleosomes on human BRCA1 DNA (~100 kb) and compared their positions and sequences with those of the wild-type nucleosomes. In the case of H2A tailless nucleosomes, the AT content of DNA sequences is changed locally at superhelical location (SHL) ±4, while maintaining the same rotational setting as their wild-type counterparts. Conversely, the H4 tailless nucleosomes display widespread changes of the AT content near SHL ±1 and SHL ±2, where the H4 N-tails interact with DNA. Furthermore, a substantial number of H4 tailless nucleosomes exhibit rotational setting opposite to that of the wild-type nucleosomes. Thus, our findings strongly suggest that the histone N-tails are operative in selection of nucleosome positions, which may have wide-ranging implications for epigenetic modulation of chromatin states.
Collapse
Affiliation(s)
- Tatiana Nikitina
- National Cancer Institute, National Institutes of Health, Bethesda, MD 20892, US
| | - Wilfried M. Guiblet
- National Cancer Institute, National Institutes of Health, Bethesda, MD 20892, US
| | - Feng Cui
- Thomas H. Gosnell School of Life Sciences, Rochester Institute of Technology, Rochester, NY, USA
| | - Victor B. Zhurkin
- National Cancer Institute, National Institutes of Health, Bethesda, MD 20892, US
| |
Collapse
|
6
|
Weissensteiner MH, Cremona MA, Guiblet WM, Stoler N, Harris RS, Cechova M, Eckert KA, Chiaromonte F, Huang YF, Makova KD. Accurate sequencing of DNA motifs able to form alternative (non-B) structures. Genome Res 2023; 33:907-922. [PMID: 37433640 PMCID: PMC10519405 DOI: 10.1101/gr.277490.122] [Citation(s) in RCA: 4] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/09/2022] [Accepted: 05/04/2023] [Indexed: 07/13/2023]
Abstract
Approximately 13% of the human genome at certain motifs have the potential to form noncanonical (non-B) DNA structures (e.g., G-quadruplexes, cruciforms, and Z-DNA), which regulate many cellular processes but also affect the activity of polymerases and helicases. Because sequencing technologies use these enzymes, they might possess increased errors at non-B structures. To evaluate this, we analyzed error rates, read depth, and base quality of Illumina, Pacific Biosciences (PacBio) HiFi, and Oxford Nanopore Technologies (ONT) sequencing at non-B motifs. All technologies showed altered sequencing success for most non-B motif types, although this could be owing to several factors, including structure formation, biased GC content, and the presence of homopolymers. Single-nucleotide mismatch errors had low biases in HiFi and ONT for all non-B motif types but were increased for G-quadruplexes and Z-DNA in all three technologies. Deletion errors were increased for all non-B types but Z-DNA in Illumina and HiFi, as well as only for G-quadruplexes in ONT. Insertion errors for non-B motifs were highly, moderately, and slightly elevated in Illumina, HiFi, and ONT, respectively. Additionally, we developed a probabilistic approach to determine the number of false positives at non-B motifs depending on sample size and variant frequency, and applied it to publicly available data sets (1000 Genomes, Simons Genome Diversity Project, and gnomAD). We conclude that elevated sequencing errors at non-B DNA motifs should be considered in low-read-depth studies (single-cell, ancient DNA, and pooled-sample population sequencing) and in scoring rare variants. Combining technologies should maximize sequencing accuracy in future studies of non-B DNA.
Collapse
Affiliation(s)
| | - Marzia A Cremona
- Department of Operations and Decision Systems, Université Laval, Quebec, Quebec G1V0A6, Canada
- Population Health and Optimal Health Practices, CHU de Québec-Université Laval Research Center, Québec, Quebec G1V4G2, Canada
- Center for Medical Genomics, The Pennsylvania State University, University Park, Pennsylvania 16802, USA
| | - Wilfried M Guiblet
- Department of Biology, The Pennsylvania State University, University Park, Pennsylvania 16802, USA
- Laboratory of Cell Biology, NCI-CCR, National Institutes of Health, Bethesda, Maryland 20892, USA
| | - Nicholas Stoler
- Department of Biochemistry and Molecular Biology, The Pennsylvania State University, University Park, Pennsylvania 16802, USA
| | - Robert S Harris
- Department of Biology, The Pennsylvania State University, University Park, Pennsylvania 16802, USA
| | - Monika Cechova
- Department of Biology, The Pennsylvania State University, University Park, Pennsylvania 16802, USA
- Faculty of Informatics, Masaryk University, 60200 Brno, Czech Republic
| | - Kristin A Eckert
- Center for Medical Genomics, The Pennsylvania State University, University Park, Pennsylvania 16802, USA
- Department of Pathology, The Pennsylvania State University, College of Medicine, Hershey, Pennsylvania 17033, USA
| | - Francesca Chiaromonte
- Center for Medical Genomics, The Pennsylvania State University, University Park, Pennsylvania 16802, USA
- Department of Statistics, The Pennsylvania State University, University Park, Pennsylvania 16802, USA
- Institute of Economics and L'EMbeDS, Sant'Anna School of Advanced Studies, Pisa 56127, Italy
| | - Yi-Fei Huang
- Department of Biology, The Pennsylvania State University, University Park, Pennsylvania 16802, USA
- Center for Medical Genomics, The Pennsylvania State University, University Park, Pennsylvania 16802, USA
| | - Kateryna D Makova
- Department of Biology, The Pennsylvania State University, University Park, Pennsylvania 16802, USA;
- Center for Medical Genomics, The Pennsylvania State University, University Park, Pennsylvania 16802, USA
| |
Collapse
|
7
|
Guiblet WM, DeGiorgio M, Cheng X, Chiaromonte F, Eckert KA, Huang YF, Makova KD. Selection and thermostability suggest G-quadruplexes are novel functional elements of the human genome. Genome Res 2021; 31:1136-1149. [PMID: 34187812 PMCID: PMC8256861 DOI: 10.1101/gr.269589.120] [Citation(s) in RCA: 16] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/30/2020] [Accepted: 05/24/2021] [Indexed: 12/11/2022]
Abstract
Approximately 1% of the human genome has the ability to fold into G-quadruplexes (G4s)-noncanonical strand-specific DNA structures forming at G-rich motifs. G4s regulate several key cellular processes (e.g., transcription) and have been hypothesized to participate in others (e.g., firing of replication origins). Moreover, G4s differ in their thermostability, and this may affect their function. Yet, G4s may also hinder replication, transcription, and translation and may increase genome instability and mutation rates. Therefore, depending on their genomic location, thermostability, and functionality, G4 loci might evolve under different selective pressures, which has never been investigated. Here we conducted the first genome-wide analysis of G4 distribution, thermostability, and selection. We found an overrepresentation, high thermostability, and purifying selection for G4s within genic components in which they are expected to be functional-promoters, CpG islands, and 5' and 3' UTRs. A similar pattern was observed for G4s within replication origins, enhancers, eQTLs, and TAD boundary regions, strongly suggesting their functionality. In contrast, G4s on the nontranscribed strand of exons were underrepresented, were unstable, and evolved neutrally. In general, G4s on the nontranscribed strand of genic components had lower density and were less stable than those on the transcribed strand, suggesting that the former are avoided at the RNA level. Across the genome, purifying selection was stronger at stable G4s. Our results suggest that purifying selection preserves the sequences of functional G4s, whereas nonfunctional G4s are too costly to be tolerated in the genome. Thus, G4s are emerging as fundamental, functional genomic elements.
Collapse
Affiliation(s)
- Wilfried M Guiblet
- Bioinformatics and Genomics Graduate Program, Penn State University, University Park, Pennsylvania 16802, USA
| | - Michael DeGiorgio
- Department of Computer and Electrical Engineering and Computer Science, Florida Atlantic University, Boca Raton, Florida 33431, USA
| | - Xiaoheng Cheng
- Department of Biology, Penn State University, University Park, Pennsylvania 16802, USA
| | - Francesca Chiaromonte
- Department of Statistics, The Pennsylvania State University, University Park, Pennsylvania 16802, USA
- Center for Medical Genomics, Penn State University, University Park and Hershey, Pennsylvania 16802, USA
- Sant'Anna School of Advanced Studies, 56127 Pisa, Italy
| | - Kristin A Eckert
- Center for Medical Genomics, Penn State University, University Park and Hershey, Pennsylvania 16802, USA
- Department of Pathology, Penn State University, College of Medicine, Hershey, Pennsylvania 17033, USA
| | - Yi-Fei Huang
- Department of Biology, Penn State University, University Park, Pennsylvania 16802, USA
- Center for Medical Genomics, Penn State University, University Park and Hershey, Pennsylvania 16802, USA
| | - Kateryna D Makova
- Department of Biology, Penn State University, University Park, Pennsylvania 16802, USA
- Center for Medical Genomics, Penn State University, University Park and Hershey, Pennsylvania 16802, USA
| |
Collapse
|
8
|
Guiblet WM, Cremona MA, Harris RS, Chen D, Eckert KA, Chiaromonte F, Huang YF, Makova KD. Non-B DNA: a major contributor to small- and large-scale variation in nucleotide substitution frequencies across the genome. Nucleic Acids Res 2021; 49:1497-1516. [PMID: 33450015 PMCID: PMC7897504 DOI: 10.1093/nar/gkaa1269] [Citation(s) in RCA: 37] [Impact Index Per Article: 12.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/05/2020] [Revised: 12/14/2020] [Accepted: 01/11/2021] [Indexed: 12/12/2022] Open
Abstract
Approximately 13% of the human genome can fold into non-canonical (non-B) DNA structures (e.g. G-quadruplexes, Z-DNA, etc.), which have been implicated in vital cellular processes. Non-B DNA also hinders replication, increasing errors and facilitating mutagenesis, yet its contribution to genome-wide variation in mutation rates remains unexplored. Here, we conducted a comprehensive analysis of nucleotide substitution frequencies at non-B DNA loci within noncoding, non-repetitive genome regions, their ±2 kb flanking regions, and 1-Megabase windows, using human-orangutan divergence and human single-nucleotide polymorphisms. Functional data analysis at single-base resolution demonstrated that substitution frequencies are usually elevated at non-B DNA, with patterns specific to each non-B DNA type. Mirror, direct and inverted repeats have higher substitution frequencies in spacers than in repeat arms, whereas G-quadruplexes, particularly stable ones, have higher substitution frequencies in loops than in stems. Several non-B DNA types also affect substitution frequencies in their flanking regions. Finally, non-B DNA explains more variation than any other predictor in multiple regression models for diversity or divergence at 1-Megabase scale. Thus, non-B DNA substantially contributes to variation in substitution frequencies at small and large scales. Our results highlight the role of non-B DNA in germline mutagenesis with implications to evolution and genetic diseases.
Collapse
Affiliation(s)
- Wilfried M Guiblet
- Bioinformatics and Genomics Graduate Program, Penn State University, UniversityPark, PA 16802, USA
| | - Marzia A Cremona
- Department of Statistics, The Pennsylvania State University, University Park, PA 16802, USA
- Department of Operations and Decision Systems, Université Laval, Canada
- CHU de Québec – Université Laval Research Center, Canada
| | - Robert S Harris
- Department of Biology, Penn State University, University Park, PA 16802, USA
| | - Di Chen
- Intercollege Graduate Degree Program in Genetics, Huck Institutes of the Life Sciences, Penn State University, UniversityPark, PA 16802, USA
| | - Kristin A Eckert
- Department of Pathology, Penn State University, College of Medicine, Hershey, PA 17033, USA
- Center for Medical Genomics, Penn State University, University Park and Hershey, PA, USA
| | - Francesca Chiaromonte
- Department of Statistics, The Pennsylvania State University, University Park, PA 16802, USA
- Center for Medical Genomics, Penn State University, University Park and Hershey, PA, USA
- EMbeDS, Sant’Anna School of Advanced Studies, 56127 Pisa, Italy
| | - Yi-Fei Huang
- Department of Biology, Penn State University, University Park, PA 16802, USA
- Center for Medical Genomics, Penn State University, University Park and Hershey, PA, USA
| | - Kateryna D Makova
- Department of Biology, Penn State University, University Park, PA 16802, USA
- Center for Medical Genomics, Penn State University, University Park and Hershey, PA, USA
| |
Collapse
|
9
|
Guiblet WM, Cremona MA, Cechova M, Harris RS, Kejnovská I, Kejnovsky E, Eckert K, Chiaromonte F, Makova KD. Long-read sequencing technology indicates genome-wide effects of non-B DNA on polymerization speed and error rate. Genome Res 2018; 28:1767-1778. [PMID: 30401733 PMCID: PMC6280752 DOI: 10.1101/gr.241257.118] [Citation(s) in RCA: 42] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/29/2018] [Accepted: 10/30/2018] [Indexed: 12/14/2022]
Abstract
DNA conformation may deviate from the classical B-form in ∼13% of the human genome. Non-B DNA regulates many cellular processes; however, its effects on DNA polymerization speed and accuracy have not been investigated genome-wide. Such an inquiry is critical for understanding neurological diseases and cancer genome instability. Here, we present the first simultaneous examination of DNA polymerization kinetics and errors in the human genome sequenced with Single-Molecule Real-Time (SMRT) technology. We show that polymerization speed differs between non-B and B-DNA: It decelerates at G-quadruplexes and fluctuates periodically at disease-causing tandem repeats. Analyzing polymerization kinetics profiles, we predict and validate experimentally non-B DNA formation for a novel motif. We demonstrate that several non-B motifs affect sequencing errors (e.g., G-quadruplexes increase error rates), and that sequencing errors are positively associated with polymerase slowdown. Finally, we show that highly divergent G4 motifs have pronounced polymerization slowdown and high sequencing error rates, suggesting similar mechanisms for sequencing errors and germline mutations.
Collapse
Affiliation(s)
- Wilfried M Guiblet
- Bioinformatics and Genomics Graduate Program, Penn State University, University Park, Pennsylvania 16802, USA
| | - Marzia A Cremona
- Department of Statistics, Penn State University, University Park, Pennsylvania 16802, USA
| | - Monika Cechova
- Department of Biology, Penn State University, University Park, Pennsylvania 16802, USA
| | - Robert S Harris
- Department of Biology, Penn State University, University Park, Pennsylvania 16802, USA
| | - Iva Kejnovská
- Department of Biophysics of Nucleic Acids, Institute of Biophysics of the Czech Academy of Sciences, Královopolská 135, 612 65 Brno, Czech Republic
| | - Eduard Kejnovsky
- Department of Plant Developmental Genetics, Institute of Biophysics of the Czech Academy of Sciences, Královopolská 135, 612 65 Brno, Czech Republic
| | - Kristin Eckert
- Department of Pathology, Penn State University, College of Medicine, Hershey, Pennsylvania 17033, USA
| | - Francesca Chiaromonte
- Department of Statistics, Penn State University, University Park, Pennsylvania 16802, USA.,Sant'Anna School of Advanced Studies, 56127 Pisa, Italy
| | - Kateryna D Makova
- Department of Biology, Penn State University, University Park, Pennsylvania 16802, USA
| |
Collapse
|
10
|
Limou S, Nelson GW, Lecordier L, An P, O'hUigin CS, David VA, Binns-Roemer EA, Guiblet WM, Oleksyk TK, Pays E, Kopp JB, Winkler CA. Sequencing rare and common APOL1 coding variants to determine kidney disease risk. Kidney Int 2015; 88:754-63. [PMID: 25993319 PMCID: PMC4591109 DOI: 10.1038/ki.2015.151] [Citation(s) in RCA: 29] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/05/2014] [Revised: 03/26/2015] [Accepted: 04/02/2015] [Indexed: 01/07/2023]
Abstract
A third of African Americans with sporadic focal segmental glomerulosclerosis (FSGS) or HIV-associated nephropathy (HIVAN) do not carry APOL1 renal risk genotypes. This raises the possibility that other APOL1 variants may contribute to kidney disease. To address this question, we sequenced all APOL1 exons in 1, 437 Americans of African and European decent, including 464 patients with biopsy-proven FSGS/HIVAN. Testing for association with 33 common and rare variants with FSGS/HIVAN revealed no association independent of strong recessive G1 and G2 effects. Seeking additional variants that might have been under selection by pathogens and could represent candidates for kidney disease risk, we also sequenced an additional 1, 112 individuals representing 53 global populations. Except for G1 and G2, none of the 7 common codon-altering variants showed evidence of selection or could restore lysis against trypanosomes causing human African trypanosomiasis. Thus, only APOL1 G1 and G2 confer renal risk and other common and rare APOL1 missense variants, including the archaic G3 haplotype, do not contribute to sporadic FSGS and HIVAN in the United States population. Hence, in most potential clinical or screening applications, our study suggests that sequencing APOL1 exons is unlikely to bring additional information compared to genotyping only APOL1 G1 and G2 risk alleles.
Collapse
Affiliation(s)
- Sophie Limou
- Molecular Genetic Epidemiology Section, Basic Research Laboratory, Basic Science Program, NCI, Leidos Biomedical Research, Frederick National Laboratory, Frederick, Maryland, USA
| | - George W Nelson
- Center for Cancer Research Informatics Core, Leidos Biomedical Research, Frederick National Laboratory, Frederick, Maryland, USA
| | - Laurence Lecordier
- Laboratory of Molecular Parasitology, IBMM, Université Libre de Bruxelles (ULB), Gosselies, Belgium
| | - Ping An
- Molecular Genetic Epidemiology Section, Basic Research Laboratory, Basic Science Program, NCI, Leidos Biomedical Research, Frederick National Laboratory, Frederick, Maryland, USA
| | - Colm S O'hUigin
- Laboratory of Experimental Immunology, Cancer and Inflammation Program, NCI, Leidos Biomedical Research, Frederick National Laboratory, Frederick, Maryland, USA
| | - Victor A David
- Molecular Genetic Epidemiology Section, Basic Research Laboratory, Basic Science Program, NCI, Frederick National Laboratory, Frederick, Maryland, USA
| | - Elizabeth A Binns-Roemer
- Molecular Genetic Epidemiology Section, Basic Research Laboratory, Basic Science Program, NCI, Leidos Biomedical Research, Frederick National Laboratory, Frederick, Maryland, USA
| | - Wilfried M Guiblet
- Caribbean Genome Center, Department of Biology, University of Puerto Rico at Mayagüez, Mayagüez, Puerto Rico
| | - Taras K Oleksyk
- Caribbean Genome Center, Department of Biology, University of Puerto Rico at Mayagüez, Mayagüez, Puerto Rico
| | - Etienne Pays
- Laboratory of Molecular Parasitology, IBMM, Université Libre de Bruxelles (ULB), Gosselies, Belgium
| | - Jeffrey B Kopp
- Kidney Disease Section, NIDDK, NIH, Bethesda, Maryland, USA
| | - Cheryl A Winkler
- Molecular Genetic Epidemiology Section, Basic Research Laboratory, Basic Science Program, NCI, Leidos Biomedical Research, Frederick National Laboratory, Frederick, Maryland, USA
| |
Collapse
|
11
|
Guiblet WM, Zhao K, O'Brien SJ, Massey SE, Roca AL, Oleksyk TK. SmileFinder: a resampling-based approach to evaluate signatures of selection from genome-wide sets of matching allele frequency data in two or more diploid populations. Gigascience 2015; 4:1. [PMID: 25838885 PMCID: PMC4382839 DOI: 10.1186/2047-217x-4-1] [Citation(s) in RCA: 102] [Impact Index Per Article: 11.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/01/2014] [Accepted: 10/23/2014] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND Adaptive alleles may rise in frequency as a consequence of positive selection, creating a pattern of decreased variation in the neighboring loci, known as a selective sweep. When the region containing this pattern is compared to another population with no history of selection, a rise in variance of allele frequencies between populations is observed. One challenge presented by large genome-wide datasets is the ability to differentiate between patterns that are remnants of natural selection from those expected to arise at random and/or as a consequence of selectively neutral demographic forces acting in the population. FINDINGS SmileFinder is a simple program that looks for diversity and divergence patterns consistent with selection sweeps by evaluating allele frequencies in windows, including neighboring loci from two or more populations of a diploid species against the genome-wide neutral expectation. The program calculates the mean of heterozygosity and FST in a set of sliding windows of incrementally increasing sizes, and then builds a resampled distribution (the baseline) of random multi-locus sets matched to the sizes of sliding windows, using an unrestricted sampling. Percentiles of the values in the sliding windows are derived from the superimposed resampled distribution. The resampling can easily be scaled from 1 K to 100 M; the higher the number, the more precise the percentiles ascribed to the extreme observed values. CONCLUSIONS The output from SmileFinder can be used to plot percentile values to look for population diversity and divergence patterns that may suggest past actions of positive selection along chromosome maps, and to compare lists of suspected candidate genes under random gene sets to test for the overrepresentation of these patterns among gene categories. Both applications of the algorithm have already been used in published studies. Here we present a publicly available, open source program that will serve as a useful tool for preliminary scans of selection using worldwide databases of human genetic variation, as well as population datasets for many non-human species, from which such data is rapidly emerging with the advent of new genotyping and sequencing technologies.
Collapse
Affiliation(s)
- Wilfried M Guiblet
- Biology Department, University of Puerto Rico at Mayagüez, Mayagüez, 00680 Puerto Rico
| | - Kai Zhao
- Department of Animal Sciences, University of Illinois at Urbana-Champaign, Urbana, 61801 Illinois USA
| | - Stephen J O'Brien
- Theodosius Dobzhansky Center for Genome Bioinformatics, St. Petersburg University, St. Petersburg, 199034 Russia ; Oceanographic Center, Nova Southeastern University, Ft. Lauderdale, 33004 Florida USA
| | - Steven E Massey
- Biology Department, University of Puerto Rico at Rio Piedras, Rio Piedras, 00931 Puerto Rico
| | - Alfred L Roca
- Department of Animal Sciences, University of Illinois at Urbana-Champaign, Urbana, 61801 Illinois USA
| | - Taras K Oleksyk
- Biology Department, University of Puerto Rico at Mayagüez, Mayagüez, 00680 Puerto Rico
| |
Collapse
|