1
|
Semyonov DA, Eltsov IV, Nechipurenko YD. A New Bias Site for Epigenetic Modifications: How Non-Canonical GC Base Pairs Favor Mechanochemical Cleavage of DNA. Bioessays 2020; 42:e2000051. [PMID: 32830350 DOI: 10.1002/bies.202000051] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/09/2020] [Revised: 07/23/2020] [Indexed: 11/11/2022]
Abstract
Properties of non-canonical GC base pairs and their relations with mechanochemical cleavage of DNA are analyzed. A hypothesis of the involvement of the transient GC wobble base pairs both in the mechanisms of the mechanochemical cleavage of DNA and epigenetic mechanisms involving of 5-methylcytosine, is proposed. The hypothesis explains the increase in the frequency of the breaks of the sugar-phosphate backbone of DNA after cytosines, the asymmetric character of these breaks, and an increase in break frequency in CpG after cytosine methylation. As an alternative hypothesis, probable implication of GC+ Hoogsteen base pairs is considered, which now exemplify the best-studied non-canonical GC base pairs in the DNA double helix. Also see the video abstract here https://youtu.be/EUunVWL0ptw.
Collapse
Affiliation(s)
- Denis A Semyonov
- Institute of Biophysics, Institute of Biophysics, Siberian Branch of Russian Academy of Science., Akademgorodok 50, Krasnoyarsk, 660036, Russia
| | | | - Yury D Nechipurenko
- Engelhardt Institute of Molecular Biology, Russian Academy of Sciences, Moscow, Russia
| |
Collapse
|
2
|
Poulos RC, Olivier J, Wong JWH. The interaction between cytosine methylation and processes of DNA replication and repair shape the mutational landscape of cancer genomes. Nucleic Acids Res 2017; 45:7786-7795. [PMID: 28531315 PMCID: PMC5737810 DOI: 10.1093/nar/gkx463] [Citation(s) in RCA: 60] [Impact Index Per Article: 8.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/27/2017] [Accepted: 05/15/2017] [Indexed: 12/31/2022] Open
Abstract
Methylated cytosines (5mCs) are frequently mutated in the genome. However, no studies have yet comprehensively analysed mutation–methylation associations across cancer types. Here we analyse 916 cancer genomes, together with tissue type-specific methylation and replication timing data. We describe a strong mutation–methylation association across colorectal cancer subtypes, most interestingly in samples with microsatellite instability (MSI) or Polymerase epsilon (POLE) exonuclease domain mutations. By analysing genomic regions with differential mismatch repair (MMR) efficiency, we suggest a possible role for MMR in the correction of 5mC deamination events, potentially accounting for the high rate of 5mC mutation accumulation in MSI tumours. Additionally, we propose that mutant POLE asserts a mutator phenotype specifically at 5mCs, and we find coding mutation hotspots in POLE-mutant cancers at highly-methylated CpGs in the tumour-suppressor genes APC and TP53. Finally, using multivariable regression models, we demonstrate that different cancers exhibit distinct mutation–methylation associations, with DNA repair influencing such associations in certain cancer genomes. Taken together, we find differential associations with methylation that are vital for accurately predicting expected mutation loads across cancer types. Our findings reveal links between methylation and common mutation and repair processes, with these mechanisms defining a key part of the mutational landscape of cancer genomes.
Collapse
Affiliation(s)
- Rebecca C Poulos
- Prince of Wales Clinical School and Lowy Cancer Research Centre, UNSW Sydney, NSW 2052, Australia
| | - Jake Olivier
- School of Mathematics and Statistics, The Red Centre, UNSW Sydney, NSW 2052, Australia
| | - Jason W H Wong
- Prince of Wales Clinical School and Lowy Cancer Research Centre, UNSW Sydney, NSW 2052, Australia
| |
Collapse
|
3
|
Upadhyay M, Sharma N, Vivekanandan P. Systematic CpT (ApG) depletion and CpG excess are unique genomic signatures of large DNA viruses infecting invertebrates. PLoS One 2014; 9:e111793. [PMID: 25369195 PMCID: PMC4219779 DOI: 10.1371/journal.pone.0111793] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/18/2014] [Accepted: 10/02/2014] [Indexed: 11/18/2022] Open
Abstract
Differences in the relative abundance of dinucleotides, if any may provide important clues on host-driven evolution of viruses. We studied dinucleotide frequencies of large DNA viruses infecting vertebrates (n = 105; viruses infecting mammals = 99; viruses infecting aves = 6; viruses infecting reptiles = 1) and invertebrates (n = 88; viruses infecting insects = 84; viruses infecting crustaceans = 4). We have identified systematic depletion of CpT(ApG) dinucleotides and over-representation of CpG dinucleotides as the unique genomic signature of large DNA viruses infecting invertebrates. Detailed investigation of this unique genomic signature suggests the existence of invertebrate host-induced pressures specifically targeting CpT(ApG) and CpG dinucleotides. The depletion of CpT dinucleotides among large DNA viruses infecting invertebrates is at least in part, explained by non-canonical DNA methylation by the infected host. Our findings highlight the role of invertebrate host-related factors in shaping virus evolution and they also provide the necessary framework for future studies on evolution, epigenetics and molecular biology of viruses infecting this group of hosts.
Collapse
Affiliation(s)
- Mohita Upadhyay
- Kusuma School of Biological Sciences, Indian Institute of Technology Delhi, New Delhi, India
| | - Neha Sharma
- Kusuma School of Biological Sciences, Indian Institute of Technology Delhi, New Delhi, India
| | - Perumal Vivekanandan
- Kusuma School of Biological Sciences, Indian Institute of Technology Delhi, New Delhi, India
- * E-mail:
| |
Collapse
|
4
|
Maumus F, Quesneville H. Ancestral repeats have shaped epigenome and genome composition for millions of years in Arabidopsis thaliana. Nat Commun 2014; 5:4104. [PMID: 24954583 PMCID: PMC4090718 DOI: 10.1038/ncomms5104] [Citation(s) in RCA: 54] [Impact Index Per Article: 5.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/02/2013] [Accepted: 05/13/2014] [Indexed: 12/19/2022] Open
Abstract
Little is known about the evolution of repeated sequences over long periods of time. Using two independent approaches, we show that the majority of the repeats found in the Arabidopsis thaliana genome are ancient and likely to derive from the retention of fragments deposited during ancestral bursts that occurred early in the Brassicaceae evolution. We determine that the majority of young repeats are found in pericentromeric domains, while older copies are frequent in the gene-rich regions. Our results further suggest that the DNA methylation of repeats through small RNA-mediated pathways can last over prolonged periods of time. We also illustrate the way repeated sequences are composted by mutations towards genomic dark matter over time, probably driven by the deamination of methylcytosines, which also have an impact on epigenomic landscapes. Overall, we show that the ancient proliferation of repeat families has long-term consequences on A. thaliana biology and genome composition. Repeated sequences are common in genomes, yet little is known about the long-term evolution of repeats in plants. Here, Maumus and Quesneville show that most of the repeated sequences in the model plant, Arabidopsis thaliana, are old and that many small RNAs correspond to old repeats.
Collapse
Affiliation(s)
- Florian Maumus
- INRA, UR1164 URGI-Research Unit in Genomics-Info, INRA de Versailles-Grignon, Route de Saint-Cyr, Versailles 78026, France
| | - Hadi Quesneville
- INRA, UR1164 URGI-Research Unit in Genomics-Info, INRA de Versailles-Grignon, Route de Saint-Cyr, Versailles 78026, France
| |
Collapse
|
5
|
CpG dinucleotide frequencies reveal the role of host methylation capabilities in parvovirus evolution. J Virol 2013; 87:13816-24. [PMID: 24109231 DOI: 10.1128/jvi.02515-13] [Citation(s) in RCA: 25] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/05/2023] Open
Abstract
Parvoviruses are rapidly evolving viruses that infect a wide range of hosts, including vertebrates and invertebrates. Extensive methylation of the parvovirus genome has been recently demonstrated. A global pattern of methylation of CpG dinucleotides is seen in vertebrate genomes, compared to "fractional" methylation patterns in invertebrate genomes. It remains unknown if the loss of CpG dinucleotides occurs in all viruses of a given DNA virus family that infect host species spanning across vertebrates and invertebrates. We investigated the link between the extent of CpG dinucleotide depletion among autonomous parvoviruses and the evolutionary lineage of the infected host. We demonstrate major differences in the relative abundance of CpG dinucleotides among autonomous parvoviruses which share similar genome organization and common ancestry, depending on the infected host species. Parvoviruses infecting vertebrate hosts had significantly lower relative abundance of CpG dinucleotides than parvoviruses infecting invertebrate hosts. The strong correlation of CpG dinucleotide depletion with the gain in TpG/CpA dinucleotides and the loss of TpA dinucleotides among parvoviruses suggests a major role for CpG methylation in the evolution of parvoviruses. Our data present evidence that links the relative abundance of CpG dinucleotides in parvoviruses to the methylation capabilities of the infected host. In sum, our findings support a novel perspective of host-driven evolution among autonomous parvoviruses.
Collapse
|
6
|
Peifer M, Karro JE, von Grünberg HH. Is there an acceleration of the CpG transition rate during the mammalian radiation? Bioinformatics 2008; 24:2157-64. [PMID: 18662928 PMCID: PMC2553435 DOI: 10.1093/bioinformatics/btn391] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/11/2008] [Revised: 07/27/2008] [Accepted: 07/27/2008] [Indexed: 11/13/2022] Open
Abstract
MOTIVATION In this article we build a model of the CpG dinucleotide substitution rate and use it to challenge the claim that, that rate underwent a sudden mammalian-specific increase approximately 90 million years ago. The evidence supporting this hypothesis comes from the application of a model of neutral substitution rates able to account for elevated CpG dinucleotide substitution rates. With the initial goal of improving that model's accuracy, we introduced a modification enabling us to account for boundary effects arising by the truncation of the Markov field, as well as improving the optimization procedure required for estimating the substitution rates. RESULTS When using this modified method to reproduce the supporting analysis, the evidence of the rate shift vanished. Our analysis suggests that the CpG-specific rate has been constant over the relevant time period and that the asserted acceleration of the CpG rate is likely an artifact of the original model.
Collapse
Affiliation(s)
- M Peifer
- Institute of Chemistry, Karl-Franzens University Graz, Graz, Austria.
| | | | | |
Collapse
|
7
|
Fryxell KJ, Moon WJ. CpG mutation rates in the human genome are highly dependent on local GC content. Mol Biol Evol 2004; 22:650-8. [PMID: 15537806 DOI: 10.1093/molbev/msi043] [Citation(s) in RCA: 131] [Impact Index Per Article: 6.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/04/2023] Open
Abstract
CpG dinucleotides mutate at a high rate because cytosine is vulnerable to deamination, cytosines in CpG dinucleotides are often methylated, and deamination of 5-methylcytosine (5mC) produces thymidine. Previous experiments have shown that DNA melting is the rate-limiting step in cytosine deamination. Here we show, through the analysis of human single-nucleotide polymorphisms (SNPs), that the mutation rate produced by 5mC deamination is highly dependent on local GC content. In fact, linear regression analysis showed that the log(10) of the 5mC mutation rates (inferred from SNP frequencies) had slopes of -3 when graphed with respect to the GC content of neighboring sequences. This is the ideal slope that would be expected if the correlation between CpG underrepresentation and GC content had been solely caused by DNA melting. Moreover, this same result was obtained regardless of the SNP locations (all SNPs versus only SNPs in noncoding intergenic regions, excluding CpG islands) and regardless of the lengths over which GC content was calculated (SNP sequences with a modal length of 564 bp versus genomic contigs with a modal length of 163 kb). Several alternative interpretations are discussed.
Collapse
Affiliation(s)
- Karl J Fryxell
- Center for Biomedical Genomics and Informatics, Department of Molecular and Microbiology, George Mason University, Manassas, Virginia, USA.
| | | |
Collapse
|
8
|
Fryxell KJ, Zuckerkandl E. Cytosine deamination plays a primary role in the evolution of mammalian isochores. Mol Biol Evol 2000; 17:1371-83. [PMID: 10958853 DOI: 10.1093/oxfordjournals.molbev.a026420] [Citation(s) in RCA: 130] [Impact Index Per Article: 5.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open
Abstract
DNA melting is rate-limiting for cytosine deamination, from which we infer that the rate of cytosine deamination should decline twofold for each 10% increase in GC content. Analysis of human DNA sequence data confirms that this is the case for 5-methylcytosine. Several lines of evidence further confirm that it is also the case for unmethylated cytosine and that cytosine deamination causes the majority of all C-->T and G-->A transitions in mammals. Thus, cytosine deamination and DNA base composition each affect the other, forming a positive feedback loop that facilitates divergent genetic drift to high or low GC content. Because a 10 degrees C increase in temperature in vitro increases the rate of cytosine deamination 5. 7-fold, cytosine deamination must be highly dependent on body temperature, which is consistent with the dramatic differences between the isochores of warm-blooded versus cold-blooded vertebrates. Because this process involves both DNA melting and positive feedback, it would be expected to spread progressively (in evolutionary time) down the length of the chromosome, which is consistent with the large size of isochores in modern mammals.
Collapse
Affiliation(s)
- K J Fryxell
- Department of Biology, George Mason University, Fairfax, Virginia 22030, USA.
| | | |
Collapse
|
9
|
Belloni M, Uberti D, Rizzini C, Ferrari-Toninelli G, Rizzonelli P, Jiricny J, Spano P, Memo M. Distribution and kainate-mediated induction of the DNA mismatch repair protein MSH2 in rat brain. Neuroscience 2000; 94:1323-31. [PMID: 10625070 DOI: 10.1016/s0306-4522(99)00380-2] [Citation(s) in RCA: 19] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]
Abstract
DNA repair is one of the most essential systems for maintaining the inherited nucleotide sequence of genomic DNA over time. Repair of DNA damage would be particularly important in neurons, because these cells are among the longest-living cells in the body. MSH2 is one of the proteins which are involved in the recognition and repair of a specific type of DNA damage that is characterized by pair mismatches. We studied the distribution of MSH2 in rat brain by immunohistochemical analysis. We found the level of MSH2 expression in rat brain to be clearly heterogeneous. The highest intensity of staining was found in the pyramidal neurons of the hippocampus and in the entorhinal and frontoparietal cortices. Positive cells were observed in the substantia nigra pars compacta, in cerebellar granular and Purkinje cells, and in the motor neurons of the spinal cord. We investigated the possible modulation of MSH2 expression after injection of kainate. Systemic administration of kainate induces various behavioural alterations and a typical pattern of neuropathology, with cell death in the hippocampal pyramidal neurons of the CA3/CA4 fields. Kainate injection also resulted in a marked, dose-dependent increase of MSH2 immunoreactivity in the hippocampal neurons of the CA3/CA4 fields. The effect was specific, since no changes in immunoreactivity were detected in the dentate gyrus nor in other brain areas. In summary, our data suggest that a mismatch DNA repair system, of which MSH2 protein is a representative component, is heterogeneously expressed in the rat brain and specifically induced by an experimental paradigm of excitotoxicity.
Collapse
Affiliation(s)
- M Belloni
- Department of Biomedical Sciences and Biotechnologies, School of Medicine, University of Brescia, Italy
| | | | | | | | | | | | | | | |
Collapse
|
10
|
Marietta C, Palombo F, Gallinari P, Jiricny J, Brooks PJ. Expression of long-patch and short-patch DNA mismatch repair proteins in the embryonic and adult mammalian brain. BRAIN RESEARCH. MOLECULAR BRAIN RESEARCH 1998; 53:317-20. [PMID: 9473709 DOI: 10.1016/s0169-328x(97)00311-2] [Citation(s) in RCA: 32] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 02/06/2023]
Abstract
Expression of the DNA mismatch repair (MMR) pathway was examined in the adult and developing rat brain. Rat homologues of human GTBP and MSH2, which are essential components of the post-replicative DNA MMR system, were identified in nuclear extracts from the adult and developing rat brain. Developmental studies showed that both GTBP and MSH2 levels were higher in nuclei isolated from the embryonic brain (day 16) than adult brain. However, this difference was not as dramatic as the difference in the number of proliferating cells. Levels of thymine DNA glycosylase (TDG), the enzyme which catalyzes the first step in short patch G:T mismatch repair, were also decreased in adult compared to embryonic brain. In the adult brain, MMR proteins were elevated in nuclear extracts enriched for neuronal nuclei. These results suggest that adult brain cells have the capacity to carry out DNA mismatch repair, in spite of a lack of ongoing DNA replication.
Collapse
Affiliation(s)
- C Marietta
- Section on Molecular Neurobiology, Laboratory of Neurogenetics, NIH/DICBR/NIAAA/LNG, 12420 Parklawn Drive, Room 451 MSC #8110, Bethesda, MD 20892-8110, USA
| | | | | | | | | |
Collapse
|