1
|
Guiblet WM, Cremona MA, Harris RS, Chen D, Eckert KA, Chiaromonte F, Huang YF, Makova KD. Non-B DNA: a major contributor to small- and large-scale variation in nucleotide substitution frequencies across the genome. Nucleic Acids Res 2021; 49:1497-1516. [PMID: 33450015 PMCID: PMC7897504 DOI: 10.1093/nar/gkaa1269] [Citation(s) in RCA: 58] [Impact Index Per Article: 19.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/05/2020] [Revised: 12/14/2020] [Accepted: 01/11/2021] [Indexed: 12/12/2022] Open
Abstract
Approximately 13% of the human genome can fold into non-canonical (non-B) DNA structures (e.g. G-quadruplexes, Z-DNA, etc.), which have been implicated in vital cellular processes. Non-B DNA also hinders replication, increasing errors and facilitating mutagenesis, yet its contribution to genome-wide variation in mutation rates remains unexplored. Here, we conducted a comprehensive analysis of nucleotide substitution frequencies at non-B DNA loci within noncoding, non-repetitive genome regions, their ±2 kb flanking regions, and 1-Megabase windows, using human-orangutan divergence and human single-nucleotide polymorphisms. Functional data analysis at single-base resolution demonstrated that substitution frequencies are usually elevated at non-B DNA, with patterns specific to each non-B DNA type. Mirror, direct and inverted repeats have higher substitution frequencies in spacers than in repeat arms, whereas G-quadruplexes, particularly stable ones, have higher substitution frequencies in loops than in stems. Several non-B DNA types also affect substitution frequencies in their flanking regions. Finally, non-B DNA explains more variation than any other predictor in multiple regression models for diversity or divergence at 1-Megabase scale. Thus, non-B DNA substantially contributes to variation in substitution frequencies at small and large scales. Our results highlight the role of non-B DNA in germline mutagenesis with implications to evolution and genetic diseases.
Collapse
Affiliation(s)
- Wilfried M Guiblet
- Bioinformatics and Genomics Graduate Program, Penn State University, UniversityPark, PA 16802, USA
| | - Marzia A Cremona
- Department of Statistics, The Pennsylvania State University, University Park, PA 16802, USA
- Department of Operations and Decision Systems, Université Laval, Canada
- CHU de Québec – Université Laval Research Center, Canada
| | - Robert S Harris
- Department of Biology, Penn State University, University Park, PA 16802, USA
| | - Di Chen
- Intercollege Graduate Degree Program in Genetics, Huck Institutes of the Life Sciences, Penn State University, UniversityPark, PA 16802, USA
| | - Kristin A Eckert
- Department of Pathology, Penn State University, College of Medicine, Hershey, PA 17033, USA
- Center for Medical Genomics, Penn State University, University Park and Hershey, PA, USA
| | - Francesca Chiaromonte
- Department of Statistics, The Pennsylvania State University, University Park, PA 16802, USA
- Center for Medical Genomics, Penn State University, University Park and Hershey, PA, USA
- EMbeDS, Sant’Anna School of Advanced Studies, 56127 Pisa, Italy
| | - Yi-Fei Huang
- Department of Biology, Penn State University, University Park, PA 16802, USA
- Center for Medical Genomics, Penn State University, University Park and Hershey, PA, USA
| | - Kateryna D Makova
- Department of Biology, Penn State University, University Park, PA 16802, USA
- Center for Medical Genomics, Penn State University, University Park and Hershey, PA, USA
| |
Collapse
|
2
|
Deshmukh AL, Porro A, Mohiuddin M, Lanni S, Panigrahi GB, Caron MC, Masson JY, Sartori AA, Pearson CE. FAN1, a DNA Repair Nuclease, as a Modifier of Repeat Expansion Disorders. J Huntingtons Dis 2021; 10:95-122. [PMID: 33579867 PMCID: PMC7990447 DOI: 10.3233/jhd-200448] [Citation(s) in RCA: 25] [Impact Index Per Article: 8.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/06/2023]
Abstract
FAN1 encodes a DNA repair nuclease. Genetic deficiencies, copy number variants, and single nucleotide variants of FAN1 have been linked to karyomegalic interstitial nephritis, 15q13.3 microdeletion/microduplication syndrome (autism, schizophrenia, and epilepsy), cancer, and most recently repeat expansion diseases. For seven CAG repeat expansion diseases (Huntington's disease (HD) and certain spinocerebellar ataxias), modification of age of onset is linked to variants of specific DNA repair proteins. FAN1 variants are the strongest modifiers. Non-coding disease-delaying FAN1 variants and coding disease-hastening variants (p.R507H and p.R377W) are known, where the former may lead to increased FAN1 levels and the latter have unknown effects upon FAN1 functions. Current thoughts are that ongoing repeat expansions in disease-vulnerable tissues, as individuals age, promote disease onset. Fan1 is required to suppress against high levels of ongoing somatic CAG and CGG repeat expansions in tissues of HD and FMR1 transgenic mice respectively, in addition to participating in DNA interstrand crosslink repair. FAN1 is also a modifier of autism, schizophrenia, and epilepsy. Coupled with the association of these diseases with repeat expansions, this suggests a common mechanism, by which FAN1 modifies repeat diseases. Yet how any of the FAN1 variants modify disease is unknown. Here, we review FAN1 variants, associated clinical effects, protein structure, and the enzyme's attributed functional roles. We highlight how variants may alter its activities in DNA damage response and/or repeat instability. A thorough awareness of the FAN1 gene and FAN1 protein functions will reveal if and how it may be targeted for clinical benefit.
Collapse
Affiliation(s)
- Amit L Deshmukh
- Program of Genetics & Genome Biology, The Hospital for Sick Children, The Peter Gilgan Centre for Research and Learning, Toronto, Ontario, Canada
| | - Antonio Porro
- Institute of Molecular Cancer Research, University of Zurich, Zurich, Switzerland
| | - Mohiuddin Mohiuddin
- Program of Genetics & Genome Biology, The Hospital for Sick Children, The Peter Gilgan Centre for Research and Learning, Toronto, Ontario, Canada
| | - Stella Lanni
- Program of Genetics & Genome Biology, The Hospital for Sick Children, The Peter Gilgan Centre for Research and Learning, Toronto, Ontario, Canada
| | - Gagan B Panigrahi
- Program of Genetics & Genome Biology, The Hospital for Sick Children, The Peter Gilgan Centre for Research and Learning, Toronto, Ontario, Canada
| | - Marie-Christine Caron
- Department of Molecular Biology, Medical Biochemistry and Pathology; Laval University Cancer Research Center, Québec City, Quebec, Canada.,Genome Stability Laboratory, CHU de Québec Research Center, HDQ Pavilion, Oncology Division, Québec City, Quebec, Canada
| | - Jean-Yves Masson
- Department of Molecular Biology, Medical Biochemistry and Pathology; Laval University Cancer Research Center, Québec City, Quebec, Canada.,Genome Stability Laboratory, CHU de Québec Research Center, HDQ Pavilion, Oncology Division, Québec City, Quebec, Canada
| | - Alessandro A Sartori
- Institute of Molecular Cancer Research, University of Zurich, Zurich, Switzerland
| | - Christopher E Pearson
- Program of Genetics & Genome Biology, The Hospital for Sick Children, The Peter Gilgan Centre for Research and Learning, Toronto, Ontario, Canada.,University of Toronto, Program of Molecular Genetics, Toronto, Ontario, Canada
| |
Collapse
|
3
|
Khristich AN, Mirkin SM. On the wrong DNA track: Molecular mechanisms of repeat-mediated genome instability. J Biol Chem 2020; 295:4134-4170. [PMID: 32060097 PMCID: PMC7105313 DOI: 10.1074/jbc.rev119.007678] [Citation(s) in RCA: 148] [Impact Index Per Article: 37.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/15/2022] Open
Abstract
Expansions of simple tandem repeats are responsible for almost 50 human diseases, the majority of which are severe, degenerative, and not currently treatable or preventable. In this review, we first describe the molecular mechanisms of repeat-induced toxicity, which is the connecting link between repeat expansions and pathology. We then survey alternative DNA structures that are formed by expandable repeats and review the evidence that formation of these structures is at the core of repeat instability. Next, we describe the consequences of the presence of long structure-forming repeats at the molecular level: somatic and intergenerational instability, fragility, and repeat-induced mutagenesis. We discuss the reasons for gender bias in intergenerational repeat instability and the tissue specificity of somatic repeat instability. We also review the known pathways in which DNA replication, transcription, DNA repair, and chromatin state interact and thereby promote repeat instability. We then discuss possible reasons for the persistence of disease-causing DNA repeats in the genome. We describe evidence suggesting that these repeats are a payoff for the advantages of having abundant simple-sequence repeats for eukaryotic genome function and evolvability. Finally, we discuss two unresolved fundamental questions: (i) why does repeat behavior differ between model systems and human pedigrees, and (ii) can we use current knowledge on repeat instability mechanisms to cure repeat expansion diseases?
Collapse
Affiliation(s)
| | - Sergei M Mirkin
- Department of Biology, Tufts University, Medford, Massachusetts 02155.
| |
Collapse
|
4
|
Kaushal S, Freudenreich CH. The role of fork stalling and DNA structures in causing chromosome fragility. Genes Chromosomes Cancer 2019; 58:270-283. [PMID: 30536896 DOI: 10.1002/gcc.22721] [Citation(s) in RCA: 42] [Impact Index Per Article: 8.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/25/2018] [Revised: 11/13/2018] [Accepted: 12/03/2018] [Indexed: 12/19/2022] Open
Abstract
Alternative non-B form DNA structures, also called secondary structures, can form in certain DNA sequences under conditions that produce single-stranded DNA, such as during replication, transcription, and repair. Direct links between secondary structure formation, replication fork stalling, and genomic instability have been found for many repeated DNA sequences that cause disease when they expand. Common fragile sites (CFSs) are known to be AT-rich and break under replication stress, yet the molecular basis for their fragility is still being investigated. Over the past several years, new evidence has linked both the formation of secondary structures and transcription to fork stalling and fragility of CFSs. How these two events may synergize to cause fragility and the role of nuclease cleavage at secondary structures in rare and CFSs are discussed here. We also highlight evidence for a new hypothesis that secondary structures at CFSs not only initiate fragility but also inhibit healing, resulting in their characteristic appearance.
Collapse
Affiliation(s)
- Simran Kaushal
- Department of Biology, Tufts University, Medford, Massachusetts
| | - Catherine H Freudenreich
- Department of Biology, Tufts University, Medford, Massachusetts.,Program in Genetics, Sackler School of Graduate Biomedical Sciences, Tufts University, Boston, Massachusetts
| |
Collapse
|
5
|
Contracting CAG/CTG repeats using the CRISPR-Cas9 nickase. Nat Commun 2016; 7:13272. [PMID: 27827362 PMCID: PMC5105158 DOI: 10.1038/ncomms13272] [Citation(s) in RCA: 55] [Impact Index Per Article: 6.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/15/2016] [Accepted: 09/12/2016] [Indexed: 12/15/2022] Open
Abstract
CAG/CTG repeat expansions cause over 13 neurological diseases that remain without a cure. Because longer tracts cause more severe phenotypes, contracting them may provide a therapeutic avenue. No currently known agent can specifically generate contractions. Using a GFP-based chromosomal reporter that monitors expansions and contractions in the same cell population, here we find that inducing double-strand breaks within the repeat tract causes instability in both directions. In contrast, the CRISPR-Cas9 D10A nickase induces mainly contractions independently of single-strand break repair. Nickase-induced contractions depend on the DNA damage response kinase ATM, whereas ATR inhibition increases both expansions and contractions in a MSH2- and XPA-dependent manner. We propose that DNA gaps lead to contractions and that the type of DNA damage present within the repeat tract dictates the levels and the direction of CAG repeat instability. Our study paves the way towards deliberate induction of CAG/CTG repeat contractions in vivo. The expansion of trinucleotide repeats has been linked to several neurodegenerative disorders. Here, the authors show that the CRISPR-Cas9 nuclease induces both expansions and contractions of the repeat region, whereas the nickase leads predominantly to contractions.
Collapse
|
6
|
Absence of MutSβ leads to the formation of slipped-DNA for CTG/CAG contractions at primate replication forks. DNA Repair (Amst) 2016; 42:107-18. [PMID: 27155933 DOI: 10.1016/j.dnarep.2016.04.002] [Citation(s) in RCA: 20] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/01/2015] [Revised: 03/22/2016] [Accepted: 04/05/2016] [Indexed: 11/22/2022]
Abstract
Typically disease-causing CAG/CTG repeats expand, but rare affected families can display high levels of contraction of the expanded repeat amongst offspring. Understanding instability is important since arresting expansions or enhancing contractions could be clinically beneficial. The MutSβ mismatch repair complex is required for CAG/CTG expansions in mice and patients. Oddly, by unknown mechanisms MutSβ-deficient mice incur contractions instead of expansions. Replication using CTG or CAG as the lagging strand template is known to cause contractions or expansions respectively; however, the interplay between replication and repair leading to this instability remains unclear. Towards understanding how repeat contractions may arise, we performed in vitro SV40-mediated replication of repeat-containing plasmids in the presence or absence of mismatch repair. Specifically, we separated repair from replication: Replication mediated by MutSβ- and MutSα-deficient human cells or cell extracts produced slipped-DNA heteroduplexes in the contraction- but not expansion-biased replication direction. Replication in the presence of MutSβ disfavoured the retention of replication products harbouring slipped-DNA heteroduplexes. Post-replication repair of slipped-DNAs by MutSβ-proficient extracts eliminated slipped-DNAs. Thus, a MutSβ-deficiency likely enhances repeat contractions because MutSβ protects against contractions by repairing template strand slip-outs. Replication deficient in LigaseI or PCNA-interaction mutant LigaseI revealed slipped-DNA formation at lagging strands. Our results reveal that distinct mechanisms lead to expansions or contractions and support inhibition of MutSβ as a therapeutic strategy to enhance the contraction of expanded repeats.
Collapse
|
7
|
Holder IT, Wagner S, Xiong P, Sinn M, Frickey T, Meyer A, Hartig JS. Intrastrand triplex DNA repeats in bacteria: a source of genomic instability. Nucleic Acids Res 2015; 43:10126-42. [PMID: 26450966 PMCID: PMC4666352 DOI: 10.1093/nar/gkv1017] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/24/2015] [Accepted: 09/21/2015] [Indexed: 01/10/2023] Open
Abstract
Repetitive nucleic acid sequences are often prone to form secondary structures distinct from B-DNA. Prominent examples of such structures are DNA triplexes. We observed that certain intrastrand triplex motifs are highly conserved and abundant in prokaryotic genomes. A systematic search of 5246 different prokaryotic plasmids and genomes for intrastrand triplex motifs was conducted and the results summarized in the ITxF database available online at http://bioinformatics.uni-konstanz.de/utils/ITxF/. Next we investigated biophysical and biochemical properties of a particular G/C-rich triplex motif (TM) that occurs in many copies in more than 260 bacterial genomes by CD and nuclear magnetic resonance spectroscopy as well as in vivo footprinting techniques. A characterization of putative properties and functions of these unusually frequent nucleic acid motifs demonstrated that the occurrence of the TM is associated with a high degree of genomic instability. TM-containing genomic loci are significantly more rearranged among closely related Escherichia coli strains compared to control sites. In addition, we found very high frequencies of TM motifs in certain Enterobacteria and Cyanobacteria that were previously described as genetically highly diverse. In conclusion we link intrastrand triplex motifs with the induction of genomic instability. We speculate that the observed instability might be an adaptive feature of these genomes that creates variation for natural selection to act upon.
Collapse
Affiliation(s)
- Isabelle T Holder
- Department of Chemistry and Konstanz Research School Chemical Biology (KoRS-CB), University of Konstanz, Universitätsstrasse 10, 78457 Konstanz, Germany
| | - Stefanie Wagner
- Department of Chemistry and Konstanz Research School Chemical Biology (KoRS-CB), University of Konstanz, Universitätsstrasse 10, 78457 Konstanz, Germany
| | - Peiwen Xiong
- Department of Biology, University of Konstanz, Universitätsstrasse 10, 78457 Konstanz, Germany
| | - Malte Sinn
- Department of Chemistry and Konstanz Research School Chemical Biology (KoRS-CB), University of Konstanz, Universitätsstrasse 10, 78457 Konstanz, Germany
| | - Tancred Frickey
- Department of Biology, University of Konstanz, Universitätsstrasse 10, 78457 Konstanz, Germany
| | - Axel Meyer
- Department of Biology, University of Konstanz, Universitätsstrasse 10, 78457 Konstanz, Germany
| | - Jörg S Hartig
- Department of Chemistry and Konstanz Research School Chemical Biology (KoRS-CB), University of Konstanz, Universitätsstrasse 10, 78457 Konstanz, Germany
| |
Collapse
|
8
|
Axford MM, Wang YH, Nakamori M, Zannis-Hadjopoulos M, Thornton CA, Pearson CE. Detection of slipped-DNAs at the trinucleotide repeats of the myotonic dystrophy type I disease locus in patient tissues. PLoS Genet 2013; 9:e1003866. [PMID: 24367268 PMCID: PMC3868534 DOI: 10.1371/journal.pgen.1003866] [Citation(s) in RCA: 49] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/31/2012] [Accepted: 08/25/2013] [Indexed: 12/16/2022] Open
Abstract
Slipped-strand DNAs, formed by out-of-register mispairing of repeat units on complementary strands, were proposed over 55 years ago as transient intermediates in repeat length mutations, hypothesized to cause at least 40 neurodegenerative diseases. While slipped-DNAs have been characterized in vitro, evidence of slipped-DNAs at an endogenous locus in biologically relevant tissues, where instability varies widely, is lacking. Here, using an anti-DNA junction antibody and immunoprecipitation, we identify slipped-DNAs at the unstable trinucleotide repeats (CTG)n•(CAG)n of the myotonic dystrophy disease locus in patient brain, heart, muscle and other tissues, where the largest expansions arise in non-mitotic tissues such as cortex and heart, and are smallest in the cerebellum. Slipped-DNAs are shown to be present on the expanded allele and in chromatinized DNA. Slipped-DNAs are present as clusters of slip-outs along a DNA, with each slip-out having 1–100 extrahelical repeats. The allelic levels of slipped-DNA containing molecules were significantly greater in the heart over the cerebellum (relative to genomic equivalents of pre-IP input DNA) of a DM1 individual; an enrichment consistent with increased allelic levels of slipped-DNA structures in tissues having greater levels of CTG instability. Surprisingly, this supports the formation of slipped-DNAs as persistent mutation products of repeat instability, and not merely as transient mutagenic intermediates. These findings further our understanding of the processes of mutation and genetic variation. Over 30 diseases are caused by the expansion of a trinucleotide repeat (TNR) in a specific gene, including the most common adult-onset form of muscular dystrophy, myotonic dystrophy (DM1). The mechanistic contributors to this unstable (TNR) expansion are not fully known, although since the discovery of these types of diseases over twenty years ago, the extrusion of the expanded repeats into mutagenic slipped-DNA conformations has been hypothesized. Here, we show the presence of slipped-DNA at the DM1 disease locus in various patient tissues. The allelic amounts of slipped-DNA in tissues correlate with overall levels of repeat instability. Slipped-DNA was also found to form in clusters along a tract of expanded repeats, which has been previously shown in vitro to impede DNA repair. This is the first evidence for slipped-DNA formation at an endogenous disease-causing gene in patient tissues.
Collapse
Affiliation(s)
- Michelle M. Axford
- Genetics & Genome Biology, The Hospital for Sick Children, Toronto, Ontario, Canada
- Department of Molecular Genetics, University of Toronto, Toronto, Ontario, Canada
| | - Yuh-Hwa Wang
- Department of Biochemistry, Wake Forest University School of Medicine, Winston-Salem, North Carolina, United States of America
| | - Masayuki Nakamori
- Department of Neurology, University of Rochester School of Medicine and Dentistry, Rochester, New York, United States of America
| | - Maria Zannis-Hadjopoulos
- Goodman Cancer Research Centre and Department of Biochemistry, McGill University, Montreal, Quebec, Canada
| | - Charles A. Thornton
- Department of Neurology, University of Rochester School of Medicine and Dentistry, Rochester, New York, United States of America
| | - Christopher E. Pearson
- Genetics & Genome Biology, The Hospital for Sick Children, Toronto, Ontario, Canada
- Department of Molecular Genetics, University of Toronto, Toronto, Ontario, Canada
- * E-mail:
| |
Collapse
|
9
|
Stevens JR, Lahue EE, Li GM, Lahue RS. Trinucleotide repeat expansions catalyzed by human cell-free extracts. Cell Res 2013; 23:565-72. [PMID: 23337586 PMCID: PMC3616437 DOI: 10.1038/cr.2013.12] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/21/2022] Open
Abstract
Trinucleotide repeat expansions cause 17 heritable human neurological disorders. In some diseases, somatic expansions occur in non-proliferating tissues such as brain where DNA replication is limited. This finding stimulated significant interest in replication-independent expansion mechanisms. Aberrant DNA repair is a likely source, based in part on mouse studies showing that somatic expansions are provoked by the DNA repair protein MutSβ (Msh2-Msh3 complex). Biochemical studies to date used cell-free extracts or purified DNA repair proteins to yield partial reactions at triplet repeats. The findings included expansions on one strand but not the other, or processing of DNA hairpin structures thought to be important intermediates in the expansion process. However, it has been difficult to recapitulate complete expansions in vitro, and the biochemical role of MutSβ remains controversial. Here, we use a novel in vitro assay to show that human cell-free extracts catalyze expansions and contractions of trinucleotide repeats without the requirement for DNA replication. The extract promotes a size range of expansions that is similar to certain diseases, and triplet repeat length and sequence govern expansions in vitro as in vivo. MutSβ stimulates expansions in the extract, consistent with aberrant repair of endogenous DNA damage as a source of expansions. Overall, this biochemical system retains the key characteristics of somatic expansions in humans and mice, suggesting that this important mutagenic process can be restored in the test tube.
Collapse
Affiliation(s)
- Jennifer R Stevens
- Centre for Chromosome Biology, School of Natural Sciences, National University of Ireland Galway, Distillery Road, Galway, Ireland
| | | | | | | |
Collapse
|
10
|
Slean MM, Reddy K, Wu B, Nichol Edamura K, Kekis M, Nelissen FHT, Aspers RLEG, Tessari M, Schärer OD, Wijmenga SS, Pearson CE. Interconverting conformations of slipped-DNA junctions formed by trinucleotide repeats affect repair outcome. Biochemistry 2013; 52:773-85. [PMID: 23339280 PMCID: PMC3566650 DOI: 10.1021/bi301369b] [Citation(s) in RCA: 27] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/16/2022]
Abstract
Expansions of (CTG)·(CAG) repeated DNAs are the mutagenic cause of 14 neurological diseases, likely arising through the formation and processing of slipped-strand DNAs. These transient intermediates of repeat length mutations are formed by out-of-register mispairing of repeat units on complementary strands. The three-way slipped-DNA junction, at which the excess repeats slip out from the duplex, is a poorly understood feature common to these mutagenic intermediates. Here, we reveal that slipped junctions can assume a surprising number of interconverting conformations where the strand opposite the slip-out either is fully base paired or has one or two unpaired nucleotides. These unpaired nucleotides can also arise opposite either of the nonslipped junction arms. Junction conformation can affect binding by various structure-specific DNA repair proteins and can also alter correct nick-directed repair levels. Junctions that have the potential to contain unpaired nucleotides are repaired with a significantly higher efficiency than constrained fully paired junctions. Surprisingly, certain junction conformations are aberrantly repaired to expansion mutations: misdirection of repair to the non-nicked strand opposite the slip-out leads to integration of the excess slipped-out repeats rather than their excision. Thus, slipped-junction structure can determine whether repair attempts lead to correction or expansion mutations.
Collapse
Affiliation(s)
- Meghan M Slean
- Program of Genetics and Genome Biology, The Hospital for Sick Children, Toronto, Ontario, Canada
| | | | | | | | | | | | | | | | | | | | | |
Collapse
|
11
|
Fonville NC, Ward RM, Mittelman D. Stress-induced modulators of repeat instability and genome evolution. J Mol Microbiol Biotechnol 2012; 21:36-44. [PMID: 22248541 DOI: 10.1159/000332748] [Citation(s) in RCA: 24] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/21/2023] Open
Abstract
Evolution hinges on the ability of organisms to adapt to their environment. A key regulator of adaptability is mutation rate, which must be balanced to maintain genome fidelity while permitting sufficient plasticity to cope with environmental changes. Multiple mechanisms govern an organism's mutation rate. Constitutive mechanisms include mutator alleles that drive global, permanent increases in mutation rates, but these changes are confined to the subpopulation that carries the mutator allele. Other mechanisms focus mutagenesis in time and space to improve the chances that adaptive mutations can spread through the population. For example, environmental stress can induce mechanisms that transiently relax the fidelity of DNA repair to bring about a temporary increase in mutation rates during times when an organism experiences a reduced fitness for its surroundings, as has been demonstrated for double-strand break repair in Escherichia coli. Still, other mechanisms control the spatial distribution of mutations by directing changes to especially mutable sequences in the genome. In eukaryotic cells, for example, the stress-sensitive chaperone Hsp90 can regulate the length of trinucleotide repeats to fine-tune gene function and can regulate the mobility of transposable elements to enable larger functional changes. Here, we review the regulation of mutation rate, with special emphasis on the roles of tandem repeats and environmental stress in genome evolution.
Collapse
|
12
|
Völker J, Plum G, Klump HH, Breslauer KJ. Energetic coupling between clustered lesions modulated by intervening triplet repeat bulge loops: allosteric implications for DNA repair and triplet repeat expansion. Biopolymers 2010; 93:355-69. [PMID: 19890964 PMCID: PMC3902826 DOI: 10.1002/bip.21343] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/18/2023]
Abstract
Clusters of closely spaced oxidative DNA lesions present challenges to the cellular repair machinery. When located in opposing strands, base excision repair (BER) of such lesions can lead to double strand DNA breaks (DSB). Activation of BER and DSB repair pathways has been implicated in inducing enhanced expansion of triplet repeat sequences. We show here that energy coupling between distal lesions (8oxodG and/or abasic sites) in opposing DNA strands can be modulated by a triplet repeat bulge loop located between the lesion sites. We find this modulation to be dependent on the identity of the lesions (8oxodG vs. abasic site) and the positions of the lesions (upstream vs. downstream) relative to the intervening bulge loop domain. We discuss how such bulge loop-mediated lesion crosstalk might influence repair processes, while favoring DNA expansion, the genotype of triplet repeat diseases.
Collapse
Affiliation(s)
- Jens Völker
- Department of Chemistry and Chemical Biology, Rutgers, The State University of New Jersey, 610 Taylor Rd, Piscataway, NJ 08854
| | - G.Eric Plum
- Department of Chemistry and Chemical Biology, Rutgers, The State University of New Jersey, 610 Taylor Rd, Piscataway, NJ 08854
- IBET Inc, 1507 Chambers Road, Suite 301, Columbus, OH 43212
| | - Horst H. Klump
- Department of Molecular and Cell Biology, University of Cape Town, Private Bag, Rondebosch 7800, South Africa
| | - Kenneth J. Breslauer
- Department of Chemistry and Chemical Biology, Rutgers, The State University of New Jersey, 610 Taylor Rd, Piscataway, NJ 08854
- The Cancer Institute of New Jersey, New Brunswick, NJ 08901
| |
Collapse
|
13
|
Zhao J, Bacolla A, Wang G, Vasquez KM. Non-B DNA structure-induced genetic instability and evolution. Cell Mol Life Sci 2010; 67:43-62. [PMID: 19727556 PMCID: PMC3017512 DOI: 10.1007/s00018-009-0131-2] [Citation(s) in RCA: 305] [Impact Index Per Article: 21.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/18/2009] [Revised: 07/22/2009] [Accepted: 08/11/2009] [Indexed: 11/26/2022]
Abstract
Repetitive DNA motifs are abundant in the genomes of various species and have the capacity to adopt non-canonical (i.e., non-B) DNA structures. Several non-B DNA structures, including cruciforms, slipped structures, triplexes, G-quadruplexes, and Z-DNA, have been shown to cause mutations, such as deletions, expansions, and translocations in both prokaryotes and eukaryotes. Their distributions in genomes are not random and often co-localize with sites of chromosomal breakage associated with genetic diseases. Current genome-wide sequence analyses suggest that the genomic instabilities induced by non-B DNA structure-forming sequences not only result in predisposition to disease, but also contribute to rapid evolutionary changes, particularly in genes associated with development and regulatory functions. In this review, we describe the occurrence of non-B DNA-forming sequences in various species, the classes of genes enriched in non-B DNA-forming sequences, and recent mechanistic studies on DNA structure-induced genomic instability to highlight their importance in genomes.
Collapse
Affiliation(s)
- Junhua Zhao
- Department of Carcinogenesis, Science Park-Research Division, The University of Texas M.D. Anderson Cancer Center, 1808 Park Road 1-C, P.O. Box 389, Smithville, TX 78957 USA
| | - Albino Bacolla
- Department of Carcinogenesis, Science Park-Research Division, The University of Texas M.D. Anderson Cancer Center, 1808 Park Road 1-C, P.O. Box 389, Smithville, TX 78957 USA
| | - Guliang Wang
- Department of Carcinogenesis, Science Park-Research Division, The University of Texas M.D. Anderson Cancer Center, 1808 Park Road 1-C, P.O. Box 389, Smithville, TX 78957 USA
| | - Karen M. Vasquez
- Department of Carcinogenesis, Science Park-Research Division, The University of Texas M.D. Anderson Cancer Center, 1808 Park Road 1-C, P.O. Box 389, Smithville, TX 78957 USA
| |
Collapse
|
14
|
Wang G, Vasquez KM. Models for chromosomal replication-independent non-B DNA structure-induced genetic instability. Mol Carcinog 2009; 48:286-98. [PMID: 19123200 PMCID: PMC2766916 DOI: 10.1002/mc.20508] [Citation(s) in RCA: 40] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/10/2022]
Abstract
Regions of genomic DNA containing repetitive nucleotide sequences can adopt a number of different structures in addition to the canonical B-DNA form: many of these non-B DNA structures are causative factors in genetic instability and human disease. Although chromosomal DNA replication through such repetitive sequences has been considered a major cause of non-B form DNA structure-induced genetic instability, it is also observed in non-proliferative tissues. In this review, we discuss putative mechanisms responsible for the mutagenesis induced by non-B DNA structures in the absence of chromosomal DNA replication.
Collapse
Affiliation(s)
- Guliang Wang
- Department of Carcinogenesis, University of Texas M.D. Anderson Cancer Center, Science Park-Research Division, 1808 Park Road 1-C, Smithville, TX 78957
| | - Karen M. Vasquez
- Department of Carcinogenesis, University of Texas M.D. Anderson Cancer Center, Science Park-Research Division, 1808 Park Road 1-C, Smithville, TX 78957
| |
Collapse
|
15
|
Pollard LM, Bourn RL, Bidichandani SI. Repair of DNA double-strand breaks within the (GAA*TTC)n sequence results in frequent deletion of the triplet-repeat sequence. Nucleic Acids Res 2008; 36:489-500. [PMID: 18045804 PMCID: PMC2241870 DOI: 10.1093/nar/gkm1066] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/09/2007] [Revised: 11/07/2007] [Accepted: 11/12/2007] [Indexed: 11/13/2022] Open
Abstract
Friedreich ataxia is caused by an expanded (GAA*TTC)n sequence, which is unstable during intergenerational transmission and in most patient tissues, where it frequently undergoes large deletions. We investigated the effect of DSB repair on instability of the (GAA*TTC)n sequence. Linear plasmids were transformed into Escherichia coli so that each colony represented an individual DSB repair event. Repair of a DSB within the repeat resulted in a dramatic increase in deletions compared with circular templates, but DSB repair outside the repeat tract did not affect instability. Repair-mediated deletions were independent of the orientation and length of the repeat, the location of the break within the repeat or the RecA status of the strain. Repair at the center of the repeat resulted in deletion of approximately half of the repeat tract, and repair at an off-center location produced deletions that were equivalent in length to the shorter of the two repeats flanking the DSB. This is consistent with a single-strand annealing mechanism of DSB repair, and implicates erroneous DSB repair as a mechanism for genetic instability of the (GAA*TTC)n sequence. Our data contrast significantly with DSB repair within (CTG*CAG)n repeats, indicating that repair-mediated instability is dependent on the sequence of the triplet repeat.
Collapse
Affiliation(s)
- Laura M. Pollard
- Department of Biochemistry and Molecular Biology and Department of Pediatrics, University of Oklahoma Health Sciences Center, Oklahoma City, OK 73104, USA
| | - Rebecka L. Bourn
- Department of Biochemistry and Molecular Biology and Department of Pediatrics, University of Oklahoma Health Sciences Center, Oklahoma City, OK 73104, USA
| | - Sanjay I. Bidichandani
- Department of Biochemistry and Molecular Biology and Department of Pediatrics, University of Oklahoma Health Sciences Center, Oklahoma City, OK 73104, USA
| |
Collapse
|
16
|
Faux NG, Huttley GA, Mahmood K, Webb GI, Garcia de la Banda M, Whisstock JC. RCPdb: An evolutionary classification and codon usage database for repeat-containing proteins. Genome Res 2007; 17:1118-27. [PMID: 17567984 PMCID: PMC1899123 DOI: 10.1101/gr.6255407] [Citation(s) in RCA: 31] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/28/2023]
Abstract
Over 3% of human proteins contain single amino acid repeats (repeat-containing proteins, RCPs). Many repeats (homopeptides) localize to important proteins involved in transcription, and the expansion of certain repeats, in particular poly-Q and poly-A tracts, can also lead to the development of neurological diseases. Previous studies have suggested that the homopeptide makeup is a result of the presence of G+C-rich tracts in the encoding genes and that expansion occurs via replication slippage. Here, we have performed a large-scale genomic analysis of the variation of the genes encoding RCPs in 13 species and present these data in an online database (http://repeats.med.monash.edu.au/genetic_analysis/). This resource allows rapid comparison and analysis of RCPs, homopeptides, and their underlying genetic tracts across the eukaryotic species considered. We report three major findings. First, there is a bias for a small subset of codons being reiterated within homopeptides, and there is no G+C or A+T bias relative to the organism's transcriptome. Second, single base pair transversions from the homocodon are unusually common and may represent a mechanism of reducing the rate of homopeptide mutations. Third, homopeptides that are conserved across different species lie within regions that are under stronger purifying selection in contrast to nonconserved homopeptides.
Collapse
Affiliation(s)
- Noel G. Faux
- Protein Crystallography Unit, Department of Biochemistry and Molecular Biology, Monash University, Clayton Campus, Melbourne, Victoria 3800, Australia
- Victorian Bioinformatics Consortium, Monash University, Clayton Campus, Melbourne, Victoria 3800, Australia
- ARC Centre for Structural and Functional Microbial Genomics, Monash University, Clayton Campus, Melbourne, Victoria 3800, Australia
| | - Gavin A. Huttley
- John Curtin School of Medical Research, Australian National University, Canberra, Australian National Territory 0200, Australia
| | - Khalid Mahmood
- Protein Crystallography Unit, Department of Biochemistry and Molecular Biology, Monash University, Clayton Campus, Melbourne, Victoria 3800, Australia
- Victorian Bioinformatics Consortium, Monash University, Clayton Campus, Melbourne, Victoria 3800, Australia
- ARC Centre for Structural and Functional Microbial Genomics, Monash University, Clayton Campus, Melbourne, Victoria 3800, Australia
| | - Geoffrey I. Webb
- Victorian Bioinformatics Consortium, Monash University, Clayton Campus, Melbourne, Victoria 3800, Australia
- School of Computer Science and Software Engineering, Monash University, Clayton Campus, Melbourne, Victoria 3800, Australia
| | - Maria Garcia de la Banda
- Victorian Bioinformatics Consortium, Monash University, Clayton Campus, Melbourne, Victoria 3800, Australia
- School of Computer Science and Software Engineering, Monash University, Clayton Campus, Melbourne, Victoria 3800, Australia
- Corresponding authors.E-mail ; fax 61 3 9905 4699.E-mail ; fax 61 3 9905 4699
| | - James C. Whisstock
- Protein Crystallography Unit, Department of Biochemistry and Molecular Biology, Monash University, Clayton Campus, Melbourne, Victoria 3800, Australia
- Victorian Bioinformatics Consortium, Monash University, Clayton Campus, Melbourne, Victoria 3800, Australia
- ARC Centre for Structural and Functional Microbial Genomics, Monash University, Clayton Campus, Melbourne, Victoria 3800, Australia
- Corresponding authors.E-mail ; fax 61 3 9905 4699.E-mail ; fax 61 3 9905 4699
| |
Collapse
|
17
|
Kosmider B, Wells RD. Double-strand breaks in the myotonic dystrophy type 1 and the fragile X syndrome triplet repeat sequences induce different types of mutations in DNA flanking sequences in Escherichia coli. Nucleic Acids Res 2006; 34:5369-82. [PMID: 17012280 PMCID: PMC1636463 DOI: 10.1093/nar/gkl612] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open
Abstract
The putative role of double-strand breaks (DSBs) created in vitro by restriction enzyme cleavage in or near CGG*CCG or CTG*CAG repeat tracts on their genetic instabilities, both within the repeats and in their flanking sequences, was investigated in an Escherichia coli plasmid system. DSBs at TRS junctions with the vector generated a large number of mutagenic events in flanking sequences whereas DSBs within the repeats elicited no similar products. A substantial enhancement in the number of mutants was caused by transcription of the repeats and by the absence of recombination functions (recA-, recBC-). Surprisingly, DNA sequence analyses on mutant clones revealed the presence of only single deletions of 0.4-1.6 kb including the TRS and the flanking sequence from plasmids originally containing (CGG*CCG)43 but single, double and multiple deletions as well as insertions were found for plasmids originally containing (CTG*CAG)n (where n = 43 or 70). Non-B DNA structures (slipped structures with loops, cruciforms, triplexes and tetraplexes) as well as microhomologies are postulated to participate in the recombination and/or repair processes.
Collapse
Affiliation(s)
| | - Robert D. Wells
- To whom correspondence should be addressed. Tel: +1 713 677 7651; Fax: +1 713 677 7689;
| |
Collapse
|
18
|
Wojciechowska M, Napierala M, Larson JE, Wells RD. Non-B DNA conformations formed by long repeating tracts of myotonic dystrophy type 1, myotonic dystrophy type 2, and Friedreich's ataxia genes, not the sequences per se, promote mutagenesis in flanking regions. J Biol Chem 2006; 281:24531-43. [PMID: 16793772 DOI: 10.1074/jbc.m603888200] [Citation(s) in RCA: 22] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022] Open
Abstract
The expansions of long repeating tracts of CTG.CAG, CCTG.CAGG, and GAA.TTC are integral to the etiology of myotonic dystrophy type 1 (DM1), myotonic dystrophy type 2 (DM2), and Friedreich's ataxia (FRDA). Essentially all studies on the molecular mechanisms of this expansion process invoke an important role for non-B DNA conformations which may be adopted by these repeat sequences. We have directly evaluated the role(s) of the repeating sequences per se, or of the non-B DNA conformations formed by these sequences, in the mutagenic process. Studies in Escherichia coli and three types of mammalian (COS-7, CV-1, and HEK-293) fibroblast-like cells revealed that conditions which promoted the formation of the non-B DNA structures enhanced the genetic instabilities, both within the repeat sequences and in the flanking sequences of up to approximately 4 kbp. The three strategies utilized included: the in vivo modulation of global negative supercoil density using topA and gyrB mutant E. coli strains; the in vivo cleavage of hairpin loops, which are an obligate consequence of slipped-strand structures, cruciforms, and intramolecular triplexes, by inactivation of the SbcC protein; and by genetic instability studies with plasmids containing long repeating sequence inserts that do, and do not, adopt non-B DNA structures in vitro. Hence, non-B DNA conformations are critical for these mutagenesis mechanisms.
Collapse
Affiliation(s)
- Marzena Wojciechowska
- Institute of Biosciences and Technology, Center for Genome Research, Texas A&M University System Health Science Center, Houston, Texas 77030, USA
| | | | | | | |
Collapse
|
19
|
Bacolla A, Wojciechowska M, Kosmider B, Larson JE, Wells RD. The involvement of non-B DNA structures in gross chromosomal rearrangements. DNA Repair (Amst) 2006; 5:1161-70. [PMID: 16807140 DOI: 10.1016/j.dnarep.2006.05.032] [Citation(s) in RCA: 70] [Impact Index Per Article: 3.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/01/2022]
Abstract
Non-B DNA conformations adopted by certain types of DNA sequences promote genetic instabilities, especially gross rearrangements including translocations. We conclude the following: (a) slipped (hairpin) structures, cruciforms, triplexes, tetraplexes and i-motifs, and left-handed Z-DNA are formed in chromosomes and elicit profound genetic consequences via recombination-repair, (b) repeating sequences, probably in their non-B conformations, cause gross genomic rearrangements (translocations, deletions, insertions, inversions, and duplications), and (c) these rearrangements are the genetic basis for numerous human diseases including polycystic kidney disease, adrenoleukodystrophy, follicular lymphomas, and spermatogenic failure.
Collapse
Affiliation(s)
- Albino Bacolla
- Institute of Biosciences and Technology, Center for Genome Research, The Texas A&M University System Health Science Center, Texas Medical Center, 2121 West Holcombe Blvd., Houston, TX 77030, USA.
| | | | | | | | | |
Collapse
|
20
|
Kim SH, Pytlos MJ, Sinden RR. Replication restart: a pathway for (CTG).(CAG) repeat deletion in Escherichia coli. Mutat Res 2006; 595:5-22. [PMID: 16472829 DOI: 10.1016/j.mrfmmm.2005.07.010] [Citation(s) in RCA: 15] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/15/2005] [Revised: 07/01/2005] [Accepted: 07/01/2005] [Indexed: 11/20/2022]
Abstract
(CTG)n.(CAG)n repeats undergo deletion at a high rate in plasmids in Escherichia coli in a process that involves RecA and RecB. In addition, DNA replication fork progression can be blocked during synthesis of (CTG)n.(CAG)n repeats. Replication forks stalled at (CTG)n.(CAG)n repeats may be rescued by replication restart that involves recombination as well as enzymes involved in replication and DNA repair, and this process may be responsible for the high rate of repeat deletion in E. coli. To test this hypothesis (CAG)n.(CTG)n deletion rates were measured in several E. coli strains carrying mutations involved in replication restart. (CAG)n.(CTG)n deletion rates were decreased, relative to the rates in wild type cells, in strains containing mutations in priA, recG, ruvAB, and recO. Mutations in priB and priC resulted in small reductions in deletion rates. In a recF strain, rates were decreased when (CAG)n comprised the leading template strand, but rates were increased when (CTG)n comprised the leading template. Deletion rates were increased slightly in a recJ strain. The mutational spectra for most mutant strains were altered relative to those in parental strains. In addition, purified PriA and RecG proteins showed unexpected binding to single-stranded, duplex, and forked DNAs containing (CAG)n and/or (CTG)n loop-outs in various positions. The results presented are consistent with an interpretation that the high rates of trinucleotide repeat instability observed in E. coli result from the attempted restart of replication forks stalled at (CAG)n.(CTG)n repeats.
Collapse
Affiliation(s)
- Seung-Hwan Kim
- Laboratory of DNA Structure and Mutagenesis, Center for Genome Research, Institute of Biosciences and Technology, Texas A&M University System Health Science Center, 2121 West Holcombe Blvd., Houston, TX 77030-3303, USA
| | | | | |
Collapse
|
21
|
Abstract
Repetitive DNA sequences are abundant in eukaryotic genomes, and many of these sequences have the potential to adopt non-B DNA conformations. Genes harboring non-B DNA structure-forming sequences increase the risk of genetic instability and thus are associated with human diseases. In this review, we discuss putative mechanisms responsible for genetic instability events occurring at these non-B DNA structures, with a focus on hairpins, left-handed Z-DNA, and intramolecular triplexes or H-DNA. Slippage and misalignment are the most common events leading to DNA structure-induced mutagenesis. However, a number of other mechanisms of genetic instability have been proposed based on the finding that these structures not only induce expansions and deletions, but can also induce DNA strand breaks and rearrangements. The available data implicate a variety of proteins, such as mismatch repair proteins, nucleotide excision repair proteins, topoisomerases, and structure specific-nucleases in the processing of these mutagenic DNA structures. The potential mechanisms of genetic instability induced by these structures and their contribution to human diseases are discussed.
Collapse
Affiliation(s)
- Guliang Wang
- Department of Carcinogenesis, University of Texas M.D. Anderson Cancer Center, Science Park-Research Division, 1808 Park Road 1-C, P.O. Box 389, Smithville, 78957, USA
| | | |
Collapse
|
22
|
Pearson CE, Nichol Edamura K, Cleary JD. Repeat instability: mechanisms of dynamic mutations. Nat Rev Genet 2005; 6:729-42. [PMID: 16205713 DOI: 10.1038/nrg1689] [Citation(s) in RCA: 645] [Impact Index Per Article: 33.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/20/2022]
Abstract
Disease-causing repeat instability is an important and unique form of mutation that is linked to more than 40 neurological, neurodegenerative and neuromuscular disorders. DNA repeat expansion mutations are dynamic and ongoing within tissues and across generations. The patterns of inherited and tissue-specific instability are determined by both gene-specific cis-elements and trans-acting DNA metabolic proteins. Repeat instability probably involves the formation of unusual DNA structures during DNA replication, repair and recombination. Experimental advances towards explaining the mechanisms of repeat instability have broadened our understanding of this mutational process. They have revealed surprising ways in which metabolic pathways can drive or protect from repeat instability.
Collapse
Affiliation(s)
- Christopher E Pearson
- Program of Genetics and Genomic Biology, The Hospital for Sick Children, 15-312, TMDT, 101 College Street, East Tower, Toronto, Ontario M5G 1L7, Canada.
| | | | | |
Collapse
|
23
|
Pelletier R, Farrell BT, Miret JJ, Lahue RS. Mechanistic features of CAG*CTG repeat contractions in cultured cells revealed by a novel genetic assay. Nucleic Acids Res 2005; 33:5667-76. [PMID: 16199754 PMCID: PMC1240116 DOI: 10.1093/nar/gki880] [Citation(s) in RCA: 22] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/31/2022] Open
Abstract
Trinucleotide repeats (TNRs) undergo high frequency mutagenesis to cause at least 15 neurodegenerative diseases. To understand better the molecular mechanisms of TNR instability in cultured cells, a new genetic assay was created using a shuttle vector. The shuttle vector contains a promoter-TNR-reporter gene construct whose expression is dependent on TNR length. The vector harbors the SV40 ori and large T antigen gene, allowing portability between primate cell lines. The shuttle vector is propagated in cultured cells, then recovered and analyzed in yeast using selection for reporter gene expression. We show that (CAG•CTG)25−33 contracts at frequencies as high as 1% in 293T and 293 human cells and in COS-1 monkey cells, provided that the plasmid undergoes replication. Hairpin-forming capacity of the repeat sequence stimulated contractions. Evidence for a threshold was observed between 25 and 33 repeats in COS-1 cells, where contraction frequencies increased sharply (up 720%) over a narrow range of repeat lengths. Expression of the mismatch repair protein Mlh1 does not correlate with repeat instability, suggesting contractions are independent of mismatch repair in our system. Together, these findings recapitulate certain features of human genetics and therefore establish a novel cell culture system to help provide new mechanistic insights into CAG•CTG repeat instability.
Collapse
Affiliation(s)
| | - Brian T. Farrell
- Department of Pathology and Microbiology, University of Nebraska Medical CenterBox 986805, Omaha, NE 68198-6805, USA
| | | | - Robert S. Lahue
- To whom correspondence should be addressed. Tel: +1 402 559 4619; Fax: +1 402 559 8270;
| |
Collapse
|
24
|
Hebert ML, Wells RD. Roles of double-strand breaks, nicks, and gaps in stimulating deletions of CTG.CAG repeats by intramolecular DNA repair. J Mol Biol 2005; 353:961-79. [PMID: 16213518 DOI: 10.1016/j.jmb.2005.09.023] [Citation(s) in RCA: 18] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/31/2005] [Revised: 08/30/2005] [Accepted: 09/09/2005] [Indexed: 11/19/2022]
Abstract
A series of plasmids harboring CTG.CAG repeats with double-strand breaks (DSB), single-strand nicks, or single-strand gaps (15 or 30 nucleotides) within the repeat regions were used to determine their capacity to induce genetic instabilities. These plasmids were introduced into Escherichia coli in the presence of a second plasmid containing a sequence that could support homologous recombination repair between the two plasmids. The transfer of a point mutation from the second to the first plasmid was used to monitor homologous recombination (gene conversion). Only DSBs increased the overall genetic instability. This instability took place by intramolecular repair, which was not dependent on RuvA. Double-strand break-induced instabilities were partially stabilized by a mutation in recF. Gaps of 30 nt formed a distinct 30 nt deletion product, whereas single strand nicks and gaps of 15 nt did not induce expansions or deletions. Formation of this deletion product required the CTG.CAG repeats to be present in the single-stranded region and was stimulated by E.coli DNA ligase, but was not dependent upon the RecFOR pathway. Models are presented to explain the intramolecular repair-induced instabilities and the formation of the 30 nt deletion product.
Collapse
Affiliation(s)
- Micheal L Hebert
- Center for Genome Research, Institute of Biosciences and Technology, Texas A and M University System Health Science Center, 2121 W. Holcombe Blvd., Houston, TX 77030-3303, USA
| | | |
Collapse
|
25
|
Wells RD, Dere R, Hebert ML, Napierala M, Son LS. Advances in mechanisms of genetic instability related to hereditary neurological diseases. Nucleic Acids Res 2005; 33:3785-98. [PMID: 16006624 PMCID: PMC1174910 DOI: 10.1093/nar/gki697] [Citation(s) in RCA: 185] [Impact Index Per Article: 9.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/16/2022] Open
Abstract
Substantial progress has been realized in the past several years in our understanding of the molecular mechanisms responsible for the expansions and deletions (genetic instabilities) of repeating tri-, tetra- and pentanucleotide repeating sequences associated with a number of hereditary neurological diseases. These instabilities occur by replication, recombination and repair processes, probably acting in concert, due to slippage of the DNA complementary strands relative to each other. The biophysical properties of the folded-back repeating sequence strands play a critical role in these instabilities. Non-B DNA structural elements (hairpins and slipped structures, DNA unwinding elements, tetraplexes, triplexes and sticky DNA) are described. The replication mechanisms are influenced by pausing of the replication fork, orientation of the repeat strands, location of the repeat sequences relative to replication origins and the flap endonuclease. Methyl-directed mismatch repair, nucleotide excision repair, and repair of damage caused by mutagens are discussed. Genetic recombination and double-strand break repair advances in Escherichia coli, yeast and mammalian models are reviewed. Furthermore, the newly discovered capacities of certain triplet repeat sequences to cause gross chromosomal rearrangements are discussed.
Collapse
Affiliation(s)
- Robert D Wells
- Center for Genome Research, Institute of Biosciences and Technology, Texas A&M University System Health Science Center, Texas Medical Center, 2121 W. Holcombe Blvd, Houston, TX 77030, USA.
| | | | | | | | | |
Collapse
|
26
|
Lin Y, Dion V, Wilson JH. A novel selectable system for detecting expansion of CAG.CTG repeats in mammalian cells. Mutat Res 2005; 572:123-31. [PMID: 15790495 DOI: 10.1016/j.mrfmmm.2005.01.013] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/16/2004] [Revised: 01/05/2005] [Accepted: 01/06/2005] [Indexed: 11/17/2022]
Abstract
CAG.CTG repeat expansions cause more than a dozen neurodegenerative diseases in humans. To define the mechanism of repeat instability in mammalian cells we developed a selectable assay to detect expansions of CAG.CTG triplet repeats in Chinese hamster ovary (CHO) cells. We showed previously that long tracts of CAG.CTG repeats, embedded in an intron of the APRT gene, kill expression of the gene, rendering the cells APRT-. By contrast, tracts with fewer than 34 repeats allow sufficient expression to give APRT+ cells. Although it should be possible to use APRT+ cells with short repeats to assay for expansion events by selecting for APRT- cells, we find that APRT+ cells with 31 repeats are not killed by the standard APRT- selection protocol, most likely because they produce too little Aprt to incorporate sufficient 8-azaadenine into their adenine pool. To overcome this problem, we devised a new selection, which increases the proportion of the adenine pool contributed by the salvage pathway by partially inhibiting the de novo pathway. We show that APRT- CHO cells with 61 or 95 CAG.CTG repeats survive this selection, whereas cells with 31 repeats die. Using this selection system, we can select for expansion to as few as 39 repeats. Thus, this assay can monitor expansions across the critical boundary from the longest lengths of normal alleles to the shortest lengths of disease alleles.
Collapse
Affiliation(s)
- Yunfu Lin
- Verna and Marrs McLean Department of Biochemistry and Molecular Biology, Baylor College of Medicine, One Baylor Plaza, Houston, TX 77030, USA
| | | | | |
Collapse
|
27
|
Wojciechowska M, Bacolla A, Larson JE, Wells RD. The Myotonic Dystrophy Type 1 Triplet Repeat Sequence Induces Gross Deletions and Inversions. J Biol Chem 2005; 280:941-52. [PMID: 15489504 DOI: 10.1074/jbc.m410427200] [Citation(s) in RCA: 23] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022] Open
Abstract
The capacity of (CTG.CAG)n and (GAA.TTC)n repeat tracts in plasmids to induce mutations in DNA flanking regions was evaluated in Escherichia coli. Long repeats of these sequences are involved in the etiology of myotonic dystrophy type 1 and Friedreich's ataxia, respectively. Long (CTG.CAG)n (where n = 98 and 175) caused the deletion of most, or all, of the repeats and the flanking GFP gene. Deletions of 0.6-1.8 kbp were found as well as inversions. Shorter repeat tracts (where n = 0 or 17) were essentially inert, as observed for the (GAA.TTC)176-containing plasmid. The orientation of the triplet repeat sequence (TRS) relative to the unidirectional origin of replication had a pronounced effect, signaling the participation of replication and/or repair systems. Also, when the TRS was transcribed, the level of deletions was greatly elevated. Under certain conditions, 30-50% of the products contained gross deletions. DNA sequence analyses of the breakpoint junctions in 47 deletions revealed the presence of 1-8-bp direct or inverted homologies in all cases. Also, the presence of non-B folded conformations (i.e. slipped structures, cruciforms, or triplexes) at or near the breakpoints was predicted in all cases. This genetic behavior, which was previously unrecognized for a TRS, may provide the basis for a new type of instability of the myotonic dystrophy protein kinase (DMPK) gene in patients with a full mutation.
Collapse
Affiliation(s)
- Marzena Wojciechowska
- Center for Genome Research Institute of Biosciences and Technology, Texas A & M University System Health Science Center, Texas Medical Center, Houston, Texas 77030, USA
| | | | | | | |
Collapse
|
28
|
Hashem VI, Pytlos MJ, Klysik EA, Tsuji K, Khajavi M, Khajav M, Ashizawa T, Sinden RR. Chemotherapeutic deletion of CTG repeats in lymphoblast cells from DM1 patients. Nucleic Acids Res 2004; 32:6334-46. [PMID: 15576360 PMCID: PMC535684 DOI: 10.1093/nar/gkh976] [Citation(s) in RCA: 45] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/26/2023] Open
Abstract
Myotonic dystrophy type 1 (DM1) is caused by the expansion of a (CTG).(CAG) repeat in the DMPK gene on chromosome 19q13.3. At least 17 neurological diseases have similar genetic mutations, the expansion of DNA repeats. In most of these disorders, the disease severity is related to the length of the repeat expansion, and in DM1 the expanded repeat undergoes further elongation in somatic and germline tissues. At present, in this class of diseases, no therapeutic approach exists to prevent or slow the repeat expansion and thereby reduce disease severity or delay disease onset. We present initial results testing the hypothesis that repeat deletion may be mediated by various chemotherapeutic agents. Three lymphoblast cell lines derived from two DM1 patients treated with either ethylmethanesulfonate (EMS), mitomycin C, mitoxantrone or doxorubicin, at therapeutic concentrations, accumulated deletions following treatment. Treatment with EMS frequently prevented the repeat expansion observed during growth in culture. A significant reduction of CTG repeat length by 100-350 (CTG).(CAG) repeats often occurred in the cell population following treatment with these drugs. Potential mechanisms of drug-induced deletion are presented.
Collapse
Affiliation(s)
- Vera I Hashem
- Center for Genome Research, Institute of Biosciences and Technology, Texas A&M University System Health Sciences Center, 2121 West Holcombe Boulevard, Houston, TX 77030-3303, USA
| | | | | | | | | | | | | | | |
Collapse
|
29
|
Dere R, Napierala M, Ranum LPW, Wells RD. Hairpin Structure-forming Propensity of the (CCTG·CAGG) Tetranucleotide Repeats Contributes to the Genetic Instability Associated with Myotonic Dystrophy Type 2. J Biol Chem 2004; 279:41715-26. [PMID: 15292165 DOI: 10.1074/jbc.m406415200] [Citation(s) in RCA: 46] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/01/2023] Open
Abstract
The genetic instabilities of (CCTG.CAGG)(n) tetranucleotide repeats were investigated to evaluate the molecular mechanisms responsible for the massive expansions found in myotonic dystrophy type 2 (DM2) patients. DM2 is caused by an expansion of the repeat from the normal allele of 26 to as many as 11,000 repeats. Genetic expansions and deletions were monitored in an African green monkey kidney cell culture system (COS-7 cells) as a function of the length (30, 114, or 200 repeats), orientation, or proximity of the repeat tracts to the origin (SV40) of replication. As found for CTG.CAG repeats related to DM1, the instabilities were greater for the longer tetranucleotide repeat tracts. Also, the expansions and deletions predominated when cloned in orientation II (CAGG on the leading strand template) rather than I and when cloned proximal rather than distal to the replication origin. Biochemical studies on synthetic d(CAGG)(26) and d(CCTG)(26) as models of unpaired regions of the replication fork revealed that d(CAGG)(26) has a marked propensity to adopt a defined base paired hairpin structure, whereas the complementary d(CCTG)(26) lacks this capacity. The effect of orientation described above differs from all previous results with three triplet repeat sequences (including CTG.CAG), which are also involved in the etiologies of other hereditary neurological diseases. However, similar to the triplet repeat sequences, the ability of one of the two strands to form a more stable folded structure, in our case the CAGG strand, explains this unorthodox "reversed" behavior.
Collapse
Affiliation(s)
- Ruhee Dere
- Institute of Biosciences and Technology, Center for Genome Research, Texas A and M University System Health Science Center, Texas Medical Center, Houston, Texas 77030-3303, USA
| | | | | | | |
Collapse
|
30
|
Napierala M, Dere R, Vetcher A, Wells RD. Structure-dependent Recombination Hot Spot Activity of GAA·TTC Sequences from Intron 1 of the Friedreich's Ataxia Gene. J Biol Chem 2004; 279:6444-54. [PMID: 14625270 DOI: 10.1074/jbc.m309596200] [Citation(s) in RCA: 52] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/11/2023] Open
Abstract
The recombinational properties of long GAA.TTC repeating sequences were analyzed in Escherichia coli to gain further insights into the molecular mechanisms of the genetic instability of this tract as possibly related to the etiology of Friedreich's ataxia. Intramolecular and intermolecular recombination studies showed that the frequency of recombination between the GAA.TTC tracts was as much as 15 times higher than the non-repeating control sequences. Homologous, intramolecular recombination between GAA.TTC tracts and GAAGGA.TCCTTC repeats also occurred with a very high frequency (approximately 0.8%). Biochemical analyses of the recombination products demonstrated the expansions and deletions of the GAA.TTC repeats. These results, together with our previous studies on the CTG.CAG sequences, suggest that the recombinational hot spot characteristics may be a common feature of all triplet repeat sequences. Unexpectedly, we found that the recombination properties of the GAA.TTC tracts were unique, compared with CTG.CAG repeats, because they depended on the DNA secondary structure polymorphism. Increasing the length of the GAA.TTC repeats decreased the intramolecular recombination frequency between these tracts. Also, a correlation was found between the propensity of the GAA.TTC tracts to adopt the sticky DNA conformation and the inhibition of intramolecular recombination. The use of novobiocin to modulate the intracellular DNA topology, i.e. the lowering of the negative superhelical density, repressed the formation of the sticky DNA structure, thereby restoring the expected positive correlation between the length of the GAA.TTC tracts and the frequency of intramolecular recombination. Hence, our results demonstrate that sticky DNA exists and functions in E. coli.
Collapse
Affiliation(s)
- Marek Napierala
- Institute of Biosciences and Technology, Center for Genome Research, Texas A&M University System Health Science Center, Texas Medical Center, Houston, Texas 77030-3303, USA
| | | | | | | |
Collapse
|