1
|
Deshmukh AL, Porro A, Mohiuddin M, Lanni S, Panigrahi GB, Caron MC, Masson JY, Sartori AA, Pearson CE. FAN1, a DNA Repair Nuclease, as a Modifier of Repeat Expansion Disorders. J Huntingtons Dis 2021; 10:95-122. [PMID: 33579867 PMCID: PMC7990447 DOI: 10.3233/jhd-200448] [Citation(s) in RCA: 25] [Impact Index Per Article: 8.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/06/2023]
Abstract
FAN1 encodes a DNA repair nuclease. Genetic deficiencies, copy number variants, and single nucleotide variants of FAN1 have been linked to karyomegalic interstitial nephritis, 15q13.3 microdeletion/microduplication syndrome (autism, schizophrenia, and epilepsy), cancer, and most recently repeat expansion diseases. For seven CAG repeat expansion diseases (Huntington's disease (HD) and certain spinocerebellar ataxias), modification of age of onset is linked to variants of specific DNA repair proteins. FAN1 variants are the strongest modifiers. Non-coding disease-delaying FAN1 variants and coding disease-hastening variants (p.R507H and p.R377W) are known, where the former may lead to increased FAN1 levels and the latter have unknown effects upon FAN1 functions. Current thoughts are that ongoing repeat expansions in disease-vulnerable tissues, as individuals age, promote disease onset. Fan1 is required to suppress against high levels of ongoing somatic CAG and CGG repeat expansions in tissues of HD and FMR1 transgenic mice respectively, in addition to participating in DNA interstrand crosslink repair. FAN1 is also a modifier of autism, schizophrenia, and epilepsy. Coupled with the association of these diseases with repeat expansions, this suggests a common mechanism, by which FAN1 modifies repeat diseases. Yet how any of the FAN1 variants modify disease is unknown. Here, we review FAN1 variants, associated clinical effects, protein structure, and the enzyme's attributed functional roles. We highlight how variants may alter its activities in DNA damage response and/or repeat instability. A thorough awareness of the FAN1 gene and FAN1 protein functions will reveal if and how it may be targeted for clinical benefit.
Collapse
Affiliation(s)
- Amit L. Deshmukh
- Program of Genetics & Genome Biology, The Hospital for Sick Children, The Peter Gilgan Centre for Research and Learning, Toronto, Ontario, Canada
| | - Antonio Porro
- Institute of Molecular Cancer Research, University of Zurich, Zurich, Switzerland
| | - Mohiuddin Mohiuddin
- Program of Genetics & Genome Biology, The Hospital for Sick Children, The Peter Gilgan Centre for Research and Learning, Toronto, Ontario, Canada
| | - Stella Lanni
- Program of Genetics & Genome Biology, The Hospital for Sick Children, The Peter Gilgan Centre for Research and Learning, Toronto, Ontario, Canada
| | - Gagan B. Panigrahi
- Program of Genetics & Genome Biology, The Hospital for Sick Children, The Peter Gilgan Centre for Research and Learning, Toronto, Ontario, Canada
| | - Marie-Christine Caron
- Department of Molecular Biology, Medical Biochemistry and Pathology; Laval University Cancer Research Center, Québec City, Quebec, Canada
- Genome Stability Laboratory, CHU de Québec Research Center, HDQ Pavilion, Oncology Division, Québec City, Quebec, Canada
| | - Jean-Yves Masson
- Department of Molecular Biology, Medical Biochemistry and Pathology; Laval University Cancer Research Center, Québec City, Quebec, Canada
- Genome Stability Laboratory, CHU de Québec Research Center, HDQ Pavilion, Oncology Division, Québec City, Quebec, Canada
| | | | - Christopher E. Pearson
- Program of Genetics & Genome Biology, The Hospital for Sick Children, The Peter Gilgan Centre for Research and Learning, Toronto, Ontario, Canada
- University of Toronto, Program of Molecular Genetics, Toronto, Ontario, Canada
| |
Collapse
|
2
|
Abstract
Huntington's disease (HD) is caused by a CAG repeat expansion in the HTT gene. Repeat length can change over time, both in individual cells and between generations, and longer repeats may drive pathology. Cellular DNA repair systems have long been implicated in CAG repeat instability but recent genetic evidence from humans linking DNA repair variants to HD onset and progression has reignited interest in this area. The DNA damage response plays an essential role in maintaining genome stability, but may also license repeat expansions in the context of HD. In this chapter we summarize the methods developed to assay CAG repeat expansion/contraction in vitro and in cells, and review the DNA repair genes tested in mouse models of HD. While none of these systems is currently ideal, new technologies, such as long-read DNA sequencing, should improve the sensitivity of assays to assess the effects of DNA repair pathways in HD. Improved assays will be essential precursors to high-throughput testing of small molecules that can alter specific steps in DNA repair pathways and perhaps ameliorate expansion or enhance contraction of the HTT CAG repeat.
Collapse
|
3
|
Chromosomal directionality of DNA mismatch repair in Escherichia coli. Proc Natl Acad Sci U S A 2015; 112:9388-93. [PMID: 26170312 DOI: 10.1073/pnas.1505370112] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
Defects in DNA mismatch repair (MMR) result in elevated mutagenesis and in cancer predisposition. This disease burden arises because MMR is required to correct errors made in the copying of DNA. MMR is bidirectional at the level of DNA strand polarity as it operates equally well in the 5' to 3' and the 3' to 5' directions. However, the directionality of MMR with respect to the chromosome, which comprises parental DNA strands of opposite polarity, has been unknown. Here, we show that MMR in Escherichia coli is unidirectional with respect to the chromosome. Our data demonstrate that, following the recognition of a 3-bp insertion-deletion loop mismatch, the MMR machinery searches for the first hemimethylated GATC site located on its origin-distal side, toward the replication fork, and that resection then proceeds back toward the mismatch and away from the replication fork. This study provides support for a tight coupling between MMR and DNA replication.
Collapse
|
4
|
Jackson A, Okely EA, Leach DRF. Expansion of CAG repeats in Escherichia coli is controlled by single-strand DNA exonucleases of both polarities. Genetics 2014; 198:509-17. [PMID: 25081568 PMCID: PMC4196609 DOI: 10.1534/genetics.114.168245] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
The expansion of CAG·CTG repeat tracts is responsible for several neurodegenerative diseases, including Huntington disease and myotonic dystrophy. Understanding the molecular mechanism of CAG·CTG repeat tract expansion is therefore important if we are to develop medical interventions limiting expansion rates. Escherichia coli provides a simple and tractable model system to understand the fundamental properties of these DNA sequences, with the potential to suggest pathways that might be conserved in humans or to highlight differences in behavior that could signal the existence of human-specific factors affecting repeat array processing. We have addressed the genetics of CAG·CTG repeat expansion in E. coli and shown that these repeat arrays expand via an orientation-independent mechanism that contrasts with the orientation dependence of CAG·CTG repeat tract contraction. The helicase Rep contributes to the orientation dependence of repeat tract contraction and limits repeat tract expansion in both orientations. However, RuvAB-dependent fork reversal, which occurs in a rep mutant, is not responsible for the observed increase in expansions. The frequency of repeat tract expansion is controlled by both the 5'-3' exonuclease RecJ and the 3'-5' exonuclease ExoI, observations that suggest the importance of both 3'and 5' single-strand ends in the pathway of CAG·CTG repeat tract expansion. We discuss the relevance of our results to two competing models of repeat tract expansion.
Collapse
Affiliation(s)
- Adam Jackson
- Institute of Cell Biology, School of Biological Sciences, University of Edinburgh, Edinburgh, EH9 3JR, United Kingdom
| | - Ewa A Okely
- Institute of Cell Biology, School of Biological Sciences, University of Edinburgh, Edinburgh, EH9 3JR, United Kingdom
| | - David R F Leach
- Institute of Cell Biology, School of Biological Sciences, University of Edinburgh, Edinburgh, EH9 3JR, United Kingdom
| |
Collapse
|
5
|
Structural studies of DNA end detection and resection in homologous recombination. Cold Spring Harb Perspect Biol 2014; 6:a017962. [PMID: 25081516 DOI: 10.1101/cshperspect.a017962] [Citation(s) in RCA: 21] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/09/2023]
Abstract
DNA double-strand breaks are repaired by two major pathways, homologous recombination or nonhomologous end joining. The commitment to one or the other pathway proceeds via different steps of resection of the DNA ends, which is controlled and executed by a set of DNA double-strand break sensors, endo- and exonucleases, helicases, and DNA damage response factors. The molecular choreography of the underlying protein machinery is beginning to emerge. In this review, we discuss the early steps of genetic recombination and double-strand break sensing with an emphasis on structural and molecular studies.
Collapse
|
6
|
Abstract
Bacterial genomes are remarkably stable from one generation to the next but are plastic on an evolutionary time scale, substantially shaped by horizontal gene transfer, genome rearrangement, and the activities of mobile DNA elements. This implies the existence of a delicate balance between the maintenance of genome stability and the tolerance of genome instability. In this review, we describe the specialized genetic elements and the endogenous processes that contribute to genome instability. We then discuss the consequences of genome instability at the physiological level, where cells have harnessed instability to mediate phase and antigenic variation, and at the evolutionary level, where horizontal gene transfer has played an important role. Indeed, this ability to share DNA sequences has played a major part in the evolution of life on Earth. The evolutionary plasticity of bacterial genomes, coupled with the vast numbers of bacteria on the planet, substantially limits our ability to control disease.
Collapse
|
7
|
Sloan DB, Moran NA. The evolution of genomic instability in the obligate endosymbionts of whiteflies. Genome Biol Evol 2013; 5:783-93. [PMID: 23542079 PMCID: PMC3673631 DOI: 10.1093/gbe/evt044] [Citation(s) in RCA: 50] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/20/2022] Open
Abstract
Many insects depend on ancient associations with intracellular bacteria to perform essential metabolic functions. These endosymbionts exhibit striking examples of convergence in genome architecture, including a high degree of structural stability that is not typical of their free-living counterparts. However, the recently sequenced genome of the obligate whitefly endosymbiont Portiera revealed features that distinguish it from other ancient insect associates, such as a low gene density and the presence of perfectly duplicated sequences. Here, we report the comparative analysis of Portiera genome sequences both within and between host species. In one whitefly lineage (Bemisia tabaci), we identify large-scale structural polymorphisms in the Portiera genome that exist even within individual insects. This variation is likely mediated by recombination across identical repeats that are maintained by gene conversion. The complete Portiera genome sequence from a distantly related whitefly host (Trialeurodes vaporarium) confirms a history of extensive genome rearrangement in this ancient endosymbiont. Using gene-order-based phylogenetic analysis, we show that the majority of rearrangements have occurred in the B. tabaci lineage, coinciding with an increase in the rate of nucleotide substitutions, a proliferation of short tandem repeats (microsatellites) in intergenic regions, and the loss of many widely conserved genes involved in DNA replication, recombination, and repair. These results indicate that the loss of recombinational machinery is unlikely to be the cause of the extreme structural conservation that is generally observed in obligate endosymbiont genomes and that large, repetitive intergenic regions are an important substrate for genomic rearrangements.
Collapse
Affiliation(s)
- Daniel B Sloan
- Department of Ecology and Evolutionary Biology, Yale University, USA.
| | | |
Collapse
|
8
|
Zhou K, Aertsen A, Michiels CW. The role of variable DNA tandem repeats in bacterial adaptation. FEMS Microbiol Rev 2013; 38:119-41. [PMID: 23927439 DOI: 10.1111/1574-6976.12036] [Citation(s) in RCA: 100] [Impact Index Per Article: 9.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/17/2013] [Revised: 07/13/2013] [Accepted: 07/26/2013] [Indexed: 01/05/2023] Open
Abstract
DNA tandem repeats (TRs), also designated as satellite DNA, are inter- or intragenic nucleotide sequences that are repeated two or more times in a head-to-tail manner. Because TR tracts are prone to strand-slippage replication and recombination events that cause the TR copy number to increase or decrease, loci containing TRs are hypermutable. An increasing number of examples illustrate that bacteria can exploit this instability of TRs to reversibly shut down or modulate the function of specific genes, allowing them to adapt to changing environments on short evolutionary time scales without an increased overall mutation rate. In this review, we discuss the prevalence and distribution of inter- and intragenic TRs in bacteria and the mechanisms of their instability. In addition, we review evidence demonstrating a role of TR variations in bacterial adaptation strategies, ranging from immune evasion and tissue tropism to the modulation of environmental stress tolerance. Nevertheless, while bioinformatic analysis reveals that most bacterial genomes contain a few up to several dozens of intra- and intergenic TRs, only a small fraction of these have been functionally studied to date.
Collapse
Affiliation(s)
- Kai Zhou
- Department of Microbial and Molecular Systems (M²S), Faculty of Bioscience Engineering, Laboratory of Food Microbiology and Leuven Food Science and Nutrition Research Centre (LFoRCe), KU Leuven, Leuven, Belgium
| | | | | |
Collapse
|
9
|
Liu G, Leffak M. Instability of (CTG)n•(CAG)n trinucleotide repeats and DNA synthesis. Cell Biosci 2012; 2:7. [PMID: 22369689 PMCID: PMC3310812 DOI: 10.1186/2045-3701-2-7] [Citation(s) in RCA: 26] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/07/2011] [Accepted: 02/27/2012] [Indexed: 12/21/2022] Open
Abstract
Expansion of (CTG)n•(CAG)n trinucleotide repeat (TNR) microsatellite sequences is the cause of more than a dozen human neurodegenerative diseases. (CTG)n and (CAG)n repeats form imperfectly base paired hairpins that tend to expand in vivo in a length-dependent manner. Yeast, mouse and human models confirm that (CTG)n•(CAG)n instability increases with repeat number, and implicate both DNA replication and DNA damage response mechanisms in (CTG)n•(CAG)n TNR expansion and contraction. Mutation and knockdown models that abrogate the expression of individual genes might also mask more subtle, cumulative effects of multiple additional pathways on (CTG)n•(CAG)n instability in whole animals. The identification of second site genetic modifiers may help to explain the variability of (CTG)n•(CAG)n TNR instability patterns between tissues and individuals, and offer opportunities for prognosis and treatment.
Collapse
Affiliation(s)
- Guoqi Liu
- Department of Biochemistry and Molecular Biology, Boonshoft School of Medicine, Wright State University, Dayton, OH 45435, USA.
| | | |
Collapse
|
10
|
DNA tandem repeat instability in the Escherichia coli chromosome is stimulated by mismatch repair at an adjacent CAG·CTG trinucleotide repeat. Proc Natl Acad Sci U S A 2010; 107:22582-6. [PMID: 21149728 DOI: 10.1073/pnas.1012906108] [Citation(s) in RCA: 12] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/18/2023] Open
Abstract
Approximately half the human genome is composed of repetitive DNA sequences classified into microsatellites, minisatellites, tandem repeats, and dispersed repeats. These repetitive sequences have coevolved within the genome but little is known about their potential interactions. Trinucleotide repeats (TNRs) are a subclass of microsatellites that are implicated in human disease. Expansion of CAG·CTG TNRs is responsible for Huntington disease, myotonic dystrophy, and a number of spinocerebellar ataxias. In yeast DNA double-strand break (DSB) formation has been proposed to be associated with instability and chromosome fragility at these sites and replication fork reversal (RFR) to be involved either in promoting or in preventing instability. However, the molecular basis for chromosome fragility of repetitive DNA remains poorly understood. Here we show that a CAG·CTG TNR array stimulates instability at a 275-bp tandem repeat located 6.3 kb away on the Escherichia coli chromosome. Remarkably, this stimulation is independent of both DNA double-strand break repair (DSBR) and RFR but is dependent on a functional mismatch repair (MMR) system. Our results provide a demonstration, in a simple model system, that MMR at one type of repetitive DNA has the potential to influence the stability of another. Furthermore, the mechanism of this stimulation places a limit on the universality of DSBR or RFR models of instability and chromosome fragility at CAG·CTG TNR sequences. Instead, our data suggest that explanations of chromosome fragility should encompass the possibility of chromosome gaps formed during MMR.
Collapse
|
11
|
Herrmann D, Rose E, Müller U, Wagner R. Microarray-based STR genotyping using RecA-mediated ligation. Nucleic Acids Res 2010; 38:e172. [PMID: 20682559 PMCID: PMC2943619 DOI: 10.1093/nar/gkq657] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/17/2022] Open
Abstract
We describe a novel assay capable of accurately determining the length of short tandem repeat (STR) alleles. STR genotyping is achieved utilizing RecA-mediated ligation (RML), which combines the high fidelity of RecA-mediated homology searching with allele-specific ligation. RecA catalyzes the pairing of synthetic oligonucleotides with one strand of a double-stranded DNA target, in this case a PCR amplicon. Ligation occurs only when two adjacent oligonucleotides are base paired to the STR region without any overlap or gap. RecA activity is required to overcome the inherent difficulty of annealing repeated sequences in register. This assay is capable of determining STR genotypes of human samples, is easily adapted to high throughput or automated systems and can have widespread utility in diagnostic and forensic applications.
Collapse
Affiliation(s)
- David Herrmann
- Gene Check Inc., Greeley, CO 80634 and UWILA International Consulting, Alachua, FL 32615, USA
| | | | | | | |
Collapse
|
12
|
Andreoni F, Darmon E, Poon WCK, Leach DRF. Overexpression of the single-stranded DNA-binding protein (SSB) stabilises CAG*CTG triplet repeats in an orientation dependent manner. FEBS Lett 2010; 584:153-8. [PMID: 19925793 DOI: 10.1016/j.febslet.2009.11.042] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/30/2009] [Accepted: 11/09/2009] [Indexed: 11/18/2022]
Abstract
The stability and deletion-size-distribution profiles of leading strand (CAG)(75) and (CTG)(137) trinucleotide repeat arrays inserted in the Escherichia coli chromosome were investigated upon overexpression of the single-stranded DNA-binding protein (SSB) and in mutant strains deficient for the SbcCD (Rad51/Mre11) nuclease. SSB overexpression increases the stability of the (CAG)(75) repeat array and leads to a loss of the bias towards large deletions for the same array. Furthermore, the absence of SbcCD leads to a reduction in the number of large deletions in strains containing the (CTG)(137) repeat array.
Collapse
Affiliation(s)
- Federica Andreoni
- Institute of Cell Biology, School of Biological Sciences, University of Edinburgh, Edinburgh, UK
| | | | | | | |
Collapse
|
13
|
SRS2 and SGS1 prevent chromosomal breaks and stabilize triplet repeats by restraining recombination. Nat Struct Mol Biol 2009; 16:159-67. [PMID: 19136956 DOI: 10.1038/nsmb.1544] [Citation(s) in RCA: 81] [Impact Index Per Article: 5.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/16/2008] [Accepted: 12/04/2008] [Indexed: 01/30/2023]
Abstract
Several molecular mechanisms have been proposed to explain trinucleotide repeat expansions. Here we show that in yeast srs2Delta cells, CTG repeats undergo both expansions and contractions, and they show increased chromosomal fragility. Deletion of RAD52 or RAD51 suppresses these phenotypes, suggesting that recombination triggers trinucleotide repeat instability in srs2Delta cells. In sgs1Delta cells, CTG repeats undergo contractions and increased fragility by a mechanism partially dependent on RAD52 and RAD51. Analysis of replication intermediates revealed abundant joint molecules at the CTG repeats during S phase. These molecules migrate similarly to reversed replication forks, and their presence is dependent on SRS2 and SGS1 but not RAD51. Our results suggest that Srs2 promotes fork reversal in repetitive sequences, preventing repeat instability and fragility. In the absence of Srs2 or Sgs1, DNA damage accumulates and is processed by homologous recombination, triggering repeat rearrangements.
Collapse
|
14
|
Richard GF, Kerrest A, Dujon B. Comparative genomics and molecular dynamics of DNA repeats in eukaryotes. Microbiol Mol Biol Rev 2008; 72:686-727. [PMID: 19052325 PMCID: PMC2593564 DOI: 10.1128/mmbr.00011-08] [Citation(s) in RCA: 334] [Impact Index Per Article: 20.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/20/2022] Open
Abstract
Repeated elements can be widely abundant in eukaryotic genomes, composing more than 50% of the human genome, for example. It is possible to classify repeated sequences into two large families, "tandem repeats" and "dispersed repeats." Each of these two families can be itself divided into subfamilies. Dispersed repeats contain transposons, tRNA genes, and gene paralogues, whereas tandem repeats contain gene tandems, ribosomal DNA repeat arrays, and satellite DNA, itself subdivided into satellites, minisatellites, and microsatellites. Remarkably, the molecular mechanisms that create and propagate dispersed and tandem repeats are specific to each class and usually do not overlap. In the present review, we have chosen in the first section to describe the nature and distribution of dispersed and tandem repeats in eukaryotic genomes in the light of complete (or nearly complete) available genome sequences. In the second part, we focus on the molecular mechanisms responsible for the fast evolution of two specific classes of tandem repeats: minisatellites and microsatellites. Given that a growing number of human neurological disorders involve the expansion of a particular class of microsatellites, called trinucleotide repeats, a large part of the recent experimental work on microsatellites has focused on these particular repeats, and thus we also review the current knowledge in this area. Finally, we propose a unified definition for mini- and microsatellites that takes into account their biological properties and try to point out new directions that should be explored in a near future on our road to understanding the genetics of repeated sequences.
Collapse
Affiliation(s)
- Guy-Franck Richard
- Institut Pasteur, Unité de Génétique Moléculaire des Levures, CNRS, URA2171, Université Pierre et Marie Curie, UFR927, 25 rue du Dr. Roux, F-75015, Paris, France.
| | | | | |
Collapse
|
15
|
Bacolla A, Larson JE, Collins JR, Li J, Milosavljevic A, Stenson PD, Cooper DN, Wells RD. Abundance and length of simple repeats in vertebrate genomes are determined by their structural properties. Genome Res 2008; 18:1545-53. [PMID: 18687880 DOI: 10.1101/gr.078303.108] [Citation(s) in RCA: 83] [Impact Index Per Article: 5.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/25/2022]
Abstract
Microsatellites are abundant in vertebrate genomes, but their sequence representation and length distributions vary greatly within each family of repeats (e.g., tetranucleotides). Biophysical studies of 82 synthetic single-stranded oligonucleotides comprising all tetra- and trinucleotide repeats revealed an inverse correlation between the stability of folded-back hairpin and quadruplex structures and the sequence representation for repeats > or =30 bp in length in nine vertebrate genomes. Alternatively, the predicted energies of base-stacking interactions correlated directly with the longest length distributions in vertebrate genomes. Genome-wide analyses indicated that unstable sequences, such as CAG:CTG and CCG:CGG, were over-represented in coding regions and that micro/minisatellites were recruited in genes involved in transcription and signaling pathways, particularly in the nervous system. Microsatellite instability (MSI) is a hallmark of cancer, and length polymorphism within genes can confer susceptibility to inherited disease. Sequences that manifest the highest MSI values also displayed the strongest base-stacking interactions; analyses of 62 tri- and tetranucleotide repeat-containing genes associated with human genetic disease revealed enrichments similar to those noted for micro/minisatellite-containing genes. We conclude that DNA structure and base-stacking determined the number and length distributions of microsatellite repeats in vertebrate genomes over evolutionary time and that micro/minisatellites have been recruited to participate in both gene and protein function.
Collapse
Affiliation(s)
- Albino Bacolla
- Institute of Biosciences and Technology, Center for Genome Research, Texas A&M University Health Science Center, Houston, Texas 77030, USA.
| | | | | | | | | | | | | | | |
Collapse
|
16
|
Delagoutte E, Goellner GM, Guo J, Baldacci G, McMurray CT. Single-stranded DNA-binding protein in vitro eliminates the orientation-dependent impediment to polymerase passage on CAG/CTG repeats. J Biol Chem 2008; 283:13341-56. [PMID: 18263578 DOI: 10.1074/jbc.m800153200] [Citation(s) in RCA: 20] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022] Open
Abstract
Small insertions and deletions of trinucleotide repeats (TNRs) can occur by polymerase slippage and hairpin formation on either template or newly synthesized strands during replication. Although not predicted by a slippage model, deletions occur preferentially when 5'-CTG is in the lagging strand template and are highly favored over insertion events in rapidly replicating cells. The mechanism for the deletion bias and the orientation dependence of TNR instability is poorly understood. We report here that there is an orientation-dependent impediment to polymerase progression on 5'-CAG and 5'-CTG repeats that can be relieved by the binding of single-stranded DNA-binding protein. The block depends on the primary sequence of the TNR but does not correlate with the thermodynamic stability of hairpins. The orientation-dependent block of polymerase passage is the strongest when 5'-CAG is the template. We propose a "template-push" model in which the slow speed of DNA polymerase across the 5'-CAG leading strand template creates a threat to helicase-polymerase coupling. To prevent uncoupling, the TNR template is pushed out and by-passed. Hairpins do not cause the block, but appear to occur as a consequence of polymerase pass-over.
Collapse
Affiliation(s)
- Emmanuelle Delagoutte
- Génotoxicologie et Cycle Cellulaire, Institut Curie, CNRS, Université Paris-Sud 11, 91405 Orsay Cedex, France.
| | | | | | | | | |
Collapse
|
17
|
Abstract
Unstable repeats are associated with various types of cancer and have been implicated in more than 40 neurodegenerative disorders. Trinucleotide repeats are located in non-coding and coding regions of the genome. Studies of bacteria, yeast, mice and man have helped to unravel some features of the mechanism of trinucleotide expansion. Looped DNA structures comprising trinucleotide repeats are processed during replication and/or repair to generate deletions or expansions. Most in vivo data are consistent with a model in which expansion and deletion occur by different mechanisms. In mammals, microsatellite instability is complex and appears to be influenced by genetic, epigenetic and developmental factors.
Collapse
|