1
|
Xu P, Zhang J, Pan F, Mahn C, Roland C, Sagui C, Weninger K. Frustration Between Preferred States of Complementary Trinucleotide Repeat DNA Hairpins Anticorrelates with Expansion Disease Propensity. J Mol Biol 2023; 435:168086. [PMID: 37024008 PMCID: PMC10191799 DOI: 10.1016/j.jmb.2023.168086] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/14/2022] [Revised: 03/21/2023] [Accepted: 03/30/2023] [Indexed: 04/08/2023]
Abstract
DNA trinucleotide repeat (TRs) expansion beyond a threshold often results in human neurodegenerative diseases. The mechanisms causing expansions remain unknown, although the tendency of TR ssDNA to self-associate into hairpins that slip along their length is widely presumed related. Here we apply single molecule FRET (smFRET) experiments and molecular dynamics simulations to determine conformational stabilities and slipping dynamics for CAG, CTG, GAC and GTC hairpins. Tetraloops are favored in CAG (89%), CTG (89%) and GTC (69%) while GAC favors triloops. We also determined that TTG interrupts near the loop in the CTG hairpin stabilize the hairpin against slipping. The different loop stabilities have implications for intermediate structures that may form when TR-containing duplex DNA opens. Opposing hairpins in the (CAG) ∙ (CTG) duplex would have matched stability whereas opposing hairpins in a (GAC) ∙ (GTC) duplex would have unmatched stability, introducing frustration in the (GAC) ∙ (GTC) opposing hairpins that could encourage their resolution to duplex DNA more rapidly than in (CAG) ∙ (CTG) structures. Given that the CAG and CTG TR can undergo large, disease-related expansion whereas the GAC and GTC sequences do not, these stability differences can inform and constrain models of expansion mechanisms of TR regions.
Collapse
Affiliation(s)
- Pengning Xu
- Department of Physics, North Carolina State University, Raleigh, NC 27695, USA. https://twitter.com/@XPengning
| | - Jiahui Zhang
- Department of Physics, North Carolina State University, Raleigh, NC 27695, USA
| | - Feng Pan
- Department of Physics, North Carolina State University, Raleigh, NC 27695, USA
| | - Chelsea Mahn
- Department of Physics, North Carolina State University, Raleigh, NC 27695, USA
| | - Christopher Roland
- Department of Physics, North Carolina State University, Raleigh, NC 27695, USA
| | - Celeste Sagui
- Department of Physics, North Carolina State University, Raleigh, NC 27695, USA.
| | - Keith Weninger
- Department of Physics, North Carolina State University, Raleigh, NC 27695, USA.
| |
Collapse
|
2
|
Mitsuhashi S, Frith MC, Matsumoto N. Genome-wide survey of tandem repeats by nanopore sequencing shows that disease-associated repeats are more polymorphic in the general population. BMC Med Genomics 2021; 14:17. [PMID: 33413375 PMCID: PMC7791882 DOI: 10.1186/s12920-020-00853-3] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/17/2020] [Accepted: 12/08/2020] [Indexed: 12/13/2022] Open
Abstract
BACKGROUND Tandem repeats are highly mutable and contribute to the development of human disease by a variety of mechanisms. It is difficult to predict which tandem repeats may cause a disease. One hypothesis is that changeable tandem repeats are the source of genetic diseases, because disease-causing repeats are polymorphic in healthy individuals. However, it is not clear whether disease-causing repeats are more polymorphic than other repeats. METHODS We performed a genome-wide survey of the millions of human tandem repeats using publicly available long read genome sequencing data from 21 humans. We measured tandem repeat copy number changes using tandem-genotypes. Length variation of known disease-associated repeats was compared to other repeat loci. RESULTS We found that known Mendelian disease-causing or disease-associated repeats, especially CAG and 5'UTR GGC repeats, are relatively long and polymorphic in the general population. We also show that repeat lengths of two disease-causing tandem repeats, in ATXN3 and GLS, are correlated with near-by GWAS SNP genotypes. CONCLUSIONS We provide a catalog of polymorphic tandem repeats across a variety of repeat unit lengths and sequences, from long read sequencing data. This method especially if used in genome wide association study, may indicate possible new candidates of pathogenic or biologically important tandem repeats in human genomes.
Collapse
Affiliation(s)
- Satomi Mitsuhashi
- Department of Human Genetics, Yokohama City University Graduate School of Medicine, Fukuura 3-9, Kanazawa-ku, Yokohama, 236-0004, Japan.
- Department of Genomic Function and Diversity, Medical Research Institute, Tokyo Medical and Dental University, M&D Tower 24F, 1-5-45 Yushima, Bunkyo-ku, Tokyo, 113-8510, Japan.
| | - Martin C Frith
- Artificial Intelligence Research Center, National Institute of Advanced Industrial Science and Technology (AIST), Tokyo, Japan
- Graduate School of Frontier Sciences, University of Tokyo, Chiba, Japan
- Computational Bio Big-Data Open Innovation Laboratory (CBBD-OIL), AIST, Tokyo, Japan
| | - Naomichi Matsumoto
- Department of Human Genetics, Yokohama City University Graduate School of Medicine, Fukuura 3-9, Kanazawa-ku, Yokohama, 236-0004, Japan.
| |
Collapse
|
3
|
Robertson AL, Bottomley SP. A method for the incremental expansion of polyglutamine repeats in recombinant proteins. Methods Mol Biol 2013; 1017:73-83. [PMID: 23719908 DOI: 10.1007/978-1-62703-438-8_5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/02/2023]
Abstract
The polyglutamine diseases are caused by the expansion of CAG repeats. A key step in understanding the disease mechanisms, at the DNA and protein level, is the ability to produce recombinant proteins with specific length glutamine tracts which is a time-consuming first step in setting up in vitro systems to study the effects of polyglutamine expansion. Described here is a PCR-based method for the amplification of CAG repeats, which we used to incrementally extend CAG length by 3-5 repeats per cycle. This method could be translated into various contexts where amplification of repeating elements is necessary.
Collapse
Affiliation(s)
- Amy L Robertson
- Department of Biochemistry and Molecular Biology, Monash University, Clayton, VIC, Australia
| | | |
Collapse
|
4
|
Robertson AL, Bate MA, Androulakis SG, Bottomley SP, Buckle AM. PolyQ: a database describing the sequence and domain context of polyglutamine repeats in proteins. Nucleic Acids Res 2010; 39:D272-6. [PMID: 21059684 PMCID: PMC3013692 DOI: 10.1093/nar/gkq1100] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/02/2022] Open
Abstract
The polyglutamine diseases are caused in part by a gain-of-function mechanism of neuronal toxicity involving protein conformational changes that result in the formation and deposition of β-sheet rich aggregates. Recent evidence suggests that the misfolding mechanism is context-dependent, and that properties of the host protein, including the domain architecture and location of the repeat tract, can modulate aggregation. In order to allow the bioinformatic investigation of the context of polyglutamines, we have constructed a database, PolyQ (http://pxgrid.med.monash.edu.au/polyq). We have collected the sequences of all human proteins containing runs of seven or more glutamine residues and annotated their sequences with domain information. PolyQ can be interrogated such that the sequence context of polyglutamine repeats in disease and non-disease associated proteins can be investigated.
Collapse
Affiliation(s)
- Amy L Robertson
- Department of Biochemistry and Molecular Biology, Faculty of Medicine, Nursing and Health Sciences, School of Biomedical Sciences, Monash University, Clayton, Victoria 3800, Australia
| | | | | | | | | |
Collapse
|
5
|
Mochmann LH, Wells RD. Transcription influences the types of deletion and expansion products in an orientation-dependent manner from GAC*GTC repeats. Nucleic Acids Res 2004; 32:4469-79. [PMID: 15317871 PMCID: PMC516059 DOI: 10.1093/nar/gkh787] [Citation(s) in RCA: 27] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/27/2022] Open
Abstract
The genetic instability of (GAC*GTC)n (where n = 6-74) was investigated in an Escherichia coli-based plasmid system. Prior work implicated the instability of a (GAC*GTC)5 tract in the cartilage oligomeric matrix protein (COMP) gene to the 4, 6 or 7mers in the etiology of pseudoachondroplasia and multiple epiphyseal dysplasia. The effects of triplet repeat length and orientation were studied after multiple replication cycles in vivo. A transcribed plasmid containing (GAC*GTC)49 repeats led to large deletions (>3 repeats) after propagation in E.coli; however, if transcription was silenced by the LacI(Q) repressor, small expansions and deletions (<3 repeats) predominated the mutation spectra. In contrast, propagation of similar length but opposing orientation (GTC*GAC)53 containing plasmid led to small instabilities that were unaffected by the repression of transcription. Thus, by inhibiting transcription, the genetic instability of (GAC*GTC)49 repeats did not significantly differ from the opposing orientation, (GTC*GAC)53. We postulate that small instabilities of GAC*GTC repeats are achieved through replicative slippage, whereas large deletion events are found when GAC*GTC repeats are transcribed. Herein, we report the first genetic study on GAC*GTC repeat instability describing two types of mutational patterns that can be partitioned by transcription modulation. Along with prior biophysical data, these results lay the initial groundwork for understanding the genetic processes responsible for triplet repeat mutations in the COMP gene.
Collapse
Affiliation(s)
- Liliana H Mochmann
- Institute of Biosciences and Technology, Center for Genome Research, Texas A&M University System Health Science Center, Texas Medical Center, 2121 W. Holcombe Blvd, Houston, TX 77030, USA
| | | |
Collapse
|
6
|
Oma Y, Kino Y, Sasagawa N, Ishiura S. Intracellular localization of homopolymeric amino acid-containing proteins expressed in mammalian cells. J Biol Chem 2004; 279:21217-22. [PMID: 14993218 DOI: 10.1074/jbc.m309887200] [Citation(s) in RCA: 53] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022] Open
Abstract
Many human proteins have homopolymeric amino acid (HPAA) tracts, which are involved in protein-protein interactions and also have intrinsic polymerization properties. Polyglutamine or polyalanine expansions cause several neurodegenerative diseases. To examine the properties of HPAAs, we expressed 20 kinds of 30-residue HPAA fused to the C terminus of yellow fluorescent protein in mammalian cells. Specific localization was observed depending on the HPAA. Polyarginine and polylysine aggregated in the nucleus. Polyalanine, polyhistidine, polyisoleucine, polyleucine, polymethionine, polyphenylalanine, polythreonine, polytryptophan, and polyvaline localized in the cytoplasm, and some of these HPAAs formed aggregate(s). Hydrophobic HPAAs such as polyisoleucine, polyleucine, polyphenylalanine, and polyvaline were found as one major aggregate or cumulus in the perinuclear region. Western blot analysis indicated that hydrophobic HPAA tracts appear to oligomerize and form high molecular weight complexes. These results indicate that hydrophobicity itself may trigger the oligomerization and aggregation of proteins when overexpressed in cells. Our experiments provide novel insights into the nature of the HPAAs that are often seen in human and other organisms.
Collapse
Affiliation(s)
- Yoko Oma
- Department of Life Sciences, Graduate School of Arts and Sciences, The University of Tokyo, Tokyo 153-8902, Japan
| | | | | | | |
Collapse
|
7
|
Veeraraghavan J, Rossi ML, Bambara RA. Analysis of DNA replication intermediates suggests mechanisms of repeat sequence expansion. J Biol Chem 2003; 278:42854-66. [PMID: 12902352 DOI: 10.1074/jbc.m305137200] [Citation(s) in RCA: 11] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022] Open
Abstract
We previously developed a system to investigate the mechanism of repeat sequence expansion during eukaryotic Okazaki fragment processing. Upstream and downstream primers were annealed to a complementary template to overlap across a CAG repeat region. Annealing by the competing primers lead to structural intermediates that ligated to expand the repeat segment. When an equal number of repeats overlapped on the upstream and downstream primers, a 2-fold expansion was expected, but no expansion occurred. We show here that such substrates do not expand irrespective of their repeat length. To reveal mechanism, we tested different hairpin loop intermediates expected to form and facilitate ligation. Substrates configured to form large loops in either the upstream or downstream primer alone allowed expansion. Large or small fixed position single loops allowed expansion when located at least six nucleotides up- or downstream of the nick. Fixed loops in both primers, simulating a double loop intermediate, allowed expansion as long as each loop was nine nucleotides from the nick. Thus, neither the double loop configuration required to form with equal length overlaps nor the large single loop configuration are fundamental structural impediments to expansion. We propose a model for the expansion mechanism based on the relative stabilities of single loop, double loop, hairpin, and flap intermediates that is consistent with the observed expansion efficiency of equal and unequal overlap substrates. The model suggests that the equilibrium concentration of double loop intermediates is so vanishingly small that they are not likely contributors to sequence expansion.
Collapse
Affiliation(s)
- Janaki Veeraraghavan
- Department of Biochemistry and Biophysics and the Cancer Center, University of Rochester School of Medicine and Dentistry, Rochester, New York 14624, USA
| | | | | |
Collapse
|
8
|
Liu Y, Bambara RA. Analysis of human flap endonuclease 1 mutants reveals a mechanism to prevent triplet repeat expansion. J Biol Chem 2003; 278:13728-39. [PMID: 12554738 DOI: 10.1074/jbc.m212061200] [Citation(s) in RCA: 41] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/25/2023] Open
Abstract
Flap endonuclease 1 (FEN1), involved in the joining of Okazaki fragments, has been proposed to restrain DNA repeat sequence expansion, a process associated with aging and disease. Here we analyze properties of human FEN1 having mutations at two conserved glycines (G66S and G242D) causing defects in nuclease activity. Introduction of these mutants into yeast led to sequence expansions. Reconstituting triplet repeat expansion in vitro, we previously found that DNA ligase I promotes expansion, but FEN1 prevents the ligation that forms expanded products. Here we show that among the intermediates that could generate sequence expansion, a bubble is necessary for ligation to produce the expansion product. Severe exonuclease defects in the mutant FEN1 suggested that the inability to degrade bubbles exonucleolytically leads to expansion. However, even wild type FEN1 exonuclease cannot compete with DNA ligase I to degrade a bubble structure before it can be ligated. Instead, we propose that FEN1 suppresses sequence expansion by degrading flaps that equilibrate with bubbles, thereby reducing bubble concentration. In this way FEN1 employs endonuclease rather than exonuclease to prevent expansions. A model is presented describing the roles of DNA structure, DNA ligase I, and FEN1 in sequence expansion.
Collapse
Affiliation(s)
- Yuan Liu
- Department of Biochemistry and Biophysics, University of Rochester School of Medicine and Dentistry, New York 14642, USA
| | | |
Collapse
|
9
|
Parniewski P, Staczek P. Molecular mechanisms of TRS instability. ADVANCES IN EXPERIMENTAL MEDICINE AND BIOLOGY 2003; 516:1-25. [PMID: 12611433 DOI: 10.1007/978-1-4615-0117-6_1] [Citation(s) in RCA: 27] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 02/26/2023]
Abstract
To date several neurodegenerative disorders including myotonic dystrophy, Huntington's disease, Kennedy's disease, fragile X syndrome, spinocerebellar ataxias or Friedreich's ataxia have been linked to the expanding trinucleotide sequences. Although phenotypic features vary among these debilitating diseases, the structural abnormalities of the triplet repeat containing DNA sequences is the primary cause for all of these disorders. Expansions of the CAG repeat within coding regions of miscellaneous genes result in the synthesis of aberrant proteins containing enormously long polyglutamine stretches. Such proteins acquire toxic functions and/or may direct cells into the apoptotic cycle. On the other hand, massive expansions of various triplet repeats (i.e., CTG/CAG, CGG/CCG/, GAA/TTC) inside the noncoding regions lead to the silencing of transcription and therefore affect expression of the adjacent genes. The repetitive character of TRS allows stretches of such tracts to form slipped-stranded structures, self-complementary hairpins, triplexes or more complex configurations called "sticky DNA", which are not equally processed by some cellular mechanisms, as compared to random DNA. It is likely that the instability of the short TRS (below the threshold level) occurs due to the SILC pathway, which is driven by the DNA slippage. Accumulation of the short expansions leads to the disease premutation state where the MLC pathway becomes predominant. Independent of which mechanism is involved in the MLC pathway (replication, transcription, repair or recombination) the process of complementary strand synthesis is crucial for the TRS instability. Generally, dependent on the location of the tract which has higher potential to form secondary DNA structure, further processing of such tract may result in expansions (secondary structure formed at the newly synthesized strand) or deletions (structure present on the template strand). Analyses of molecular mechanisms of the TRS genetic instability using bacteria, yeast, cell lines and transgenic animals as models allowed the scientists to better understand the role of some major cellular processes in the development of neurodegenerative disorders in humans. However, it is necessary to remember that most of these investigations were focused on the involvement of each particular process separately. Much less of this work though was dedicated to the search for the interactions between such cellular systems that in effect could result in different rate of TRS expansions. Thus, more intensive studies are necessary in order to fully understand the phenomenon ofthe dynamic mutations leading to the human hereditary neurodegenerative diseases.
Collapse
Affiliation(s)
- Pawel Parniewski
- Centre for Microbiology and Virology, Polish Academy of Sciences, ul. Lodowa 106, 93-232 Lódz, Poland
| | | |
Collapse
|
10
|
Völker J, Makube N, Plum GE, Klump HH, Breslauer KJ. Conformational energetics of stable and metastable states formed by DNA triplet repeat oligonucleotides: implications for triplet expansion diseases. Proc Natl Acad Sci U S A 2002; 99:14700-5. [PMID: 12417759 PMCID: PMC137482 DOI: 10.1073/pnas.222519799] [Citation(s) in RCA: 46] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
We have embedded the hexameric triplet repeats (CAG)(6) and (CTG)(6) between two (GC)(3) domains to produce two 30-mer hairpins with the sequences d[(GC)(3)(CAG)(6)(GC)(3)] and d[(GC)(3)(CTG)(6)(GC)(3)]. This construct reduces the conformational space available to these repetitive DNA sequences. We find that the (CAG)(6) and (CTG)(6) repeats form stable, ordered, single-stranded structures. These structures are stabilized at 62 degrees C by an average enthalpy per base of 1.38 kcal.mol(-1) for the CAG triplet and 2.87 kcal.mol(-1) for the CTG triplet, while being entropically destabilized by 3.50 cal.K(-1).mol(-1) for the CAG triplet and 7.6 cal.K(-1).mol(-1) for the CTG triplet. Remarkably, these values correspond, respectively, to 1/3 (for CAG) and 2/3 (for CTG) of the enthalpy and entropy per base values associated with Watson-Crick base pairs. We show that the presence of the loop structure kinetically inhibits duplex formation from the two complementary 30-mer hairpins, even though the duplex is the thermodynamically more stable state. Duplex formation, however, does occur at elevated temperatures. We propose that this thermally induced formation of a more stable duplex results from thermal disruption of the single-stranded order, thereby allowing the complementary domains to associate (perhaps via "kissing hairpins"). Our melting profiles show that, once duplex formation has occurred, the hairpin intermediate state cannot be reformed, consistent with our interpretation of kinetically trapped hairpin structures. The duplex formed by the two complementary oligonucleotides does not have any unusual optical or thermodynamic properties. By contrast, the very stable structures formed by the individual single-stranded triplet repeat sequences are thermally and thermodynamically unusual. We discuss this stable, triplet repeat, single-stranded structure and its interconversion with duplex in terms of triplet expansion diseases.
Collapse
Affiliation(s)
- J Völker
- Department of Chemistry and Chemical Biology, Rutgers, State University of New Jersey, 610 Taylor Road, Piscataway 08854, USA
| | | | | | | | | |
Collapse
|
11
|
Pinheiro P, Scarlett G, Rodger A, Rodger PM, Murray A, Brown T, Newbury SF, McClellan JA. Structures of CUG repeats in RNA. Potential implications for human genetic diseases. J Biol Chem 2002; 277:35183-90. [PMID: 12077125 DOI: 10.1074/jbc.m202235200] [Citation(s) in RCA: 14] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022] Open
Abstract
Triplet repeats that cause human genetic diseases have been shown to exhibit unusual compact structures in DNA, and in this paper we show that similar structures exist in shorter "normal length" CNG RNA. CUG and control RNAs were made chemically and by in vitro transcription. We find that "normal" short CUG RNAs migrate anomalously fast on non-denaturing gels, compared with control oligos of similar base composition. By contrast, longer tracts approaching clinically relevant lengths appear to form higher order structures. The CD spectrum of shorter tracts is similar to triplex and pseudoknot nucleic acid structures and different from classical hairpin spectra. A model is outlined that enables the base stacking features of poly(r(G-C))(2).poly(r(U)) or poly(d(G-C))(2).poly(d(T)) triplexes to be achieved, even by a single 15-mer.
Collapse
Affiliation(s)
- Philip Pinheiro
- Biophysics Laboratories, School of Biological Sciences, University of Portsmouth, St. Michael's Building, White Swan Road, Portsmouth, PO1 2DT, United Kingdom
| | | | | | | | | | | | | | | |
Collapse
|
12
|
Lee Y, Oh MR, Kim CH, Hwang HZ, Kim JS, Song SM, Jin DK. A simple method for the detection of neurologic disorders associated with CAG repeat expansion using PCR-microtiter plate hybridization. J Biotechnol 2002; 95:215-23. [PMID: 12007862 DOI: 10.1016/s0168-1656(02)00024-x] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
Abstract
A new screening method was developed for the detection of CAG expanded alleles in patients with hereditary ataxia using polymerase chain reaction-based microtiter plate hybridization (PCR-MPH). The system can be applied to detect pathologic alleles by hybridization with the immobilized (CAG)48 repeat probe derived from the unrelated gene 'ERDA1' except for the CAG repeats. We examined 10 individuals with SCA3, 10 with Huntington disease and 30 normal controls (31 controls for SCA3) using this method. The results showed that a clear discrimination was possible in all cases. We suggest that this system be made available for mass screening of patients with hereditary ataxia disorders. This report is the first to demonstrate that a PCR-MPH system can be successfully applied to DNA size differentiation in addition to base pair mismatches. Also, our design of the probe is unique in that the probe motif stem from the unrelated gene sequence and not from the synthetic oligonucleotides.
Collapse
Affiliation(s)
- Y Lee
- Department of Clinical Research Center, Samsung Biomedical Research Institute, Sungkyunkwan University, Samsung Medical Center, 50 Ilwon-dong, kangnam-ku, 135-710, Seoul, South Korea
| | | | | | | | | | | | | |
Collapse
|
13
|
Jithesh PV, Singh P, Joshi R. Molecular dynamics studies of trinucleotide repeat DNA involved in neurodegenerative disorders. J Biomol Struct Dyn 2001; 19:479-95. [PMID: 11790146 DOI: 10.1080/07391102.2001.10506756] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/10/2023]
Abstract
Expansion of trinucleotide repeat DNA of the classes CAG-CTG, CGG-CCG and GAA-TTC are found to be associated with several neurodegenerative disorders. Different mechanisms have been attributed to the expansion of triplets, mainly involving the formation of alternate secondary structures by such repeats. This paper reports the molecular dynamics simulation of triplet repeat DNA sequences to study the basic structural features of DNA that are responsible for the formation of structures such as hairpins and slip-strand DNA leading to expansion. All the triplet repeat sequences studied were found to be more flexible compared to the control sequence unassociated with disease. Moreover, flexibility was found to be in the order CAG-CTG > CGG-CCG approximately GAA-TTC, the highly flexible CAG-CTG repeat being the most common cause of neurodegenerative disorders. In another simulation, a single G-C to T-A mutation at the 9th position of the CAG-CTG repeat exhibited a reduction in bending compared to the pure 15-mer CAG-CTG repeat. EPM1 dodecamer repeat associated with the pathogenesis of progressive myoclonus epilepsy was also simulated and showed flexible nature suggesting a similar expansion mechanism.
Collapse
Affiliation(s)
- P V Jithesh
- Scientific and Engineering Computing Group, Centre for Development of Advanced Computing, Pune University Campus, Pune, Maharashtra 411007, India
| | | | | |
Collapse
|
14
|
Rolfsmeier ML, Dixon MJ, Pessoa-Brandão L, Pelletier R, Miret JJ, Lahue RS. Cis-elements governing trinucleotide repeat instability in Saccharomyces cerevisiae. Genetics 2001; 157:1569-79. [PMID: 11290713 PMCID: PMC1461582 DOI: 10.1093/genetics/157.4.1569] [Citation(s) in RCA: 42] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
Abstract
Trinucleotide repeat (TNR) instability in humans is governed by unique cis-elements. One element is a threshold, or minimal repeat length, conferring frequent mutations. Since thresholds have not been directly demonstrated in model systems, their molecular nature remains uncertain. Another element is sequence specificity. Unstable TNR sequences are almost always CNG, whose hairpin-forming ability is thought to promote instability by inhibiting DNA repair. To understand these cis-elements further, TNR expansions and contractions were monitored by yeast genetic assays. A threshold of approximately 15--17 repeats was observed for CTG expansions and contractions, indicating that thresholds function in organisms besides humans. Mutants lacking the flap endonuclease Rad27p showed little change in the expansion threshold, suggesting that this element is not altered by the presence or absence of flap processing. CNG or GNC sequences yielded frequent mutations, whereas A-T rich sequences were substantially more stable. This sequence analysis further supports a hairpin-mediated mechanism of TNR instability. Expansions and contractions occurred at comparable rates for CTG tract lengths between 15 and 25 repeats, indicating that expansions can comprise a significant fraction of mutations in yeast. These results indicate that several unique cis-elements of human TNR instability are functional in yeast.
Collapse
Affiliation(s)
- M L Rolfsmeier
- Eppley Institute for Research in Cancer and Allied Diseases, University of Nebraska Medical Center, Omaha, Nebraska 68198-6805, USA
| | | | | | | | | | | |
Collapse
|
15
|
Schumacher S, Pinet I, Bichara M. Modulation of transcription reveals a new mechanism of triplet repeat instability in Escherichia coli. J Mol Biol 2001; 307:39-49. [PMID: 11243802 DOI: 10.1006/jmbi.2000.4489] [Citation(s) in RCA: 26] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022]
Abstract
Many human hereditary disease genes are associated with the expansion of triplet repeat sequences. In Escherichia coli (CTG/CAG) triplet repeat sequences are unstable and we have developed a plasmid-based assay enabling us to observe and quantify both expansions and deletions. In this work, we have investigated the role of transcription on the instability of a (CTG/CAG) insert containing 64 repeats. Using this assay, we show that induction of transcription results in a significant increase in the frequency of long deletions and a reduction in the frequency of long expansions. On the other hand, overproduction of transcription repressor molecules leads to an increase in both expansions and deletions. In this latter case, we propose that the increased instability is due to the arrest of replication progression by the interaction of the repressor molecule with its cognate operator and subsequent generations of DNA strand breaks.
Collapse
Affiliation(s)
- S Schumacher
- Cancérogénèse et Mutagénèse Moléculaire et Structurale UPR 9003, CNRS, Pôle API, Boulevard Sébastien Brant, 67400 Strasbourg-Illkirch, France
| | | | | |
Collapse
|
16
|
Bowater RP, Wells RD. The intrinsically unstable life of DNA triplet repeats associated with human hereditary disorders. PROGRESS IN NUCLEIC ACID RESEARCH AND MOLECULAR BIOLOGY 2001; 66:159-202. [PMID: 11051764 DOI: 10.1016/s0079-6603(00)66029-4] [Citation(s) in RCA: 98] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/30/2022]
Abstract
Expansions of specific DNA triplet repeats are the cause of an increasing number of hereditary neurological disorders in humans. In some diseases, such as Huntington's and several spinocerebellar ataxias, the repetitive DNA sequences are translated into long tracts of the same amino acid (usually glutamine), which alters interactions with cellular constituents and leads to the development of disease. For other disorders, including common genetic disorders such as myotonic dystrophy and fragile X syndrome, the DNA repeat is located in noncoding regions of transcribed sequences and disease is probably caused by altered gene expression. In studies in lower organisms, mammalian cells, and transgenic mice, high frequencies of length changes (increases and decreases) occur in long DNA triplet repeats. These observations are similar to other types of repetitive DNA sequences, which also undergo frequent length changes at genomic loci. A variety of processes acting on DNA influence the genetic stability of DNA triplet repeats, including replication, recombination, repair, and transcription. It is not yet known how these different multienzyme systems interact to produce the genetic mutation of expanded repeats. In vitro studies have identified that DNA triplet repeats can adopt several unusual DNA structures, including hairpins, triplexes, quadruplexes, slipped structures, and highly flexible and writhed helices. The formation of stable unusual structures within the cell is likely to disturb DNA metabolism and be a critical intermediate in the molecular mechanism(s) leading to genetic instabilities of DNA repeats and, hence, to disease pathogenesis.
Collapse
Affiliation(s)
- R P Bowater
- Molecular Biology Sector, School of Biological Sciences, University of East Anglia, Norwich, United Kingdom
| | | |
Collapse
|
17
|
Abstract
At least eight neurodegenerative diseases result from expansions of polyglutamine tracts encoded by CAG trinucleotide repeats. Although polyglutamine diseases typically have onset after age 50 in humans, these diseases can be modeled in animals and in cell culture by using highly expanded repeats to accelerate the pathogenesis. Unfortunately, current methods for preparing recombinant constructs with large glutamine tracts either alter the coding region adjacent to the repeat or yield highly unstable pure CAG repeats. We have developed a technique for expanding repeats that results in a more stable mix of CAG and CAA glutamine codons. We expect this technique to allow rapid preparation of highly expand repeats suitable for stable animal and cell culture models for any of the polyglutamine repeat diseases.
Collapse
Affiliation(s)
- M F Peters
- Department of Psychiatry, The Johns Hopkins University School of Medicine, Baltimore, MD 21205-2196, USA
| | | |
Collapse
|
18
|
Abstract
The expansion of triplet repeat sequences (TRS) associated with hereditary neurological diseases is believed from prior studies to be due to DNA replication. This report demonstrates that the expansion of (CTG.CAG)(n) in vivo also occurs by homologous recombination as shown by biochemical and genetic studies. A two-plasmid recombination system was established in Escherichia coli with derivatives of pUC19 (harboring the ampicillin resistance gene) and pACYC184 (harboring the tetracycline resistance gene). The derivatives contained various triplet repeat inserts ((CTG.CAG), (CGG.CCG), (GAA.TTC), (GTC.GAC), and (GTG.CAC)) of different lengths, orientations, and extents of interruptions and a control non-repetitive sequence. The availability of the two drug resistance genes and of several unique restriction sites on the plasmids enabled rigorous genetic and biochemical analyses. The requirements for recombination at the TRS include repeat lengths >30, the presence of CTG.CAG on both plasmids, and recA and recBC. Sequence analyses on a number of DNA products isolated from individual colonies directly demonstrated the crossing-over and expansion of the homologous CTG.CAG regions. Furthermore, inversion products of the type [(CTG)(13)(CAG)(67)].[(CTG)(67)(CAG)(13)] were isolated as the apparent result of "illegitimate" recombination events on intrahelical pseudoknots. This work establishes the relationships between CTG.CAG sequences, multiple fold expansions, genetic recombination, formation of new recombinant DNA products, and the presence of both drug resistance genes. Thus, if these reactions occur in humans, unequal crossing-over or gene conversion may also contribute to the expansions responsible for anticipation associated with several hereditary neurological syndromes.
Collapse
Affiliation(s)
- J P Jakupciak
- Institute of Biosciences and Technology, Center for Genome Research, Texas A&M University, Texas Medical Center, Houston, Texas 77030, USA
| | | |
Collapse
|
19
|
Abstract
The sequences are related to evolution in several ways. First, they carry traces of a distant past. Two sequence features point to the earliest sequence organization. The universal hidden GCU-periodical pattern in mRNA suggests the earliest codons: GCU and its nine-point-change derivatives. They code for seven amino acids that by several criteria are also the oldest. Together it makes the earliest form of the triplet code, still recognizable in the extant sequences. Another feature present in the sequences, apparently, since separation of prokaryotes and eukaryotes, is hidden genome segmentation. Both protein-coding and noncoding sequences appear to have been formed by fusion of standard size units, about 360 bp (120 aa) in eukaryotes and 450 bp (150 aa) in prokaryotes. Presumably, the units have been functioning at some stage of evolution as autonomous single-gene size elements. There are sequence designs that promote evolution. One such design suitable for fast adaptation is the tandem repetition of identical sequences, so that their copy numbers in the repeat arrays would modulate (tune) the expression of nearby genes. The tandem repeat expansion diseases illustrate this mechanism in a dramatic way: overtuning of the respective gene expression leads to the disease.
Collapse
Affiliation(s)
- E N Trifonov
- Department of Structural Biology, Weizmann Institute of Science, Rehovot, Israel.
| |
Collapse
|
20
|
Grabczyk E, Usdin K. Generation of microgram quantities of trinucleotide repeat tracts of defined length, interspersion pattern, and orientation. Anal Biochem 1999; 267:241-3. [PMID: 9918680 DOI: 10.1006/abio.1998.2962] [Citation(s) in RCA: 20] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022]
Affiliation(s)
- E Grabczyk
- Laboratory of Molecular and Cellular Biology, National Institute of Diabetes, Digestive, and Kidney Diseases, Bethesda, Maryland, 20892-0830, USA
| | | |
Collapse
|
21
|
Parniewski P, Bacolla A, Jaworski A, Wells RD. Nucleotide excision repair affects the stability of long transcribed (CTG*CAG) tracts in an orientation-dependent manner in Escherichia coli. Nucleic Acids Res 1999; 27:616-23. [PMID: 9862988 PMCID: PMC148223 DOI: 10.1093/nar/27.2.616] [Citation(s) in RCA: 76] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/03/2023] Open
Abstract
The influence of nucleotide excision repair (NER), the principal in vivo repair system for DNA damages, was investigated in Escherichia coli with uvrA, uvrB and uvrAuvrB mutants with the triplet repeat sequences (TRS) involved in myotonic dystrophy, the fragile X syndrome and Friedreich's ataxia. (CTG*CAG)175was more stable when the (CTG) strand was transcribed than when the (CAG) strand was transcribed in the alternate orientation. A lack of the UvrA protein dramatically increases the instability of this TRS in vivo as compared with the stability of the same sequence in uvrB mutant, which produces an intact UvrA protein. We propose that transcription transiently dissociates the triplet repeat complementary strands enabling the non-transcribed strand to fold into a hairpin conformation which is then sufficiently stable that replication bypasses the hairpin to give large deletions. If the TRS was not transcribed, fewer deletions were observed. Alternatively, in the uvrA-mutant, the hairpins existing on the lagging strand will suffer bypass DNA synthesis to generate deleted molecules. Hence, NER, functionally similar in both prokaryotes and eukaryotes, is an important factor in the genetic instabilities of long transcribed TRS implicated in human hereditary neuro-logical diseases.
Collapse
Affiliation(s)
- P Parniewski
- Institute of Biosciences and Technology, Center for Genome Research, Department of Biochemistry and Biophysics, Texas A&M University, 2121 West Holcombe Boulevard, Houston, TX 77030-3303, USA
| | | | | | | |
Collapse
|
22
|
Morel P, Reverdy C, Michel B, Ehrlich SD, Cassuto E. The role of SOS and flap processing in microsatellite instability in Escherichia coli. Proc Natl Acad Sci U S A 1998; 95:10003-8. [PMID: 9707590 PMCID: PMC21451 DOI: 10.1073/pnas.95.17.10003] [Citation(s) in RCA: 39] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
Mutations affecting mismatch repair result in elevated frequencies of microsatellite length alteration in prokaryotes and eukaryotes. However, the finding that microsatellite instability is found often in cells with a functional mismatch repair system prompted a search for other factors of tract alteration. In the present report, we show that, in Escherichia coli, poly(AC/TG) tracts are destabilized by mutations that induce SOS. These observations may have implications for eukaryotic cells because recent results suggest the existence of a mammalian SOS response analogous to that in prokaryotes. In addition, a defect in the 5'-3' exonuclease domain of DNA polymerase I, homologous to the mammalian FEN1 and the yeast RAD27 nucleases, leads to a marked increase in repeat expansions characteristic of several genetic disorders. Finally, we found that the combination of a proofreading defect with mismatch repair deficiency results in extreme microsatellite instability.
Collapse
Affiliation(s)
- P Morel
- Génétique Microbienne, Institut National de la Recherche Agronomique, Domaine de Vilvert, 78352 Jouy en Josas Cedex, France
| | | | | | | | | |
Collapse
|
23
|
Wells RD, Bacolla A, Bowater RP. Instabilities of triplet repeats: factors and mechanisms. Results Probl Cell Differ 1998; 21:133-65. [PMID: 9670316 DOI: 10.1007/978-3-540-69680-3_4] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/06/2023]
Affiliation(s)
- R D Wells
- Institute of Biosciences and Technology, Texas A&M University, Texas Medical Center, Houston 77030-3303, USA
| | | | | |
Collapse
|
24
|
Ohshima K, Montermini L, Wells RD, Pandolfo M. Inhibitory effects of expanded GAA.TTC triplet repeats from intron I of the Friedreich ataxia gene on transcription and replication in vivo. J Biol Chem 1998; 273:14588-95. [PMID: 9603975 DOI: 10.1074/jbc.273.23.14588] [Citation(s) in RCA: 219] [Impact Index Per Article: 8.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022] Open
Abstract
Friedreich ataxia (FRDA) is associated with the expansion of a GAA. TTC triplet repeat in the first intron of the frataxin gene, resulting in reduced levels of frataxin mRNA and protein. To investigate the mechanisms by which the intronic expansion produces its effect, GAA.TTC repeats of various lengths (9 to 270 triplets) were cloned in both orientations in the intron of a reporter gene. Plasmids containing these repeats were transiently transfected into COS-7 cells. A length- and orientation-dependent inhibition of reporter gene expression was observed. RNase protection and Northern blot analyses showed very low levels of mature mRNA when longer GAA repeats were transcribed, with no accumulation of primary transcript. Replication of plasmids carrying long GAA.TTC tracts (approximately 250 triplets) was greatly inhibited in COS-7 cells compared with plasmids carrying (GAA.TTC)9 and (GAA.TTC)90. Replication inhibition was five times greater for the plasmid whose transcript contains (GAA)230 than for the plasmid whose transcript contains (UUC)270. Our in vivo investigation revealed that expanded GAA.TTC repeats from intron I of the FRDA gene inhibit transcription rather than post-transcriptional RNA processing and also interfere with replication. The molecular basis for these effects may be the formation of non-B DNA structures.
Collapse
Affiliation(s)
- K Ohshima
- Centre de Recherche Louis-Charles Simard, C.H.U.M., Campus Notre-Dame, 1560 rue Sherbrooke est, Montréal, Québec H2L 4M1, Canada
| | | | | | | |
Collapse
|
25
|
Abstract
mRNA sequences are known to carry a hidden periodical pattern (GCU)n, which may be considered a remnant of sequence organization of mRNA early in its evolution, dominated by codons for alanine and their point mutation derivatives. A similar pattern is characteristic of the master (consensus) tRNA sequence derived in 1981 by Eigen and Winkler-Oswatitsch. The master tRNA sequence is thought to represent one of the earliest mRNA. From analysis of literature and from our own calculations presented in this work, the (GCU)n pattern appears to be the most expandable in the norm and in disease. The speculation is put forward that (GCU)n and polyalanine have been key players at the beginning of the triplet code, and the first codons, apart from the GCU triplet, were point change derivatives of the generic triplet GCU, coding for amino acids present in the early prebiotic-biotic environment. The set of the earliest amino acids is derived on the basis of structural simplicity, presence in imitated prebiotic conditions and involvement with class II aminoacyl-tRNA synthetases. The set consists of six amino acids: Ala, Asp, Gly, Pro, Ser and Thr. All these amino acids are, indeed, encoded by the GCU triplet and its derivatives, as predicted. Thus, the pairs GCN (Ala), GAU (Asp), GGU (Gly), CCU (Pro), UCU (Ser) and ACU (Thr) can be viewed as an early triplet code.
Collapse
Affiliation(s)
- E N Trifonov
- Department of Structural Biology, The Weizmann Institute of Science, Rehovot, Israel.
| | | |
Collapse
|
26
|
Bowater RP, Jaworski A, Larson JE, Parniewski P, Wells RD. Transcription increases the deletion frequency of long CTG.CAG triplet repeats from plasmids in Escherichia coli. Nucleic Acids Res 1997; 25:2861-8. [PMID: 9207036 PMCID: PMC146811 DOI: 10.1093/nar/25.14.2861] [Citation(s) in RCA: 76] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/04/2023] Open
Abstract
Induction of transcription into long CTG.CAG repeats contained on plasmids in Escherichia coli is shown to increase the frequency of deletions within the repeat sequences. This elevated genetic instability was detected because active transcription into the triplet repeat influenced the growth transitions of the host cell, allowing advantageous growth for cells harboring plasmids with deleted repeat sequences. The variety of deletion products observed in separate cultures suggests that transcription altered the metabolism of the DNA in a manner that produced random length changes in the repeat sequence. For cultures containing plasmids without active transcription into the triplet repeat, or those maintained in exponential growth, deletions occurred within the repeat at a lower frequency (5-20-fold lower). In these incubations the extent of deletions was proportional to the number of cell divisions and many repeat lengths were observed within each culture, suggesting that the decrease in average repeat length at long incubation times was due to multiple small deletions. These observations show that deletions within long CTG.CAG repeats contained on plasmids in E.coli occur via more than one pathway and their level of genetic instability is altered by the enzymatic processes occurring upon the DNA.
Collapse
Affiliation(s)
- R P Bowater
- Albert B. Alkek Institute of Biosciences and Technology, Texas A&M University, Center for Genome Research, Department of Biochemistry and Biophysics, Texas Medical Center, 2121 West Holcombe Boulevard, Houston, TX 77030, USA
| | | | | | | | | |
Collapse
|
27
|
Bacolla A, Gellibolian R, Shimizu M, Amirhaeri S, Kang S, Ohshima K, Larson JE, Harvey SC, Stollar BD, Wells RD. Flexible DNA: genetically unstable CTG.CAG and CGG.CCG from human hereditary neuromuscular disease genes. J Biol Chem 1997; 272:16783-92. [PMID: 9201983 DOI: 10.1074/jbc.272.27.16783] [Citation(s) in RCA: 91] [Impact Index Per Article: 3.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/04/2023] Open
Abstract
The properties of duplex CTG.CAG and CGG.CCG, which are involved in the etiology of several hereditary neurodegenerative diseases, were investigated by a variety of methods, including circularization kinetics, apparent helical repeat determination, and polyacrylamide gel electrophoresis. The bending moduli were 1.13 x 10(-19) erg.cm for CTG and 1.27 x 10(-19) erg.cm for CGG, approximately 40% less than for random B-DNA. Also, the persistence lengths of the triplet repeat sequences were approximately 60% the value for random B-DNA. However, the torsional moduli and the helical repeats were 2.3 x 10(-19) erg.cm and 10.4 base pairs (bp)/turn for CTG and 2.4 x 10(-19) erg.cm and 10.3 bp/turn for CGG, respectively, all within the range for random B-DNA. Determination of the apparent helical repeat by the band shift assay indicated that the writhe of the repeats was different from that of random B-DNA. In addition, molecules of 224-245 bp in length (64-71 triplet repeats) were able to form topological isomers upon cyclization. The low bending moduli are consistent with predictions from crystallographic variations in slide, roll, and tilt. No unpaired bases or non-B-DNA structures could be detected by chemical and enzymatic probe analyses, two-dimensional agarose gel electrophoresis, and immunological studies. Hence, CTG and CGG are more flexible and highly writhed than random B-DNA and thus would be expected to act as sinks for the accumulation of superhelical density.
Collapse
Affiliation(s)
- A Bacolla
- Center for Genome Research, Institute of Biosciences and Technology, Texas A & M University, Texas Medical Center, 2121 Holcombe Blvd., Houston, Texas 77030, USA
| | | | | | | | | | | | | | | | | | | |
Collapse
|
28
|
Deissler H, Wilm M, Genç B, Schmitz B, Ternes T, Naumann F, Mann M, Doerfler W. Rapid protein sequencing by tandem mass spectrometry and cDNA cloning of p20-CGGBP. A novel protein that binds to the unstable triplet repeat 5'-d(CGG)n-3' in the human FMR1 gene. J Biol Chem 1997; 272:16761-8. [PMID: 9201980 DOI: 10.1074/jbc.272.27.16761] [Citation(s) in RCA: 38] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/04/2023] Open
Abstract
The autonomous expansion of the unstable 5'-d(CGG)n-3' repeat in the 5'-untranslated region of the human FMR1 gene leads to the fragile X syndrome, one of the most frequent causes of mental retardation in human males. We have recently described the isolation of a protein p20-CGGBP that binds sequence-specifically to the double-stranded trinucleotide repeat 5'-d(CGG)-3' (Deissler, H., Behn-Krappa, A., and Doerfler, W. (1996) J. Biol. Chem. 271, 4327-4334). We demonstrate now that the p20-CGGBP can also bind to an interrupted repeat sequence. Peptide sequence tags of p20-CGGBP obtained by nanoelectrospray mass spectrometry were screened against an expressed sequence tag data base, retrieving a clone that contained the full-length coding sequence for p20-CGGBP. A bacterially expressed fusion protein p20-CGGBP-6xHis exhibits a binding pattern to the double-stranded 5'-d(CGG)n-3' repeat similar to that of the authentic p20-CGGBP. This novel protein lacks any overall homology to other known proteins but carries a putative nuclear localization signal. The p20-CGGBP gene is conserved among mammals but shows no homology to non-vertebrate species. The gene encoding the sequence for the new protein has been mapped to human chromosome 3.
Collapse
Affiliation(s)
- H Deissler
- Institut für Genetik, Universität zu Köln, D-50931 Köln, Federal Republic of Germany
| | | | | | | | | | | | | | | |
Collapse
|
29
|
Ohshima K, Wells RD. Hairpin formation during DNA synthesis primer realignment in vitro in triplet repeat sequences from human hereditary disease genes. J Biol Chem 1997; 272:16798-806. [PMID: 9201985 DOI: 10.1074/jbc.272.27.16798] [Citation(s) in RCA: 81] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/04/2023] Open
Abstract
Genetic expansion of DNA triplet repeat sequences (TRS) found in neurogenetic disorders may be due to abnormal DNA replication. We have previously observed strong DNA synthesis pausings at specific loci within the long tracts (> approximately 70 repeats) of CTG.CAG and CGG.CCG as well as GTC.GAC by primer extensions in vitro using DNA polymerases (the Klenow fragment of Escherichia coli DNA polymerase I, the modified T7 DNA polymerase (Sequenase), and human DNA polymerase beta). Herein, we have isolated and analyzed the products of stalled synthesis found at approximately 30-40 triplets from the beginning of the TRS. DNA sequence analyses revealed that the stalled products contained short tracts of homogeneous TRS (6-12 repeats) in the middle of the sequence corresponding to the flanking region of the primer-template sequence. The sequence at the 3'-side terminated at the end of the primer, indicating that the primer molecule had served as a template. In addition, chemical probe and polyacrylamide gel electrophoretic analyses revealed that the stalled products existed in hairpin structures. We postulate that these products of the DNA polymerases are caused by the existence of an unusual DNA conformation(s) within the TRS, during the in vitro DNA synthesis, enhancing the DNA slippages and the hairpin formations in the TRS due to primer realignment. The consequence of these steps is DNA synthesis to the end of the primer and termination. Primer realignment including hairpin formation may play an important intermediate role in the replication of TRS in vivo to elicit genetic expansions.
Collapse
Affiliation(s)
- K Ohshima
- Center for Genome Research, Institute of Biosciences and Technology, Texas A & M University, Texas Medical Center, 2121 W. Holcombe Blvd., Houston, Texas 77030-3303, USA
| | | |
Collapse
|
30
|
Bulle F, Chiannilkulchai N, Pawlak A, Weissenbach J, Gyapay G, Guellaën G. Identification and chromosomal localization of human genes containing CAG/CTG repeats expressed in testis and brain. Genome Res 1997; 7:705-15. [PMID: 9253599 DOI: 10.1101/gr.7.7.705] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/05/2023]
Abstract
Human genes containing triplet repeats have been demonstrated to be involved in several neurodegenerative diseases by expansion of the repeat in succeeding generations. To identify novel genes involved in such pathologies, we have isolated transcripts containing (CAG/CTG)n repeats using two approaches. First, we screened 4 x 10(6) clones representing 10 copies of a human testis cDNA library using a (CAG)14 oligonucleotide probe. Among the 910 clones identified, the 243 clones with the strongest hybridization signal were sequenced partially from 3' or 5' ends. This provided us with 251 partial sequences that grouped into clusters corresponding to 39 genes, of which 19 represent unknown species. Second, we selected 203 additional ESTs containing (CAG/CTG)n repeats representing 121 clusters from the IMAGE consortium infant brain cDNA library. From these two series of sequences, we have localized 95 genes on human chromosomes using a panel of whole genome radiation hybrid (Genebridge 4). These genes are located on all of the chromosomes except for chromosome X, the highest density being observed on chromosome 19.
Collapse
Affiliation(s)
- F Bulle
- Institut National de la Santé et de la Recherche Médicale (INSERM), Unité 99, Hôpital Henri Mondor, Créteil, France.
| | | | | | | | | | | |
Collapse
|
31
|
Abstract
Triplet repeat expansion diseases (TREDs) are characterized by the coincidence of disease manifestation with amplification of d(CAG. CTG), d(CGG.CCG) or d(GAA.TTC) repeats contained within specific genes. Amplification of triplet repeats continues in offspring of affected individuals, which generally results in progressive severity of the disease and/or an earlier age of onset, phenomena clinically referred to as 'anticipation'. Recent biophysical and biochemical studies reveal that five of the six [d(CGG)n, d(CCG)n, (CAG)n, d(CTG)n and d(GAA)n] complementary sequences that are associated with human disease form stable hairpin structures. Although the triplet repeat sequences d(GAC)n and d(GTC)n also form hairpins, repeats of the double-stranded forms of these sequences are conspicuously absent from DNA sequence databases and are not anticipated to be associated with human disease. With the exception of d(GAG)n and d(GTG)n, the remaining triplet repeat sequences are unlikely to form hairpin structures at physiological salt and temperature. The details of hairpin structures containing trinucleotide repeats are summarized and discussed with respect to potential mechanisms of triplet repeat expansion and d(CGG.CCG) n methylation/demethylation.
Collapse
Affiliation(s)
- M Mitas
- Department of Biochemistry and Molecular Biology, Oklahoma State University, 246 Noble Research Center, Stillwater, OK 74078, USA.
| |
Collapse
|
32
|
Aoki T, Koch KS, Leffert HL. Attenuation of gene expression by a trinucleotide repeat-rich tract from the terminal exon of the rat hepatic polymeric immunoglobulin receptor gene. J Mol Biol 1997; 267:229-36. [PMID: 9096221 DOI: 10.1006/jmbi.1997.0890] [Citation(s) in RCA: 25] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/04/2023]
Abstract
A 359 bp terminal exon fragment of the rat polymeric immunoglobulin receptor gene has been tested for biological effects. The fragment contains an S1 nuclease-sensitive microsatellite with d(GGA) and d(GAA) trinucleotide repeats that are expressed discordantly in the 3'UTRs of liver mRNAs encoded by the single copy gene. When human A293 cells are transfected with expression plasmids carrying this fragment in forward orientations, flanking or replacing poly(A) cassettes in the 3' ends of the transcription units, luciferase reporter gene expression is attenuated 47 to 59% or 98.5%, respectively. In contrast, when the fragment is tested similarly in reverse orientation, there is significantly less or no attenuation of gene expression. These observations, and computer models of partial triplet repeat DNA tertiary and RNA secondary structures, suggest that this fragment might regulate gene expression by orientation and position-dependent mechanisms at transcriptional and post-transcriptional levels.
Collapse
Affiliation(s)
- T Aoki
- Department of Biochemistry, Faculty of Pharmaceutical Sciences, Health Sciences University of Hokkaido, Ishikari-Tobetsu, Japan
| | | | | |
Collapse
|
33
|
Ohshima K, Kang S, Larson JE, Wells RD. TTA.TAA triplet repeats in plasmids form a non-H bonded structure. J Biol Chem 1996; 271:16784-91. [PMID: 8663378 DOI: 10.1074/jbc.271.28.16784] [Citation(s) in RCA: 38] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/01/2023] Open
Abstract
CTG.CAG, CGG.CCG, and AAG.CTT triplet repeats proximal to or in disease genes expand by a non-Mendelian genetic process to cause several human hereditary syndromes. As part of our physical, biological, and genetic studies on the 10 possible triplet repeats, we discovered that the TTA.TAA repeat, isolated from the upstream region of the variant surface glycoprotein gene of Trypanosoma brucei, shows a propensity to adopt a non-H bonded structure under appropriate conditions. The other nine triplet repeat sequences do not exhibit this property. (TTA.TAA)n, where n = 90, 60, 30, and 18, cloned into pUC19 was studied by chemical and enzymatic probes as well as two-dimensional gel electrophoretic analyses under a variety of conditions. The helix opening was observed for all four inserts in supercoiled plasmids as a function of temperature, pH, metal ions, and buffer conditions using OsO4, diethyl pyrocarbonate, and chloroacetaldehyde probes. This unusual property of the TTA.TAA repeat suggests that it plays a different role from the other nine triplet repeats in gene expression.
Collapse
Affiliation(s)
- K Ohshima
- Department of Biochemistry and Biophysics, Texas A&M University, Texas Medical Center, Houston, Texas 77030-3303, USA
| | | | | | | |
Collapse
|
34
|
Ohshima K, Kang S, Larson JE, Wells RD. Cloning, characterization, and properties of seven triplet repeat DNA sequences. J Biol Chem 1996; 271:16773-83. [PMID: 8663377 DOI: 10.1074/jbc.271.28.16773] [Citation(s) in RCA: 82] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/01/2023] Open
Abstract
Several neuromuscular and neurodegenerative diseases are caused by genetically unstable triplet repeat sequences (CTG.CAG, CGG.CCG, or AAG.CTT) in or near the responsible genes. We implemented novel cloning strategies with chemically synthesized oligonucleotides to clone seven of the triplet repeat sequences (GTA.TAC, GAT.ATC, GTT.AAC, CAC.GTG, AGG.CCT, TCG.CGA, and AAG.CTT), and the adjoining paper (Ohshima, K., Kang, S., Larson, J. E., and Wells, R. D.(1996) J. Biol. Chem. 271, 16784-16791) describes studies on TTA.TAA. This approach in conjunction with in vivo expansion studies in Escherichia coli enabled the preparation of at least 81 plasmids containing the repeat sequences with lengths of approximately 16 up to 158 triplets in both orientations with varying extents of polymorphisms. The inserts were characterized by DNA sequencing as well as DNA polymerase pausings, two-dimensional agarose gel electrophoresis, and chemical probe analyses to evaluate the capacity to adopt negative supercoil induced non-B DNA conformations. AAG.CTT and AGG.CCT form intramolecular triplexes, and the other five repeat sequences do not form any previously characterized non-B structures. However, long tracts of TCG.CGA showed strong inhibition of DNA synthesis at specific loci in the repeats as seen in the cases of CTG.CAG and CGG.CCG (Kang, S., Ohshima, K., Shimizu, M., Amirhaeri, S., and Wells, R. D.(1995) J. Biol. Chem. 270, 27014-27021). This work along with other studies (Wells, R. D.(1996) J. Biol. Chem. 271, 2875-2878) on CTG.CAG, CGG.CCG, and TTA.TAA makes available long inserts of all 10 triplet repeat sequences for a variety of physical, molecular biological, genetic, and medical investigations. A model to explain the reduction in mRNA abundance in Friedreich's ataxia based on intermolecular triplex formation is proposed.
Collapse
Affiliation(s)
- K Ohshima
- Department of Biochemistry and Biophysics, Texas A&M University, Texas Medical Center, Houston, Texas 77030-3303, USA
| | | | | | | |
Collapse
|
35
|
Affiliation(s)
- R D Wells
- Institute of Biosciences and Technology, Center for Genome Research, Department of Biochemistry and Biophysics, Texas A&M University, Houston, Texas 77030-3303, USA
| |
Collapse
|