1
|
Wanford JJ, Green LR, Aidley J, Bayliss CD. Phasome analysis of pathogenic and commensal Neisseria species expands the known repertoire of phase variable genes, and highlights common adaptive strategies. PLoS One 2018; 13:e0196675. [PMID: 29763438 PMCID: PMC5953494 DOI: 10.1371/journal.pone.0196675] [Citation(s) in RCA: 14] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/08/2018] [Accepted: 04/17/2018] [Indexed: 12/02/2022] Open
Abstract
Pathogenic Neisseria are responsible for significantly higher levels of morbidity and mortality than their commensal relatives despite having similar genetic contents. Neisseria possess a disparate arsenal of surface determinants that facilitate host colonisation and evasion of the immune response during persistent carriage. Adaptation to rapid changes in these hostile host environments is enabled by phase variation (PV) involving high frequency, stochastic switches in expression of surface determinants. In this study, we analysed 89 complete and 79 partial genomes, from the NCBI and Neisseria PubMLST databases, representative of multiple pathogenic and commensal species of Neisseria using PhasomeIt, a new program that identifies putatively phase-variable genes and homology groups by the presence of simple sequence repeats (SSR). We detected a repertoire of 884 putative PV loci with maxima of 54 and 47 per genome in gonococcal and meningococcal isolates, respectively. Most commensal species encoded a lower number of PV genes (between 5 and 30) except N. lactamica wherein the potential for PV (36–82 loci) was higher, implying that PV is an adaptive mechanism for persistence in this species. We also characterised the repeat types and numbers in both pathogenic and commensal species. Conservation of SSR-mediated PV was frequently observed in outer membrane proteins or modifiers of outer membrane determinants. Intermittent and weak selection for evolution of SSR-mediated PV was suggested by poor conservation of tracts with novel PV genes often occurring in only one isolate. Finally, we describe core phasomes—the conserved repertoires of phase-variable genes—for each species that identify overlapping but distinctive adaptive strategies for the pathogenic and commensal members of the Neisseria genus.
Collapse
Affiliation(s)
- Joseph J. Wanford
- Department of Genetics and Genome Biology, University of Leicester, Leicestershire, United Kingdom
- * E-mail:
| | - Luke R. Green
- Department of Genetics and Genome Biology, University of Leicester, Leicestershire, United Kingdom
| | - Jack Aidley
- Department of Genetics and Genome Biology, University of Leicester, Leicestershire, United Kingdom
| | - Christopher D. Bayliss
- Department of Genetics and Genome Biology, University of Leicester, Leicestershire, United Kingdom
| |
Collapse
|
2
|
Liljegren MM, de Muinck EJ, Trosvik P. Microsatellite Length Scoring by Single Molecule Real Time Sequencing - Effects of Sequence Structure and PCR Regime. PLoS One 2016; 11:e0159232. [PMID: 27414800 PMCID: PMC4945053 DOI: 10.1371/journal.pone.0159232] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/22/2016] [Accepted: 06/29/2016] [Indexed: 12/02/2022] Open
Abstract
Microsatellites are DNA sequences consisting of repeated, short (1–6 bp) sequence motifs that are highly mutable by enzymatic slippage during replication. Due to their high intrinsic variability, microsatellites have important applications in population genetics, forensics, genome mapping, as well as cancer diagnostics and prognosis. The current analytical standard for microsatellites is based on length scoring by high precision electrophoresis, but due to increasing efficiency next-generation sequencing techniques may provide a viable alternative. Here, we evaluated single molecule real time (SMRT) sequencing, implemented in the PacBio series of sequencing apparatuses, as a means of microsatellite length scoring. To this end we carried out multiplexed SMRT sequencing of plasmid-carried artificial microsatellites of varying structure under different pre-sequencing PCR regimes. For each repeat structure, reads corresponding to the target length dominated. We found that pre-sequencing amplification had large effects on scoring accuracy and error distribution relative to controls, but that the effects of the number of amplification cycles were generally weak. In line with expectations enzymatic slippage decreased proportionally with microsatellite repeat unit length and increased with repetition number. Finally, we determined directional mutation trends, showing that PCR and SMRT sequencing introduced consistent but opposing error patterns in contraction and expansion of the microsatellites on the repeat motif and single nucleotide level.
Collapse
Affiliation(s)
- Mikkel Meyn Liljegren
- Centre for Ecological and Evolutionary Synthesis, Dept. of Biosciences, University of Oslo, Oslo, Norway
| | - Eric Jacques de Muinck
- Centre for Ecological and Evolutionary Synthesis, Dept. of Biosciences, University of Oslo, Oslo, Norway
| | - Pål Trosvik
- Centre for Ecological and Evolutionary Synthesis, Dept. of Biosciences, University of Oslo, Oslo, Norway
- * E-mail:
| |
Collapse
|
3
|
Comparison of southern Chinese Han and Brazilian Caucasian mutation rates at autosomal short tandem repeat loci used in human forensic genetics. Int J Legal Med 2013; 128:1-9. [PMID: 23549598 DOI: 10.1007/s00414-013-0847-2] [Citation(s) in RCA: 19] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/04/2012] [Accepted: 03/05/2013] [Indexed: 10/27/2022]
Abstract
The short tandem repeat (STR) loci used in human genetic studies are characterized by having relatively high mutation rates. In particular, to ensure an appropriate evaluation of genetic evidence in parentage and forensic analyses, it is essential to have accurate estimates of the mutation rates associated with the commonly used autosomal and sex chromosome STR loci. Differences in STR mutation rates between different ethnic groups should also be determined. Mutation data from two laboratories working with different ethnic groups were extracted from many meiotic transmissions ascertained for 15 autosomal STR loci currently used in forensic routine. Forty-five thousand and eighty-five trios were checked for the biological consistency of maternity and paternity through the analysis of a minimum of 15 loci. Mutations were scored as paternal, maternal, or ambiguous according to the most parsimonious explanation for the inconsistency, using always the least requiring hypothesis in terms of number of repeat differences. The main findings are: (a) the overall mutation rate across the 15 loci was 9.78 × 10(-4) per gamete per generation (95% CI = 9.30 × 10(-4)-1.03 × 10(-3)), and with just 48 (out of 1,587) exceptions, all of the mutations were single-step; (b) repeat gains were more frequent than losses; (c) longer alleles were found to be more mutable; and (d) the mutation rates differ at some loci between the two ethnic groups. Large worldwide meiotic transmission datasets are still needed to measure allele-specific mutation rates at the STR loci consensually used in forensic genetics.
Collapse
|
4
|
Kumar P, Nagarajaram HA. A study on mutational dynamics of simple sequence repeats in relation to mismatch repair system in prokaryotic genomes. J Mol Evol 2012; 74:127-39. [PMID: 22415400 DOI: 10.1007/s00239-012-9491-6] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/30/2011] [Accepted: 02/15/2012] [Indexed: 11/30/2022]
Abstract
Mutational bias toward expansion or contraction of simple sequence repeats (SSRs) is referred to as directionality of SSR evolution. In this communication, we report the mutational bias exhibited by mononucleotide SSRs occurring in the non-coding regions of several prokaryotic genomes. Our investigations revealed that the strains or species lacking mismatch repair (MMR) system generally show higher number of polymorphic SSRs than those species/strains having MMR system. An exception to this observation was seen in the mycobacterial genomes that are MMR deficient where only a few SSR tracts were seen with mutations. This low incidence of SSR mutations even in the MMR-deficient background could be attributed to the high fidelity of the DNA polymerases as a consequence of high generation time of the mycobacteria. MMR system-deficient species generally did not show any bias toward mononucleotide SSR expansions or contractions indicating a neutral evolution of SSRs in these species. The MMR-proficient species in which the observed mutations correspond to secondary mutations showed bias toward contraction of polymononucleotide tracts, perhaps, indicating low efficiency of MMR system to repair SSR-induced slippage errors on template strands. This bias toward deletion in the mononucleotide SSR tracts might be a probable reason behind scarcity for long poly A|T and G|C tracts in prokaryotic systems which are mostly MMR proficient. In conclusion, our study clearly demonstrates mutational dynamics of SSRs in relation to the presence/absence of MMR system in the prokaryotic system.
Collapse
Affiliation(s)
- Pankaj Kumar
- Laboratory of Computational Biology, Centre for DNA Fingerprinting and Diagnostics (CDFD), Tuljaguda Complex, 4-1-714, Mozamjahi Rd, Nampally, Hyderabad, 500 001, India
| | | |
Collapse
|
5
|
Kelkar YD, Strubczewski N, Hile SE, Chiaromonte F, Eckert KA, Makova KD. What is a microsatellite: a computational and experimental definition based upon repeat mutational behavior at A/T and GT/AC repeats. Genome Biol Evol 2010; 2:620-35. [PMID: 20668018 PMCID: PMC2940325 DOI: 10.1093/gbe/evq046] [Citation(s) in RCA: 86] [Impact Index Per Article: 6.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/12/2022] Open
Abstract
Microsatellites are abundant in eukaryotic genomes and have high rates of strand slippage-induced repeat number alterations. They are popular genetic markers, and their mutations are associated with numerous neurological diseases. However, the minimal number of repeats required to constitute a microsatellite has been debated, and a definition of a microsatellite that considers its mutational behavior has been lacking. To define a microsatellite, we investigated slippage dynamics for a range of repeat sizes, utilizing two approaches. Computationally, we assessed length polymorphism at repeat loci in ten ENCODE regions resequenced in four human populations, assuming that the occurrence of polymorphism reflects strand slippage rates. Experimentally, we determined the in vitro DNA polymerase-mediated strand slippage error rates as a function of repeat number. In both approaches, we compared strand slippage rates at tandem repeats with the background slippage rates. We observed two distinct modes of mutational behavior. At small repeat numbers, slippage rates were low and indistinguishable from background measurements. A marked transition in mutability was observed as the repeat array lengthened, such that slippage rates at large repeat numbers were significantly higher than the background rates. For both mononucleotide and dinucleotide microsatellites studied, the transition length corresponded to a similar number of nucleotides (approximately 10). Thus, microsatellite threshold is determined not by the presence/absence of strand slippage at repeats but by an abrupt alteration in slippage rates relative to background. These findings have implications for understanding microsatellite mutagenesis, standardization of genome-wide microsatellite analyses, and predicting polymorphism levels of individual microsatellite loci.
Collapse
|
6
|
Smirnov GB. Repeats in bacterial genome: Evolutionary considerations. MOLECULAR GENETICS, MICROBIOLOGY AND VIROLOGY 2010. [DOI: 10.3103/s0891416810020023] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/28/2022]
|
7
|
Campregher C, Scharl T, Nemeth M, Honeder C, Jascur T, Boland CR, Gasche C. The nucleotide composition of microsatellites impacts both replication fidelity and mismatch repair in human colorectal cells. Hum Mol Genet 2010; 19:2648-57. [PMID: 20421367 PMCID: PMC2883347 DOI: 10.1093/hmg/ddq175] [Citation(s) in RCA: 19] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/21/2022] Open
Abstract
Microsatellite instability is a key mechanism of colon carcinogenesis. We have previously studied mutations within a (CA)13 microsatellite using an enhanced green fluorescent protein (EGFP)-based reporter assay that allows the distinction of replication errors and mismatch repair (MMR) activity. Here we utilize this assay to compare mutations of mono- and dinucleotide repeats in human colorectal cells. HCT116 and HCT116+chr3 cells were stably transfected with EGFP-based plasmids harboring A10, G10, G16, (CA)13 and (CA)26 repeats. EGFP-positive mutant fractions were quantitated by flow cytometry, mutation rates were calculated and the mutant spectrum was analyzed by cycle sequencing. EGFP fluorescence pattern changed with the microsatellite's nucleotide sequence and cell type and clonal variations were observed in mononucleotide repeats. Replication errors (as calculated in HCT116) at A10 repeats were 5–10-fold higher than in G10, G16 were 30-fold higher than G10 and (CA)26 were 10-fold higher than (CA)13. The mutation rates in hMLH1-proficient HCT116+chr3 were 30–230-fold lower than in HCT116. MMR was more efficient in G16 than in A10 clones leading to a higher stability of poly-G tracts. Mutation spectra revealed predominantly 1-unit deletions in A10, (CA)13 and G10 and 2-unit deletions or 1-unit insertion in (CA)26. These findings indicate that both replication fidelity and MMR are affected by the microsatellite's nucleotide composition.
Collapse
|
8
|
Chung H, Lopez CG, Holmstrom J, Young DJ, Lai JF, Ream-Robinson D, Carethers JM. Both microsatellite length and sequence context determine frameshift mutation rates in defective DNA mismatch repair. Hum Mol Genet 2010; 19:2638-47. [PMID: 20418486 DOI: 10.1093/hmg/ddq151] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
Abstract
It is generally accepted that longer microsatellites mutate more frequently in defective DNA mismatch repair (MMR) than shorter microsatellites. Indeed, we have previously observed that the A10 microsatellite of transforming growth factor beta type II receptor (TGFBR2) frameshifts -1 bp at a faster rate than the A8 microsatellite of activin type II receptor (ACVR2), although both genes become frameshift-mutated in >80% of MMR-defective colorectal cancers. To experimentally determine the effect of microsatellite length upon frameshift mutation in gene-specific sequence contexts, we altered the microsatellite length within TGFBR2 exon 3 and ACVR2 exon 10, generating A7, A10 and A13 constructs. These constructs were cloned 1 bp out of frame of EGFP, allowing a -1 bp frameshift to drive EGFP expression, and stably transfected into MMR-deficient cells. Subsequent non-fluorescent cells were sorted, cultured for 7-35 days and harvested for EGFP analysis and DNA sequencing. Longer microsatellites within TGFBR2 and ACVR2 showed significantly higher mutation rates than shorter ones, with TGFBR2 A13, A10 and A7 frameshifts measured at 22.38x10(-4), 2.17x10(-4) and 0.13x10(-4), respectively. Surprisingly, shorter ACVR2 constructs showed three times higher mutation rates at A7 and A10 lengths than identical length TGFBR2 constructs but comparably lower at the A13 length, suggesting influences from both microsatellite length as well as the sequence context. Furthermore, the TGFBR2 A13 construct mutated into 33% A11 sequences (-2 bp) in addition to expected A12 (-1 bp), indicating that this construct undergoes continual subsequent frameshift mutation. These data demonstrate experimentally that both the length of a mononucleotide microsatellite and its sequence context influence mutation rate in defective DNA MMR.
Collapse
Affiliation(s)
- Heekyung Chung
- Department of Medicine, University of California, San Diego, CA, USA
| | | | | | | | | | | | | |
Collapse
|
9
|
Spampinato CP, Gomez RL, Galles C, Lario LD. From bacteria to plants: a compendium of mismatch repair assays. Mutat Res 2009; 682:110-28. [PMID: 19622396 DOI: 10.1016/j.mrrev.2009.07.001] [Citation(s) in RCA: 22] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/09/2009] [Revised: 06/16/2009] [Accepted: 07/13/2009] [Indexed: 10/20/2022]
Abstract
Mismatch repair (MMR) system maintains genome integrity by correcting mispaired or unpaired bases which have escaped the proofreading activity of DNA polymerases. The basic features of the pathway have been highly conserved throughout evolution, although the nature and number of the proteins involved in the mechanism vary from prokaryotes to eukaryotes and even between humans and plants. Cells deficient in MMR genes have been observed to display a mutator phenotype characterized by an increased rate in spontaneous mutation, instability of microsatellite sequences and illegitimate recombination between diverged DNA sequences. Studies of the mutator phenotype have demonstrated a critical role for the MMR system in mutation avoidance and genetic stability. Here, we briefly review our current knowledge of the MMR mechanism and then focus on the in vivo biochemical and genetic assays used to investigate the function of the MMR proteins in processing DNA mismatches generated during replication and mitotic recombination in Escherichia coli, Saccharomyces cerevisiae, Homo sapiens and Arabidopsis thaliana. An overview of the biochemical assays developed to study mismatch correction in vitro is also provided.
Collapse
Affiliation(s)
- Claudia P Spampinato
- Centro de Estudios Fotosintéticos y Bioquímicos, Facultad de Ciencias Bioquímicas y Farmacéuticas, Universidad Nacional de Rosario, Argentina.
| | | | | | | |
Collapse
|
10
|
Eckert KA, Hile SE. Every microsatellite is different: Intrinsic DNA features dictate mutagenesis of common microsatellites present in the human genome. Mol Carcinog 2009; 48:379-88. [PMID: 19306292 DOI: 10.1002/mc.20499] [Citation(s) in RCA: 81] [Impact Index Per Article: 5.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/06/2023]
Abstract
Microsatellite sequences are ubiquitous in the human genome and are important regulators of genome function. Here, we examine the mutational mechanisms governing the stability of highly abundant mono-, di-, and tetranucleotide microsatellites. Microsatellite mutation rate estimates from pedigree analyses and experimental models range from a low of approximately 10(-6) to a high of approximately 10(-2) mutations per locus per generation. The vast majority of observed mutational variation can be attributed to features intrinsic to the allele itself, including motif size, length, and sequence composition. A greater than linear relationship between motif length and mutagenesis has been observed in several model systems. Motif sequence differences contribute up to 10-fold to the variation observed in human cell mutation rates. The major mechanism of microsatellite mutagenesis is strand slippage during DNA synthesis. DNA polymerases produce errors within microsatellites at a frequency that is 10- to 100-fold higher than the frequency of frameshifts in coding sequences. Motif sequence significantly affects both polymerase error rate and specificity, resulting in strand biases within complementary microsatellites. Importantly, polymerase errors within microsatellites include base substitutions, deletions, and complex mutations, all of which produced interrupted alleles from pure microsatellites. Postreplication mismatch repair efficiency is affected by microsatellite motif size and sequence, also contributing to the observed variation in microsatellite mutagenesis. Inhibition of DNA synthesis within common microsatellites is highly sequence-dependent, and is positively correlated with the production of errors. DNA secondary structure within common microsatellites can account for some DNA polymerase pause sites, and may be an important factor influencing mutational specificity.
Collapse
Affiliation(s)
- Kristin A Eckert
- Department of Pathology, The Jake Gittlen Cancer Research Foundation, The Pennsylvania State University College of Medicine, 500 University Drive, PA, USA
| | | |
Collapse
|
11
|
Shah SN, Eckert KA. Human postmeiotic segregation 2 exhibits biased repair at tetranucleotide microsatellite sequences. Cancer Res 2009; 69:1143-9. [PMID: 19155293 DOI: 10.1158/0008-5472.can-08-3499] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/12/2023]
Abstract
The mismatch repair (MMR) system plays a major role in removing DNA polymerization errors, and loss of this pathway results in hereditary cancers characterized by microsatellite instability. We investigated microsatellite stability during DNA replication within human postmeiotic segregation 2 (hPMS2)-deficient and proficient human lymphoblastoid cell lines. Using a shuttle vector assay, we measured mutation rates at reporter cassettes containing defined mononucleotide, dinucleotide, and tetranucleotide microsatellite sequences. A mutator phenotype was observed in the hPMS2-deficient cell line. The mutation rate of vectors containing [G/C](10) or [GT/CA](10) alleles was elevated 20-fold to 40-fold in hPMS2-deficient cells, relative to an hPMS2-expressing cell line. We observed a 6-fold and 12-fold relative increase in mutation rate of [TTTC/AAAG](9) and [TTCC/AAGG](9) sequences, respectively, in hPMS2-deficient cells. Mutational specificity analyses suggested that repair by hPMS2 is biased. In the absence of hPMS2, a greater number of microsatellite expansion versus deletion mutations was observed, and expansion rates of the tetranucleotide alleles were similar. In the presence of hPMS2, we observed a 29-fold decrease in the [TTCC/AAGG](9) expansion rate but only a 6-fold decrease for the [TTTC/AAAG](9) allele. Our data indicate that hPMS2 is more protective of tetranucleotide expansions than deletions and that hPMS2 displays a sequence bias, wherein [TTCC/AAGG] sequences are stabilized to a greater extent than [TTTC/AAAG]. Our results allow for greater accuracy during identification of MMR defects by providing a mutational signature characteristic of hPMS2 defect. This study also provides clues to possible mechanisms of repair by hPMS2 in the context of the MMR system.
Collapse
Affiliation(s)
- Sandeep N Shah
- Department of Pathology, Gittlen Cancer Research Foundation and Intercollege Graduate Degree Program in Genetics, Pennsylvania State University College of Medicine, Hershey, Pennsylvania 17033, USA
| | | |
Collapse
|
12
|
Recombination frequency in plasmid DNA containing direct repeats—predictive correlation with repeat and intervening sequence length. Plasmid 2008; 60:159-65. [DOI: 10.1016/j.plasmid.2008.06.004] [Citation(s) in RCA: 18] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/28/2008] [Revised: 06/09/2008] [Accepted: 06/24/2008] [Indexed: 11/19/2022]
|
13
|
Kumar GA, Woodhall MR, Hood DW, Moxon ER, Bayliss CD. RecJ, ExoI and RecG are required for genome maintenance but not for generation of genetic diversity by repeat-mediated phase variation in Haemophilus influenzae. Mutat Res 2008; 640:46-53. [PMID: 18242643 DOI: 10.1016/j.mrfmmm.2007.12.002] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/27/2007] [Revised: 10/25/2007] [Accepted: 12/11/2007] [Indexed: 05/25/2023]
Abstract
High levels of genetic diversity are generated in Haemophilus influenzae populations through DNA repeat-mediated phase variation and recombination with DNA fragments acquired by uptake from the external milieu. Conversely, multiple pathways for maintenance of the genome sequence are encoded in H. influenzae genomes. In Escherichia coli, mutations in single-stranded-DNA exonucleases destabilise tandem DNA repeats whilst inactivation of recG can stabilise repeat tracts. These enzymes also have varying effects on recombination. Deletion mutations were constructed in H. influenzae genes encoding homologs of ExoI, RecJ and RecG whilst ExoVII was refractory to mutation. Inactivation of RecJ and RecG, but not ExoI, increased sensitivity to irradiation with ultraviolet light. An increase in spontaneous mutation rate was not observed in single mutants but only when both RecJ and ExoI were mutated. None of the single- or double-mutations increased or decreased the rates of slippage in tetranucleotide repeat tracts. Furthermore, the exonuclease mutants did not exhibit significant defects in horizontal gene transfer. We conclude that RecJ, ExoI and RecG are required for maintenance of the H. influenzae genome but none of these enzymes influence the generation of genetic diversity through mutations in the tetranucleotide repeat tracts of this species.
Collapse
Affiliation(s)
- Gaurav A Kumar
- Molecular Infectious Diseases Group, Weatherall Institute of Molecular Medicine, John Radcliffe Hospital, Headington, Oxford OX3 9DS, UK
| | | | | | | | | |
Collapse
|
14
|
Xu Z, Gutierrez L, Hitchens M, Scherer S, Sater AK, Wells DE. Distribution of polymorphic and non-polymorphic microsatellite repeats in Xenopus tropicalis. Bioinform Biol Insights 2008; 2:157-69. [PMID: 19812773 PMCID: PMC2735965 DOI: 10.4137/bbi.s561] [Citation(s) in RCA: 11] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/17/2022] Open
Abstract
The results of our bioinformatics analysis have found over 91,000 di-, tri-, and tetranucleotide microsatellites in our survey of 25% of the X. tropicalis genome, suggesting there may be over 360,000 within the entire genome. Within the X. tropicalis genome, dinucleotide (78.7%) microsatellites vastly out numbered tri- and tetranucleotide microsatellites. Similarly, AT-rich repeats are overwhelmingly dominant. The four AT-only motifs (AT, AAT, AAAT, and AATT) account for 51,858 out of 91,304 microsatellites found. Individually, AT microsatellites were the most common repeat found, representing over half of all di-, tri-, and tetranucleotide microsatellites. This contrasts with data from other studies, which show that AC is the most frequent microsatellite in vertebrate genomes (Toth et al. 2000). In addition, we have determined the rate of polymorphism for 5,128 non-redundant microsatellites, embedded in unique sequences. Interestingly, this subgroup of microsatellites was determined to have significantly longer repeats than genomic microsatellites as a whole. In addition, microsatellite loci with tandem repeat lengths more than 30 bp exhibited a significantly higher degree of polymorphism than other loci. Pairwise comparisons show that tetranucleotide microsatellites have the highest polymorphic rates. In addition, AAT and ATC showed significant higher polymorphism than other trinucleotide microsatellites, while AGAT and AAAG were significantly more polymorphic than other tetranucleotide microsatellites.
Collapse
Affiliation(s)
- Zhenkang Xu
- Department of Biology and Biochemistry, University of Houston, Houston, TX 77204, USA
| | | | | | | | | | | |
Collapse
|
15
|
Boyer JC, Hawk JD, Stefanovic L, Farber RA. Sequence-dependent effect of interruptions on microsatellite mutation rate in mismatch repair-deficient human cells. Mutat Res 2007; 640:89-96. [PMID: 18242644 DOI: 10.1016/j.mrfmmm.2007.12.005] [Citation(s) in RCA: 13] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/10/2007] [Revised: 11/21/2007] [Accepted: 12/11/2007] [Indexed: 11/18/2022]
Abstract
Although microsatellite mutation rates generally increase with increasing length of the repeat tract, interruptions in a microsatellite may stabilize it. We have performed a direct analysis of the effect of microsatellite interruptions on mutation rate and spectrum in cultured mammalian cells. Two mononucleotide sequences (G(17) and A(17)) and a dinucleotide [(CA)(17)] were compared with interrupted repeats of the same size and with sequences of 8 repeat units. MMR-deficient (MMR(-)) cells were used for these studies to eliminate effects of this repair process. Mutation rates were determined by fluctuation analysis on cells containing a microsatellite sequence at the 5' end of an antibiotic-resistance gene; the vector carrying this sequence was integrated in the genome of the cells. In general, interrupted sequences had lower mutation rates than perfect ones of the same size, but the magnitude of the difference was dependent upon the sequence of the interrupting base(s). Some interrupted repeats had mutation rates that were lower than those of perfect sequences of the same length but similar to those of half the length. This suggests that interrupting bases effectively divide microsatellites into smaller repeat runs with mutational characteristics different from those of the corresponding full-length microsatellite. We conclude that interruptions decrease microsatellite mutation rate and influence the spectrum of frameshift mutations. The sequence of the interrupting base(s) determines the magnitude of the effect on mutation rate.
Collapse
Affiliation(s)
- Jayne C Boyer
- Department of Pathology and Laboratory Medicine, University of North Carolina at Chapel Hill, CB #7525, Chapel Hill, NC 27599, United States.
| | | | | | | |
Collapse
|
16
|
Hile SE, Eckert KA. DNA polymerase kappa produces interrupted mutations and displays polar pausing within mononucleotide microsatellite sequences. Nucleic Acids Res 2007; 36:688-96. [PMID: 18079151 PMCID: PMC2241860 DOI: 10.1093/nar/gkm1089] [Citation(s) in RCA: 41] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/23/2023] Open
Abstract
Microsatellites are ubiquitously present in eukaryotic genomes and are implicated as positive factors in evolution. At the nucleotide level, microsatellites undergo slippage events that alter allele length and base changes that interrupt the repetitive tract. We examined DNA polymerase errors within a [T]11 microsatellite using an in vitro assay that preferentially detects mutations other than unit changes. We observed that human DNA polymerase kappa (Pol κ) inserts dGMP and dCMP within the [T]11 mononucleotide repeat, producing an interrupted 12-bp allele. Polymerase β produced such interruptions at a lower frequency. These data demonstrate that DNA polymerases are capable of directly producing base interruptions within microsatellites. At the molecular level, expanded microsatellites have been implicated in DNA replication fork stalling. Using an in vitro primer extension assay, we observed sequence-specific synthesis termination by DNA polymerases within mononucleotides. Quantitatively, intense, polar pausing was observed for both pol κ and polymerase α-primase within a [T]11 allele. A mechanism is proposed in which pausing results from DNA bending within the duplex stem of the nascent DNA. Our data support the concept of a microsatellite life-cycle, and are consistent with the models in which DNA sequence or secondary structures contributes to non-uniform rates of replication fork progression.
Collapse
Affiliation(s)
- Suzanne E Hile
- Department of Pathology, Gittlen Cancer Research Foundation, The Pennsylvania State University College of Medicine, 500 University Drive, Hershey, PA 17033, USA
| | | |
Collapse
|
17
|
Kassai-Jáger E, Ortutay C, Tóth G, Vellai T, Gáspári Z. Distribution and evolution of short tandem repeats in closely related bacterial genomes. Gene 2007; 410:18-25. [PMID: 18191346 DOI: 10.1016/j.gene.2007.11.006] [Citation(s) in RCA: 22] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/24/2007] [Revised: 11/08/2007] [Accepted: 11/16/2007] [Indexed: 11/27/2022]
Abstract
Simultaneous identification and comparison of perfect and imperfect microsatellites within a genome is a valuable tool both to overcome the lack of a consensus definition of SSRs and to assess repeat history. Detailed analysis of the overall distribution of perfect and imperfect microsatellites in closely related bacterial taxa is expected to give new insight into the evolution of prokaryotic genomes. We have performed a genome-wide analysis of microsatellite distribution in four Escherichia coli and seven Chlamydial strains. Chlamydial strains generally have a higher density of SSRs and show greater intra-group differences of SSR distribution patterns than E. coli genomes. In most investigated genomes the distribution of the total lengths of matching perfect and imperfect trinucleotide repeats are highly similar, with the notable exception of C. muridarum. Closely related strains show more similar repeat distribution patterns than strains separated by a longer divergence time. The discrepancy between the preferred classes of perfect and imperfect repeats in C. muridarum implies accelerated evolution of SSRs in this particular strain. Our results suggest that microsatellites, although considerably less abundant than in eukaryotic genomes, may nevertheless play an important role in the evolution of prokaryotic genomes and several gene families.
Collapse
Affiliation(s)
- Edit Kassai-Jáger
- Department of Genetics, Eötvös Loránd University, Pázmány Péter sétány 1/C, H-1117 Budapest, Hungary
| | | | | | | | | |
Collapse
|
18
|
King MR, Vimr RP, Steenbergen SM, Spanjaard L, Plunkett G, Blattner FR, Vimr ER. Escherichia coli K1-specific bacteriophage CUS-3 distribution and function in phase-variable capsular polysialic acid O acetylation. J Bacteriol 2007; 189:6447-56. [PMID: 17601779 PMCID: PMC1951898 DOI: 10.1128/jb.00657-07] [Citation(s) in RCA: 29] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022] Open
Abstract
Escherichia coli K1 is the leading cause of human neonatal sepsis and meningitis and is important in other clinical syndromes of both humans and domestic animals; in this strain the polysialic acid capsule (K1 antigen) functions by inhibiting innate immunity. Recent discovery of the phase-variable capsular O acetylation mechanism indicated that the O-acetyltransferase gene, neuO, is carried on a putative K1-specific prophage designated CUS-3 (E. L. Deszo, S. M. Steenbergen, D. I. Freedberg, and E. R. Vimr, Proc. Natl. Acad. Sci. USA 102:5564-5569, 2005). Here we describe the isolation and characterization of a CUS-3 derivative (CUS-3a), demonstrating its morphology, lysogenization of a sensitive host, and the distribution of CUS-3 among a collection of 111 different K1 strains. The 40,207-bp CUS-3 genome was annotated from the strain RS218 genomic DNA sequence, indicating that most of the 63 phage open reading frames have their closest homologues in one of seven different lambdoid phages. Translational fusion of a reporter lacZ fragment to the hypervariable poly-Psi domain facilitated measurement of phase variation frequencies, indicating no significant differences between switch rates or effects on rates of the methyl-directed mismatch repair system. PCR analysis of poly-Psi domain length indicated preferential loss or gain of single 5'-AAGACTC-3' nucleotide repeats. Analysis of a K1 strain previously reported as "locked on" indicated a poly-Psi region with the least number of heptad repeats compatible with in-frame neuO expression. The combined results establish CUS-3 as an active mobile contingency locus in E. coli K1, indicating its capacity to mediate population-wide capsule variation.
Collapse
Affiliation(s)
- Michael R King
- Laboratory of Sialobiology and Comparative Metabolomics, Department of Pathobiology, University of Illinois at Urbana-Champain, Urbana, IL 61802, USA
| | | | | | | | | | | | | |
Collapse
|
19
|
Weng X, Wang Z, Liu J, Kimura M, Black WC, Brennan PJ, Li H, Vissa VD. Identification and distribution of Mycobacterium leprae genotypes in a region of high leprosy prevalence in China: a 3-year molecular epidemiological study. J Clin Microbiol 2007; 45:1728-34. [PMID: 17428944 PMCID: PMC1933063 DOI: 10.1128/jcm.00018-07] [Citation(s) in RCA: 27] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/04/2007] [Revised: 02/05/2007] [Accepted: 03/28/2007] [Indexed: 11/20/2022] Open
Abstract
Multiple-locus variable-number tandem-repeat (VNTR) analysis (MLVA) has been proposed as a means of strain typing for tracking the transmission of leprosy. However, empirical data for a defined population are lacking. To this end, a study was initiated to assess the diversity and distribution of prevalent Mycobacterium leprae strains in Qiubei County, Yunnan Province, People's Republic of China, where the annual detection rate of leprosy is 10-fold higher than the national average rate. Sixty-eight newly diagnosed leprosy patients were included in the study. MLVA at eight M. leprae loci was applied using DNA extracts from skin biopsies. The number of alleles per locus ranged from 4 to 24, providing adequate strain discrimination. MLVA strain typing identified several clusters of patients whose M. leprae specimens shared similar VNTR profiles. Two of these clusters were comprised of patients who resided predominantly in the north and northwest parts of Qiubei County. Furthermore, it was found that multicase families are common in this county: 23 of the 68 patients were from 11 families. Intrafamilial VNTR profiles closely matched within six families, although they were different between the families. Moreover, VNTR patterns related to those found in some multicase families were also detected in patients in the same or adjacent townships, indicating the utility of VNTR strain typing to identify and detect short-range transmission events. Social contact through village markets is proposed as a means of transmission.
Collapse
Affiliation(s)
- Xiaoman Weng
- Beijing Friendship Hospital-Affiliate of Capital University of Medical Sciences, Beijing Tropical Medicine Research Institute, 95 Yong An Road, Beijing 100050, People's Republic of China
| | | | | | | | | | | | | | | |
Collapse
|
20
|
Jacob KD, Eckert KA. Escherichia coli DNA polymerase IV contributes to spontaneous mutagenesis at coding sequences but not microsatellite alleles. Mutat Res 2007; 619:93-103. [PMID: 17397877 PMCID: PMC2703455 DOI: 10.1016/j.mrfmmm.2007.02.007] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/14/2006] [Revised: 02/16/2007] [Accepted: 02/26/2007] [Indexed: 11/21/2022]
Abstract
Slipped strand mispairing during DNA synthesis is one proposed mechanism for microsatellite or short tandem repeat (STR) mutation. However, the DNA polymerase(s) responsible for STR mutagenesis have not been determined. In this study, we investigated the effect of the Escherichia colidinB gene product (Pol IV) on mononucleotide and dinucleotide repeat stability, using an HSV-tk gene episomal reporter system for microsatellite mutations. For the control vector (HSV-tk gene only) we observed a statistically significant 3.5-fold lower median mutation frequency in dinB(-) than dinB(+) cells (p<0.001, Wilcoxon Mann Whitney Test). For vectors containing an in-frame mononucleotide allele ([G/C](10)) or either of two dinucleotide alleles ([GT/CA](10) and [TC/AG](11)) we observed no statistically significant difference in the overall HSV-tk mutation frequency observed between dinB(+) and dinB(-) strains. To determine if a mutational bias exists for mutations made by Pol IV, mutational spectra were generated for each STR vector and strain. No statistically significant differences between strains were observed for either the proportion of mutational events at the STR or STR specificity among the three vectors. However, the specificity of mutational events at the STR alleles in each strain varied in a statistically significant manner as a consequence of microsatellite sequence. Our results indicate that while Pol IV contributes to spontaneous mutations within the HSV-tk coding sequence, Pol IV does not play a significant role in spontaneous mutagenesis at [G/C](10), [GT/CA](10), or [TC/AG](11) microsatellite alleles. Our data demonstrate that in a wild type genetic background, the major factor influencing microsatellite mutagenesis is the allelic sequence composition.
Collapse
Affiliation(s)
| | - Kristin A. Eckert
- Corresponding Author Information: 500 University Drive, H059 – Gittlen Cancer Research Foundation, Hershey, PA 17033, Phone: (717) 531-4056, Fax: (717) 531-5634, E-mail:
| |
Collapse
|
21
|
Buschiazzo E, Gemmell NJ. The rise, fall and renaissance of microsatellites in eukaryotic genomes. Bioessays 2006; 28:1040-50. [PMID: 16998838 DOI: 10.1002/bies.20470] [Citation(s) in RCA: 190] [Impact Index Per Article: 10.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/15/2022]
Abstract
Microsatellites are among the most versatile of genetic markers, being used in an impressive number of biological applications. However, the evolutionary dynamics of these markers remain a source of contention. Almost 20 years after the discovery of these ubiquitous simple sequences, new genomic data are clarifying our understanding of the structure, distribution and variability of microsatellites in genomes, especially for the eukaryotes. While these new data provide a great deal of descriptive information about the nature and abundance of microsatellite sequences within eukaryotic genomes, there have been few attempts to synthesise this information to develop a global concept of evolution. This review provides an up-to-date account of the mutational processes, biases and constraints believed to be involved in the evolution of microsatellites, particularly with respect to the creation and degeneration of microsatellites, which we assert may be broadly viewed as a life cycle. In addition, we identify areas of contention that require further research and propose some possible directions for future investigation.
Collapse
Affiliation(s)
- Emmanuel Buschiazzo
- School of Biological Sciences, University of Canterbury, Christchurch, New Zealand.
| | | |
Collapse
|
22
|
Noller AC, McEllistrem MC, Shutt KA, Harrison LH. Locus-specific mutational events in a multilocus variable-number tandem repeat analysis of Escherichia coli O157:H7. J Clin Microbiol 2006; 44:374-7. [PMID: 16455886 PMCID: PMC1392644 DOI: 10.1128/jcm.44.2.374-377.2006] [Citation(s) in RCA: 49] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022] Open
Abstract
Multilocus variable-number tandem repeat analysis (MLVA) is a validated molecular subtyping method for detecting and evaluating Escherichia coli O157:H7 outbreaks. In a previous study, five outbreaks with a total of 21 isolates were examined by MLVA. Nearly 20% of the epidemiologically linked strains were single-locus variants (SLV) of their respective predominant outbreak clone. This result prompted an investigation into the mutation rates of the seven MLVA loci (TR1 to TR7). With an outbreak strain that was an SLV at the TR1 locus of the predominant clone, parallel and serial batch culture experiments were performed. In a parallel experiment, none (0/384) of the strains analyzed had mutations at the seven MLVA loci. In contrast, in the two 5-day serial experiments, 4.3% (41/960) of the strains analyzed had a significant variation in at least one of these loci (P < 0.001). The TR2 locus accounted for 85.3% (35/41) of the mutations, with an average mutation rate of 3.5 x 10(-3); the mutations rates for TR1 and TR5 were 10-fold lower. Single additions accounted for 77.1% (27/35) of the mutation events in TR2 and all (6/6) of the additions in TR1 and TR5. The remaining four loci had no slippage events detected. The mutation rates were locus specific and may impact the interpretation of MLVA data for epidemiologic investigations.
Collapse
Affiliation(s)
- Anna C Noller
- Infectious Diseases Epidemiology Research Unit, University of Pittsburgh Graduate School of Public Health and School of Medicine, 521 Parran Hall, 130 DeSoto St., Pittsburgh, PA 15261, USA
| | | | | | | |
Collapse
|
23
|
Abstract
Hypermutable tandem repeat sequences (TRSs) are present in the genomes of both prokaryotic and eukaryotic organisms. Numerous studies have been conducted in several laboratories over the past decade to investigate the mechanisms responsible for expansions and contractions of microsatellites (a subset of TRSs with a repeat length of 1-6 nucleotides) in the model prokaryotic organism Escherichia coli. Both the frequency of tandem repeat instability (TRI), and the types of mutational events that arise, are markedly influenced by the DNA sequence of the repeat, the number of unit repeats, and the types of cellular pathways that process the TRS. DNA strand slippage is a general mechanism invoked to explain instability in TRSs. Misaligned DNA sequences are stabilized both by favorable base pairing of complementary sequences and by the propensity of TRSs to form relatively stable secondary structures. Several cellular processes, including replication, recombination and a variety of DNA repair pathways, have been shown to interact with such structures and influence TRI in bacteria. This paper provides an overview of our current understanding of mechanisms responsible for TRI in bacteria, with an emphasis on studies that have been carried out in E. coli. In addition, new experimental data are presented, suggesting that TLS polymerases (PolII, PolIV and PolV) do not contribute significantly to TRI in E. coli.
Collapse
Affiliation(s)
- M Bichara
- Département Intégrité du Génome de l'UMR 7175, PolAP1, Boulevard Sébastien Brant 67400, Strasbourg-Illkirch, France
| | | | | |
Collapse
|
24
|
Affiliation(s)
- Hans Ellegren
- Department of Evolutionary Biology, Evolutionary Biology Centre, Uppsala University, Norbyvägen 18D, SE-752 36 Uppsala, Sweden.
| |
Collapse
|
25
|
Bayliss CD, Sweetman WA, Moxon ER. Mutations in Haemophilus influenzae mismatch repair genes increase mutation rates of dinucleotide repeat tracts but not dinucleotide repeat-driven pilin phase variation rates. J Bacteriol 2004; 186:2928-35. [PMID: 15126452 PMCID: PMC400628 DOI: 10.1128/jb.186.10.2928-2935.2004] [Citation(s) in RCA: 31] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022] Open
Abstract
High-frequency, reversible switches in expression of surface antigens, referred to as phase variation (PV), are characteristic of Haemophilus influenzae. PV enables this bacterial species, an obligate commensal and pathogen of the human upper respiratory tract, to adapt to changes in the host environment. Phase-variable hemagglutinating pili are expressed by many H. influenzae isolates. PV involves alterations in the number of 5' TA repeats located between the -10 and -35 promoter elements of the overlapping, divergently orientated promoters of hifA and hifBCDE, whose products mediate biosynthesis and assembly of pili. Dinucleotide repeat tracts are destabilized by mismatch repair (MMR) mutations in Escherichia coli. The influence of mutations in MMR genes of H. influenzae strain Rd on dinucleotide repeat-mediated PV rates was investigated by using reporter constructs containing 20 5' AT repeats. Mutations in mutS, mutL, and mutH elevated rates approximately 30-fold, while rates in dam and uvrD mutants were increased 14- and 3-fold, respectively. PV rates of constructs containing 10 to 12 5' AT repeats were significantly elevated in mutS mutants of H. influenzae strains Rd and Eagan. An intact hif locus was found in 14 and 12% of representative nontypeable H. influenzae isolates associated with either otitis media or carriage, respectively. Nine or more tandem 5' TA repeats were present in the promoter region. Surprisingly, inactivation of mutS in two serotype b H. influenzae strains did not alter pilin PV rates. Thus, although functionally analogous to the E. coli MMR pathway and active on dinucleotide repeat tracts, defects in H. influenzae MMR do not affect 5' TA-mediated pilin PV.
Collapse
Affiliation(s)
- Christopher D Bayliss
- Molecular Infectious Diseases Group, Department of Paediatrics, Weatherall Institute for Molecular Medicine, University of Oxford, John Radcliffe Hospital, Headington, Oxford OX3 9DU, United Kingdom.
| | | | | |
Collapse
|
26
|
Dupuy BM, Stenersen M, Egeland T, Olaisen B. Y-chromosomal microsatellite mutation rates: differences in mutation rate between and within loci. Hum Mutat 2004; 23:117-124. [PMID: 14722915 DOI: 10.1002/humu.10294] [Citation(s) in RCA: 90] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]
Abstract
Precise estimates of mutation rates at Y-chromosomal microsatellite STR (short tandem repeat) loci make an important basis for paternity diagnostics and dating of Y chromosome lineage origins. There are indications of considerable locus mutation rate variability between (inter-) and within (intra-) loci. We have studied nine Y-STR loci-DYS19, DYS389I/II, DYS390, DYS391, DYS392, DYS393, DYS385, and DYS388-in 1,766 father-son pairs of confirmed paternity (a total of 15,894 meioses). Five biallelic markers were also analyzed in the fathers-Tat, YAP, 12f2, SRY1532, and 92R7-defining haplogroups 1, 2, 3, 4, 9, and 16, respectively. A total of 36 fragment length mutations were observed: 24 gains (22 single-step, two double-step) and 12 single-step losses. Thus, there was a significant surplus of gains (p=0.045). Overall, the mutation rate was positively correlated to STR repeat length and there was a significant relative excess of losses in long alleles and gains in short alleles (p=0.043). In contrast to the situation in autosomal STR loci and in MSY-1, no noteworthy correlation between mutation rate and the father's age at the child's birth was observed. We observed significant interlocus differences in Y-STR mutation rates (p<0.01). The number of observed mutations ranged from zero in DYS392 to eight in DYS391 and DYS390. We have also demonstrated obvious differences in mutation rates between the haplogroups studied (p=0.024), a phenomenon that is a reflection of the dependence of mutation rate on allele size. Our study has thus demonstrated the necessity of not only locus-specific, but even allele-specific, mutation rate estimates for forensic and population genetic purposes, and provides a considerable basis for such estimates.
Collapse
Affiliation(s)
- B Myhre Dupuy
- Institute of Forensic Medicine, University of Oslo, Rikshospitalet, Oslo, Norway
| | - M Stenersen
- Institute of Forensic Medicine, University of Oslo, Rikshospitalet, Oslo, Norway
| | - T Egeland
- Biostatistics, Rikshospitalet, Oslo, Norway
| | - B Olaisen
- Institute of Forensic Medicine, University of Oslo, Rikshospitalet, Oslo, Norway
| |
Collapse
|
27
|
Hile SE, Eckert KA. Positive correlation between DNA polymerase alpha-primase pausing and mutagenesis within polypyrimidine/polypurine microsatellite sequences. J Mol Biol 2004; 335:745-59. [PMID: 14687571 DOI: 10.1016/j.jmb.2003.10.075] [Citation(s) in RCA: 47] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022]
Abstract
Microsatellite DNA sequences are ubiquitous in the human genome, and mutation rates of these repetitive sequences vary with respect to DNA sequence as well as length. We have analyzed polymerase-DNA interactions as a function of microsatellite sequence, using polypyrimidine/polypurine di- and tetranucleotide alleles representative of those found in the human genome. Using an in vitro primer extension assay and the mammalian DNA polymerase alpha-primase complex, we have observed a polymerase termination profile for each microsatellite that is unique to that allele. Interestingly, a periodic termination profile with an interval size (9-11 nucleotides) unrelated to microsatellite unit length was observed for the [TC](20) and [TTCC](9) templates. In contrast, a unit-punctuated polymerase termination profile was found for the longer polypurine templates. We detected strong polymerase pauses within the [TC](20) allele at low reaction pH which were eliminated by the addition of deaza-dGTP, consistent with these specific pauses being a consequence of triplex DNA formation during DNA synthesis. Quantitatively, a strand bias was observed in the primer extension assay, in that polymerase synthesis termination is more intense when the polypurine sequence serves as the template, relative to its complementary polypyrimidine sequence. The HSV-tk forward mutation assay was utilized to determine the corresponding polymerase alpha-primase error frequencies and specificities at the microsatellite alleles. A higher microsatellite polymerase error frequency (50x10(-4) to 60x10(-4)) was measured when polypurine sequences serve as templates for DNA synthesis, relative to the polypyrimidine template (18x10(-4)). Thus, a positive correlation exists between polymerase alpha-primase pausing and mutagenesis within microsatellite DNA alleles.
Collapse
Affiliation(s)
- Suzanne E Hile
- Department of Pathology, Gittlen Cancer Research Institute, The Pennsylvania State University College of Medicine, Hershey Medical Center, 500 University Drive, Hershey, PA 17033, USA
| | | |
Collapse
|
28
|
Torres-Cruz J, van der Woude MW. Slipped-strand mispairing can function as a phase variation mechanism in Escherichia coli. J Bacteriol 2004; 185:6990-4. [PMID: 14617664 PMCID: PMC262711 DOI: 10.1128/jb.185.23.6990-6994.2003] [Citation(s) in RCA: 48] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/29/2023] Open
Abstract
Slipped-strand mispairing (SSM) has not been identified as a mechanism of phase variation in Escherichia coli. Using a reporter gene, we show that sequences that cause phase variation by SSM in Haemophilus influenzae also lead to phase variation when introduced onto the chromosome of E. coli, and the frequencies of switching are in the biologically relevant range. Thus, the absence of SSM-mediated phase variation in E. coli does not appear to be due to a mechanistic constraint.
Collapse
Affiliation(s)
- Joshua Torres-Cruz
- Department of Microbiology, University of Pennsylvania, Philadelphia, Pennsylvania 19104, USA
| | | |
Collapse
|
29
|
Greig C, Jacobson DP, Banks MA. New tetranucleotide microsatellites for fine-scale discrimination among endangered chinook salmon (Oncorhynchus tshawytscha). ACTA ACUST UNITED AC 2003. [DOI: 10.1046/j.1471-8286.2003.00455.x] [Citation(s) in RCA: 304] [Impact Index Per Article: 14.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022]
|
30
|
Rocha EPC. An appraisal of the potential for illegitimate recombination in bacterial genomes and its consequences: from duplications to genome reduction. Genome Res 2003; 13:1123-32. [PMID: 12743022 PMCID: PMC403640 DOI: 10.1101/gr.966203] [Citation(s) in RCA: 59] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/24/2022]
Abstract
An exhaustive search for shortly spaced repeats in 74 bacterial chromosomes reveals that they are much more numerous than is usually acknowledged. These repeats were divided into five classes: close repeats (CRs), tandem repeats (TRs), simple sequence repeats (SSRs), spaced interspersed direct repeats, and "others." CRs are widespread and constitute the most abundant class, particularly in coding sequences. The other classes are less frequent, but each individual element shows a higher potential for recombination, when the number of repeats and their distances are taken into account. SSRs and TRs are more frequent in pathogens, as expected given their role in contingency loci, but are also widespread in the other bacteria. The analysis of CRs shows that they have an important role in the evolution of genomes, namely by generating duplications and deletions. Several cases compatible with a significant role of small CRs in the formation of large repeats were detected. Also, gene deletion in Buchnera correlates with repeat density, suggesting that CRs may lead to sequence deletion in general and genome reductive evolution of obligatory intracellular bacteria in particular. The assembly of these results indicates that shortly spaced repeats are key players in the dynamics of genome evolution.
Collapse
Affiliation(s)
- Eduardo P C Rocha
- Unité Génétique des Génomes Bactériens, Institut Pasteur, 75724 Paris Cedex 15, France.
| |
Collapse
|
31
|
Vigouroux Y, Jaqueth JS, Matsuoka Y, Smith OS, Beavis WD, Smith JSC, Doebley J. Rate and pattern of mutation at microsatellite loci in maize. Mol Biol Evol 2002; 19:1251-60. [PMID: 12140237 DOI: 10.1093/oxfordjournals.molbev.a004186] [Citation(s) in RCA: 171] [Impact Index Per Article: 7.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022] Open
Abstract
Microsatellites are important tools for plant breeding, genetics, and evolution, but few studies have analyzed their mutation pattern in plants. In this study, we estimated the mutation rate for 142 microsatellite loci in maize (Zea mays subsp. mays) in two different experiments of mutation accumulation. The mutation rate per generation was estimated to be 7.7 x 10(-4) for microsatellites with dinucleotide repeat motifs, with a 95% confidence interval from 5.2 x 10(-4) to 1.1 x 10(-3). For microsatellites with repeat motifs of more than 2 bp in length, no mutations were detected; so we could only estimate the upper 95% confidence limit of 5.1 x 10(-5) for the mutation rate. For dinucleotide repeat microsatellites, we also determined that the variance of change in the number of repeats (sigma(m)2) is 3.2. We sequenced 55 of the 73 observed mutations, and all mutations proved to be changes in the number of repeats in the microsatellite or in mononucleotide tracts flanking the microsatellite. There is a higher probability to mutate to an allele of larger size. There is heterogeneity in the mutation rate among dinucleotide microsatellites and a positive correlation between the number of repeats in the progenitor allele and the mutation rate. The microsatellite-based estimate of the effective population size of maize is more than an order of magnitude less than previously reported values based on nucleotide sequence variation.
Collapse
Affiliation(s)
- Yves Vigouroux
- Department of Genetics, University of Wisconsin, Madison 53706, USA
| | | | | | | | | | | | | |
Collapse
|
32
|
Eckert KA, Yan G, Hile SE. Mutation rate and specificity analysis of tetranucleotide microsatellite DNA alleles in somatic human cells. Mol Carcinog 2002; 34:140-50. [PMID: 12112308 DOI: 10.1002/mc.10058] [Citation(s) in RCA: 33] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/08/2022]
Abstract
We have systematically varied microsatellite sequence composition to determine the effects of repeat unit size, G+C content, and DNA secondary structure on microsatellite stability in human cells. The microsatellites were inserted in frame within the 5' region of the herpes simplex virus thymidine kinase (HSV-tk) gene. The polypyrimidine/polypurine microsatellites displayed enhanced S1 nuclease sensitivity in vitro, consistent with the formation of non-B-form DNA structures. Microsatellite mutagenesis studies were performed with a shuttle vector system in which inactivating HSV-tk mutations are measured after replication in a nontumorigenic cell line. A significant increase in the HSV-tk mutation frequency per cell generation was observed after insertion of [TTCC/AAGG]9, [TTTC/AAAG]9, or [TCTA/AGAT]9 sequences (P <or= 0.0002), relative to the HSV-tk gene control. We observed that the G + C content of the microsatellite may affect mutagenesis, as the mean microsatellite mutation rates of the [TTTC/AAAG]9 and [TCTA/AGAT]9 alleles were sevenfold and 11-fold higher, respectively, than the [TTCC/AAGG]9 allele. A bias toward expansion mutations was noted for the majority of clones bearing the [TTCC/AAGG]9 allele as well as a [TC/AG]17 microsatellite of similar allele length. The mean microsatellite mutation rate of the [TTCC/AAGG]9 allele did not differ significantly from that for a [TC/AG]11 allele, demonstrating that these tetranucleotide and dinucleotide alleles are of equivalent stability. It is known that microsatellite mutagenesis is affected by the number of repeat units within an allele. Our data suggest that additional biochemical factors may regulate both the rate and specificity of somatic cell microsatellite mutagenesis.
Collapse
Affiliation(s)
- Kristin A Eckert
- Jake Gittlen Cancer Research Institute, Department of Pathology, Pennsylvania State University, College of Medicine, Hershey, Pennsylvania 17033, USA
| | | | | |
Collapse
|
33
|
Rocha EPC, Blanchard A. Genomic repeats, genome plasticity and the dynamics of Mycoplasma evolution. Nucleic Acids Res 2002; 30:2031-42. [PMID: 11972343 PMCID: PMC113839 DOI: 10.1093/nar/30.9.2031] [Citation(s) in RCA: 140] [Impact Index Per Article: 6.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open
Abstract
Mycoplasmas evolved by a drastic reduction in genome size, but their genomes contain numerous repeated sequences with important roles in their evolution. We have established a bioinformatic strategy to detect the major recombination hot-spots in the genomes of Mycoplasma pneumoniae, Mycoplasma genitalium, Ureaplasma urealyticum and Mycoplasma pulmonis. This allowed the identification of large numbers of potentially variable regions, as well as a comparison of the relative recombination potentials of different genomic regions. Different trends are perceptible among mycoplasmas, probably due to different functional and structural constraints. The largest potential for illegitimate recombination in M.pulmonis is found at the vsa locus and its comparison in two different strains reveals numerous changes since divergence. On the other hand, the main M.pneumoniae and M.genitalium adhesins rely on large distant repeats and, hence, homologous recombination for variation. However, the relation between the existence of repeats and antigenic variation is not necessarily straightforward, since repeats of P1 adhesin were found to be anti-correlated with epitopes recognized by patient antibodies. These different strategies have important consequences for the structures of genomes, since large distant repeats correlate well with the major chromosomal rearrangements. Probably to avoid such events, mycoplasmas strongly avoid inverse repeats, in comparison to co-oriented repeats.
Collapse
|
34
|
Bayliss CD, van de Ven T, Moxon ER. Mutations in polI but not mutSLH destabilize Haemophilus influenzae tetranucleotide repeats. EMBO J 2002; 21:1465-76. [PMID: 11889052 PMCID: PMC125930 DOI: 10.1093/emboj/21.6.1465] [Citation(s) in RCA: 46] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022] Open
Abstract
Haemophilus influenzae (Hi), an obligate upper respiratory tract commensal/pathogen, uses phase variation (PV) to adapt to host environment changes. Switching occurs by slippage of nucleotide repeats (microsatellites) within genes coding for virulence molecules. Most such microsatellites in Hi are tetranucleotide repeats, but an exception is the dinucleotide repeats in the pilin locus. To investigate the effects on PV rates of mutations in genes for mismatch repair (MMR), insertion/deletion mutations of mutS, mutL, mutH, dam, polI, uvrD, mfd and recA were constructed in Hi strain Rd. Only inactivation of polI destabilized tetranucleotide (5'AGTC) repeat tracts of chromosomally located reporter constructs, whereas inactivation of mutS, but not polI, destabilized dinucleotide (5'AT) repeats. Deletions of repeats were predominant in polI mutants, which we propose are due to end-joining occurring without DNA polymerization during polI-deficient Okazaki fragment processing. The high prevalence of tetranucleotides mediating PV is an exceptional feature of the Hi genome. The refractoriness to MMR of hypermutation in Hi tetranucleotides facilitates adaptive switching without the deleterious increase in global mutation rates that accompanies a mutator genotype.
Collapse
Affiliation(s)
- Christopher D Bayliss
- Molecular Infectious Diseases Group, Department of Paediatrics, Weatherall Institute for Molecular Medicine, John Radcliffe Hospital, Headington, Oxford OX3 9DU, UK.
| | | | | |
Collapse
|
35
|
Metzgar D, Liu L, Hansen C, Dybvig K, Wills C. Domain-level differences in microsatellite distribution and content result from different relative rates of insertion and deletion mutations. Genome Res 2002; 12:408-13. [PMID: 11875028 PMCID: PMC155286 DOI: 10.1101/gr.198602] [Citation(s) in RCA: 32] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/24/2022]
Abstract
Microsatellites (short tandem polynucleotide repeats) are found throughout eukaryotic genomes at frequencies many orders of magnitude higher than the frequencies predicted to occur by chance. Most of these microsatellites appear to have evolved in a generally neutral manner. In contrast, microsatellites are generally absent from bacterial genomes except in locations where they provide adaptive functional variability, and these appear to have evolved under selection. We demonstrate a mutational bias towards deletion (repeat contraction) in a native chromosomal microsatellite of the bacterium Mycoplasma gallisepticum, through the collection and analysis of independent mutations in the absence of natural selection. Using this and similar existing data from two other bacterial species and four eukaryotic species, we find strong evidence that deletion biases resulting in repeat contraction are common in bacteria, while eukaryotic microsatellites generally experience unbiased mutation or a bias towards insertion (repeat expansion). This difference in mutational bias suggests that eukaryotic microsatellites should generally expand wherever selection does not exclude them, whereas bacterial microsatellites should be driven to extinction by mutational pressure wherever they are not maintained by selection. This is consistent with observed bacterial and eukaryotic microsatellite distributions. Hence, mutational biases that differ between eukaryotes and bacteria can account for many of the observed differences in microsatellite DNA content and distribution found in these two groups of organisms.
Collapse
Affiliation(s)
- David Metzgar
- Division of Biology, University of California at San Diego, San Diego, CA 92093-0116, USA.
| | | | | | | | | |
Collapse
|
36
|
Snyder LAS, Butcher SA, Saunders NJ. Comparative whole-genome analyses reveal over 100 putative phase-variable genes in the pathogenic Neisseria spp. MICROBIOLOGY (READING, ENGLAND) 2001; 147:2321-2332. [PMID: 11496009 DOI: 10.1099/00221287-147-8-2321] [Citation(s) in RCA: 106] [Impact Index Per Article: 4.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/18/2022]
Abstract
Previously, a complete genome analysis of Neisseria meningitidis strain MC58 revealed the largest repertoire of putative phase-variable genes described in any species to date. Initial comparisons with two incomplete Neisseria spp. genome sequences available at that time revealed differences in the repeats associated with these genes in the form of polymorphisms, the absence of the potentially unstable elements in some alleles, and in the repertoire of the genes that were present. Analyses of the complete genomes of N. meningitidis strain Z2491 and Neisseria gonorrhoeae strain FA1090 have been performed and are combined with a comprehensive comparative analysis between the three available complete genome sequences. This has increased the sensitivity of these searches and provided additional contextual information that facilitates the interpretation of the functional consequences of repeat instability. This analysis identified: (i) 68 phase-variable gene candidates in N. meningitidis strain Z2491, rather than the 27 previously reported; (ii) 83 candidates in N. gonorrhoeae strain FA1090; and (iii) 82 candidates in N. meningitidis strain MC58, including an additional 19 identified through cross-comparisons with the other two strains. In addition to the 18 members of the opa gene family, a repertoire of 119 putative phase-variable genes is described, indicating a huge potential for diversification mediated by this mechanism of gene switching in these species that is central to their interactions with the host and environmental transitions. Eighty-two of these are either known (14) or strong (68) candidates for phase variation, which together with the opa genes make a total of 100 identified genes. The repertoires of the genes identified in this analysis diverge from the different species groupings, indicating horizontal exchange that significantly affects the species and strain complements of these genes.
Collapse
Affiliation(s)
- Lori A S Snyder
- The Sir William Dunn School of Pathology, University of Oxford, South Parks Road, Oxford OX1 3RE, UK1
| | - Sarah A Butcher
- Oxford University Bioinformatics Centre, The Sir William Dunn School of Pathology, University of Oxford, South Parks Road, Oxford OX1 3RE, UK2
| | - Nigel J Saunders
- The Sir William Dunn School of Pathology, University of Oxford, South Parks Road, Oxford OX1 3RE, UK1
| |
Collapse
|