1
|
Kristmundsdottir S, Jonsson H, Hardarson MT, Palsson G, Beyter D, Eggertsson HP, Gylfason A, Sveinbjornsson G, Holley G, Stefansson OA, Halldorsson GH, Olafsson S, Arnadottir GA, Olason PI, Eiriksson O, Masson G, Thorsteinsdottir U, Rafnar T, Sulem P, Helgason A, Gudbjartsson DF, Halldorsson BV, Stefansson K. Sequence variants affecting the genome-wide rate of germline microsatellite mutations. Nat Commun 2023; 14:3855. [PMID: 37386006 PMCID: PMC10310707 DOI: 10.1038/s41467-023-39547-6] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/10/2022] [Accepted: 06/16/2023] [Indexed: 07/01/2023] Open
Abstract
Microsatellites are polymorphic tracts of short tandem repeats with one to six base-pair (bp) motifs and are some of the most polymorphic variants in the genome. Using 6084 Icelandic parent-offspring trios we estimate 63.7 (95% CI: 61.9-65.4) microsatellite de novo mutations (mDNMs) per offspring per generation, excluding one bp repeats motifs (homopolymers) the estimate is 48.2 mDNMs (95% CI: 46.7-49.6). Paternal mDNMs occur at longer repeats than maternal ones, which are in turn larger with a mean size of 3.4 bp vs 3.1 bp for paternal ones. mDNMs increase by 0.97 (95% CI: 0.90-1.04) and 0.31 (95% CI: 0.25-0.37) per year of father's and mother's age at conception, respectively. Here, we find two independent coding variants that associate with the number of mDNMs transmitted to offspring; The minor allele of a missense variant (allele frequency (AF) = 1.9%) in MSH2, a mismatch repair gene, increases transmitted mDNMs from both parents (effect: 13.1 paternal and 7.8 maternal mDNMs). A synonymous variant (AF = 20.3%) in NEIL2, a DNA damage repair gene, increases paternally transmitted mDNMs (effect: 4.4 mDNMs). Thus, the microsatellite mutation rate in humans is in part under genetic control.
Collapse
Affiliation(s)
- Snaedis Kristmundsdottir
- deCODE genetics / Amgen Inc., Reykjavik, Iceland
- School of Technology, Reykjavik University, Reykjavik, Iceland
| | | | - Marteinn T Hardarson
- deCODE genetics / Amgen Inc., Reykjavik, Iceland
- School of Technology, Reykjavik University, Reykjavik, Iceland
| | | | - Doruk Beyter
- deCODE genetics / Amgen Inc., Reykjavik, Iceland
| | | | | | | | | | | | - Gisli H Halldorsson
- deCODE genetics / Amgen Inc., Reykjavik, Iceland
- School of Engineering and Natural Sciences, University of Iceland, Reykjavik, Iceland
| | | | - Gudny A Arnadottir
- deCODE genetics / Amgen Inc., Reykjavik, Iceland
- Faculty of Medicine, School of Health Sciences, University of Iceland, Reykjavik, Iceland
| | | | | | - Gisli Masson
- deCODE genetics / Amgen Inc., Reykjavik, Iceland
| | - Unnur Thorsteinsdottir
- deCODE genetics / Amgen Inc., Reykjavik, Iceland
- Faculty of Medicine, School of Health Sciences, University of Iceland, Reykjavik, Iceland
| | | | | | - Agnar Helgason
- deCODE genetics / Amgen Inc., Reykjavik, Iceland
- Department of Anthropology, University of Iceland, Reykjavik, Iceland
| | - Daniel F Gudbjartsson
- deCODE genetics / Amgen Inc., Reykjavik, Iceland
- School of Engineering and Natural Sciences, University of Iceland, Reykjavik, Iceland
| | - Bjarni V Halldorsson
- deCODE genetics / Amgen Inc., Reykjavik, Iceland.
- School of Technology, Reykjavik University, Reykjavik, Iceland.
| | | |
Collapse
|
2
|
Guo YH, Halasan LC, Wang HY, Lin HC. High migratory propensity constitutes a single stock of an exploited cutlassfish species in the Northwest Pacific: A microsatellite approach. PLoS One 2022; 17:e0265548. [PMID: 35298539 PMCID: PMC8929604 DOI: 10.1371/journal.pone.0265548] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/10/2021] [Accepted: 03/03/2022] [Indexed: 11/19/2022] Open
Abstract
Cutlassfishes, also known as hairtails, include multiple predatory fishes of the family Trichiuridae. They constitute a top marine fish commodity globally, yet the knowledge about their composition and intraspecific genetic structures is still limited. Trichiurus japonicus accounts for a major amount in the northwest Pacific fishery. Previous studies based on mitochondrial DNA markers reported incongruences in its population structure, hence prompting the need for high-resolution markers and avoiding possible shortcomings in its management. Here we genotyped ten novel de novo-assembled transcriptome-derived microsatellite markers on a total of 150 samples across five major fishing grounds (encompassing latitudes 22-39°N). These markers presented a high number of alleles and heterozygosity compared to other marine fishes, corresponding to the large effective population size of ~20,000 per location and cohort differentiation. Population structuring analyses suggested T. japonicus to be a homogenous well-mixed population. This configuration is likely attributed to the majority of its effective population migrates across locations, and the absence of oceanographic barriers at the continental shelves. Qingdao with reportedly high ocean productivity could be a genetic pseudosink based on the high heterozygosity and migratory preference. Moreover, the results of sign tests suggest that T. japonicus experienced a recent bottleneck likely concurrent with historical glaciation events. Further, we demonstrated satisfactory cross-amplifications of our markers on several congeners, indicating a great promise to use these markers to study the population genetics of trichiurids. Together, our findings will serve as an essential groundwork for enhancing resource conservation and management of cutlassfishes.
Collapse
Affiliation(s)
- Yu-Hong Guo
- Department of Marine Biotechnology and Resources, National Sun Yat-sen University, Kaohsiung, Taiwan
| | - Lorenzo C. Halasan
- Department of Marine Biotechnology and Resources, National Sun Yat-sen University, Kaohsiung, Taiwan
| | - Hui-Yu Wang
- Institute of Oceanography, National Taiwan University, Taipei, Taiwan
| | - Hsiu-Chin Lin
- Department of Marine Biotechnology and Resources, National Sun Yat-sen University, Kaohsiung, Taiwan
- Doctoral Degree Program in Marine Biotechnology, National Sun Yat-sen University, Kaohsiung, Taiwan
| |
Collapse
|
3
|
Mahfooz S, Singh P, Akhter Y. A comparative study of microsatellites among crocodiles and development of genomic resources for the critically endangered Indian gharial. Genetica 2022; 150:67-75. [DOI: 10.1007/s10709-021-00148-0] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/07/2021] [Accepted: 12/09/2021] [Indexed: 11/29/2022]
|
4
|
Lei Y, Zhou Y, Price M, Song Z. Genome-wide characterization of microsatellite DNA in fishes: survey and analysis of their abundance and frequency in genome-specific regions. BMC Genomics 2021; 22:421. [PMID: 34098869 PMCID: PMC8186053 DOI: 10.1186/s12864-021-07752-6] [Citation(s) in RCA: 12] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/08/2021] [Accepted: 05/24/2021] [Indexed: 12/14/2022] Open
Abstract
BACKGROUND Microsatellite repeats are ubiquitous in organism genomes and play an important role in the chromatin organization, regulation of gene activity, recombination and DNA replication. Although microsatellite distribution patterns have been studied in most phylogenetic lineages, they are unclear in fish species. RESULTS Here, we present the first systematic examination of microsatellite distribution in coding and non-coding regions of 14 fish genomes. Our study showed that the number and type of microsatellites displayed nonrandom distribution for both intragenic and intergenic regions, suggesting that they have potential roles in transcriptional or translational regulation and DNA replication slippage theories alone were insufficient to explain the distribution patterns. Our results showed that microsatellites are dominant in non-coding regions. The total number of microsatellites ranged from 78,378 to 1,012,084, and the relative density varied from 4925.76 bp/Mb to 25,401.97 bp/Mb. Overall, (A + T)-rich repeats were dominant. The dependence of repeat abundance on the length of the repeated unit (1-6 nt) showed a great similarity decrease, whereas more tri-nucleotide repeats were found in exonic regions than tetra-nucleotide repeats of most species. Moreover, the incidence of different repeated types appeared species- and genomic-specific. These results highlight potential mechanisms for maintaining microsatellite distribution, such as selective forces and mismatch repair systems. CONCLUSIONS Our data could be beneficial for the studies of genome evolution and microsatellite DNA evolutionary dynamics, and facilitate the exploration of microsatellites structural, function, composition mode and molecular markers development in these species.
Collapse
Affiliation(s)
- Yi Lei
- Sichuan Key Laboratory of Conservation Biology on Endangered Wildlife, College of Life Sciences, Sichuan University, Chengdu, 610065, People's Republic of China
| | - Yu Zhou
- Sichuan Key Laboratory of Conservation Biology on Endangered Wildlife, College of Life Sciences, Sichuan University, Chengdu, 610065, People's Republic of China
| | - Megan Price
- Sichuan Key Laboratory of Conservation Biology on Endangered Wildlife, College of Life Sciences, Sichuan University, Chengdu, 610065, People's Republic of China
| | - Zhaobin Song
- Sichuan Key Laboratory of Conservation Biology on Endangered Wildlife, College of Life Sciences, Sichuan University, Chengdu, 610065, People's Republic of China.
- Key Laboratory of Bio-Resources and Eco-Environment of Ministry of Education, College of Life Sciences, Sichuan University, Chengdu, 610065, People's Republic of China.
| |
Collapse
|
5
|
Gits-Muselli M, Campagne P, Desnos-Ollivier M, Le Pape P, Bretagne S, Morio F, Alanio A. Comparison of MultiLocus Sequence Typing (MLST) and Microsatellite Length Polymorphism (MLP) for Pneumocystis jirovecii genotyping. Comput Struct Biotechnol J 2020; 18:2890-2896. [PMID: 33163149 PMCID: PMC7593342 DOI: 10.1016/j.csbj.2020.10.005] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/09/2020] [Revised: 09/28/2020] [Accepted: 10/02/2020] [Indexed: 01/02/2023] Open
Abstract
Pneumocystis jirovecii is an atypical fungus responsible for severe respiratory infections, often reported as local outbreaks in immunocompromised patients. Epidemiology of this infection, and transmission risk emphasises the need for developing genotyping techniques. Currently, two methods have emerged: Multilocus Sequence typing (MLST) and microsatellite length polymorphism (MLP). Here we compare an MLST strategy, including 2 nuclear loci and 2 mitochondrial loci, with an MLP strategy including 6 nuclear markers using 37 clinical PCR-positive respiratory samples from two French hospitals. Pneumocystis jirovecii MLST and MLP provided 30 and 35 different genotypes respectively. A higher number of mixed infections was detected using MLP (48.6% vs. 13.5% respectively; p = 0.002). Only one MLP marker (STR279) was statistically associated with the geographical origin of samples. Haplotype network inferred using the available genotypes yielded expanded network for MLP, characterized by more mutational steps as compared to MLST, suggesting that the MLP approach is more resolutive to separate genotypes. The correlation between genetic distances calculated based on MLST and MLP was modest with a R2 value = 0.32 (p < 0.001). Finally, both genotyping methods fulfilled important criteria: (i) a discriminatory power from 97.5% to 99.5% and (ii) being quick and convenient genotyping tools. While MLP appeared highly resolutive regarding genotypes mixture within samples, using one genotyping method rather than the other may also depend on the context (i.e., MLST for investigation of suspected clonal outbreaks versus MLP for population structure study) as well as local facilities.
Collapse
Affiliation(s)
- Maud Gits-Muselli
- Laboratoire de Parasitologie-Mycologie; AP-HP, Groupe Hospitalier Saint-Louis-Lariboisière-Fernand-Widal, Paris, France.,Université de Paris, Sorbonne Paris Cité, Paris, France.,Institut Pasteur, CNRS, unité de Mycologie Moléculaire, Centre National de référence Mycoses invasives et Antifongiques (CNRMA), UMR2000, Paris, France
| | - Pascal Campagne
- Hub of Bioinformatics and Biostatistics - Département Biologie Computationnelle, Institut Pasteur, USR 3756 CNRS, Paris, France
| | - Marie Desnos-Ollivier
- Institut Pasteur, CNRS, unité de Mycologie Moléculaire, Centre National de référence Mycoses invasives et Antifongiques (CNRMA), UMR2000, Paris, France
| | - Patrice Le Pape
- Laboratoire de Parasitologie-Mycologie, Institut de Biologie, CHU Nantes, Nantes, France.,Département de Parasitologie et Mycologie Médicale, EA1155 IICiMed, Institut de Recherche en Santé 2, Université de Nantes, Nantes Atlantique Universités, Nantes, France
| | - Stéphane Bretagne
- Laboratoire de Parasitologie-Mycologie; AP-HP, Groupe Hospitalier Saint-Louis-Lariboisière-Fernand-Widal, Paris, France.,Université de Paris, Sorbonne Paris Cité, Paris, France.,Institut Pasteur, CNRS, unité de Mycologie Moléculaire, Centre National de référence Mycoses invasives et Antifongiques (CNRMA), UMR2000, Paris, France
| | - Florent Morio
- Laboratoire de Parasitologie-Mycologie, Institut de Biologie, CHU Nantes, Nantes, France.,Département de Parasitologie et Mycologie Médicale, EA1155 IICiMed, Institut de Recherche en Santé 2, Université de Nantes, Nantes Atlantique Universités, Nantes, France
| | - Alexandre Alanio
- Laboratoire de Parasitologie-Mycologie; AP-HP, Groupe Hospitalier Saint-Louis-Lariboisière-Fernand-Widal, Paris, France.,Université de Paris, Sorbonne Paris Cité, Paris, France.,Institut Pasteur, CNRS, unité de Mycologie Moléculaire, Centre National de référence Mycoses invasives et Antifongiques (CNRMA), UMR2000, Paris, France
| |
Collapse
|
6
|
Varshney S, Sagwekar M, Pavan-Kumar A, Das R, Gireesh-Babu P, Chaudhari A, Krishna G. Development and characterisation of novel polymorphic microsatellite loci in the freshwater mussel Lamellidens marginalis (Lamarck, 1819) using next generation sequencing. MOLLUSCAN RESEARCH 2020. [DOI: 10.1080/13235818.2020.1799147] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 10/23/2022]
Affiliation(s)
- Shubham Varshney
- Division of Fish Genetics and Biotechnology, ICAR-Central Institute of Fisheries Education, Mumbai, India
| | - Madhavi Sagwekar
- Division of Fish Genetics and Biotechnology, ICAR-Central Institute of Fisheries Education, Mumbai, India
| | - A. Pavan-Kumar
- Division of Fish Genetics and Biotechnology, ICAR-Central Institute of Fisheries Education, Mumbai, India
| | - Rekha Das
- Division of Fisheries, ICAR Research Complex for NEH Region, Tripura Centre, Lembucherra, India
| | - P. Gireesh-Babu
- Division of Fish Genetics and Biotechnology, ICAR-Central Institute of Fisheries Education, Mumbai, India
| | - Aparna Chaudhari
- Division of Fish Genetics and Biotechnology, ICAR-Central Institute of Fisheries Education, Mumbai, India
| | - Gopal Krishna
- Division of Fish Genetics and Biotechnology, ICAR-Central Institute of Fisheries Education, Mumbai, India
| |
Collapse
|
7
|
El Bakkali A, Essalouh L, Tollon C, Rivallan R, Mournet P, Moukhli A, Zaher H, Mekkaoui A, Hadidou A, Sikaoui L, Khadari B. Characterization of Worldwide Olive Germplasm Banks of Marrakech (Morocco) and Córdoba (Spain): Towards management and use of olive germplasm in breeding programs. PLoS One 2019; 14:e0223716. [PMID: 31622375 PMCID: PMC6797134 DOI: 10.1371/journal.pone.0223716] [Citation(s) in RCA: 14] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/10/2019] [Accepted: 09/26/2019] [Indexed: 11/09/2022] Open
Abstract
Olive (Olea europaea L.) is a major fruit crop in the Mediterranean Basin. Ex-situ olive management is essential to ensure optimal use of genetic resources in breeding programs. The Worldwide Olive Germplasm Bank of Córdoba (WOGBC), Spain, and Marrakech (WOGBM), Morocco, are currently the largest existing olive germplasm collections. Characterization, identification, comparison and authentication of all accessions in both collections could thus provide useful information for managing olive germplasm for its preservation, exchange within the scientific community and use in breeding programs. Here we applied 20 microsatellite markers (SSR) and 11 endocarp morphological traits to discriminate and authenticate 1091 olive accessions belonging to WOGBM and WOGBC (554 and 537, respectively). Of all the analyzed accessions, 672 distinct SSR profiles considered as unique genotypes were identified, but only 130 were present in both collections. Combining SSR markers and endocarp traits led to the identification of 535 cultivars (126 in common) and 120 authenticated cultivars. No significant differences were observed between collections regarding the allelic richness and diversity index. We concluded that the genetic diversity level was stable despite marked contrasts in varietal composition between collections, which could be explained by their different collection establishment conditions. This highlights the extent of cultivar variability within WOGBs. Moreover, we detected 192 mislabeling errors, 72 of which were found in WOGBM. A total of 228 genotypes as molecular variants of 74 cultivars, 79 synonyms and 39 homonyms as new cases were identified. Both collections were combined to define the nested core collections of 55, 121 and 150 sample sizes proposed for further studies. This study was a preliminary step towards managing and mining the genetic diversity in both collections while developing collaborations between olive research teams to conduct association mapping studies by exchanging and phenotyping accessions in contrasted environmental sites.
Collapse
Affiliation(s)
- Ahmed El Bakkali
- INRA, UR Amélioration des Plantes et Conservation des Ressources Phyto-génétiques, Meknès, Morocco
| | - Laila Essalouh
- AGAP, University Montpellier, CIRAD, INRA, Montpellier SupAgro, Montpellier, France
- EPLEFPA de Nîmes-CFPPA du Gard, Rodilhan, France
| | - Christine Tollon
- AGAP, University Montpellier, CIRAD, INRA, Montpellier SupAgro, Montpellier, France
| | - Ronan Rivallan
- AGAP, University Montpellier, CIRAD, INRA, Montpellier SupAgro, Montpellier, France
| | - Pierre Mournet
- AGAP, University Montpellier, CIRAD, INRA, Montpellier SupAgro, Montpellier, France
| | | | - Hayat Zaher
- INRA, UR Amélioration des Plantes, Marrakech, Morocco
| | - Abderrahmane Mekkaoui
- INRA, UR Amélioration des Plantes et Conservation des Ressources Phyto-génétiques, Meknès, Morocco
| | - Amal Hadidou
- INRA, UR Amélioration des Plantes et Conservation des Ressources Phyto-génétiques, Meknès, Morocco
| | | | - Bouchaib Khadari
- AGAP, University Montpellier, CIRAD, INRA, Montpellier SupAgro, Montpellier, France
- Conservatoire Botanique National Méditerranéen de Porquerolles (CBNMed), UMR AGAP, Montpellier, France
| |
Collapse
|
8
|
Xia Y, Luo W, Yuan S, Zheng Y, Zeng X. Microsatellite development from genome skimming and transcriptome sequencing: comparison of strategies and lessons from frog species. BMC Genomics 2018; 19:886. [PMID: 30526480 PMCID: PMC6286531 DOI: 10.1186/s12864-018-5329-y] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/10/2017] [Accepted: 11/28/2018] [Indexed: 11/14/2022] Open
Abstract
Background Even though microsatellite loci frequently have been isolated using recently developed next-generation sequencing (NGS) techniques, this task is still difficult because of the subsequent polymorphism screening requires a substantial amount of time. Selecting appropriate polymorphic microsatellites is a critical issue for ecological and evolutionary studies. However, the extent to which assembly strategy, read length, sequencing depth, and library layout produce a measurable effect on microsatellite marker development remains unclear. Here, we use six frog species for genome skimming and two frog species for transcriptome sequencing to develop microsatellite markers, and investigate the effect of different isolation strategies on the yield of microsatellites. Results The results revealed that the number of isolated microsatellites increases with increased data quantity and read length. Assembly strategy could influence the yield and the polymorphism of microsatellite development. Larger k-mer sizes produced fewer total number of microsatellite loci, but these loci had a longer repeat length, suggesting greater polymorphism. However, the proportion of each type of nucleotide repeats was not affected; dinucleotide repeats were always the dominant type. Finally, the transcriptomic microsatellites displayed lower levels of polymorphisms and were less abundant than genomic microsatellites, but more likely to be functionally linked loci. Conclusions These observations provide deep insight into the evolution and distribution of microsatellites and how different isolation strategies affect microsatellite development using NGS. Electronic supplementary material The online version of this article (10.1186/s12864-018-5329-y) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Yun Xia
- Chengdu Institute of Biology, Chinese Academy of Sciences, Chengdu, 610041, China
| | - Wei Luo
- Chengdu Institute of Biology, Chinese Academy of Sciences, Chengdu, 610041, China.,University of Chinese Academy of Sciences, Beijing, 100049, China
| | - Siqi Yuan
- Chengdu Institute of Biology, Chinese Academy of Sciences, Chengdu, 610041, China.,College of Bioengineering, Sichuan University of Science & Engineering, Zigong, 643000, China
| | - Yuchi Zheng
- Chengdu Institute of Biology, Chinese Academy of Sciences, Chengdu, 610041, China
| | - Xiaomao Zeng
- Chengdu Institute of Biology, Chinese Academy of Sciences, Chengdu, 610041, China.
| |
Collapse
|
9
|
Qi WH, Jiang XM, Yan CC, Zhang WQ, Xiao GS, Yue BS, Zhou CQ. Distribution patterns and variation analysis of simple sequence repeats in different genomic regions of bovid genomes. Sci Rep 2018; 8:14407. [PMID: 30258087 PMCID: PMC6158176 DOI: 10.1038/s41598-018-32286-5] [Citation(s) in RCA: 17] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/26/2018] [Accepted: 09/04/2018] [Indexed: 01/23/2023] Open
Abstract
As the first examination of distribution, guanine-cytosine (GC) pattern, and variation analysis of microsatellites (SSRs) in different genomic regions of six bovid species, SSRs displayed nonrandomly distribution in different regions. SSR abundances are much higher in the introns, transposable elements (TEs), and intergenic regions compared to the 3′-untranslated regions (3′UTRs), 5′UTRs and coding regions. Trinucleotide perfect SSRs (P-SSRs) were the most frequent in the coding regions, whereas, mononucleotide P-SSRs were the most in the introns, 3′UTRs, TEs, and intergenic regions. Trifold P-SSRs had more GC-contents in the 5′UTRs and coding regions than that in the introns, 3′UTRs, TEs, and intergenic regions, whereas mononucleotide P-SSRs had the least GC-contents in all genomic regions. The repeat copy numbers (RCN) of the same mono- to hexanucleotide P-SSRs showed significantly different distributions in different regions (P < 0.01). Except for the coding regions, mononucleotide P-SSRs had the most RCNs, followed by the pattern: di- > tri- > tetra- > penta- > hexanucleotide P-SSRs in the same regions. The analysis of coefficient of variability (CV) of SSRs showed that the CV variations of RCN of the same mono- to hexanucleotide SSRs were relative higher in the intronic and intergenic regions, followed by the CV variation of RCN in the TEs, and the relative lower was in the 5′UTRs, 3′UTRs, and coding regions. Wide SSR analysis of different genomic regions has helped to reveal biological significances of their distributions.
Collapse
Affiliation(s)
- Wen-Hua Qi
- College of Biology and Food Engineering, Chongqing Three Gorges University, Chongqing, 404100, P. R. China
| | - Xue-Mei Jiang
- College of Environmental and Chemistry Engineering, Chongqing Three Gorges University, Chongqing, 404100, P. R. China
| | - Chao-Chao Yan
- Key Laboratory of Bio-resources and Eco-environment (Ministry of Education), College of Life Sciences, Sichuan University, Chengdu, 610064, P. R. China
| | - Wan-Qing Zhang
- College of Life Sciences, Sichuan Agricultural University, Ya'an, Sichuan Province, 625014, P. R. China
| | - Guo-Sheng Xiao
- College of Biology and Food Engineering, Chongqing Three Gorges University, Chongqing, 404100, P. R. China
| | - Bi-Song Yue
- Key Laboratory of Bio-resources and Eco-environment (Ministry of Education), College of Life Sciences, Sichuan University, Chengdu, 610064, P. R. China
| | - Cai-Quan Zhou
- Key Laboratory of Southwest China Wildlife Resources Conservation (Ministry of Education), China West Normal University, Nanchong, 637009, P. R. China.
| |
Collapse
|
10
|
Aci MM, Lupini A, Mauceri A, Morsli A, Khelifi L, Sunseri F. Genetic variation and structure of maize populations from Saoura and Gourara oasis in Algerian Sahara. BMC Genet 2018; 19:51. [PMID: 30068292 PMCID: PMC6090932 DOI: 10.1186/s12863-018-0655-2] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/15/2017] [Accepted: 07/12/2018] [Indexed: 11/14/2022] Open
Abstract
Background The ability of maize populations/landraces to tolerate drastically extreme environments over the past four centuries in Algeria leads to characterize these genetic resources for germplasm management as well as the identification of the best landraces useful for genetic improvement. Thus, the aim of the present work was a fingerprinting of an Algerian maize collection (47 landraces) from Saharan oasis by using 24 agro-morphological traits and18 Simple Sequence Repeats to evaluate genetic diversity and population structure. Results Phenotypic traits showed large significant variation in which earliness, plant size, ear and kernel features and crop yield appeared the most discriminant traits among landraces by using principal component analysis (PCA). One hundred ninety-seven different alleles were detected with a high mean number of allele per locus (10.9). The selected SSR were highly informative with PIC values > 0.65 as well as an overall genetic diversity (0.47) highlighting a broad genetic variability in the analyzed landraces. Genetic structure analysis revealed a high genetic differentiation among the 47 maize landraces with an overall Fst value (0.33). Cluster analysis for morphological traits as well as for SSR markers grouped the 47 Algerian populations regardless their geographic origin. Conclusions Maize from Algerian desert harbors a wide genetic diversity offering a source of novel/unique alleles useful for maize breeding programs to face the ongoing and future major challenge, the climate changes. Electronic supplementary material The online version of this article (10.1186/s12863-018-0655-2) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Miyassa Meriem Aci
- LRGB, École Nationale Supérieure Agronomique (ENSA : ES1603), Avenue Pasteur, Hassan Badi, 16200, Algiers, El Harrach, Algeria
| | - Antonio Lupini
- Dipartimento AGRARIA, Università Mediterranea di Reggio Calabria, Località Feo di Vito snc, 89121, Reggio Calabria, Italy
| | - Antonio Mauceri
- Dipartimento AGRARIA, Università Mediterranea di Reggio Calabria, Località Feo di Vito snc, 89121, Reggio Calabria, Italy
| | - Abdelkader Morsli
- LRGB, École Nationale Supérieure Agronomique (ENSA : ES1603), Avenue Pasteur, Hassan Badi, 16200, Algiers, El Harrach, Algeria
| | - Lakhdar Khelifi
- LRGB, École Nationale Supérieure Agronomique (ENSA : ES1603), Avenue Pasteur, Hassan Badi, 16200, Algiers, El Harrach, Algeria.
| | - Francesco Sunseri
- Dipartimento AGRARIA, Università Mediterranea di Reggio Calabria, Località Feo di Vito snc, 89121, Reggio Calabria, Italy.
| |
Collapse
|
11
|
Yashima AS, Innan H. varver: a database of microsatellite variation in vertebrates. Mol Ecol Resour 2016; 17:824-833. [PMID: 27796069 DOI: 10.1111/1755-0998.12625] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/19/2015] [Revised: 09/24/2016] [Accepted: 10/04/2016] [Indexed: 01/16/2023]
Abstract
Understanding how genetic variation is maintained within a species is important in ecology, evolution, conservation and population genetics. Tremendous efforts have been made to evaluate the patterns of genetic variation in natural populations of various species. For this purpose, microsatellites have played a major role since the 1990s. Here we describe a comprehensive database, varver (Variation in Vertebrates) that provides complete information regarding microsatellite variation in natural populations of vertebrates. For each species, varver includes basic information of the species, a list of publications reporting the microsatellite variation, and tables of genetic variation within and between populations (heterozygosity and FST ). The geographic location and rough sampling range are also shown for each sampled population. The database should be useful for researchers interested in not only specific species but also comparing multiple species. We discuss the utility of microsatellite data, particularly for meta-analyses that involve multiple microsatellite loci from various species. We show that in such analyses, it is extremely important to correct for biases caused by differences in mutation rate, mainly due to repeat unit and number.
Collapse
Affiliation(s)
- Akiko Sato Yashima
- Department of Evolutionary Studies of Biosystems, Graduate University for Advanced Studies (SOKENDAI), Hayama, Kanagawa, 240-0193, Japan.,Department of Mathematical Engineering, Musashino University, 3-3-3 Ariake, Koto-ku, Tokyo, 135-8181, Japan
| | - Hideki Innan
- Department of Evolutionary Studies of Biosystems, Graduate University for Advanced Studies (SOKENDAI), Hayama, Kanagawa, 240-0193, Japan
| |
Collapse
|
12
|
Abstract
Lynch syndrome, an autosomal dominant inherited disorder, is caused by inactivating mutations involving DNA mismatch repair (MMR) genes. This leads to profound genetic instability, including microsatellite instability (MSI) and increased risk for cancer development, particularly colon and endometrial malignancies. Clinical testing of tumor tissues for the presence of MMR gene deficiency is standard practice in clinical oncology, with immunohistochemistry and PCR-based microsatellite instability analysis used as screening tests to identify potential Lynch syndrome families. The ultimate diagnosis of Lynch syndrome requires documentation of mutation within one of the four MMR genes (MLH1, PMS2, MSH2 and MSH6) or EPCAM, currently achieved by comprehensive sequencing analysis of germline DNA. In this review, the genetic basis of Lynch syndrome, methodologies of MMR deficiency testing, and current diagnostic algorithms in the clinical management of Lynch syndrome, are discussed.
Collapse
Affiliation(s)
- Natalia Buza
- a Department of Pathology, School of Medicine , Yale University , New Haven , CT , USA
| | - James Ziai
- b Genentech Inc ., San Francisco , CA , USA
| | - Pei Hui
- a Department of Pathology, School of Medicine , Yale University , New Haven , CT , USA
| |
Collapse
|
13
|
Chapuis MP, Plantamp C, Streiff R, Blondin L, Piou C. Microsatellite evolutionary rate and pattern in Schistocerca gregaria inferred from direct observation of germline mutations. Mol Ecol 2015; 24:6107-19. [PMID: 26562076 DOI: 10.1111/mec.13465] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/28/2015] [Revised: 11/05/2015] [Accepted: 11/06/2015] [Indexed: 01/21/2023]
Abstract
Unravelling variation among taxonomic orders regarding the rate of evolution in microsatellites is crucial for evolutionary biology and population genetics research. The mean mutation rate of microsatellites tends to be lower in arthropods than in vertebrates, but data are scarce and mostly concern accumulation of mutations in model species. Based on parent-offspring segregations and a hierarchical Bayesian model, the mean rate of mutation in the orthopteran insect Schistocerca gregaria was estimated at 2.1e(-4) per generation per untranscribed dinucleotide locus. This is close to vertebrate estimates and one order of magnitude higher than estimates from species of other arthropod orders, such as Drosophila melanogaster and Daphnia pulex. We also found evidence of a directional bias towards expansions even for long alleles and exceptionally large ranges of allele sizes. Finally, at transcribed microsatellites, the mean rate of mutation was half the rate found at untranscribed loci and the mutational model deviated from that usually considered, with most mutations involving multistep changes that avoid disrupting the reading frame. Our direct estimates of mutation rate were discussed in the light of peculiar biological and genomic features of S. gregaria, including specificities in mismatch repair and the dependence of its activity to allele length. Shedding new light on the mutational dynamics of grasshopper microsatellites is of critical importance for a number of research fields. As an illustration, we showed how our findings improve microsatellite application in population genetics, by obtaining a more precise estimation of S. gregaria effective population size from a published data set based on the same microsatellites.
Collapse
Affiliation(s)
- M-P Chapuis
- CIRAD, UMR CBGP, Montpellier, F-34398, France
| | - C Plantamp
- Laboratoire de Biométrie et Biologie Evolutive, CNRS, UMR 5558, Université Lyon 1, Villeurbanne, 69622, France
| | - R Streiff
- INRA, UMR CBGP, Montpellier, F-34398, France.,INRA, UMR DGIMI, Montpellier, F-34000, France
| | - L Blondin
- CIRAD, UPR B-AMR, Montpellier, F-34398, France
| | - C Piou
- CIRAD, UMR CBGP, Montpellier, F-34398, France
| |
Collapse
|
14
|
Lombal AJ, Wenner TJ, Burridge CP. Assessment of high-resolution melting (HRM) profiles as predictors of microsatellite variation: an example in Providence Petrel (Pterodroma solandri). Genes Genomics 2015. [DOI: 10.1007/s13258-015-0327-9] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
|
15
|
Kwong M, Pemberton TJ. Sequence differences at orthologous microsatellites inflate estimates of human-chimpanzee differentiation. BMC Genomics 2014; 15:990. [PMID: 25407736 PMCID: PMC4253012 DOI: 10.1186/1471-2164-15-990] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/02/2014] [Accepted: 10/30/2014] [Indexed: 02/06/2023] Open
Abstract
Background Microsatellites---contiguous arrays of 2–6 base-pair motifs---have formed the cornerstone of population-genetic studies for over two decades. Their genotype data typically takes the form of PCR fragment lengths obtained using locus-specific primer pairs to amplify the genomic region encompassing the microsatellite. Recently, we reported a dataset of 5,795 human and 84 chimpanzee individuals with genotypes at 246 human-derived autosomal microsatellites as a resource to facilitate interspecies comparisons. A major assumption underlying this dataset is that PCR amplicons at orthologous microsatellites are commensurable between species. Results We find this assumption to be frequently incorrect owing to discordance in microsatellite organization and variability, as well as nontrivial length imbalances caused by small species-specific indels in microsatellite flanking sequences. Converting PCR fragment lengths into the repeat numbers they represent at 138 microsatellites whose organization and variability was found to be highly similar in both species, we show that interspecies incommensurability among PCR amplicons can inflate FST and DPS estimates by up to 10.6%. Separate investigations of determinants of microsatellite variability in humans and chimpanzees uncover similar patterns with mean and maximum numbers of repeats, as well as numbers and ranges of distinct alleles, all important factors in predicting heterozygosity. In contrast, across microsatellites, numbers of repeats were significantly smaller in chimpanzees than in humans, while numbers and ranges of distinct alleles were instead larger. Conclusions Our findings have fundamental implications for interspecies comparisons using microsatellites and offer new opportunities for more accurate comparisons of patterns of human and chimpanzee genetic variation in numerous areas of application. Electronic supplementary material The online version of this article (doi:10.1186/1471-2164-15-990) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
| | - Trevor J Pemberton
- Department of Biochemistry and Medical Genetics, University of Manitoba, Winnipeg, Manitoba, Canada.
| |
Collapse
|
16
|
Putman AI, Carbone I. Challenges in analysis and interpretation of microsatellite data for population genetic studies. Ecol Evol 2014; 4:4399-428. [PMID: 25540699 PMCID: PMC4267876 DOI: 10.1002/ece3.1305] [Citation(s) in RCA: 237] [Impact Index Per Article: 23.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/02/2014] [Revised: 10/02/2014] [Accepted: 10/03/2014] [Indexed: 12/14/2022] Open
Abstract
Advancing technologies have facilitated the ever-widening application of genetic markers such as microsatellites into new systems and research questions in biology. In light of the data and experience accumulated from several years of using microsatellites, we present here a literature review that synthesizes the limitations of microsatellites in population genetic studies. With a focus on population structure, we review the widely used fixation (F ST) statistics and Bayesian clustering algorithms and find that the former can be confusing and problematic for microsatellites and that the latter may be confounded by complex population models and lack power in certain cases. Clustering, multivariate analyses, and diversity-based statistics are increasingly being applied to infer population structure, but in some instances these methods lack formalization with microsatellites. Migration-specific methods perform well only under narrow constraints. We also examine the use of microsatellites for inferring effective population size, changes in population size, and deeper demographic history, and find that these methods are untested and/or highly context-dependent. Overall, each method possesses important weaknesses for use with microsatellites, and there are significant constraints on inferences commonly made using microsatellite markers in the areas of population structure, admixture, and effective population size. To ameliorate and better understand these constraints, researchers are encouraged to analyze simulated datasets both prior to and following data collection and analysis, the latter of which is formalized within the approximate Bayesian computation framework. We also examine trends in the literature and show that microsatellites continue to be widely used, especially in non-human subject areas. This review assists with study design and molecular marker selection, facilitates sound interpretation of microsatellite data while fostering respect for their practical limitations, and identifies lessons that could be applied toward emerging markers and high-throughput technologies in population genetics.
Collapse
Affiliation(s)
- Alexander I Putman
- Department of Plant Pathology, North Carolina State University Raleigh, North Carolina, 27695-7616
| | - Ignazio Carbone
- Department of Plant Pathology, North Carolina State University Raleigh, North Carolina, 27695-7616
| |
Collapse
|
17
|
Willems T, Gymrek M, Highnam G, Mittelman D, Erlich Y. The landscape of human STR variation. Genome Res 2014; 24:1894-904. [PMID: 25135957 PMCID: PMC4216929 DOI: 10.1101/gr.177774.114] [Citation(s) in RCA: 176] [Impact Index Per Article: 17.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/30/2014] [Accepted: 08/15/2014] [Indexed: 02/06/2023]
Abstract
Short tandem repeats are among the most polymorphic loci in the human genome. These loci play a role in the etiology of a range of genetic diseases and have been frequently utilized in forensics, population genetics, and genetic genealogy. Despite this plethora of applications, little is known about the variation of most STRs in the human population. Here, we report the largest-scale analysis of human STR variation to date. We collected information for nearly 700,000 STR loci across more than 1000 individuals in Phase 1 of the 1000 Genomes Project. Extensive quality controls show that reliable allelic spectra can be obtained for close to 90% of the STR loci in the genome. We utilize this call set to analyze determinants of STR variation, assess the human reference genome's representation of STR alleles, find STR loci with common loss-of-function alleles, and obtain initial estimates of the linkage disequilibrium between STRs and common SNPs. Overall, these analyses further elucidate the scale of genetic variation beyond classical point mutations.
Collapse
Affiliation(s)
- Thomas Willems
- Whitehead Institute for Biomedical Research, Cambridge, Massachusetts 02142, USA; Computational and Systems Biology Program, MIT, Cambridge, Massachusetts 02139, USA
| | - Melissa Gymrek
- Whitehead Institute for Biomedical Research, Cambridge, Massachusetts 02142, USA; Harvard-MIT Division of Health Sciences and Technology, MIT, Cambridge, Massachusetts 02139, USA; Program in Medical and Population Genetics, Broad Institute of MIT and Harvard, Cambridge, Massachusetts 02142, USA; Department of Molecular Biology and Diabetes Unit, Massachusetts General Hospital, Boston, Massachusetts 02114, USA
| | - Gareth Highnam
- Virginia Bioinformatics Institute and Department of Biological Sciences, Virginia Tech, Blacksburg, Virginia 24061, USA
| | - David Mittelman
- Virginia Bioinformatics Institute and Department of Biological Sciences, Virginia Tech, Blacksburg, Virginia 24061, USA; Gene by Gene, Ltd., Houston, Texas 77008, USA
| | - Yaniv Erlich
- Whitehead Institute for Biomedical Research, Cambridge, Massachusetts 02142, USA;
| |
Collapse
|
18
|
Brittain A, Stroebele E, Erives A. Microsatellite repeat instability fuels evolution of embryonic enhancers in Hawaiian Drosophila. PLoS One 2014; 9:e101177. [PMID: 24978198 PMCID: PMC4076327 DOI: 10.1371/journal.pone.0101177] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/02/2014] [Accepted: 06/03/2014] [Indexed: 12/16/2022] Open
Abstract
For ∼30 million years, the eggs of Hawaiian Drosophila were laid in ever-changing environments caused by high rates of island formation. The associated diversification of the size and developmental rate of the syncytial fly embryo would have altered morphogenic gradients, thus necessitating frequent evolutionary compensation of transcriptional responses. We investigate the consequences these radiations had on transcriptional enhancers patterning the embryo to see whether their pattern of molecular evolution is different from non-Hawaiian species. We identify and functionally assay in transgenic D. melanogaster the Neurogenic Ectoderm Enhancers from two different Hawaiian Drosophila groups: (i) the picture wing group, and (ii) the modified mouthparts group. We find that the binding sites in this set of well-characterized enhancers are footprinted by diverse microsatellite repeat (MSR) sequences. We further show that Hawaiian embryonic enhancers in general are enriched in MSR relative to both Hawaiian non-embryonic enhancers and non-Hawaiian embryonic enhancers. We propose embryonic enhancers are sensitive to Activator spacing because they often serve as assembly scaffolds for the aggregation of transcription factor activator complexes. Furthermore, as most indels are produced by microsatellite repeat slippage, enhancers from Hawaiian Drosophila lineages, which experience dynamic evolutionary pressures, would become grossly enriched in MSR content.
Collapse
Affiliation(s)
- Andrew Brittain
- Department of Biology, University of Iowa, Iowa City, Iowa, United States of America
| | - Elizabeth Stroebele
- Department of Biology, University of Iowa, Iowa City, Iowa, United States of America
| | - Albert Erives
- Department of Biology, University of Iowa, Iowa City, Iowa, United States of America
- * E-mail:
| |
Collapse
|
19
|
Aketarawong N, Isasawin S, Thanaphum S. Evidence of weak genetic structure and recent gene flow between Bactrocera dorsalis s.s. and B. papayae, across Southern Thailand and West Malaysia, supporting a single target pest for SIT applications. BMC Genet 2014; 15:70. [PMID: 24929425 PMCID: PMC4071329 DOI: 10.1186/1471-2156-15-70] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/03/2014] [Accepted: 06/10/2014] [Indexed: 11/16/2022] Open
Abstract
Background Bactrocera dorsalis s.s. (Hendel) and B. papayae Drew & Hancock, are invasive pests belonging to the B. dorsalis complex. Their species status, based on morphology, is sometimes arguable. Consequently, the existence of cryptic species and/or population isolation may decrease the effectiveness of the sterile insect technique (SIT) due to an unknown degree of sexual isolation between released sterile flies and wild counterparts. To evaluate the genetic relationship and current demography in wild populations for guiding the application of area-wide integrated pest management using SIT, seven microsatellite-derived markers from B. dorsalis s.s. and another five from B. papayae were used for surveying intra- and inter-specific variation, population structure, and recent migration among sympatric and allopatric populations of the two morphological forms across Southern Thailand and West Malaysia. Results Basic genetic variations were not significantly different among forms, populations, and geographical areas (P > 0.05). Nonetheless, two sets of microsatellite markers showed significantly different levels of polymorphisms. Genetic differentiation between intra- and inter-specific differences was significant, but low. Seventeen populations revealed three hypothetical genetic clusters (K = 3) regardless of forms and geographical areas. The genetic structure of sympatric populations slightly changed during the different years of collection. Recent gene flow (m ≥ 0.10) was frequently detected whether samples were sympatric or allopatric. Ninety-five of 379 individuals distributed across the given area were designated as recent migrants or of admixed ancestry. As a consequence of substantial migration, no significant correlation between genetic and geographic distances was detected (R2 = 0.056, P = 0.650). Conclusions According to the 12 microsatellite variations, weak population structure and recent gene flow suggest that there is no status for cryptic species between B. dorsalis s.s. and B. papayae forms in Southern Thailand and West Malaysia. Both forms can be treated as a single target pest for the SIT program in an area-wide sense. Additionally, the result of species identification based on molecular data and morphological character are not congruent. The use of independent, multiple approaches in the characterization of the target population may ensure the effectiveness and feasibility of SIT-based control in the target area.
Collapse
Affiliation(s)
- Nidchaya Aketarawong
- Department of Biotechnology, Faculty of Science, Mahidol University, Rama VI Road, Bangkok 10400, Thailand.
| | | | | |
Collapse
|
20
|
Sutton PL. A call to arms: on refining Plasmodium vivax microsatellite marker panels for comparing global diversity. Malar J 2013; 12:447. [PMID: 24330329 PMCID: PMC3878832 DOI: 10.1186/1475-2875-12-447] [Citation(s) in RCA: 36] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/24/2013] [Accepted: 12/06/2013] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND Microsatellite (MS) markers have become an important tool for studying the population diversity, evolutionary history and multiplicity of infection (MOI) of malaria parasite infections. MS are typically selected on the basis of being highly polymorphic. However, it is known that the polymorphic potential (mutability) of each marker can vary as much as two orders of magnitude, which radically changes how diversity is represented in the genome from one marker to the next. Over the past decade, approximately 240 Plasmodium vivax MS have been published, comprising nine major panels of markers. Inconsistent usage of each panel has resulted in a surfeit of descriptive genetic diversity data that are largely incomparable between populations. The objective of this study was to statistically evaluate the quality of individual MS markers in order to validate a refined panel of markers that will provide a balanced picture of P. vivax population diversity. METHODS All previously published data, including genetic diversity indices, MS parameters, and population parameters, were assembled from 18 different global studies into a flat file to facilitate statistical analysis and modelling using JMP® Genomics 6.0 (SAS Institute Inc, Cary, NC, USA). Statistical modeling was employed to down-select markers with extreme variation among the mean number of alleles, expected heterozygosity, maximum repeat length and/or chromosomal location of the repeat. Individual MS were analysed by step-down whole model linear regression and standard least squares fit models, both stratified by annual parasite incidence to identify MS markers with values significantly different from the mean. RESULTS Of the 42 MS under evaluation in this study, 18 (nine high priority) were identified as ideal candidates for measuring population diversity between global regions, while five (two high priority) additional markers were identified as candidates for MOI studies. CONCLUSIONS MS diversity was found to be a function of endemicity and motif structure. Evaluation of individual MS permitted the assembly of a refined panel of markers that can be reliably utilized in the field to compare population structures between global regions.
Collapse
Affiliation(s)
- Patrick L Sutton
- Center for Genomics and Systems Biology, Department of Biology, New York University, 12 Waverly Place, New York, NY 10003, USA.
| |
Collapse
|
21
|
Abstract
Microsatellites are DNA sequences of tandem repeats of one to six nucleotides, which are highly polymorphic, and thus the molecular markers of choice in many kinship, population genetic, and conservation studies. There have been significant technical improvements since the early methods for microsatellite isolation were developed, and today the most common procedures take advantage of the hybrid capture methods of enriched-targeted microsatellite DNA. Furthermore, recent advents in sequencing technologies (i.e., next-generation sequencing, NGS) have fostered the mining of microsatellite markers in non-model organisms, affording a cost-effective way of obtaining a large amount of sequence data potentially useful for loci characterization. The rapid improvements of NGS platforms together with the increase in available microsatellite information open new avenues to the understanding of the evolutionary forces that shape genetic structuring in wild populations. Here, we provide detailed methodological procedures for microsatellite isolation based on the screening of GT microsatellite-enriched libraries, either by cloning and Sanger sequencing of positive clones or by direct NGS. Guides for designing new species-specific primers and basic genotyping are also given.
Collapse
Affiliation(s)
- Elena G Gonzalez
- Departamento de Biodiversidad y Biología Evolutiva, Museo Nacional de Ciencias Naturales, MNCN-CSIC, Madrid, Spain
| | | |
Collapse
|
22
|
Xiong LW, Wang Q, Qiu GF. Large-scale isolation of microsatellites from Chinese Mitten Crab Eriocheir sinensis via a Solexa Genomic Survey. Int J Mol Sci 2012. [PMID: 23208373 PMCID: PMC3546693 DOI: 10.3390/ijms131216333] [Citation(s) in RCA: 11] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/22/2023] Open
Abstract
Microsatellites are simple sequence repeats with a high degree of polymorphism in the genome; they are used as DNA markers in many molecular genetic studies. Using traditional methods such as the magnetic beads enrichment method, only a few microsatellite markers have been isolated from the Chinese mitten crab Eriocheir sinensis, as the crab genome sequence information is unavailable. Here, we have identified a large number of microsatellites from the Chinese mitten crab by taking advantage of Solexa genomic surveying. A total of 141,737 SSR (simple sequence repeats) motifs were identified via analysis of 883 Mb of the crab genomic DNA information, including mono-, di-, tri-, tetra-, penta- and hexa-nucleotide repeat motifs. The number of di-nucleotide repeat motifs was 82,979, making this the most abundant type of repeat motif (58.54%); the second most abundant were the tri-nucleotide repeats (42,657, 30.11%). Among di-nucleotide repeats, the most frequent repeats were AC motifs, accounting for 67.55% of the total number. AGG motifs were the most frequent (59.32%) of the tri-nucleotide motifs. A total of 15,125 microsatellite loci had a flanking sequence suitable for setting the primer of a polymerase chain reaction (PCR). To verify the identified SSRs, a subset of 100 primer pairs was randomly selected for PCR. Eighty two primer sets (82%) produced strong PCR products matching expected sizes, and 78% were polymorphic. In an analysis of 30 wild individuals from the Yangtze River with 20 primer sets, the number of alleles per locus ranged from 2–14 and the mean allelic richness was 7.4. No linkage disequilibrium was found between any pair of loci, indicating that the markers were independent. The Hardy-Weinberg equilibrium test showed significant deviation in four of the 20 microsatellite loci after sequential Bonferroni corrections. This method is cost- and time-effective in comparison to traditional approaches for the isolation of microsatellites.
Collapse
Affiliation(s)
- Liang-Wei Xiong
- Key laboratory of Freshwater Aquatic Genetic Resources Certificated by Ministry of Agriculture, College of Life Science, Shanghai Ocean University, 999 Hucheng Huan Road, Shanghai 201306, China; E-Mail:
- Jiangsu Animal Husbandry & Veterinary College, Jiangsu 225300, China
| | - Qun Wang
- College of Life Science, East China Normal University, Shanghai 200062, China; E-Mail:
| | - Gao-Feng Qiu
- Key laboratory of Freshwater Aquatic Genetic Resources Certificated by Ministry of Agriculture, College of Life Science, Shanghai Ocean University, 999 Hucheng Huan Road, Shanghai 201306, China; E-Mail:
- Author to whom correspondence should be addressed; E-Mail: ; Tel./Fax: +86-21-6190-0436
| |
Collapse
|
23
|
Chapuis MP, Streiff R, Sword GA. Long microsatellites and unusually high levels of genetic diversity in the Orthoptera. INSECT MOLECULAR BIOLOGY 2012; 21:181-186. [PMID: 22211932 DOI: 10.1111/j.1365-2583.2011.01124.x] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/31/2023]
Abstract
Much remains to be learned about the mutational processes governing the evolution of microsatellite repeat regions and the associated levels of genetic diversity observed at microsatellite markers across populations or species. An extensive survey of microsatellite variation in 210 insect species from six major orders revealed that within Orthopterans, which are characterized by giant genomes, levels of genetic diversity were ~20% higher and microsatellite repeat arrays were longer than in any other group. Because of the mutation dependence on repeat length, this result suggests a higher microsatellite loci mutation rate in the Orthoptera. We deem it plausible that differences among insect orders, either in mismatch repair systems or in abundance of transposable element-derived microsatellites, can shape the size distribution of both genomes and microsatellite repeat regions. Our findings emphasise that observed levels of genetic diversity can greatly vary across species (orders at least) because of molecular differences in the mechanisms that determine microsatellite size, and are therefore critical to conservation and population genetics studies, where microsatellite repeat variability is primarily interpreted in terms of population demography and history.
Collapse
|
24
|
Comparative use of InDel and SSR markers in deciphering the interspecific structure of cultivated citrus genetic diversity: a perspective for genetic association studies. Mol Genet Genomics 2011; 287:77-94. [PMID: 22160318 DOI: 10.1007/s00438-011-0658-4] [Citation(s) in RCA: 71] [Impact Index Per Article: 5.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/12/2011] [Accepted: 11/03/2011] [Indexed: 10/14/2022]
Abstract
Genetic stratification associated with domestication history is a key parameter for estimating the pertinence of genetic association study within a gene pool. Previous molecular and phenotypic studies have shown that most of the diversity of cultivated citrus results from recombination between three main species: C. medica (citron), C. reticulata (mandarin) and C. maxima (pummelo). However, the precise contribution of each of these basic species to the genomes of secondary cultivated species, such as C. sinensis (sweet orange), C. limon (lemon), C. aurantium (sour orange), C. paradisi (grapefruit) and recent hybrids is unknown. Our study focused on: (1) the development of insertion-deletion (InDel) markers and their comparison with SSR markers for use in genetic diversity and phylogenetic studies; (2) the analysis of the contributions of basic taxa to the genomes of secondary species and modern cultivars and (3) the description of the organisation of the Citrus gene pool, to evaluate how genetic association studies should be done at the cultivated Citrus gene pool level. InDel markers appear to be better phylogenetic markers for tracing the contributions of the three ancestral species, whereas SSR markers are more useful for intraspecific diversity analysis. Most of the genetic organisation of the Citrus gene pool is related to the differentiation between C. reticulata, C. maxima and C. medica. High and generalised LD was observed, probably due to the initial differentiation between the basic species and a limited number of interspecific recombinations. This structure precludes association genetic studies at the genus level without developing additional recombinant populations from interspecific hybrids. Association genetic studies should also be affordable at intraspecific level in a less structured pool such as C. reticulata.
Collapse
|
25
|
Gardner MG, Fitch AJ, Bertozzi T, Lowe AJ. Rise of the machines--recommendations for ecologists when using next generation sequencing for microsatellite development. Mol Ecol Resour 2011; 11:1093-101. [PMID: 21679314 DOI: 10.1111/j.1755-0998.2011.03037.x] [Citation(s) in RCA: 180] [Impact Index Per Article: 13.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
Abstract
Next generation sequencing is revolutionizing molecular ecology by simplifying the development of molecular genetic markers, including microsatellites. Here, we summarize the results of the large-scale development of microsatellites for 54 nonmodel species using next generation sequencing and show that there are clear differences amongst plants, invertebrates and vertebrates for the number and proportion of motif types recovered that are able to be utilized as markers. We highlight that the heterogeneity within each group is very large. Despite this variation, we provide an indication of what number of sequences and consequent proportion of a 454 run are required for the development of 40 designable, unique microsatellite loci for a typical molecular ecological study. Finally, to address the challenges of choosing loci from the vast array of microsatellite loci typically available from partial genome runs (average for this study, 2341 loci), we provide a microsatellite development flowchart as a procedural guide for application once the results of a partial genome run are obtained.
Collapse
Affiliation(s)
- Michael G Gardner
- School of Biological Sciences, Flinders University, GPO Box 2100, Adelaide, SA 5001, Australia.
| | | | | | | |
Collapse
|
26
|
Sung W, Tucker A, Bergeron RD, Lynch M, Thomas WK. Simple sequence repeat variation in the Daphnia pulex genome. BMC Genomics 2010; 11:691. [PMID: 21129182 PMCID: PMC3017760 DOI: 10.1186/1471-2164-11-691] [Citation(s) in RCA: 16] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/24/2010] [Accepted: 12/03/2010] [Indexed: 11/24/2022] Open
Abstract
Background Simple sequence repeats (SSRs) are highly variable features of all genomes. Their rapid evolution makes them useful for tracing the evolutionary history of populations and investigating patterns of selection and mutation across gnomes. The recently sequenced Daphnia pulex genome provides us with a valuable data set to study the mode and tempo of SSR evolution, without the inherent biases that accompany marker selection. Results Here we catalogue SSR loci in the Daphnia pulex genome with repeated motif sizes of 1-100 nucleotides with a minimum of 3 perfect repeats. We then used whole genome shotgun reads to determine the average heterozygosity of each SSR type and the relationship that it has to repeat number, motif size, motif sequence, and distribution of SSR loci. We find that SSR heterozygosity is motif specific, and positively correlated with repeat number as well as motif size. For non-repeat unit polymorphisms, we identify a motif-dependent end-nucleotide polymorphism bias that may contribute to the patterns of abundance for specific homopolymers, dimers, and trimers. Our observations confirm the high frequency of multiple unit variation (multistep) at large microsatellite loci, and further show that the occurrence of multiple unit variation is dependent on both repeat number and motif size. Using the Daphnia pulex genetic map, we show a positive correlation between dimer and trimer frequency and recombination. Conclusions This genome-wide analysis of SSR variation in Daphnia pulex indicates that several aspects of SSR variation are motif dependent and suggests that a combination of unit length variation and end repeat biased base substitution contribute to the unique spectrum of SSR repeat loci.
Collapse
Affiliation(s)
- Way Sung
- Hubbard Center for Genome Studies, University of New Hampshire, Durham, NH 03824, USA.
| | | | | | | | | |
Collapse
|
27
|
Nunes MDS, Wengel POT, Kreissl M, Schlötterer C. Multiple hybridization events between Drosophila simulans and Drosophila mauritiana are supported by mtDNA introgression. Mol Ecol 2010; 19:4695-707. [PMID: 20958812 PMCID: PMC3035818 DOI: 10.1111/j.1365-294x.2010.04838.x] [Citation(s) in RCA: 34] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/04/2010] [Revised: 06/17/2010] [Accepted: 06/19/2010] [Indexed: 11/30/2022]
Abstract
The study of speciation has advanced considerably in the last decades because of the increased application of molecular tools. In particular, the quantification of gene flow between recently diverged species could be addressed. Drosophila simulans and Drosophila mauritiana diverged, probably allopatrically, from a common ancestor approximately 250,000 years ago. However, these species share one mitochondrial DNA (mtDNA) haplotype indicative of a recent episode of introgression. To study the extent of gene flow between these species, we took advantage of a large sample of D. mauritiana and employed a range of different markers, i.e. nuclear and mitochondrial sequences, and microsatellites. This allowed us to detect two new mtDNA haplotypes (MAU3 and MAU4). These haplotypes diverged quite recently from haplotypes of the siII group present in cosmopolitan populations of D. simulans. The mean divergence time of the most diverged haplotype (MAU4) is approximately 127,000 years, which is more than 100,000 years before the assumed speciation time. Interestingly, we also found some evidence for gene flow at the nuclear level because an excess of putatively neutral loci shows significantly reduced differentiation between D. simulans and D. mauritiana. Our results suggest that these species are exchanging genes more frequently than previously thought.
Collapse
Affiliation(s)
- Maria D S Nunes
- Institut für Populationsgenetik, Veterinärmedizinische Universität Wien, Vienna, Austria
| | | | | | | |
Collapse
|
28
|
An experimental evaluation with Drosophila melanogaster of a novel dynamic system for the management of subdivided populations in conservation programs. Heredity (Edinb) 2010; 106:765-74. [PMID: 20823908 DOI: 10.1038/hdy.2010.117] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022] Open
Abstract
A dynamic method (DM) recently proposed for the management of captive subdivided populations was evaluated using the pilot species Drosophila melanogaster. By accounting for the particular genetic population structure, the DM determines the optimal mating pairs, their contributions to progeny and the migration pattern that minimize the overall co-ancestry in the population with a control of inbreeding levels. After a pre-management period such that one of the four subpopulations had higher inbreeding and differentiation than the others, three management methods were compared for 10 generations over three replicates: (1) isolated subpopulations (IS), (2) one-migrant-per-generation rule (OMPG), (3) DM aimed to produce the same or lower inbreeding coefficient than OMPG. The DM produced the lowest co-ancestry and equal or lower inbreeding than the OMPG method throughout the experiment. The initially lower fitness and lower variation for nine microsatellite loci of the highly inbred subpopulation were restored more quickly with the DM than with the OMPG method. We provide, therefore, an empirical illustration of the usefulness of the DM as a conservation protocol for captive subdivided populations when pedigree information is available (or can be deduced) and manipulation of breeding pairs is possible.
Collapse
|
29
|
Hadziabdic D, Fitzpatrick BM, Wang X, Wadl PA, Rinehart TA, Ownley BH, Windham MT, Trigiano RN. Analysis of genetic diversity in flowering dogwood natural stands using microsatellites: the effects of dogwood anthracnose. Genetica 2010; 138:1047-57. [PMID: 20820882 DOI: 10.1007/s10709-010-9490-8] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/26/2009] [Accepted: 08/21/2010] [Indexed: 11/25/2022]
Abstract
Flowering dogwood (Cornus florida L.) populations recently have experienced severe declines caused by dogwood anthracnose. Mortality has ranged from 48 to 98%, raising the concern that genetic diversity has been reduced significantly. Microsatellite data were used to evaluate the level and distribution of genetic variation throughout much of the native range of the tree. Genetic variation in areas affected by anthracnose was as high as or higher than areas without die-offs. We found evidence of four widespread, spatially contiguous genetic clusters. However, there was little relationship between geographic distance and genetic difference. These observations suggest that high dispersal rates and large effective population sizes have so far prevented rapid loss of genetic diversity. The effects of anthracnose on demography and community structure are likely to be far more consequential than short-term genetic effects.
Collapse
Affiliation(s)
- D Hadziabdic
- Department of Entomology and Plant Pathology, The University of Tennessee, 2431 Joe Johnson Dr, Knoxville, TN, 37996-4560, USA
| | | | | | | | | | | | | | | |
Collapse
|
30
|
Pemberton TJ, Sandefur CI, Jakobsson M, Rosenberg NA. Sequence determinants of human microsatellite variability. BMC Genomics 2009; 10:612. [PMID: 20015383 PMCID: PMC2806349 DOI: 10.1186/1471-2164-10-612] [Citation(s) in RCA: 42] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/07/2009] [Accepted: 12/16/2009] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND Microsatellite loci are frequently used in genomic studies of DNA sequence repeats and in population studies of genetic variability. To investigate the effect of sequence properties of microsatellites on their level of variability we have analyzed genotypes at 627 microsatellite loci in 1,048 worldwide individuals from the HGDP-CEPH cell line panel together with the DNA sequences of these microsatellites in the human RefSeq database. RESULTS Calibrating PCR fragment lengths in individual genotypes by using the RefSeq sequence enabled us to infer repeat number in the HGDP-CEPH dataset and to calculate the mean number of repeats (as opposed to the mean PCR fragment length), under the assumption that differences in PCR fragment length reflect differences in the numbers of repeats in the embedded repeat sequences. We find the mean and maximum numbers of repeats across individuals to be positively correlated with heterozygosity. The size and composition of the repeat unit of a microsatellite are also important factors in predicting heterozygosity, with tetra-nucleotide repeat units high in G/C content leading to higher heterozygosity. Finally, we find that microsatellites containing more separate sets of repeated motifs generally have higher heterozygosity. CONCLUSIONS These results suggest that sequence properties of microsatellites have a significant impact in determining the features of human microsatellite variability.
Collapse
Affiliation(s)
- Trevor J Pemberton
- Department of Human Genetics, University of Michigan, Ann Arbor, Michigan 48109, USA.
| | | | | | | |
Collapse
|
31
|
|
32
|
Marriage TN, Hudman S, Mort ME, Orive ME, Shaw RG, Kelly JK. Direct estimation of the mutation rate at dinucleotide microsatellite loci in Arabidopsis thaliana (Brassicaceae). Heredity (Edinb) 2009; 103:310-7. [PMID: 19513093 PMCID: PMC2749907 DOI: 10.1038/hdy.2009.67] [Citation(s) in RCA: 60] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/08/2022] Open
Abstract
The mutation rate at 54 perfect (uninterrupted) dinucleotide microsatellite loci is estimated by direct genotyping of 96 Arabidopsis thaliana mutation accumulation lines. The estimated rate differs significantly among motif types with the highest rate for AT repeats (2.03 x 10(-3) per allele per generation), intermediate for CT (3.31 x 10(-4)), and lowest for CA (4.96 x 10(-5)). The average mutation rate per generation for this sample of loci is 8.87 x 10(-4) (s.e.=2.57 x 10(-4)). There is a strong effect of initial repeat number, particularly for AT repeats, with mutation rate increasing with the length of the microsatellite locus in the progenitor line. Controlling for motif and initial repeat number, chromosome 4 exhibited an elevated mutation rate relative to other chromosomes. The great majority of mutations were gains or losses of a single repeat. Generally, the data are consistent with the stepwise mutation model of microsatellite evolution. Several lines exhibited multiple step changes from the progenitor sequence, but it is unclear whether these are multi-step mutations or multiple single-step mutations. A survey of dinucleotide repeats across the entire Arabidopsis genome indicates that AT repeats are most abundant, followed by CT, and CA.
Collapse
Affiliation(s)
- Tara N. Marriage
- Department of Ecology and Evolutionary Biology University of Kansas, Lawrence, KS
| | - Stephen Hudman
- Department of Ecology and Evolutionary Biology University of Kansas, Lawrence, KS
- Department of Ecology, Evolution, and Behavior, University of Minnesota
| | - Mark E. Mort
- Department of Ecology and Evolutionary Biology University of Kansas, Lawrence, KS
| | - Maria E. Orive
- Department of Ecology and Evolutionary Biology University of Kansas, Lawrence, KS
| | - Ruth G. Shaw
- Department of Ecology, Evolution, and Behavior, University of Minnesota
| | - John K. Kelly
- Department of Ecology and Evolutionary Biology University of Kansas, Lawrence, KS
| |
Collapse
|
33
|
Tang Z, Fu S, Ren Z, Zou Y. Rapid Evolution of Simple Sequence Repeat Induced by Allopolyploidization. J Mol Evol 2009; 69:217-28. [DOI: 10.1007/s00239-009-9261-2] [Citation(s) in RCA: 13] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/09/2008] [Revised: 05/21/2009] [Accepted: 06/29/2009] [Indexed: 01/24/2023]
|
34
|
Li CY, Liu L, Yang J, Li JB, Su Y, Zhang Y, Wang YY, Zhu YY. Genome-wide analysis of microsatellite sequence in seven filamentous fungi. Interdiscip Sci 2009; 1:141-50. [PMID: 20640828 DOI: 10.1007/s12539-009-0014-5] [Citation(s) in RCA: 21] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/20/2008] [Revised: 09/17/2008] [Accepted: 09/17/2008] [Indexed: 10/20/2022]
Abstract
Abundance of microsatellites with repeated unit lengths of 1-6 base pairs in seven fungi: Aspergillus nidulans, Coprinus cinereus, Cryptococcus neoformans (serotype A), Fusarium graminearum, Magnaporthe grisea, Neurospora crassa and Ustilago maydis were investigated on genomic scale. The results showed that each species has its specific profile for different types and different motifs of SSR loci. Ascomycetes fungi M. grisea, N. crassa and basidiomycete fungus U. maydis adopt much more microsatellites than other fungi examined. Total amount of 15,751, 14,788 and 6,854 SSR loci were observed respectively, average density is 406, 389 and 347 per Mbp sequence; overall length of SSR sequence was 0.82%, 0.95% and 0.79% of genomic sequence respectively. While ascomycetes fungus F. graminearum and A. nidulans contains the least SSRs in the genomic DNA, only 4,679 and 4,837 tracts were observed in 36 Mb and 30 Mb genomic sequence respectively. Microsatellite repeats in protein coding regions are investigated in Aspergillus nidulans, Magnaporthe grisea, and Neurospora crassa also, the results show that the difference of different types and motifs among three fungi is very little than that in whole genomic sequence. For trinucleotide repeats, overrepresent (comparing to the total base pair of protein coding region) of AGC, GGC, AGG, ACG and ACC was observed in coding region, frequencies of AAC and AAG were not difference between coding and non-coding region, AAT, AGT and ATG were underrepresent in coding region excepted for A. nidulans, in which ATG was overrepresentative.
Collapse
Affiliation(s)
- Cheng-Yun Li
- Key Laboratory for Agricultural Biodiversity and Pest Management of China Education Ministry, Yunnan Agricultural University, Heilongtan, Kunming, Yunnan Province, 650201, PR China.
| | | | | | | | | | | | | | | |
Collapse
|
35
|
Fast, cost-effective development of species-specific microsatellite markers by genomic sequencing. Biotechniques 2009; 46:185-92. [PMID: 19317661 DOI: 10.2144/000113084] [Citation(s) in RCA: 204] [Impact Index Per Article: 13.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/23/2022] Open
Abstract
Microsatellites are the genetic markers of choice for many population genetic studies, but must be isolated de novo using recombinant approaches where prior genetic data are lacking. Here we utilized high-throughput genomic sequencing technology to produce millions of base pairs of short fragment reads, which were screened with bioinformatics toolsets to identify primers that amplify polymorphic microsatellite loci. Using this approach we isolated 13 polymorphic microsatellites for the blue duck (Hymenolaimus malacorhynchos), a species for which limited genetic data were available. Our genomic approach eliminates recombinant genetic steps, significantly reducing the time and cost requirements of marker development compared with traditional approaches. While this application of genomic sequencing may seem obvious to many, this study is, to the best of our knowledge, the first attempt to describe the use of genomic sequencing for the development of microsatellite markers in a non-model organism or indeed any organism.
Collapse
|
36
|
Yin TM, Zhang XY, Gunter LE, Li SX, Wullschleger SD, Huang MR, Tuskan GA. Microsatellite primer resource for Populus developed from the mapped sequence scaffolds of the Nisqually-1 genome. THE NEW PHYTOLOGIST 2009; 181:498-503. [PMID: 19121044 DOI: 10.1111/j.1469-8137.2008.02663.x] [Citation(s) in RCA: 14] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/27/2023]
Abstract
In this study, 148 428 simple sequence repeat (SSR) primer pairs were designed from the unambiguously mapped sequence scaffolds of the Nisqually-1 genome. The physical position of the priming sites were identified along each of the 19 Populus chromosomes, and it was specified whether the priming sequences belong to intronic, intergenic, exonic or UTR regions. A subset of 150 SSR loci were amplified and a high amplification success rate (72%) was obtained in P. tremuloides, which belongs to a divergent subgenus of Populus relative to Nisqually-1. PCR reactions showed that the amplification success rate of exonic primer pairs was much higher than that of the intronic/intergenic primer pairs. Applying ANOVA and regression analyses to the flanking sequences of microsatellites, the repeat lengths, the GC contents of the repeats, the repeat motif numbers, the repeat motif length and the base composition of the repeat motif, it was determined that only the base composition of the repeat motif and the repeat motif length significantly affect the microsatellite variability in P. tremuloides samples. The SSR primer resource developed in this study provides a database for selecting highly transferable SSR markers with known physical position in the Populus genome and provides a comprehensive genetic tool to extend the genome sequence of Nisqually-1 to genetic studies in different Populus species.
Collapse
Affiliation(s)
- T M Yin
- Environmental Sciences Division, Oak Ridge National Laboratory, Oak Ridge, TN 37831-6422, USA;The Key Lab of Forest Genetics and Gene Engineering, Nanjing Forestry University, Nanjing, China
| | - X Y Zhang
- Environmental Sciences Division, Oak Ridge National Laboratory, Oak Ridge, TN 37831-6422, USA;The Key Lab of Forest Genetics and Gene Engineering, Nanjing Forestry University, Nanjing, China
| | - L E Gunter
- Environmental Sciences Division, Oak Ridge National Laboratory, Oak Ridge, TN 37831-6422, USA;The Key Lab of Forest Genetics and Gene Engineering, Nanjing Forestry University, Nanjing, China
| | - S X Li
- Environmental Sciences Division, Oak Ridge National Laboratory, Oak Ridge, TN 37831-6422, USA;The Key Lab of Forest Genetics and Gene Engineering, Nanjing Forestry University, Nanjing, China
| | - S D Wullschleger
- Environmental Sciences Division, Oak Ridge National Laboratory, Oak Ridge, TN 37831-6422, USA;The Key Lab of Forest Genetics and Gene Engineering, Nanjing Forestry University, Nanjing, China
| | - M R Huang
- Environmental Sciences Division, Oak Ridge National Laboratory, Oak Ridge, TN 37831-6422, USA;The Key Lab of Forest Genetics and Gene Engineering, Nanjing Forestry University, Nanjing, China
| | - G A Tuskan
- Environmental Sciences Division, Oak Ridge National Laboratory, Oak Ridge, TN 37831-6422, USA;The Key Lab of Forest Genetics and Gene Engineering, Nanjing Forestry University, Nanjing, China
| |
Collapse
|
37
|
BOLNICK DANIELI, CALDERA ERICJ, MATTHEWS BLAKE. Evidence for asymmetric migration load in a pair of ecologically divergent stickleback populations. Biol J Linn Soc Lond 2008. [DOI: 10.1111/j.1095-8312.2008.00978.x] [Citation(s) in RCA: 41] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
|
38
|
Xu Z, Gutierrez L, Hitchens M, Scherer S, Sater AK, Wells DE. Distribution of polymorphic and non-polymorphic microsatellite repeats in Xenopus tropicalis. Bioinform Biol Insights 2008; 2:157-69. [PMID: 19812773 PMCID: PMC2735965 DOI: 10.4137/bbi.s561] [Citation(s) in RCA: 11] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/17/2022] Open
Abstract
The results of our bioinformatics analysis have found over 91,000 di-, tri-, and tetranucleotide microsatellites in our survey of 25% of the X. tropicalis genome, suggesting there may be over 360,000 within the entire genome. Within the X. tropicalis genome, dinucleotide (78.7%) microsatellites vastly out numbered tri- and tetranucleotide microsatellites. Similarly, AT-rich repeats are overwhelmingly dominant. The four AT-only motifs (AT, AAT, AAAT, and AATT) account for 51,858 out of 91,304 microsatellites found. Individually, AT microsatellites were the most common repeat found, representing over half of all di-, tri-, and tetranucleotide microsatellites. This contrasts with data from other studies, which show that AC is the most frequent microsatellite in vertebrate genomes (Toth et al. 2000). In addition, we have determined the rate of polymorphism for 5,128 non-redundant microsatellites, embedded in unique sequences. Interestingly, this subgroup of microsatellites was determined to have significantly longer repeats than genomic microsatellites as a whole. In addition, microsatellite loci with tandem repeat lengths more than 30 bp exhibited a significantly higher degree of polymorphism than other loci. Pairwise comparisons show that tetranucleotide microsatellites have the highest polymorphic rates. In addition, AAT and ATC showed significant higher polymorphism than other trinucleotide microsatellites, while AGAT and AAAG were significantly more polymorphic than other tetranucleotide microsatellites.
Collapse
Affiliation(s)
- Zhenkang Xu
- Department of Biology and Biochemistry, University of Houston, Houston, TX 77204, USA
| | | | | | | | | | | |
Collapse
|
39
|
Li S, Yin T. Map and analysis of microsatellites in the genome of Populus: the first sequenced perennial plant. ACTA ACUST UNITED AC 2008; 50:690-9. [PMID: 17879069 DOI: 10.1007/s11427-007-0073-6] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/24/2006] [Accepted: 03/24/2007] [Indexed: 11/28/2022]
Abstract
We mapped and analyzed the microsatellites throughout 284295605 base pairs of the unambiguously assembled sequence scaffolds along 19 chromosomes of the haploid poplar genome. Totally, we found 150985 SSRs with repeat unit lengths between 2 and 5 bp. The established microsatellite physical map demonstrated that SSRs were distributed relatively evenly across the genome of Populus. On average, These SSRs occurred every 1883 bp within the poplar genome and the SSR densities in intergenic regions, introns, exons and UTRs were 85.4%, 10.7%, 2.7% and 1.2%, respectively. We took di-, tri-, tetra-and pentamers as the four classes of repeat units and found that the density of each class of SSRs decreased with the repeat unit lengths except for the tetranucleotide repeats. It was noteworthy that the length diversification of microsatellite sequences was negatively correlated with their repeat unit length and the SSRs with shorter repeat units gained repeats faster than the SSRs with longer repeat units. We also found that the GC content of poplar sequence significantly correlated with densities of SSRs with uneven repeat unit lengths (tri-and penta-), but had no significant correlation with densities of SSRs with even repeat unit lengths (di-and tetra-). In poplar genome, there were evidences that the occurrence of different microsatellites was under selection and the GC content in SSR sequences was found to significantly relate to the functional importance of microsatellites.
Collapse
Affiliation(s)
- ShuXian Li
- College of Environment and Forest Resources, Nanjing Forestry University, Nanjing 210037, China
| | | |
Collapse
|
40
|
Kelkar YD, Tyekucheva S, Chiaromonte F, Makova KD. The genome-wide determinants of human and chimpanzee microsatellite evolution. Genome Res 2007; 18:30-8. [PMID: 18032720 DOI: 10.1101/gr.7113408] [Citation(s) in RCA: 176] [Impact Index Per Article: 10.4] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/24/2022]
Abstract
Mutation rates of microsatellites vary greatly among loci. The causes of this heterogeneity remain largely enigmatic yet are crucial for understanding numerous human neurological diseases and genetic instability in cancer. In this first genome-wide study, the relative contributions of intrinsic features and regional genomic factors to the variation in mutability among orthologous human-chimpanzee microsatellites are investigated with resampling and regression techniques. As a result, we uncover the intricacies of microsatellite mutagenesis as follows. First, intrinsic features (repeat number, length, and motif size), which all influence the probability and rate of slippage, are the strongest predictors of mutability. Second, mutability increases nonuniformly with length, suggesting that processes additional to slippage, such as faulty repair, contribute to mutations. Third, mutability varies among microsatellites with different motif composition likely due to dissimilarities in secondary DNA structure formed by their slippage intermediates. Fourth, mutability of mononucleotide microsatellites is impacted by their location on sex chromosomes vs. autosomes and inside vs. outside of Alu repeats, the former confirming the importance of replication and the latter suggesting a role for gene conversion. Fifth, transcription status and location in a particular isochore do not influence microsatellite mutability. Sixth, compared with intrinsic features, regional genomic factors have only minor effects. Finally, our regression models explain approximately 90% of variation in microsatellite mutability and can generate useful predictions for the studies of human diseases, forensics, and conservation genetics.
Collapse
Affiliation(s)
- Yogeshwar D Kelkar
- Department of Biology, Penn State University, University Park, Pennsylvania 16802, USA
| | | | | | | |
Collapse
|
41
|
Jensen JD, Bauer DuMont VL, Ashmore AB, Gutierrez A, Aquadro CF. Patterns of sequence variability and divergence at the diminutive gene region of Drosophila melanogaster: complex patterns suggest an ancestral selective sweep. Genetics 2007; 177:1071-85. [PMID: 17720938 PMCID: PMC2034614 DOI: 10.1534/genetics.106.069468] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/12/2006] [Accepted: 08/19/2007] [Indexed: 11/18/2022] Open
Abstract
To identify putatively swept regions of the Drosophila melanogaster genome, we performed a microsatellite screen spanning a 260-kb region of the X chromosome in populations from Zimbabwe, Ecuador, the United States, and China. Among the regions identified by this screen as showing a complex pattern of reduced heterozygosity and a skewed frequency spectrum was the gene diminutive (dm). To investigate the microsatellite findings, nucleotide sequence polymorphism data were generated in populations from both China and Zimbabwe spanning a 25-kb region and encompassing dm. Analysis of the sequence data reveals strongly reduced nucleotide variation across the entire gene region in both the non-African and the African populations, an extended haplotype pattern, and structured linkage disequilibrium, as well as a rejection of neutrality in favor of selection using a composite likelihood-ratio test. Additionally, unusual patterns of synonymous site evolution were observed at the second exon of this locus. On the basis of simulation studies as well as recently proposed methods for distinguishing between selection and nonequilibrium demography, we find that this "footprint" is best explained by a selective sweep in the ancestral population, the signal of which has been somewhat blurred via founder effects in the non-African samples.
Collapse
Affiliation(s)
- Jeffrey D Jensen
- Department of Molecular Biology and Genetics, Cornell University, Ithaca, New York 14853, USA.
| | | | | | | | | |
Collapse
|
42
|
Buschiazzo E, Gemmell NJ. The rise, fall and renaissance of microsatellites in eukaryotic genomes. Bioessays 2006; 28:1040-50. [PMID: 16998838 DOI: 10.1002/bies.20470] [Citation(s) in RCA: 190] [Impact Index Per Article: 10.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/15/2022]
Abstract
Microsatellites are among the most versatile of genetic markers, being used in an impressive number of biological applications. However, the evolutionary dynamics of these markers remain a source of contention. Almost 20 years after the discovery of these ubiquitous simple sequences, new genomic data are clarifying our understanding of the structure, distribution and variability of microsatellites in genomes, especially for the eukaryotes. While these new data provide a great deal of descriptive information about the nature and abundance of microsatellite sequences within eukaryotic genomes, there have been few attempts to synthesise this information to develop a global concept of evolution. This review provides an up-to-date account of the mutational processes, biases and constraints believed to be involved in the evolution of microsatellites, particularly with respect to the creation and degeneration of microsatellites, which we assert may be broadly viewed as a life cycle. In addition, we identify areas of contention that require further research and propose some possible directions for future investigation.
Collapse
Affiliation(s)
- Emmanuel Buschiazzo
- School of Biological Sciences, University of Canterbury, Christchurch, New Zealand.
| | | |
Collapse
|
43
|
Abstract
Divergence and speciation may occur by various means, depending on the particular history, selective environments, and genetic composition of populations. In Drosophila mojavensis, a good model of incipient speciation, understanding the population genetic structure within this group facilitates our ability to understand the context in which reproductive isolation among populations is developing. Here we report the genetic structure and relationships of D. mojavensis populations at nuclear loci. We surveyed 29 populations throughout the distribution of D. mojavensis for four microsatellite loci to differentiation among populations of this species. These loci reveal four distinct geographical regions of D. mojavensis populations in the south-western United States and north-western Mexico--(i) Baja California peninsula (Baja), (ii) Sonora, Mexico-southern Arizona, United States (Sonora), (iii) Mojave Desert and Grand Canyon (Mojave), and (iv) Santa Catalina Island (Catalina). While all regions show strong isolation, Mojave and Catalina are highly diverged from other regions. Within any region, populations are largely homogenous over broad geographical distances. Based on the population structure, we find clear geographical barriers to gene flow appear to have a strong effect in isolating populations across regions for this species.
Collapse
Affiliation(s)
- C L Ross
- Department of Ecology and Evolutionary Biology, University of Arizona, Tucson, AZ 85721, USA
| | | |
Collapse
|
44
|
Wiehe T, Nolte V, Zivkovic D, Schlötterer C. Identification of selective sweeps using a dynamically adjusted number of linked microsatellites. Genetics 2006; 175:207-18. [PMID: 17057237 PMCID: PMC1775015 DOI: 10.1534/genetics.106.063677] [Citation(s) in RCA: 32] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
There is currently large interest in distinguishing the signatures of genetic variation produced by demographic events from those produced by natural selection. We propose a simple multilocus statistical test to identify candidate sites of selective sweeps with high power. The test is based on the variability profile measured in an array of linked microsatellites. We also show that the analysis of flanking markers drastically reduces the number of false positives among the candidates that are identified in a genomewide survey of unlinked loci and find that this property is maintained in many population-bottleneck scenarios. However, for a certain range of intermediately severe population bottlenecks we find genomic signatures that are very similar to those produced by a selective sweep. While in these worst-case scenarios the power of the proposed test remains high, the false-positive rate reaches values close to 50%. Hence, selective sweeps may be hard to identify even if multiple linked loci are analyzed. Nevertheless, the integration of information from multiple linked loci always leads to a considerable reduction of the false-positive rate compared to a genome scan of unlinked loci. We discuss the application of this test to experimental data from Drosophila melanogaster.
Collapse
Affiliation(s)
- Thomas Wiehe
- Institut für Genetik, Universität zu Köln, Köln, Germany
| | | | | | | |
Collapse
|
45
|
Weetman D, Hauser L, Carvalho GR. Heterogeneous evolution of microsatellites revealed by reconstruction of recent mutation history in an invasive apomictic snail, Potamopyrgus antipodarum. Genetica 2006; 127:285-93. [PMID: 16850232 DOI: 10.1007/s10709-005-4847-0] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/22/2005] [Accepted: 11/07/2005] [Indexed: 10/24/2022]
Abstract
Heterogeneous patterns of microsatellite evolution present a major challenge for the development of mutation models, and an improved understanding of the determinants of variation in mutation rates and patterns among loci, alleles and taxa is required. A 19th Century bottleneck associated with the introduction of clones of the snail Potamopyrgus antipodarum to Britain presented an opportunity to reconstruct recent microsatellite evolution within the most common apomictic lineage. There was significant variation in both the number and step size of mutations among the seven loci studied. Patterns of mutability were consistent with higher mutation rates for di- than trinucleotides and for longer alleles at a locus. Mutation size was influenced in a more complex way, decreasing with relative allele length much more strongly for tri-, than dinucleotides. We found support for this latter, highly novel result in the literature via reanalysis of data in a recent genome-scan study of human microsatellites, which showed a similarly disparate pattern of length-dependence between di- and trinucleotides. In spite of the apomictic form of reproduction and an unusually strong excess of microsatellite contractions in P. antipodarum, there were notable similarities with mutation processes of human microsatellites, supporting the wider taxonomic generality of such evolutionary mechanisms.
Collapse
Affiliation(s)
- David Weetman
- Department of Biological Sciences, University of Hull, HU6 7RX, Hull, UK.
| | | | | |
Collapse
|
46
|
DuMont VB, Aquadro CF. Multiple signatures of positive selection downstream of notch on the X chromosome in Drosophila melanogaster. Genetics 2005; 171:639-53. [PMID: 16020794 PMCID: PMC1456778 DOI: 10.1534/genetics.104.038851] [Citation(s) in RCA: 34] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
To identify genomic regions affected by the rapid fixation of beneficial mutations (selective sweeps), we performed a scan of microsatellite variability across the Notch locus region of Drosophila melanogaster. Nine microsatellites spanning 60 kb of the X chromosome were surveyed for variation in one African and three non-African populations of this species. The microsatellites identified an approximately 14-kb window for which we observed relatively low levels of variability and/or a skew in the frequency spectrum toward rare alleles, patterns predicted at regions linked to a selective sweep. DNA sequence polymorphism data were subsequently collected within this 14-kb region for three of the D. melanogaster populations. The sequence data strongly support the initial microsatellite findings; in the non-African populations there is evidence of a recent selective sweep downstream of the Notch locus near or within the open reading frames CG18508 and Fcp3C. In addition, we observe a significant McDonald-Kreitman test result suggesting too many amino acid fixations species wide, presumably due to positive selection, at the unannotated open reading frame CG18508. Thus, we observe within this small genomic region evidence for both recent (skew toward rare alleles in non-African populations) and recurring (amino acid evolution at CG18508) episodes of positive selection.
Collapse
Affiliation(s)
- Vanessa Bauer DuMont
- Department of Molecular Biology and Genetics, Cornell University, Ithaca, NY 14853, USA
| | | |
Collapse
|
47
|
Petit RJ, Deguilloux MF, Chat J, Grivet D, Garnier-Géré P, Vendramin GG. Standardizing for microsatellite length in comparisons of genetic diversity. Mol Ecol 2005; 14:885-90. [PMID: 15723680 DOI: 10.1111/j.1365-294x.2005.02446.x] [Citation(s) in RCA: 45] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
Abstract
Mutation rates at microsatellites tend to increase with the number of repeats of the motif, leading to higher levels of polymorphism at long microsatellites. To standardize levels of diversity when microsatellites differ in size, we investigate the relationship between tract length and variation and provide a formula to adjust allelic richness to a fixed mean number of repeats in the specific case of chloroplast microsatellites. A comparison between 39 loci from eight species of conifers (where chloroplast DNA is paternally inherited) and 64 loci from 12 species of angiosperms (where chloroplast DNA is generally predominantly maternally inherited) indicates that the greater allelic richness found in conifers remains significant after controlling for number of repeats. The approach stresses the advantage of reporting variation in number of repeats instead of relative fragment sizes.
Collapse
Affiliation(s)
- Rémy J Petit
- UMR Biodiversity, Genes & Ecosystems, Institut National de la Recherche Agronomique, 69 route d Arcachon, F-33612 Cestas cedex, France.
| | | | | | | | | | | |
Collapse
|
48
|
Abstract
Mutation is the ultimate source of genetic variation, and mutation rate is thus an important parameter governing the extent of genetic variation. Microsatellites are highly informative genetic markers that have been widely used in genetic studies. While previous studies showed that the mutation rate differs in di-, tri-, and tetranucleotide repeats, how mutation rate distributes within each class of repeat is poorly understood. This study first revealed the pattern of the mutation rate variation within the dinucleotide repeats. Two data sets were used. The first is the allele frequency data from 115 microsatellites with dinucleotide repeats distributed along the human genome in 10 worldwide populations. The second data set is much larger, consisting of the allele frequency of 5252 dinucleotide repeats from the Genome Database. Mutation rate for each locus is estimated through a new homozygosity-based estimator, which has been shown to be unbiased and highly efficient and is reasonably robust against deviations from the single-step model. The mutation rates among loci can be approximated well by a gamma distribution and its shape parameter can be accurately estimated with this approach. This result provides the basic guidelines for analyzing the large-scale genomic data from microsatellite loci.
Collapse
Affiliation(s)
- Hongyan Xu
- Computational Genomics Section, Human Genetics Center, University of Texas, Houston, 77030, USA.
| | | | | |
Collapse
|
49
|
Affiliation(s)
- Hans Ellegren
- Department of Evolutionary Biology, Evolutionary Biology Centre, Uppsala University, Norbyvägen 18D, SE-752 36 Uppsala, Sweden.
| |
Collapse
|
50
|
Symonds VV, Lloyd AM. An Analysis of Microsatellite Loci in Arabidopsis thaliana: Mutational Dynamics and Application. Genetics 2003; 165:1475-88. [PMID: 14668396 PMCID: PMC1462854 DOI: 10.1093/genetics/165.3.1475] [Citation(s) in RCA: 31] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
Abstract
Abstract
Microsatellite loci are among the most commonly used molecular markers. These loci typically exhibit variation for allele frequency distribution within a species. However, the factors contributing to this variation are not well understood. To expand on the current knowledge of microsatellite evolution, 20 microsatellite loci were examined for 126 accessions of the flowering plant, Arabidopsis thaliana. Substantial variability in mutation pattern among loci was found, most of which cannot be explained by the assumptions of the traditional stepwise mutation model or infinite alleles model. Here it is shown that the degree of locus diversity is strongly correlated with the number of contiguous repeats, more so than with the total number of repeats. These findings support a strong role for repeat disruptions in stabilizing microsatellite loci by reducing the substrate for polymerase slippage and recombination. Results of cluster analyses are also presented, demonstrating the potential of microsatellite loci for resolving relationships among accessions of A. thaliana.
Collapse
Affiliation(s)
- V Vaughan Symonds
- Section of Molecular, Cell, and Developmental Biology and Institute for Cellular and Molecular Biology, University of Texas, Austin, Texas 78712, USA
| | | |
Collapse
|