1
|
Bei L, He C, Liu J, Han C, Zhou H, Zhaorigetu, Siqintuya, Li J, Su X, Wang Y, Chen Q, Nashun, Daolema, Meng H. Genome-wide identification and characterization of microsatellite markers in Bactrian Camel. Genomics 2023; 115:110726. [PMID: 37832857 DOI: 10.1016/j.ygeno.2023.110726] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/18/2023] [Revised: 10/05/2023] [Accepted: 10/08/2023] [Indexed: 10/15/2023]
Abstract
Simple sequence repeats (SSRs) have been widely used for parentage testing, marker-assisted selection, and evolutionary studies. The insufficient availability of SSR markers in Bactrian camels partially accounts for the lack of systematic breeding. Therefore, we aimed to establish a comprehensive SSR dataset for the Bactrian camel. Our approach involved genome searching to locate every SSR in the genome, SSR-enriched sequencing to acquire polymorphism information, and literature research to collect published data. The resulting dataset contains 213,711 SSRs and details their characteristics, including genome coordinates, motifs, lengths, annotations, PCR primers, and polymorphism information. The dataset reveals a biased distribution of SSRs in the Bactrian camel genome, reflecting the mutation mechanism and complex evolution of SSRs. In practice, we successfully demonstrated the utility of the dataset through parentage testing using 15 randomly selected SSRs. This comprehensive dataset can facilitate systematic breeding and enable QTL mapping and GWAS of the Bactrian camel.
Collapse
Affiliation(s)
- Lanxin Bei
- Shanghai Key Laboratory of Veterinary Biotechnology, School of Agriculture and Biology, Shanghai Jiao Tong University, 200240 Shanghai, China
| | - Chuan He
- Shanghai Key Laboratory of Veterinary Biotechnology, School of Agriculture and Biology, Shanghai Jiao Tong University, 200240 Shanghai, China
| | - Jiajia Liu
- Shanghai Key Laboratory of Veterinary Biotechnology, School of Agriculture and Biology, Shanghai Jiao Tong University, 200240 Shanghai, China
| | - Chengxiao Han
- Shanghai Key Laboratory of Veterinary Biotechnology, School of Agriculture and Biology, Shanghai Jiao Tong University, 200240 Shanghai, China
| | - Hao Zhou
- Shanghai Key Laboratory of Veterinary Biotechnology, School of Agriculture and Biology, Shanghai Jiao Tong University, 200240 Shanghai, China
| | - Zhaorigetu
- Animal Husbandry Institute of Alxa League, 750306, Inner Mongolia, China
| | - Siqintuya
- Animal Husbandry Institute of Alxa League, 750306, Inner Mongolia, China
| | - Jing Li
- Animal Husbandry Institute of Alxa League, 750306, Inner Mongolia, China
| | - Xue Su
- Animal Husbandry Institute of Alxa League, 750306, Inner Mongolia, China
| | - Yunfei Wang
- Bayannur Institute of Agriculture & Animal Husbandry Science, 015000, Inner Mongolia, China
| | - Qiujv Chen
- Bayannur Institute of Agriculture & Animal Husbandry Science, 015000, Inner Mongolia, China
| | - Nashun
- Alxa Left Banner Agriculture and Animal Husbandry Comprehensive Administrative Law Enforcement Brigade, 735499, Inner Mongolia, China
| | - Daolema
- Animal Husbandry Institute of Alxa League, 750306, Inner Mongolia, China.
| | - He Meng
- Shanghai Key Laboratory of Veterinary Biotechnology, School of Agriculture and Biology, Shanghai Jiao Tong University, 200240 Shanghai, China.
| |
Collapse
|
2
|
Descorps-Declère S, Richard GF. Megasatellite formation and evolution in vertebrate genes. Cell Rep 2022; 40:111347. [PMID: 36103826 DOI: 10.1016/j.celrep.2022.111347] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/23/2021] [Revised: 04/28/2022] [Accepted: 08/23/2022] [Indexed: 11/03/2022] Open
Abstract
Since formation of the first proto-eukaryotes, gene repertoire and genome complexity have significantly increased. Among genetic elements responsible for this increase are tandem repeats. Here we describe a genome-wide analysis of large tandem repeats, called megasatellites, in 58 vertebrate genomes. Two bursts occurred, one after the radiation between Agnatha and Gnathostomata fishes and the second one in therian mammals. Megasatellites are enriched in subtelomeric regions and frequently encoded in genes involved in transcription regulation, intracellular trafficking, and cell membrane metabolism, reminiscent of what is observed in fungus genomes. The presence of many introns within young megasatellites suggests that an exon-intron DNA segment is first duplicated and amplified before accumulation of mutations in intronic parts partially erases the megasatellite in such a way that it becomes detectable only in exons. Our results suggest that megasatellite formation and evolution is a dynamic and still ongoing process in vertebrate genomes.
Collapse
Affiliation(s)
- Stéphane Descorps-Declère
- Institut Pasteur, Université Paris Cité, Bioinformatics and Biostatistics Hub, 25 rue du Dr Roux, 75015 Paris, France.
| | - Guy-Franck Richard
- Institut Pasteur, Université Paris Cité, CNRS UMR3525, Natural & Synthetic Genome Instabilities, 25 rue du Dr Roux, 75015 Paris, France.
| |
Collapse
|
3
|
Jackson B, Charlesworth B. Evidence for a force favoring GC over AT at short intronic sites in Drosophila simulans and Drosophila melanogaster. G3 GENES|GENOMES|GENETICS 2021; 11:6321237. [PMID: 34544137 PMCID: PMC8496279 DOI: 10.1093/g3journal/jkab240] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 02/13/2021] [Accepted: 07/06/2021] [Indexed: 11/13/2022]
Abstract
Population genetics studies often make use of a class of nucleotide site free from selective pressures, in order to make inferences about population size changes or natural selection at other sites. If such neutral sites can be identified, they offer the opportunity to avoid any confounding effects of selection. Here, we investigate evolution at putatively neutrally evolving short intronic sites in natural populations of Drosophila melanogaster and Drosophila simulans, in order to understand the properties of spontaneous mutations and the extent of GC-biased gene conversion in these species. Use of data on the genetics of natural populations is advantageous because it integrates information from large numbers of individuals over long timescales. In agreement with direct evidence from observations of spontaneous mutations in Drosophila, we find a bias in the spectrum of mutations toward AT basepairs. In addition, we find that this bias is stronger in the D. melanogaster lineage than in the D. simulans lineage. The evidence for GC-biased gene conversion in Drosophila has been equivocal. Here, we provide evidence for a weak force favoring GC in both species, which is correlated with the GC content of introns and is stronger in D. simulans than in D. melanogaster.
Collapse
Affiliation(s)
- Ben Jackson
- School of Biological Sciences, Institute of Evolutionary Biology, University of Edinburgh, Edinburgh EH9 3FL, UK
| | - Brian Charlesworth
- School of Biological Sciences, Institute of Evolutionary Biology, University of Edinburgh, Edinburgh EH9 3FL, UK
| |
Collapse
|
4
|
Mode and Tempo of Microsatellite Evolution across 300 Million Years of Insect Evolution. Genes (Basel) 2020; 11:genes11080945. [PMID: 32824315 PMCID: PMC7464534 DOI: 10.3390/genes11080945] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/12/2020] [Revised: 08/11/2020] [Accepted: 08/14/2020] [Indexed: 01/02/2023] Open
Abstract
Microsatellites are short, repetitive DNA sequences that can rapidly expand and contract due to slippage during DNA replication. Despite their impacts on transcription, genome structure, and disease, relatively little is known about the evolutionary dynamics of these short sequences across long evolutionary periods. To address this gap in our knowledge, we performed comparative analyses of 304 available insect genomes. We investigated the impact of sequence assembly methods and assembly quality on the inference of microsatellite content, and we explored the influence of chromosome type and number on the tempo and mode of microsatellite evolution across one of the most speciose clades on the planet. Diploid chromosome number had no impact on the rate of microsatellite evolution or the amount of microsatellite content in genomes. We found that centromere type (holocentric or monocentric) is not associated with a difference in the amount of microsatellite content; however, in those species with monocentric chromosomes, microsatellite content tends to evolve faster than in species with holocentric chromosomes.
Collapse
|
5
|
Ranathunge C, Wheeler GL, Chimahusky ME, Perkins AD, Pramod S, Welch ME. Transcribed microsatellite allele lengths are often correlated with gene expression in natural sunflower populations. Mol Ecol 2020; 29:1704-1716. [PMID: 32285554 DOI: 10.1111/mec.15440] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/16/2018] [Revised: 03/15/2020] [Accepted: 04/02/2020] [Indexed: 12/23/2022]
Abstract
Microsatellites are common in genomes of most eukaryotic species. Due to their high mutability, an adaptive role for microsatellites has been considered. However, little is known concerning the contribution of microsatellites towards phenotypic variation. We used populations of the common sunflower (Helianthus annuus) at two latitudes to quantify the effect of microsatellite allele length on phenotype at the level of gene expression. We conducted a common garden experiment with seed collected from sunflower populations in Kansas and Oklahoma followed by an RNA-Seq experiment on 95 individuals. The effect of microsatellite allele length on gene expression was assessed across 3,325 microsatellites that could be consistently scored. Our study revealed 479 microsatellites at which allele length significantly correlates with gene expression (eSTRs). When irregular allele sizes not conforming to the motif length were removed, the number of eSTRs rose to 2,379. The percentage of variation in gene expression explained by eSTRs ranged from 1%-86% when controlling for population and allele-by-population interaction effects at the 479 eSTRs. Of these eSTRs, 70.4% are in untranslated regions (UTRs). A gene ontology (GO) analysis revealed that eSTRs are significantly enriched for GO terms associated with cis- and trans-regulatory processes. Our findings suggest that a substantial number of transcribed microsatellites can influence gene expression.
Collapse
Affiliation(s)
- Chathurani Ranathunge
- Department of Biological Sciences, Mississippi State University, Starkville, MS, USA
| | - Gregory L Wheeler
- Department of Biological Sciences, Mississippi State University, Starkville, MS, USA
| | - Melody E Chimahusky
- Department of Biological Sciences, Mississippi State University, Starkville, MS, USA
| | - Andy D Perkins
- Department of Computer Science and Engineering, Mississippi State University, Starkville, MS, USA
| | - Sreepriya Pramod
- Department of Biological Sciences, Mississippi State University, Starkville, MS, USA
| | - Mark E Welch
- Department of Biological Sciences, Mississippi State University, Starkville, MS, USA
| |
Collapse
|
6
|
Rodríguez ME, Molina B, Merlo MA, Arias-Pérez A, Portela-Bens S, García-Angulo A, Cross I, Liehr T, Rebordinos L. Evolution of the Proto Sex-Chromosome in Solea senegalensis. Int J Mol Sci 2019; 20:ijms20205111. [PMID: 31618912 PMCID: PMC6829477 DOI: 10.3390/ijms20205111] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/26/2019] [Revised: 10/11/2019] [Accepted: 10/13/2019] [Indexed: 01/17/2023] Open
Abstract
Solea senegalensis is a flatfish belonging to the Soleidae family within the Pleuronectiformes order. It has a karyotype of 2n = 42 (FN = 60; 6M + 4 SM + 8 St + 24 T) and a XX/XY system. The first pair of metacentric chromosomes has been proposed as a proto sex-chromosome originated by a Robertsonian fusion between acrocentric chromosomes. In order to elucidate a possible evolutionary origin of this chromosome 1, studies of genomic synteny were carried out with eight fish species. A total of 88 genes annotated within of 14 BACs located in the chromosome 1 of S. senegalensis were used to elaborate syntenic maps. Six BACs (BAC5K5, BAC52C17, BAC53B20, BAC84K7, BAC56H24, and BAC48P7) were distributed in, at least, 5 chromosomes in the species studied, and a group of four genes from BAC53B20 (grsf1, rufy3, slc4a4 and npffr2) and genes from BAC48K7 (dmrt2, dmrt3, dmrt1, c9orf117, kank1 and fbp1) formed a conserved cluster in all species. The analysis of repetitive sequences showed that the number of retroelements and simple repeat per BAC showed its highest value in the subcentromeric region where 53B20, 16E16 and 48K7 BACs were localized. This region contains all the dmrt genes, which are associated with sex determination in some species. In addition, the presence of a satellite “chromosome Y” (motif length: 860 bp) was detected in this region. These findings allowed to trace an evolutionary trend for the large metacentric chromosome of S. senegalensis, throughout different rearrangements, which could be at an initial phase of differentiation as sex chromosome.
Collapse
Affiliation(s)
- María Esther Rodríguez
- Área de Genética, Facultad de Ciencias del Mar y Ambientales, INMAR, Universidad de Cádiz, 11510 Cádiz, Spain.
| | - Belén Molina
- Área de Genética, Facultad de Ciencias del Mar y Ambientales, INMAR, Universidad de Cádiz, 11510 Cádiz, Spain.
| | - Manuel Alejandro Merlo
- Área de Genética, Facultad de Ciencias del Mar y Ambientales, INMAR, Universidad de Cádiz, 11510 Cádiz, Spain.
| | - Alberto Arias-Pérez
- Área de Genética, Facultad de Ciencias del Mar y Ambientales, INMAR, Universidad de Cádiz, 11510 Cádiz, Spain.
| | - Silvia Portela-Bens
- Área de Genética, Facultad de Ciencias del Mar y Ambientales, INMAR, Universidad de Cádiz, 11510 Cádiz, Spain.
| | - Aglaya García-Angulo
- Área de Genética, Facultad de Ciencias del Mar y Ambientales, INMAR, Universidad de Cádiz, 11510 Cádiz, Spain.
| | - Ismael Cross
- Área de Genética, Facultad de Ciencias del Mar y Ambientales, INMAR, Universidad de Cádiz, 11510 Cádiz, Spain.
| | - Thomas Liehr
- University Clinic Jena Institute of Human Genetics, 07747 Jena, Germany.
| | - Laureana Rebordinos
- Área de Genética, Facultad de Ciencias del Mar y Ambientales, INMAR, Universidad de Cádiz, 11510 Cádiz, Spain.
| |
Collapse
|
7
|
Stewart NB, Rogers RL. Chromosomal rearrangements as a source of new gene formation in Drosophila yakuba. PLoS Genet 2019; 15:e1008314. [PMID: 31545792 PMCID: PMC6776367 DOI: 10.1371/journal.pgen.1008314] [Citation(s) in RCA: 19] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/31/2019] [Revised: 10/03/2019] [Accepted: 07/17/2019] [Indexed: 11/19/2022] Open
Abstract
The origins of new genes are among the most fundamental questions in evolutionary biology. Our understanding of the ways that new genetic material appears and how that genetic material shapes population variation remains incomplete. De novo genes and duplicate genes are a key source of new genetic material on which selection acts. To better understand the origins of these new gene sequences, we explored the ways that structural variation might alter expression patterns and form novel transcripts. We provide evidence that chromosomal rearrangements are a source of novel genetic variation that facilitates the formation of de novo exons in Drosophila. We identify 51 cases of de novo exon formation created by chromosomal rearrangements in 14 strains of D. yakuba. These new genes inherit transcription start signals and open reading frames when the 5' end of existing genes are combined with previously untranscribed regions. Such new genes would appear with novel peptide sequences, without the necessity for secondary transitions from non-coding RNA to protein. This mechanism of new peptide formations contrasts with canonical theory of de novo gene progression requiring non-coding intermediaries that must acquire new mutations prior to loss via pseudogenization. Hence, these mutations offer a means to de novo gene creation and protein sequence formation in a single mutational step, answering a long standing open question concerning new gene formation. We further identify gene expression changes to 134 existing genes, indicating that these mutations can alter gene regulation. Population variability for chromosomal rearrangements is considerable, with 2368 rearrangements observed across 14 inbred lines. More rearrangements were identified on the X chromosome than any of the autosomes, suggesting the X is more susceptible to chromosome alterations. Together, these results suggest that chromosomal rearrangements are a source of variation in populations that is likely to be important to explain genetic and therefore phenotypic diversity.
Collapse
Affiliation(s)
- Nicholas B. Stewart
- Department of Bioinformatics and Genomics, University of North Carolina at Charlotte, Charlotte, North Carolina, United States of America
- Department of Biological Sciences, Ft Hays State University, Ft Hays, Kansas, United States of America
| | - Rebekah L. Rogers
- Department of Bioinformatics and Genomics, University of North Carolina at Charlotte, Charlotte, North Carolina, United States of America
- * E-mail:
| |
Collapse
|
8
|
Qi WH, Jiang XM, Yan CC, Zhang WQ, Xiao GS, Yue BS, Zhou CQ. Distribution patterns and variation analysis of simple sequence repeats in different genomic regions of bovid genomes. Sci Rep 2018; 8:14407. [PMID: 30258087 PMCID: PMC6158176 DOI: 10.1038/s41598-018-32286-5] [Citation(s) in RCA: 17] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/26/2018] [Accepted: 09/04/2018] [Indexed: 01/23/2023] Open
Abstract
As the first examination of distribution, guanine-cytosine (GC) pattern, and variation analysis of microsatellites (SSRs) in different genomic regions of six bovid species, SSRs displayed nonrandomly distribution in different regions. SSR abundances are much higher in the introns, transposable elements (TEs), and intergenic regions compared to the 3′-untranslated regions (3′UTRs), 5′UTRs and coding regions. Trinucleotide perfect SSRs (P-SSRs) were the most frequent in the coding regions, whereas, mononucleotide P-SSRs were the most in the introns, 3′UTRs, TEs, and intergenic regions. Trifold P-SSRs had more GC-contents in the 5′UTRs and coding regions than that in the introns, 3′UTRs, TEs, and intergenic regions, whereas mononucleotide P-SSRs had the least GC-contents in all genomic regions. The repeat copy numbers (RCN) of the same mono- to hexanucleotide P-SSRs showed significantly different distributions in different regions (P < 0.01). Except for the coding regions, mononucleotide P-SSRs had the most RCNs, followed by the pattern: di- > tri- > tetra- > penta- > hexanucleotide P-SSRs in the same regions. The analysis of coefficient of variability (CV) of SSRs showed that the CV variations of RCN of the same mono- to hexanucleotide SSRs were relative higher in the intronic and intergenic regions, followed by the CV variation of RCN in the TEs, and the relative lower was in the 5′UTRs, 3′UTRs, and coding regions. Wide SSR analysis of different genomic regions has helped to reveal biological significances of their distributions.
Collapse
Affiliation(s)
- Wen-Hua Qi
- College of Biology and Food Engineering, Chongqing Three Gorges University, Chongqing, 404100, P. R. China
| | - Xue-Mei Jiang
- College of Environmental and Chemistry Engineering, Chongqing Three Gorges University, Chongqing, 404100, P. R. China
| | - Chao-Chao Yan
- Key Laboratory of Bio-resources and Eco-environment (Ministry of Education), College of Life Sciences, Sichuan University, Chengdu, 610064, P. R. China
| | - Wan-Qing Zhang
- College of Life Sciences, Sichuan Agricultural University, Ya'an, Sichuan Province, 625014, P. R. China
| | - Guo-Sheng Xiao
- College of Biology and Food Engineering, Chongqing Three Gorges University, Chongqing, 404100, P. R. China
| | - Bi-Song Yue
- Key Laboratory of Bio-resources and Eco-environment (Ministry of Education), College of Life Sciences, Sichuan University, Chengdu, 610064, P. R. China
| | - Cai-Quan Zhou
- Key Laboratory of Southwest China Wildlife Resources Conservation (Ministry of Education), China West Normal University, Nanchong, 637009, P. R. China.
| |
Collapse
|
9
|
Yuan Z, Zhou T, Bao L, Liu S, Shi H, Yang Y, Gao D, Dunham R, Waldbieser G, Liu Z. The annotation of repetitive elements in the genome of channel catfish (Ictalurus punctatus). PLoS One 2018; 13:e0197371. [PMID: 29763462 PMCID: PMC5953449 DOI: 10.1371/journal.pone.0197371] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/17/2017] [Accepted: 05/01/2018] [Indexed: 11/18/2022] Open
Abstract
Channel catfish (Ictalurus punctatus) is a highly adaptive species and has been used as a research model for comparative immunology, physiology, and toxicology among ectothermic vertebrates. It is also economically important for aquaculture. As such, its reference genome was generated and annotated with protein coding genes. However, the repetitive elements in the catfish genome are less well understood. In this study, over 417.8 Megabase (MB) of repetitive elements were identified and characterized in the channel catfish genome. Among them, the DNA/TcMar-Tc1 transposons are the most abundant type, making up ~20% of the total repetitive elements, followed by the microsatellites (14%). The prevalence of repetitive elements, especially the mobile elements, may have provided a driving force for the evolution of the catfish genome. A number of catfish-specific repetitive elements were identified including the previously reported Xba elements whose divergence rate was relatively low, slower than that in untranslated regions of genes but faster than the protein coding sequences, suggesting its evolutionary restrictions.
Collapse
Affiliation(s)
- Zihao Yuan
- School of Fisheries, Aquaculture and Aquatic Sciences, Auburn University, Auburn, Alabama, United States of America
| | - Tao Zhou
- School of Fisheries, Aquaculture and Aquatic Sciences, Auburn University, Auburn, Alabama, United States of America
| | - Lisui Bao
- School of Fisheries, Aquaculture and Aquatic Sciences, Auburn University, Auburn, Alabama, United States of America
| | - Shikai Liu
- School of Fisheries, Aquaculture and Aquatic Sciences, Auburn University, Auburn, Alabama, United States of America
| | - Huitong Shi
- School of Fisheries, Aquaculture and Aquatic Sciences, Auburn University, Auburn, Alabama, United States of America
| | - Yujia Yang
- School of Fisheries, Aquaculture and Aquatic Sciences, Auburn University, Auburn, Alabama, United States of America
| | - Dongya Gao
- School of Fisheries, Aquaculture and Aquatic Sciences, Auburn University, Auburn, Alabama, United States of America
| | - Rex Dunham
- School of Fisheries, Aquaculture and Aquatic Sciences, Auburn University, Auburn, Alabama, United States of America
| | - Geoff Waldbieser
- USDA-ARS Warmwater Aquaculture Research Unit, Stoneville, Mississippi, United States of America
| | - Zhanjiang Liu
- Department of Biology, Syracuse University, Syracuse, New York, United States of America
- * E-mail:
| |
Collapse
|
10
|
Distinct patterns of simple sequence repeats and GC distribution in intragenic and intergenic regions of primate genomes. Aging (Albany NY) 2017; 8:2635-2654. [PMID: 27644032 PMCID: PMC5191860 DOI: 10.18632/aging.101025] [Citation(s) in RCA: 17] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/08/2016] [Accepted: 08/22/2016] [Indexed: 01/23/2023]
Abstract
As the first systematic examination of simple sequence repeats (SSRs) and guanine-cytosine (GC) distribution in intragenic and intergenic regions of ten primates, our study showed that SSRs and GC displayed nonrandom distribution for both intragenic and intergenic regions, suggesting that they have potential roles in transcriptional or translational regulation. Our results suggest that the majority of SSRs are distributed in non-coding regions, such as the introns, TEs, and intergenic regions. In these primates, trinucleotide perfect (P) SSRs were the most abundant repeats type in the 5'UTRs and CDSs, whereas, mononucleotide P-SSRs were the most in the intron, 3'UTRs, TEs, and intergenic regions. The GC-contents varied greatly among different intragenic and intergenic regions: 5'UTRs > CDSs > 3'UTRs > TEs > introns > intergenic regions, and high GC-content was frequently distributed in exon-rich regions. Our results also showed that in the same intragenic and intergenic regions, the distribution of GC-contents were great similarity in the different primates. Tri- and hexanucleotide P-SSRs had the most GC-contents in the 5'UTRs and CDSs, whereas mononucleotide P-SSRs had the least GC-contents in the six genomic regions of these primates. The most frequent motifs for different length varied obviously with the different genomic regions.
Collapse
|
11
|
Li Z, Chen F, Huang C, Zheng W, Yu C, Cheng H, Zhou R. Genome-wide mapping and characterization of microsatellites in the swamp eel genome. Sci Rep 2017; 7:3157. [PMID: 28600492 PMCID: PMC5466649 DOI: 10.1038/s41598-017-03330-7] [Citation(s) in RCA: 23] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/14/2017] [Accepted: 04/26/2017] [Indexed: 11/09/2022] Open
Abstract
We described genome-wide screening and characterization of microsatellites in the swamp eel genome. A total of 99,293 microsatellite loci were identified in the genome with an overall density of 179 microsatellites per megabase of genomic sequences. The dinucleotide microsatellites were the most abundant type representing 71% of the total microsatellite loci and the AC-rich motifs were the most recurrent in all repeat types. Microsatellite frequency decreased as numbers of repeat units increased, which was more obvious in long than short microsatellite motifs. Most of microsatellites were located in non-coding regions, whereas only approximately 1% of the microsatellites were detected in coding regions. Trinucleotide repeats were most abundant microsatellites in the coding regions, which represented amino acid repeats in proteins. There was a chromosome-biased distribution of microsatellites in non-coding regions, with the highest density of 203.95/Mb on chromosome 8 and the least on chromosome 7 (164.06/Mb). The most abundant dinucleotides (AC)n was mainly located on chromosome 8. Notably, genomic mapping showed that there was a chromosome-biased association of genomic distributions between microsatellites and transposon elements. Thus, the novel dataset of microsatellites in swamp eel provides a valuable resource for further studies on QTL-based selection breeding, genetic resource conservation and evolutionary genetics.
Collapse
Affiliation(s)
- Zhigang Li
- Hubei Key Laboratory of Cell Homeostasis, Laboratory of Molecular and Developmental Genetics, College of Life Sciences, Wuhan University, Wuhan, 430072, P. R. China
| | - Feng Chen
- Hubei Key Laboratory of Cell Homeostasis, Laboratory of Molecular and Developmental Genetics, College of Life Sciences, Wuhan University, Wuhan, 430072, P. R. China
| | - Chunhua Huang
- Hubei Key Laboratory of Cell Homeostasis, Laboratory of Molecular and Developmental Genetics, College of Life Sciences, Wuhan University, Wuhan, 430072, P. R. China
| | - Weixin Zheng
- Hubei Key Laboratory of Cell Homeostasis, Laboratory of Molecular and Developmental Genetics, College of Life Sciences, Wuhan University, Wuhan, 430072, P. R. China
| | - Chunlai Yu
- Hubei Key Laboratory of Cell Homeostasis, Laboratory of Molecular and Developmental Genetics, College of Life Sciences, Wuhan University, Wuhan, 430072, P. R. China
| | - Hanhua Cheng
- Hubei Key Laboratory of Cell Homeostasis, Laboratory of Molecular and Developmental Genetics, College of Life Sciences, Wuhan University, Wuhan, 430072, P. R. China.
| | - Rongjia Zhou
- Hubei Key Laboratory of Cell Homeostasis, Laboratory of Molecular and Developmental Genetics, College of Life Sciences, Wuhan University, Wuhan, 430072, P. R. China.
| |
Collapse
|
12
|
Mahfooz S, Singh SP, Rakh R, Bhattacharya A, Mishra N, Singh PC, Chauhan PS, Nautiyal CS, Mishra A. A Comprehensive Characterization of Simple Sequence Repeats in the Sequenced Trichoderma Genomes Provides Valuable Resources for Marker Development. Front Microbiol 2016; 7:575. [PMID: 27199911 PMCID: PMC4846858 DOI: 10.3389/fmicb.2016.00575] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/16/2016] [Accepted: 04/07/2016] [Indexed: 11/13/2022] Open
Abstract
Members of genus Trichoderma are known worldwide for mycoparasitism. To gain a better insight into the organization and evolution of their genomes, we used an in silico approach to compare the occurrence, relative abundance and density of SSRs in Trichoderma atroviride, T. harzianum, T. reesei, and T. virens. Our analysis revealed that in all the four genome sequences studied, the occurrence, relative abundance, and density of microsatellites varied and was not influenced by genome sizes. The relative abundance and density of SSRs positively correlated with the G + C content of their genomes. The maximum frequency of SSRs was observed in the smallest genome of T. reesei whereas it was least in second smallest genome of T. atroviride. Among different classes of repeats, the tri-nucleotide repeats were abundant in all the genomes and accounts for ∼38%, whereas hexa-nuceotide repeats were the least (∼10.2%). Further evaluation of the conservation of motifs in the transcript sequences shows a 49.5% conservation among all the motifs. In order to study polymorphism in Trichoderma isolates, 12 polymorphic SSR markers were developed. Of the 12 markers, 6 markers are from T. atroviride and remaining 6 belong to T. harzianum. SSR markers were found to be more polymorphic from T. atroviride with an average polymorphism information content value of 0.745 in comparison with T. harzianum (0.615). Twelve polymorphic markers obtained in this study clearly demonstrate the utility of newly developed SSR markers in establishing genetic relationships among different isolates of Trichoderma.
Collapse
Affiliation(s)
- Sahil Mahfooz
- Division of Plant Microbe Interaction, Council of Scientific and Industrial Research-National Botanical Research Institute Lucknow, India
| | - Satyendra P Singh
- Division of Plant Microbe Interaction, Council of Scientific and Industrial Research-National Botanical Research Institute Lucknow, India
| | - Ramraje Rakh
- Maharashtra Institute of Medical Sciences and Research Medical College Latur, India
| | - Arpita Bhattacharya
- Division of Plant Microbe Interaction, Council of Scientific and Industrial Research-National Botanical Research Institute Lucknow, India
| | - Nishtha Mishra
- Division of Plant Microbe Interaction, Council of Scientific and Industrial Research-National Botanical Research Institute Lucknow, India
| | - Poonam C Singh
- Division of Plant Microbe Interaction, Council of Scientific and Industrial Research-National Botanical Research Institute Lucknow, India
| | - Puneet S Chauhan
- Division of Plant Microbe Interaction, Council of Scientific and Industrial Research-National Botanical Research Institute Lucknow, India
| | - Chandra S Nautiyal
- Division of Plant Microbe Interaction, Council of Scientific and Industrial Research-National Botanical Research Institute Lucknow, India
| | - Aradhana Mishra
- Division of Plant Microbe Interaction, Council of Scientific and Industrial Research-National Botanical Research Institute Lucknow, India
| |
Collapse
|
13
|
Yano CF, Bertollo LAC, Liehr T, Troy WP, Cioffi MDB. W Chromosome Dynamics in Triportheus Species (Characiformes, Triportheidae): An Ongoing Process Narrated by Repetitive Sequences. J Hered 2016; 107:342-8. [PMID: 27036509 DOI: 10.1093/jhered/esw021] [Citation(s) in RCA: 20] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/16/2015] [Accepted: 03/26/2016] [Indexed: 11/13/2022] Open
Abstract
Characterizing the abundance and genomic distribution of repetitive DNAs provides information on genome evolution, especially regarding the origin and differentiation of sex chromosomes. Triportheus fishes offer a useful model to explore the evolution of sex chromosomes, since they represent a monophyletic group in which all species share a ZZ/ZW sex chromosome system. In this study, we analyzed the distribution of 13 classes of repetitive DNA sequences by FISH, including microsatellites, rDNAs, and transposable elements in 6 Triportheus species, in order to investigate the fate of the sex-specific chromosome among them. These findings show the dynamic differentiation process of the W chromosome concerning changes in the repetitive DNA fraction of the heterochromatin. The differential accumulation of the same class of repeats on this chromosome, in both nearby and distant species, reflects the inherent dynamism of the microsatellites, as well as the plasticity that shapes the evolutionary history of the sex chromosomes, even among closely related species sharing a same sex chromosome system.
Collapse
Affiliation(s)
- Cassia Fernanda Yano
- Departamento de Genética e Evolução, Universidade Federal de São Carlos, São Carlos, SP, Brazil (Yano and Cioffi); Jena University Hospital, Friedrich Schiller University, Institute of Human Genetics, Kollegiengasse 10, D-07743 Jena, Germany (Liehr); Departamento de Ciências Biológicas, Universidade do Estado de Mato Grosso, Tangará da Serra, Brazil (Troy); Departamento de Genetica e Evolução, Universidade Federal de São Carlos, São Carlos, SP, Brazil (Bertollo); CAPES Foundation, Ministry of Education of Brazil, Brasília, DF 70040-020, Brazil (Yano)
| | - Luiz Antônio Carlos Bertollo
- Departamento de Genética e Evolução, Universidade Federal de São Carlos, São Carlos, SP, Brazil (Yano and Cioffi); Jena University Hospital, Friedrich Schiller University, Institute of Human Genetics, Kollegiengasse 10, D-07743 Jena, Germany (Liehr); Departamento de Ciências Biológicas, Universidade do Estado de Mato Grosso, Tangará da Serra, Brazil (Troy); Departamento de Genetica e Evolução, Universidade Federal de São Carlos, São Carlos, SP, Brazil (Bertollo); CAPES Foundation, Ministry of Education of Brazil, Brasília, DF 70040-020, Brazil (Yano)
| | - Thomas Liehr
- Departamento de Genética e Evolução, Universidade Federal de São Carlos, São Carlos, SP, Brazil (Yano and Cioffi); Jena University Hospital, Friedrich Schiller University, Institute of Human Genetics, Kollegiengasse 10, D-07743 Jena, Germany (Liehr); Departamento de Ciências Biológicas, Universidade do Estado de Mato Grosso, Tangará da Serra, Brazil (Troy); Departamento de Genetica e Evolução, Universidade Federal de São Carlos, São Carlos, SP, Brazil (Bertollo); CAPES Foundation, Ministry of Education of Brazil, Brasília, DF 70040-020, Brazil (Yano)
| | - Waldo Pinheiro Troy
- Departamento de Genética e Evolução, Universidade Federal de São Carlos, São Carlos, SP, Brazil (Yano and Cioffi); Jena University Hospital, Friedrich Schiller University, Institute of Human Genetics, Kollegiengasse 10, D-07743 Jena, Germany (Liehr); Departamento de Ciências Biológicas, Universidade do Estado de Mato Grosso, Tangará da Serra, Brazil (Troy); Departamento de Genetica e Evolução, Universidade Federal de São Carlos, São Carlos, SP, Brazil (Bertollo); CAPES Foundation, Ministry of Education of Brazil, Brasília, DF 70040-020, Brazil (Yano)
| | - Marcelo de Bello Cioffi
- Departamento de Genética e Evolução, Universidade Federal de São Carlos, São Carlos, SP, Brazil (Yano and Cioffi); Jena University Hospital, Friedrich Schiller University, Institute of Human Genetics, Kollegiengasse 10, D-07743 Jena, Germany (Liehr); Departamento de Ciências Biológicas, Universidade do Estado de Mato Grosso, Tangará da Serra, Brazil (Troy); Departamento de Genetica e Evolução, Universidade Federal de São Carlos, São Carlos, SP, Brazil (Bertollo); CAPES Foundation, Ministry of Education of Brazil, Brasília, DF 70040-020, Brazil (Yano).
| |
Collapse
|
14
|
Li S, Zhang G, Li X, Wang L, Yuan J, Deng C, Gao W. Genome-wide identification and validation of simple sequence repeats (SSRs) from Asparagus officinalis. Mol Cell Probes 2016; 30:153-60. [PMID: 26987412 DOI: 10.1016/j.mcp.2016.03.003] [Citation(s) in RCA: 16] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/20/2016] [Revised: 02/26/2016] [Accepted: 03/08/2016] [Indexed: 11/29/2022]
Abstract
Garden asparagus (Asparagus officinalis), an important vegetable cultivated worldwide, can also serve as a model dioecious plant species in the study of sex determination and sex chromosome evolution. However, limited DNA marker resources have been developed and used for this species. To expand these resources, we examined the DNA sequences for simple sequence repeats (SSRs) in 163,406 scaffolds representing approximately 400 Mbp of the A. officinalis genome. A total of 87,576 SSRs were identified in 59,565 scaffolds. The most abundant SSR repeats were trinucleotide and tetranucleotide, accounting for 29.2 and 29.1% of the total SSRs, respectively, followed by di-, penta-, hexa-, hepta-, and octanucleotides. The AG motif was most common among dinucleotides and was also the most frequent motif in the entire A. officinalis genome, representing 14.7% of all SSRs. A total of 41,917 SSR primers pairs were designed to amplify SSRs. Twenty-two genomic SSR markers were tested in 39 asparagus accessions belonging to ten cultivars and one accession of Asparagus setaceus for determination of genetic diversity. The intra-species polymorphism information content (PIC) values of the 22 genomic SSR markers were intermediate, with an average of 0.41. The genetic diversity between the ten A. officinalis cultivars was low, and the UPGMA dendrogram was largely unrelated to cultivars. It is here suggested that the sex of individuals is an important factor influencing the clustering results. The information reported here provides new information about the organization of the microsatellites in A. officinalis genome and lays a foundation for further genetic studies and breeding applications of A. officinalis and related species.
Collapse
Affiliation(s)
- Shufen Li
- College of Life Sciences, Henan Normal University, Xinxiang 453007, Henan, PR China
| | - Guojun Zhang
- School of Basic Medical Sciences, Xinxiang Medical University, Xinxiang 453003, Henan, PR China
| | - Xu Li
- College of Life Sciences, Henan Normal University, Xinxiang 453007, Henan, PR China
| | - Lianjun Wang
- Institute of Food Corps, Hubei Academy of Agricultural Sciences, Wuhan 430064, Hubei, PR China
| | - Jinhong Yuan
- College of Life Sciences, Henan Normal University, Xinxiang 453007, Henan, PR China
| | - Chuanliang Deng
- College of Life Sciences, Henan Normal University, Xinxiang 453007, Henan, PR China
| | - Wujun Gao
- College of Life Sciences, Henan Normal University, Xinxiang 453007, Henan, PR China.
| |
Collapse
|
15
|
Adams RH, Blackmon H, Reyes-Velasco J, Schield DR, Card DC, Andrew AL, Waynewood N, Castoe TA. Microsatellite landscape evolutionary dynamics across 450 million years of vertebrate genome evolution. Genome 2016; 59:295-310. [PMID: 27064176 DOI: 10.1139/gen-2015-0124] [Citation(s) in RCA: 27] [Impact Index Per Article: 3.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022]
Abstract
The evolutionary dynamics of simple sequence repeats (SSRs or microsatellites) across the vertebrate tree of life remain largely undocumented and poorly understood. In this study, we analyzed patterns of genomic microsatellite abundance and evolution across 71 vertebrate genomes. The highest abundances of microsatellites exist in the genomes of ray-finned fishes, squamate reptiles, and mammals, while crocodilian, turtle, and avian genomes exhibit reduced microsatellite landscapes. We used comparative methods to infer evolutionary rates of change in microsatellite abundance across vertebrates and to highlight particular lineages that have experienced unusually high or low rates of change in genomic microsatellite abundance. Overall, most variation in microsatellite content, abundance, and evolutionary rate is observed among major lineages of reptiles, yet we found that several deeply divergent clades (i.e., squamate reptiles and mammals) contained relatively similar genomic microsatellite compositions. Archosauromorph reptiles (turtles, crocodilians, and birds) exhibit reduced genomic microsatellite content and the slowest rates of microsatellite evolution, in contrast to squamate reptile genomes that have among the highest rates of microsatellite evolution. Substantial branch-specific shifts in SSR content in primates, monotremes, rodents, snakes, and fish are also evident. Collectively, our results support multiple major shifts in microsatellite genomic landscapes among vertebrates.
Collapse
Affiliation(s)
- Richard H Adams
- a Department of Biology, 501 S. Nedderman Dr., University of Texas at Arlington, TX 76019, USA
| | - Heath Blackmon
- b Department of Ecology, Evolution & Behavior, 1987 Upper Buford Cir., University of Minnesota, Saint Paul, MN 55108-6097, USA
| | - Jacobo Reyes-Velasco
- a Department of Biology, 501 S. Nedderman Dr., University of Texas at Arlington, TX 76019, USA
| | - Drew R Schield
- a Department of Biology, 501 S. Nedderman Dr., University of Texas at Arlington, TX 76019, USA
| | - Daren C Card
- a Department of Biology, 501 S. Nedderman Dr., University of Texas at Arlington, TX 76019, USA
| | - Audra L Andrew
- a Department of Biology, 501 S. Nedderman Dr., University of Texas at Arlington, TX 76019, USA
| | - Nyimah Waynewood
- a Department of Biology, 501 S. Nedderman Dr., University of Texas at Arlington, TX 76019, USA
| | - Todd A Castoe
- a Department of Biology, 501 S. Nedderman Dr., University of Texas at Arlington, TX 76019, USA
| |
Collapse
|
16
|
Saeed AF, Wang R, Wang S. Microsatellites in Pursuit of Microbial Genome Evolution. Front Microbiol 2016; 6:1462. [PMID: 26779133 PMCID: PMC4700210 DOI: 10.3389/fmicb.2015.01462] [Citation(s) in RCA: 21] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/11/2015] [Accepted: 12/07/2015] [Indexed: 12/27/2022] Open
Abstract
Microsatellites or short sequence repeats are widespread genetic markers which are hypermutable 1-6 bp long short nucleotide motifs. Significantly, their applications in genetics are extensive due to their ceaseless mutational degree, widespread length variations and hypermutability skills. These features make them useful in determining the driving forces of evolution by using powerful molecular techniques. Consequently, revealing important questions, for example, what is the significance of these abundant sequences in DNA, what are their roles in genomic evolution? The answers of these important questions are hidden in the ways these short motifs contributed in altering the microbial genomes since the origin of life. Even though their size ranges from 1 -to- 6 bases, these repeats are becoming one of the most popular genetic probes in determining their associations and phylogenetic relationships in closely related genomes. Currently, they have been widely used in molecular genetics, biotechnology and evolutionary biology. However, due to limited knowledge; there is a significant gap in research and lack of information concerning hypermutational mechanisms. These mechanisms play a key role in microsatellite loci point mutations and phase variations. This review will extend the understandings of impacts and contributions of microsatellite in genomic evolution and their universal applications in microbiology.
Collapse
Affiliation(s)
- Abdullah F. Saeed
- Key Laboratory of Biopesticide and Chemical Biology of Education Ministry, School of Life Sciences, Fujian Agriculture and Forestry UniversityFuzhou, China
| | | | | |
Collapse
|
17
|
Qi WH, Jiang XM, Du LM, Xiao GS, Hu TZ, Yue BS, Quan QM. Genome-Wide Survey and Analysis of Microsatellite Sequences in Bovid Species. PLoS One 2015. [PMID: 26196922 PMCID: PMC4510479 DOI: 10.1371/journal.pone.0133667] [Citation(s) in RCA: 25] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022] Open
Abstract
Microsatellites or simple sequence repeats (SSRs) have become the most popular source of genetic markers, which are ubiquitously distributed in many eukaryotic and prokaryotic genomes. This is the first study examining and comparing SSRs in completely sequenced genomes of the Bovidae. We analyzed and compared the number of SSRs, relative abundance, relative density, guanine-cytosine (GC) content and proportion of SSRs in six taxonomically different bovid species: Bos taurus, Bubalus bubalis, Bos mutus, Ovis aries, Capra hircus, and Pantholops hodgsonii. Our analysis revealed that, based on our search criteria, the total number of perfect SSRs found ranged from 663,079 to 806,907 and covered from 0.44% to 0.48% of the bovid genomes. Relative abundance and density of SSRs in these Bovinae genomes were non-significantly correlated with genome size (Pearson, r < 0.420, p > 0.05). Perfect mononucleotide SSRs were the most abundant, followed by the pattern: perfect di- > tri- > penta- > tetra- > hexanucleotide SSRs. Generally, the number of SSRs, relative abundance, and relative density of SSRs decreased as the motif repeat length increased in each species of Bovidae. The most GC-content was in trinucleotide SSRs and the least was in the mononucleotide SSRs in the six bovid genomes. The GC-contents of tri- and pentanucleotide SSRs showed a great deal of similarity among different chromosomes of B. taurus, O. aries, and C. hircus. SSR number of all chromosomes in the B. taurus, O.aries, and C. hircus is closely positively correlated with chromosome sequence size (Pearson, r > 0.980, p < 0.01) and significantly negatively correlated with GC-content (Pearson, r < -0.638, p < 0.01). Relative abundance and density of SSRs in all chromosomes of the three species were significantly negatively correlated with GC-content (Pearson, r < -0.333, P < 0.05) but not significantly correlated with chromosome sequence size (Pearson, r < -0.185, P > 0.05). Relative abundances of the same nucleotide SSR type showed great similarity among different chromosomes of B. taurus, O. aries, and C. hircus.
Collapse
Affiliation(s)
- Wen-Hua Qi
- College of Life Science and Engineering, Chongqing Three Gorges University, Chongqing, 404100, China
- * E-mail:
| | - Xue-Mei Jiang
- College of Environmental and Chemistry Engineering, Chongqing Three Gorges University, Chongqing, 404100, China
| | - Lian-Ming Du
- Key Laboratory of Bio-resources and Eco-environment (Ministry of Education), College of Life Sciences, Sichuan University, Chengdu, 610064, China
| | - Guo-Sheng Xiao
- College of Life Science and Engineering, Chongqing Three Gorges University, Chongqing, 404100, China
| | - Ting-Zhang Hu
- College of Life Science and Engineering, Chongqing Three Gorges University, Chongqing, 404100, China
| | - Bi-Song Yue
- Key Laboratory of Bio-resources and Eco-environment (Ministry of Education), College of Life Sciences, Sichuan University, Chengdu, 610064, China
| | - Qiu-Mei Quan
- School of Life Sciences, China West Normal University, Nanchong, 637009, China
| |
Collapse
|
18
|
Figueiredo MLA, Kim M, Philip P, Allgardsson A, Stenberg P, Larsson J. Non-coding roX RNAs prevent the binding of the MSL-complex to heterochromatic regions. PLoS Genet 2014; 10:e1004865. [PMID: 25501352 PMCID: PMC4263465 DOI: 10.1371/journal.pgen.1004865] [Citation(s) in RCA: 24] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/26/2014] [Accepted: 10/30/2014] [Indexed: 12/29/2022] Open
Abstract
Long non-coding RNAs contribute to dosage compensation in both mammals and Drosophila by inducing changes in the chromatin structure of the X-chromosome. In Drosophila melanogaster, roX1 and roX2 are long non-coding RNAs that together with proteins form the male-specific lethal (MSL) complex, which coats the entire male X-chromosome and mediates dosage compensation by increasing its transcriptional output. Studies on polytene chromosomes have demonstrated that when both roX1 and roX2 are absent, the MSL-complex becomes less abundant on the male X-chromosome and is relocated to the chromocenter and the 4th chromosome. Here we address the role of roX RNAs in MSL-complex targeting and the evolution of dosage compensation in Drosophila. We performed ChIP-seq experiments which showed that MSL-complex recruitment to high affinity sites (HAS) on the X-chromosome is independent of roX and that the HAS sequence motif is conserved in D. simulans. Additionally, a complete and enzymatically active MSL-complex is recruited to six specific genes on the 4th chromosome. Interestingly, our sequence analysis showed that in the absence of roX RNAs, the MSL-complex has an affinity for regions enriched in Hoppel transposable elements and repeats in general. We hypothesize that roX mutants reveal the ancient targeting of the MSL-complex and propose that the role of roX RNAs is to prevent the binding of the MSL-complex to heterochromatin. In both fruit flies and humans, males and females have different sets of sex chromosomes. This generates differences in gene dosage that must be compensated for by adjusting the transcriptional output of most genes located on the X-chromosome. The specific recognition and targeting of the X-chromosome is essential for such dosage compensation. In fruit flies, dosage compensation is mediated by the male-specific lethal (MSL) complex, which upregulates gene transcription on the male X-chromosome. The MSL-complex consists of five proteins and two non-coding RNAs, roX1 and roX2. While non-coding RNAs are known to be critical for dosage compensation in both flies and mammals, their precise functions remain elusive. Here we present a study on the targeting and function of the MSL-complex in the absence of roX RNAs. The results obtained suggest that the dosage compensating MSL-complex has an intrinsic tendency to target repeat-rich regions and that the function of roX RNAs is to prevent its binding to such targets. Our findings reveal an ancient targeting and regulatory function of the MSL-complex that has been adapted for use in dosage compensation and modified by the rapidly evolving noncoding roX RNAs.
Collapse
Affiliation(s)
| | - Maria Kim
- Department of Molecular Biology, Umeå University, Umeå, Sweden
| | - Philge Philip
- Department of Molecular Biology, Umeå University, Umeå, Sweden
- Computational Life Science Cluster (CLiC), Umeå University, Umeå, Sweden
| | | | - Per Stenberg
- Department of Molecular Biology, Umeå University, Umeå, Sweden
- Computational Life Science Cluster (CLiC), Umeå University, Umeå, Sweden
| | - Jan Larsson
- Department of Molecular Biology, Umeå University, Umeå, Sweden
- * E-mail:
| |
Collapse
|
19
|
Jiang Q, Li Q, Yu H, Kong L. Genome-wide analysis of simple sequence repeats in marine animals-a comparative approach. MARINE BIOTECHNOLOGY (NEW YORK, N.Y.) 2014; 16:604-619. [PMID: 24939717 DOI: 10.1007/s10126-014-9580-1] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/01/2014] [Accepted: 05/22/2014] [Indexed: 06/03/2023]
Abstract
Tandem simple sequence repeats (SSRs) are one of the most popular molecular markers in genetic analysis owing to their ubiquitous occurrence,high reproducibility, multiallelic nature, and codominant mode. High mutability makes SSRs play a role in genome evolution and correspondingly show different patterns. Comparative analysis of genomic SSRs in different taxonomic groups usually focuses on land species, while marine animals have been neglected. This study examined the abundance of genomic SSRs with repeated unit lengths of 1-6 bp in 30 marine animals including nine taxonomic groups and further compared with the land species. More than thousands of SSRs were discovered in every organism which provided a huge resource for the development of molecular markers. Thirty marine animals showed profound differences in SSR characteristics, but some group-specific trends were also found. Both similarities and differences of repeat patterns were discovered between the land and marine species. Two taxon-specific SSR types were discovered: the pentanucleotides motif AGAGG in Euteleostei and the hexanucleotide repeats of ATGTAC in Porifera and Echinodermata. Gene ontology (GO) enrichment analysis of two representative species (Amphimedon queenslandica for Porifera and Strongylocentrotus purpuratus for Echinodermata) revealed functional preference of the ATGTAC motif associated genes, and this might hint at evolutionary significance.
Collapse
Affiliation(s)
- Qun Jiang
- The Key Laboratory of Mariculture, Ministry of Education, Ocean University of China, 266003, Qingdao, China
| | | | | | | |
Collapse
|
20
|
Garrigan D, Kingan SB, Geneva AJ, Vedanayagam JP, Presgraves DC. Genome diversity and divergence in Drosophila mauritiana: multiple signatures of faster X evolution. Genome Biol Evol 2014; 6:2444-58. [PMID: 25193308 PMCID: PMC4202334 DOI: 10.1093/gbe/evu198] [Citation(s) in RCA: 39] [Impact Index Per Article: 3.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/01/2023] Open
Abstract
Drosophila mauritiana is an Indian Ocean island endemic species that diverged from its two sister species, Drosophila simulans and Drosophila sechellia, approximately 240,000 years ago. Multiple forms of incomplete reproductive isolation have evolved among these species, including sexual, gametic, ecological, and intrinsic postzygotic barriers, with crosses among all three species conforming to Haldane’s rule: F1 hybrid males are sterile and F1 hybrid females are fertile. Extensive genetic resources and the fertility of hybrid females have made D. mauritiana, in particular, an important model for speciation genetics. Analyses between D. mauritiana and both of its siblings have shown that the X chromosome makes a disproportionate contribution to hybrid male sterility. But why the X plays a special role in the evolution of hybrid sterility in these, and other, species remains an unsolved problem. To complement functional genetic analyses, we have investigated the population genomics of D. mauritiana, giving special attention to differences between the X and the autosomes. We present a de novo genome assembly of D. mauritiana annotated with RNAseq data and a whole-genome analysis of polymorphism and divergence from ten individuals. Our analyses show that, relative to the autosomes, the X chromosome has reduced nucleotide diversity but elevated nucleotide divergence; an excess of recurrent adaptive evolution at its protein-coding genes; an excess of recent, strong selective sweeps; and a large excess of satellite DNA. Interestingly, one of two centimorgan-scale selective sweeps on the D. mauritiana X chromosome spans a region containing two sex-ratio meiotic drive elements and a high concentration of satellite DNA. Furthermore, genes with roles in reproduction and chromosome biology are enriched among genes that have histories of recurrent adaptive protein evolution. Together, these genome-wide analyses suggest that genetic conflict and frequent positive natural selection on the X chromosome have shaped the molecular evolutionary history of D. mauritiana, refining our understanding of the possible causes of the large X-effect in speciation.
Collapse
|
21
|
Brittain A, Stroebele E, Erives A. Microsatellite repeat instability fuels evolution of embryonic enhancers in Hawaiian Drosophila. PLoS One 2014; 9:e101177. [PMID: 24978198 PMCID: PMC4076327 DOI: 10.1371/journal.pone.0101177] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/02/2014] [Accepted: 06/03/2014] [Indexed: 12/16/2022] Open
Abstract
For ∼30 million years, the eggs of Hawaiian Drosophila were laid in ever-changing environments caused by high rates of island formation. The associated diversification of the size and developmental rate of the syncytial fly embryo would have altered morphogenic gradients, thus necessitating frequent evolutionary compensation of transcriptional responses. We investigate the consequences these radiations had on transcriptional enhancers patterning the embryo to see whether their pattern of molecular evolution is different from non-Hawaiian species. We identify and functionally assay in transgenic D. melanogaster the Neurogenic Ectoderm Enhancers from two different Hawaiian Drosophila groups: (i) the picture wing group, and (ii) the modified mouthparts group. We find that the binding sites in this set of well-characterized enhancers are footprinted by diverse microsatellite repeat (MSR) sequences. We further show that Hawaiian embryonic enhancers in general are enriched in MSR relative to both Hawaiian non-embryonic enhancers and non-Hawaiian embryonic enhancers. We propose embryonic enhancers are sensitive to Activator spacing because they often serve as assembly scaffolds for the aggregation of transcription factor activator complexes. Furthermore, as most indels are produced by microsatellite repeat slippage, enhancers from Hawaiian Drosophila lineages, which experience dynamic evolutionary pressures, would become grossly enriched in MSR content.
Collapse
Affiliation(s)
- Andrew Brittain
- Department of Biology, University of Iowa, Iowa City, Iowa, United States of America
| | - Elizabeth Stroebele
- Department of Biology, University of Iowa, Iowa City, Iowa, United States of America
| | - Albert Erives
- Department of Biology, University of Iowa, Iowa City, Iowa, United States of America
- * E-mail:
| |
Collapse
|
22
|
Ren QP, Fan Z, Zhou XM, Jiang GF, Wang YT, Liu YX. Identification and characterization of anonymous nuclear markers for the double-striped cockroach, Blattella bisignata. BULLETIN OF ENTOMOLOGICAL RESEARCH 2013; 103:29-35. [PMID: 22697889 DOI: 10.1017/s0007485312000375] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/01/2023]
Abstract
During the last decade, multilocus analysis has gradually become a powerful tool for the studies of population genetics and phylogeography. The double-striped cockroach, Blattella bisignata, is endemic to southeast Asia, and there is currently little genetic information available for the species. We chose it as the target species to investigate a biodiversity hotspot in southwest China. Here, we report the identification and characterization of 11 single-copy anonymous nuclear markers with an average length of 378bp. These loci, isolated from a genomic library of B. bisignata, can amplify in two additional Blattella species (B. germanica and B. lituricollis). While testing these markers in representative species of Blattellidae, Blattidae and Epilampridae, some of them can cross-amplify successfully. After sequencing 30 individuals collected from southern China per locus, we found relatively high variability (approximately 3.6 SNPs per 100bp). Finally, a small-scale study was also performed to show that these markers do indeed fulfill the expectations as phylogeographic markers.
Collapse
Affiliation(s)
- Q-P Ren
- Jiangsu Key Laboratory for Biodiversity and Biotechnology, College of Life Sciences, Nanjing Normal University, Nanjing, China
| | | | | | | | | | | |
Collapse
|
23
|
Kejnovský E, Michalovova M, Steflova P, Kejnovska I, Manzano S, Hobza R, Kubat Z, Kovarik J, Jamilena M, Vyskot B. Expansion of microsatellites on evolutionary young Y chromosome. PLoS One 2013; 8:e45519. [PMID: 23341866 PMCID: PMC3547029 DOI: 10.1371/journal.pone.0045519] [Citation(s) in RCA: 49] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/20/2012] [Accepted: 08/20/2012] [Indexed: 01/30/2023] Open
Abstract
Sex chromosomes are an ideal system to study processes connected with suppressed recombination. We found evidence of microsatellite expansion, on the relatively young Y chromosome of the dioecious plant sorrel (Rumex acetosa, XY1Y2 system), but no such expansion on the more ancient Y chromosomes of liverwort (Marchantia polymorpha) and human. The most expanding motifs were AC and AAC, which also showed periodicity of array length, indicating the importance of beginnings and ends of arrays. Our data indicate that abundance of microsatellites in genomes depends on the inherent expansion potential of specific motifs, which could be related to their stability and ability to adopt unusual DNA conformations. We also found that the abundance of microsatellites is higher in the neighborhood of transposable elements (TEs) suggesting that microsatellites are probably targets for TE insertions. This evidence suggests that microsatellite expansion is an early event shaping the Y chromosome where this process is not opposed by recombination, while accumulation of TEs and chromosome shrinkage predominate later.
Collapse
Affiliation(s)
- Eduard Kejnovský
- Department of Plant Developmental Genetics, Institute of Biophysics, Academy of Sciences of the Czech Republic, Brno, Czech Republic.
| | | | | | | | | | | | | | | | | | | |
Collapse
|
24
|
Mahfooz S, Singh P, Maurya DK, Yadav MC, Tahoor A, Sahay H, Srivastava A, Prakash A. Microsatellite repeat dynamics in mitochondrial genomes of phytopathogenic fungi: frequency and distribution in the genic and intergenic regions. Bioinformation 2012; 8:1171-5. [PMID: 23275715 PMCID: PMC3530887 DOI: 10.6026/97320630081171] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/02/2012] [Accepted: 11/05/2012] [Indexed: 11/23/2022] Open
Abstract
The frequency and distribution of microsatellites were analyzed in the 19 mitogenomes of phytopathogenic fungi covering five phyla. Our analysis revealed that in all the mitogenomes studied, the frequency and relative abundance varied, and it was neither influenced by genome size nor by GC content. SSRs were found to be differential distributed in genic and intergenic regions. An average of 5.14 (23.6%) SSRs were present in genic sequences and 21.7 (76.4%) SSRs were located in the intergenic sequences. Relative abundance of SSRs in mitogenomes was the highest in Aspergillus tubigensis, whereas, it was the least in Phaeosphaeria nodurum, the average being 0.45. Trinucleotide repeats were the most abundant motifs in the genic and intergenic regions of the mitogenomes of the phytopathogenic fungi. Among the genes, cox1 harbors the maximum SSRs, whereas cox3 and nad 7 contain the least. Based on the presence of SSRs in a particular gene, genetic relationships among individual organisms were also established.
Collapse
Affiliation(s)
- Sahil Mahfooz
- Department of Biotechnology and Bioinformatics, Barkatullah University, Bhopal 462 026, Madhya Pradesh, India
| | - Pallavi Singh
- National Bureau of Agriculturally Important Microorganisms, Mau 275 101, Uttar Pradesh, India
| | - Deepak K Maurya
- National Bureau of Agriculturally Important Microorganisms, Mau 275 101, Uttar Pradesh, India
| | - Mahesh C Yadav
- National Research Centre on DNA Fingerprinting, National Bureau of Plant Genetic Resources, Pusa Campus, New Delhi 110 012, India
| | - Azram Tahoor
- Department of Wildlife Science, Aligarh Muslim University, Aligarh 202002, Uttar Pradesh, India
| | - Harmesh Sahay
- National Bureau of Agriculturally Important Microorganisms, Mau 275 101, Uttar Pradesh, India
| | - Arpita Srivastava
- National Bureau of Agriculturally Important Microorganisms, Mau 275 101, Uttar Pradesh, India
| | - Anil Prakash
- Department of Biotechnology and Bioinformatics, Barkatullah University, Bhopal 462 026, Madhya Pradesh, India
| |
Collapse
|
25
|
Scala C, Tian X, Mehdiabadi NJ, Smith MH, Saxer G, Stephens K, Buzombo P, Strassmann JE, Queller DC. Amino acid repeats cause extraordinary coding sequence variation in the social amoeba Dictyostelium discoideum. PLoS One 2012; 7:e46150. [PMID: 23029418 PMCID: PMC3460934 DOI: 10.1371/journal.pone.0046150] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/16/2012] [Accepted: 08/28/2012] [Indexed: 12/19/2022] Open
Abstract
Protein sequences are normally the most conserved elements of genomes owing to purifying selection to maintain their functions. We document an extraordinary amount of within-species protein sequence variation in the model eukaryote Dictyostelium discoideum stemming from triplet DNA repeats coding for long strings of single amino acids. D. discoideum has a very large number of such strings, many of which are polyglutamine repeats, the same sequence that causes various human neurological disorders in humans, like Huntington’s disease. We show here that D. discoideum coding repeat loci are highly variable among individuals, making D. discoideum a candidate for the most variable proteome. The coding repeat loci are not significantly less variable than similar non-coding triplet repeats. This pattern is consistent with these amino-acid repeats being largely non-functional sequences evolving primarily by mutation and drift.
Collapse
Affiliation(s)
- Clea Scala
- Department of Ecology and Evolutionary Biology, Rice University, Houston, Texas, United States of America
| | - Xiangjun Tian
- Department of Biology, Washington University in St. Louis, St. Louis, Missouri, United States of America
| | - Natasha J. Mehdiabadi
- Department of Ecology and Evolutionary Biology, Rice University, Houston, Texas, United States of America
| | - Margaret H. Smith
- Department of Ecology and Evolutionary Biology, Rice University, Houston, Texas, United States of America
| | - Gerda Saxer
- Department of Biochemistry and Cell Biology, Rice University, Houston, Texas, United States of America
| | - Katie Stephens
- Department of Ecology and Evolutionary Biology, Rice University, Houston, Texas, United States of America
| | - Prince Buzombo
- Department of Ecology and Evolutionary Biology, Rice University, Houston, Texas, United States of America
| | - Joan E. Strassmann
- Department of Biology, Washington University in St. Louis, St. Louis, Missouri, United States of America
| | - David C. Queller
- Department of Biology, Washington University in St. Louis, St. Louis, Missouri, United States of America
- * E-mail:
| |
Collapse
|
26
|
Differential distribution and occurrence of simple sequence repeats in diverse geminivirus genomes. Virus Genes 2012; 45:556-66. [DOI: 10.1007/s11262-012-0802-1] [Citation(s) in RCA: 21] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/28/2012] [Accepted: 07/31/2012] [Indexed: 01/13/2023]
|
27
|
Gemayel R, Cho J, Boeynaems S, Verstrepen KJ. Beyond junk-variable tandem repeats as facilitators of rapid evolution of regulatory and coding sequences. Genes (Basel) 2012; 3:461-80. [PMID: 24704980 PMCID: PMC3899988 DOI: 10.3390/genes3030461] [Citation(s) in RCA: 81] [Impact Index Per Article: 6.8] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/25/2012] [Revised: 07/19/2012] [Accepted: 07/21/2012] [Indexed: 01/19/2023] Open
Abstract
Copy Number Variations (CNVs) and Single Nucleotide Polymorphisms (SNPs) have been the major focus of most large-scale comparative genomics studies to date. Here, we discuss a third, largely ignored, type of genetic variation, namely changes in tandem repeat number. Historically, tandem repeats have been designated as non functional “junk” DNA, mostly as a result of their highly unstable nature. With the exception of tandem repeats involved in human neurodegenerative diseases, repeat variation was often believed to be neutral with no phenotypic consequences. Recent studies, however, have shown that as many as 10% to 20% of coding and regulatory sequences in eukaryotes contain an unstable repeat tract. Contrary to initial suggestions, tandem repeat variation can have useful phenotypic consequences. Examples include rapid variation in microbial cell surface, tuning of internal molecular clocks in flies and the dynamic morphological plasticity in mammals. As such, tandem repeats can be useful functional elements that facilitate evolvability and rapid adaptation.
Collapse
Affiliation(s)
- Rita Gemayel
- Laboratory for Systems Biology, VIB, Gaston Geenslaan 1, B-3001 Heverlee, Belgium.
| | - Janice Cho
- Laboratory for Systems Biology, VIB, Gaston Geenslaan 1, B-3001 Heverlee, Belgium.
| | - Steven Boeynaems
- Laboratory for Systems Biology, VIB, Gaston Geenslaan 1, B-3001 Heverlee, Belgium.
| | - Kevin J Verstrepen
- Laboratory for Systems Biology, VIB, Gaston Geenslaan 1, B-3001 Heverlee, Belgium.
| |
Collapse
|
28
|
Kuntal H, Sharma V. In silico analysis of SSRs in mitochondrial genomes of plants. OMICS-A JOURNAL OF INTEGRATIVE BIOLOGY 2011; 15:783-9. [PMID: 22011339 DOI: 10.1089/omi.2011.0074] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/12/2022]
Abstract
Simple sequence repeats (SSRs) or microsatellites constitute a countable portion of genomes. However, the significance of SSRs in organelle genomes has not been completely understood. The availability of organelle genome sequences allows us to understand the organization of SSRs in their genic and intergenic regions. In the current study we surveyed the patterns of SSRs in mitochondrial genomes of different taxa of plants. A total of 16 mitochondrial genomes, from algae to angiosperms, have been considered to analyze the pattern of simple sequence repeats present in them. Based on study, the mononucleotide repeats of A/T were found to be more prevalent in mitochondrial genomes over other repeat types. The dinucleotides repeats, TA/AT, were the second most numerous, whereas tri-, tetra-, and pentanucleotide repeats were in less number and present in intronic or intergenic portions only. Mononucleotide repeats prevailed in protein-coding exonic portions of all organisms. These results indicates that microsatellite pattern in mitochondrial genomes is different from nuclear genomes and also focuses on organization and diversity at SSR locuses in mitochondrial genomes. This is the novel report of microsatellite polymorphism in plant mitochondrion on whole genome level.
Collapse
Affiliation(s)
- Himani Kuntal
- Department of Biosciences and Biotechnology, Banasthali University, Banasthali, Rajasthan, India.
| | | |
Collapse
|
29
|
Kelkar YD, Eckert KA, Chiaromonte F, Makova KD. A matter of life or death: how microsatellites emerge in and vanish from the human genome. Genome Res 2011; 21:2038-48. [PMID: 21994250 DOI: 10.1101/gr.122937.111] [Citation(s) in RCA: 59] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/24/2022]
Abstract
Microsatellites--tandem repeats of short DNA motifs--are abundant in the human genome and have high mutation rates. While microsatellite instability is implicated in numerous genetic diseases, the molecular processes involved in their emergence and disappearance are still not well understood. Microsatellites are hypothesized to follow a life cycle, wherein they are born and expand into adulthood, until their degradation and death. Here we identified microsatellite births/deaths in human, chimpanzee, and orangutan genomes, using macaque and marmoset as outgroups. We inferred mutations causing births/deaths based on parsimony, and investigated local genomic environments affecting them. We also studied birth/death patterns within transposable elements (Alus and L1s), coding regions, and disease-associated loci. We observed that substitutions were the predominant cause for births of short microsatellites, while insertions and deletions were important for births of longer microsatellites. Substitutions were the cause for deaths of microsatellites of virtually all lengths. AT-rich L1 sequences exhibited elevated frequency of births/deaths over their entire length, while GC-rich Alus only in their 3' poly(A) tails and middle A-stretches, with differences depending on transposable element integration timing. Births/deaths were strongly selected against in coding regions. Births/deaths occurred in genomic regions with high substitution rates, protomicrosatellite content, and L1 density, but low GC content and Alu density. The majority of the 17 disease-associated microsatellites examined are evolutionarily ancient (were acquired by the common ancestor of simians). Our genome-wide investigation of microsatellite life cycle has fundamental applications for predicting the susceptibility of birth/death of microsatellites, including many disease-causing loci.
Collapse
Affiliation(s)
- Yogeshwar D Kelkar
- Department of Biology, Penn State University, University Park, Pennsylvania 16802, USA
| | | | | | | |
Collapse
|
30
|
Cuadrado A, Jouve N. Novel simple sequence repeats (SSRs) detected by ND-FISH in heterochromatin of Drosophila melanogaster. BMC Genomics 2011; 12:205. [PMID: 21521504 PMCID: PMC3114746 DOI: 10.1186/1471-2164-12-205] [Citation(s) in RCA: 20] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/03/2011] [Accepted: 04/26/2011] [Indexed: 12/02/2022] Open
Abstract
Background In recent years, substantial progress has been made in understanding the organization of sequences in heterochromatin regions containing single-copy genes and transposable elements. However, the sequence and organization of tandem repeat DNA sequences, which are by far the majority fraction of D. melanogaster heterochromatin, are little understood. Results This paper reports that the heterochromatin, as well as containing long tandem arrays of pentanucleotide satellites (AAGAG, AAGAC, AATAT, AATAC and AACAC), is also enriched in other simple sequence repeats (SSRs) such as A, AC, AG, AAG, ACT, GATA and GACA. Non-denaturing FISH (ND-FISH) showed these SSRs to localize to the chromocentre of polytene chromosomes, and was used to map them on mitotic chromosomes. Different distributions were detected ranging from single heterochromatic clusters to complex combinations on different chromosomes. ND-FISH performed on extended DNA fibres, along with Southern blotting, showed the complex organization of these heterochromatin sequences in long tracts, and revealed subclusters of SSRs (several kilobase in length) flanked by other DNA sequences. The chromosomal characterization of C, AAC, AGG, AAT, CCG, ACG, AGC, ATC and ACC provided further detailed information on the SSR content of D. melanogaster at the whole genome level. Conclusion These data clearly show the variation in the abundance of different SSR motifs and reveal their non-random distribution within and between chromosomes. The greater representation of certain SSRs in D. melanogaster heterochromatin suggests that its complexity may be greater than previously thought.
Collapse
Affiliation(s)
- Angeles Cuadrado
- Department of Cell Biology and Genetics, University of Alcalá de Henares, 28871 Alcalá de Henares, Madrid, Spain.
| | | |
Collapse
|
31
|
Tian X, Strassmann JE, Queller DC. Genome nucleotide composition shapes variation in simple sequence repeats. Mol Biol Evol 2010; 28:899-909. [PMID: 20943830 DOI: 10.1093/molbev/msq266] [Citation(s) in RCA: 32] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/30/2022] Open
Abstract
Simple sequence repeats (SSRs) or microsatellites are a common component of genomes but vary greatly across species in their abundance. We tested the hypothesis that this variation is due in part to AT/GC content of genomes, with genomes biased toward either high AT or high CG generating more short random repeats that are long enough to enhance expansion through slippage during replication. To test this hypothesis, we identified repeats with perfect tandem iterations of 1-6 bp from 25 protists with complete or near-complete genome sequences. As expected, the density and the frequency are highly related to genome AT content, with excellent fits to quadratic regressions with minima near a 50% AT content and rising toward both extremes. Within species, the same trends hold, except the limited variation in AT content within each species places each mainly on the descending (GC rich), middle, or ascending (AT rich) part of the curve. The base usages of repeat motifs are also significantly correlated with genome nucleotide compositions: Percentages of AT-rich motifs rise with the increase of genome AT content but vice versa for GC-rich subgroups. Amino acid homopolymer repeats also show the expected quadratic relationship, with higher abundance in species with AT content biased in either direction. Our results show that genome nucleotide composition explains up to half of the variance in the abundance and motif constitution of SSRs.
Collapse
Affiliation(s)
- Xiangjun Tian
- Department of Ecology and Evolutionary Biology, Rice University, USA
| | | | | |
Collapse
|
32
|
Ellison CK, Shaw KL. Mining non-model genomic libraries for microsatellites: BAC versus EST libraries and the generation of allelic richness. BMC Genomics 2010; 11:428. [PMID: 20624300 PMCID: PMC2996956 DOI: 10.1186/1471-2164-11-428] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/25/2010] [Accepted: 07/12/2010] [Indexed: 11/24/2022] Open
Abstract
BACKGROUND Simple sequence repeats (SSRs) are tandemly repeated sequence motifs common in genomic nucleotide sequence that often harbor significant variation in repeat number. Frequently used as molecular markers, SSRs are increasingly identified via in silico approaches. Two common classes of genomic resources that can be mined are bacterial artificial chromosome (BAC) libraries and expressed sequence tag (EST) libraries. RESULTS 288 SSR loci were screened in the rapidly radiating Hawaiian swordtail cricket genus Laupala. SSRs were more densely distributed and contained longer repeat structures in BAC library-derived sequence than in EST library-derived sequence, although neither repeat density nor length was exceptionally elevated despite the relatively large genome size of Laupala. A non-random distribution favoring AT-rich SSRs was observed. Allelic diversity of SSRs was positively correlated with repeat length and was generally higher in AT-rich repeat motifs. CONCLUSION The first large-scale survey of Orthopteran SSR allelic diversity is presented. Selection contributes more strongly to the size and density distributions of SSR loci derived from EST library sequence than from BAC library sequence, although all SSRs likely are subject to similar physical and structural constraints, such as slippage of DNA replication machinery, that may generate increased allelic diversity in AT-rich sequence motifs. Although in silico approaches work well for SSR locus identification in both EST and BAC libraries, BAC library sequence and AT-rich repeat motifs are generally superior SSR development resources for most applications.
Collapse
Affiliation(s)
| | - Kerry L Shaw
- Department of Neurobiology and Behavior, Cornell University, Ithaca, NY 14850, USA
| |
Collapse
|
33
|
Datta SR, McQuillin A, Rizig M, Blaveri E, Thirumalai S, Kalsi G, Lawrence J, Bass NJ, Puri V, Choudhury K, Pimm J, Crombie C, Fraser G, Walker N, Curtis D, Zvelebil M, Pereira A, Kandaswamy R, St Clair D, Gurling HMD. A threonine to isoleucine missense mutation in the pericentriolar material 1 gene is strongly associated with schizophrenia. Mol Psychiatry 2010; 15:615-28. [PMID: 19048012 DOI: 10.1038/mp.2008.128] [Citation(s) in RCA: 44] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 01/30/2023]
Abstract
Markers at the pericentriolar material 1 gene (PCM1) have shown genetic association with schizophrenia in both a University College London (UCL) and a USA-based case-control sample. In this paper we report a statistically significant replication of the PCM1 association in a large Scottish case-control sample from Aberdeen. Resequencing of the genomic DNA from research volunteers who had inherited haplotypes associated with schizophrenia showed a threonine to isoleucine missense mutation in exon 24 which was likely to change the structure and function of PCM1 (rs370429). This mutation was found only as a heterozygote in 98 schizophrenic research subjects and controls out of 2246 case and control research subjects. Among the 98 carriers of rs370429, 67 were affected with schizophrenia. The same alleles and haplotypes were associated with schizophrenia in both the London and Aberdeen samples. Another potential aetiological base pair change in PCM1 was rs445422, which altered a splice site signal. A further mutation, rs208747, was shown by electrophoretic mobility shift assays to create or destroy a promoter transcription factor site. Five further non-synonymous changes in exons were also found. Genotyping of the new variants discovered in the UCL case-control sample strengthened the evidence for allelic and haplotypic association (P=0.02-0.0002). Given the number and identity of the haplotypes associated with schizophrenia, further aetiological base pair changes must exist within and around the PCM1 gene. PCM1 protein has been shown to interact directly with the disrupted-in-schizophrenia 1 (DISC1) protein, Bardet-Biedl syndrome 4, and Huntingtin-associated protein 1, and is important in neuronal cell growth. In a separate study we found that clozapine but not haloperidol downregulated PCM1 expression in the mouse brain. We hypothesize that mutant PCM1 may be responsible for causing a subtype of schizophrenia through abnormal cell division and abnormal regeneration in dividing cells in the central nervous system. This is supported by our previous finding of orbitofrontal volumetric deficits in PCM1-associated schizophrenia patients as opposed to temporal pole deficits in non-PCM1-associated schizophrenia patients. Caution needs to be exercised in interpreting the actual biological effects of the mutations we have found without further cell biology. However, the DNA changes we have found deserve widespread genotyping in multiple case-control populations.
Collapse
Affiliation(s)
- S R Datta
- Molecular Psychiatry Laboratory, Research Department of Mental Health Sciences, University College London Medical School, Windeyer Institute of Medical Sciences, London, UK
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
34
|
Shikano T, Ramadevi J, Shimada Y, Merilä J. Utility of sequenced genomes for microsatellite marker development in non-model organisms: a case study of functionally important genes in nine-spined sticklebacks (Pungitius pungitius). BMC Genomics 2010; 11:334. [PMID: 20507571 PMCID: PMC2891615 DOI: 10.1186/1471-2164-11-334] [Citation(s) in RCA: 31] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/09/2010] [Accepted: 05/27/2010] [Indexed: 12/04/2022] Open
Abstract
Background Identification of genes involved in adaptation and speciation by targeting specific genes of interest has become a plausible strategy also for non-model organisms. We investigated the potential utility of available sequenced fish genomes to develop microsatellite (cf. simple sequence repeat, SSR) markers for functionally important genes in nine-spined sticklebacks (Pungitius pungitius), as well as cross-species transferability of SSR primers from three-spined (Gasterosteus aculeatus) to nine-spined sticklebacks. In addition, we examined the patterns and degree of SSR conservation between these species using their aligned sequences. Results Cross-species amplification success was lower for SSR markers located in or around functionally important genes (27 out of 158) than for those randomly derived from genomic (35 out of 101) and cDNA (35 out of 87) libraries. Polymorphism was observed at a large proportion (65%) of the cross-amplified loci independently of SSR type. To develop SSR markers for functionally important genes in nine-spined sticklebacks, SSR locations were surveyed in or around 67 target genes based on the three-spined stickleback genome and these regions were sequenced with primers designed from conserved sequences in sequenced fish genomes. Out of the 81 SSRs identified in the sequenced regions (44,084 bp), 57 exhibited the same motifs at the same locations as in the three-spined stickleback. Di- and trinucleotide SSRs appeared to be highly conserved whereas mononucleotide SSRs were less so. Species-specific primers were designed to amplify 58 SSRs using the sequences of nine-spined sticklebacks. Conclusions Our results demonstrated that a large proportion of SSRs are conserved in the species that have diverged more than 10 million years ago. Therefore, the three-spined stickleback genome can be used to predict SSR locations in the nine-spined stickleback genome. While cross-species utility of SSR primers is limited due to low amplification success, SSR markers can be developed for target genes and genomic regions using our approach, which should be also applicable to other non-model organisms. The SSR markers developed in this study should be useful for identification of genes responsible for phenotypic variation and adaptive divergence of nine-spined stickleback populations, as well as for constructing comparative gene maps of nine-spined and three-spined sticklebacks.
Collapse
Affiliation(s)
- Takahito Shikano
- Ecological Genetics Research Unit, Department of Biosciences, University of Helsinki, P,O, Box 65, FI-00014, Helsinki, Finland.
| | | | | | | |
Collapse
|
35
|
Santos J, Serra L, Solé E, Pascual M. FISH mapping of microsatellite loci from Drosophila subobscura and its comparison to related species. Chromosome Res 2010; 18:213-26. [PMID: 20198419 DOI: 10.1007/s10577-010-9112-4] [Citation(s) in RCA: 22] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/09/2009] [Revised: 01/06/2010] [Accepted: 01/08/2010] [Indexed: 12/24/2022]
Abstract
Microsatellites are highly polymorphic markers that are distributed through all the genome being more abundant in non-coding regions. Whether they are neutral or under selection, these markers if localized can be used as co-dominant molecular markers to explore the dynamics of the evolutionary processes. Their cytological localization can allow identifying genes under selection, inferring recombination from a genomic point of view, or screening for the genomic reorganizations occurring during the evolution of a lineage, among others. In this paper, we report for the first time the localization of microsatellite loci by fluorescent in situ hybridization on Drosophila polytene chromosomes. In Drosophila subobscura, 72 dinucleotide microsatellite loci were localized by fluorescent in situ hybridization yielding unique hybridization signals. In the sex chromosome, microsatellite distribution was not uniform and its density was higher than in autosomes. We identified homologous segments to the sequence flanking the microsatellite loci by browsing the genome sequence of Drosophila pseudoobscura and Drosophila melanogaster. Their localization supports the conservation of Muller's chromosomal elements among Drosophila species and the existence of multiple intrachromosomal rearrangements within each evolutionary lineage. Finally, the lack of microsatellite repeats in the homologous D. melanogaster sequences suggests convergent evolution for high microsatellite density in the distal part of the X chromosome.
Collapse
Affiliation(s)
- Josiane Santos
- Departament de Genètica, Universitat de Barcelona, Spain
| | | | | | | |
Collapse
|
36
|
|
37
|
|
38
|
Identifying cis-regulatory sequences by word profile similarity. PLoS One 2009; 4:e6901. [PMID: 19730735 PMCID: PMC2731932 DOI: 10.1371/journal.pone.0006901] [Citation(s) in RCA: 21] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/08/2008] [Accepted: 08/07/2009] [Indexed: 12/13/2022] Open
Abstract
Background Recognizing regulatory sequences in genomes is a continuing challenge, despite a wealth of available genomic data and a growing number of experimentally validated examples. Methodology/Principal Findings We discuss here a simple approach to search for regulatory sequences based on the compositional similarity of genomic regions and known cis-regulatory sequences. This method, which is not limited to searching for predefined motifs, recovers sequences known to be under similar regulatory control. The words shared by the recovered sequences often correspond to known binding sites. Furthermore, we show that although local word profile clustering is predictive for the regulatory sequences involved in blastoderm segmentation, local dissimilarity is a more universal feature of known regulatory sequences in Drosophila. Conclusions/Significance Our method leverages sequence motifs within a known regulatory sequence to identify co-regulated sequences without explicitly defining binding sites. We also show that regulatory sequences can be distinguished from surrounding sequences by local sequence dissimilarity, a novel feature in identifying regulatory sequences across a genome. Source code for WPH-finder is available for download at http://rana.lbl.gov/downloads/wph.tar.gz.
Collapse
|
39
|
Zhan S, Huang J, Guo Q, Zhao Y, Li W, Miao X, Goldsmith MR, Li M, Huang Y. An integrated genetic linkage map for silkworms with three parental combinations and its application to the mapping of single genes and QTL. BMC Genomics 2009; 10:389. [PMID: 19698097 PMCID: PMC2741490 DOI: 10.1186/1471-2164-10-389] [Citation(s) in RCA: 32] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/19/2009] [Accepted: 08/21/2009] [Indexed: 12/17/2022] Open
Abstract
Background Bombyx mori, the domesticated silkworm, is a well-studied model insect with great economic and scientific significance. Although more than 400 mutations have been described in silkworms, most have not been identified, especially those affecting economically-important traits. Simple sequence repeats (SSRs) are effective and economical tools for mapping traits and genetic improvement. The current SSR linkage map is of low density and contains few polymorphisms. The purpose of this work was to develop a dense and informative linkage map that would assist in the preliminary mapping and dissection of quantitative trait loci (QTL) in a variety of silkworm strains. Results Through an analysis of > 50,000 genotypes across new mapping populations, we constructed two new linkage maps covering 27 assigned chromosomes and merged the data with previously reported data sets. The integrated consensus map contains 692 unique SSR sites, improving the density from 6.3 cM in the previous map to 4.8 cM. We also developed 497 confirmed neighboring markers for corresponding low-polymorphism sites, with 244 having polymorphisms. Large-scale statistics on the SSR type were suggestive of highly efficient markers, based upon which we searched 16,462 available genomic scaffolds for SSR loci. With the newly constructed map, we mapped single-gene traits, the QTL of filaments, and a number of ribosomal protein genes. Conclusion The integrated map produced in this study is a highly efficient genetic tool for the high-throughput mapping of single genes and QTL. Compared to previous maps, the current map offers a greater number of markers and polymorphisms; thus, it may be used as a resource for marker-assisted breeding.
Collapse
Affiliation(s)
- Shuai Zhan
- Institute of Plant Physiology and Ecology, Shanghai Institutes for Biological Sciences, Chinese Academy of Sciences, Shanghai, 200032, PR China.
| | | | | | | | | | | | | | | | | |
Collapse
|
40
|
Stratikopoulos EE, Augustinos AA, Pavlopoulos ID, Economou KP, Mintzas A, Mathiopoulos KD, Zacharopoulou A. Isolation and characterization of microsatellite markers from the Mediterranean fruit fly, Ceratitis capitata: cross-species amplification in other Tephritidae species reveals a varying degree of transferability. Mol Genet Genomics 2009; 282:283-306. [PMID: 19544072 DOI: 10.1007/s00438-009-0465-3] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/10/2008] [Accepted: 06/05/2009] [Indexed: 11/25/2022]
Abstract
The Mediterranean fruit fly, Ceratitis capitata, is a pest of major economic importance and has become a model for the development of SIT control programs for insect pests. Significant information has been accumulated on classical and population genetics of this species during the past 2 decades. However, the availability of molecular markers is limited. Here, we present the isolation and characterization of 159 microsatellite clones and the development of 108 polymorphic microsatellite markers for this insect pest. Mapping by in situ hybridization to polytene chromosomes of 21 microsatellite clones enriched the cytogenetic map that was previously constructed by our group. The enriched map provides a large number of STSs for future genome mapping projects. Cross-species amplification of these microsatellite loci in 12 Tephritidae species and sequence analysis of several amplification products indicated a varying degree of transferability and their possible usefulness as molecular and genetic markers in these species where genetic and molecular tools are limited.
Collapse
|
41
|
Guo WJ, Ling J, Li P. Consensus features of microsatellite distribution: Microsatellite contents are universally correlated with recombination rates and are preferentially depressed by centromeres in multicellular eukaryotic genomes. Genomics 2009; 93:323-31. [DOI: 10.1016/j.ygeno.2008.12.009] [Citation(s) in RCA: 23] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/23/2008] [Revised: 12/14/2008] [Accepted: 12/16/2008] [Indexed: 10/21/2022]
|
42
|
Augustinos AA, Stratikopoulos EE, Drosopoulou E, Kakani EG, Mavragani-Tsipidou P, Zacharopoulou A, Mathiopoulos KD. Isolation and characterization of microsatellite markers from the olive fly, Bactrocera oleae, and their cross-species amplification in the Tephritidae family. BMC Genomics 2008; 9:618. [PMID: 19099577 PMCID: PMC2635384 DOI: 10.1186/1471-2164-9-618] [Citation(s) in RCA: 27] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/11/2008] [Accepted: 12/19/2008] [Indexed: 11/10/2022] Open
Abstract
Background The Tephritidae family of insects includes the most important agricultural pests of fruits and vegetables, belonging mainly to four genera (Bactrocera, Ceratitis, Anastrepha and Rhagoletis). The olive fruit fly, Bactrocera oleae, is the major pest of the olive fruit. Currently, its control is based on chemical insecticides. Environmentally friendlier methods have been attempted in the past (Sterile Insect Technique), albeit with limited success. This was mainly attributed to the lack of knowledge on the insect's behaviour, ecology and genetic structure of natural populations. The development of molecular markers could facilitate the access in the genome and contribute to the solution of the aforementioned problems. We chose to focus on microsatellite markers due to their abundance in the genome, high degree of polymorphism and easiness of isolation. Results Fifty-eight microsatellite-containing clones were isolated from the olive fly, Bactrocera oleae, bearing a total of sixty-two discrete microsatellite motifs. Forty-two primer pairs were designed on the unique sequences flanking the microsatellite motif and thirty-one of them amplified a PCR product of the expected size. The level of polymorphism was evaluated against wild and laboratory flies and the majority of the markers (93.5%) proved highly polymorphic. Thirteen of them presented a unique position on the olive fly polytene chromosomes by in situ hybridization, which can serve as anchors to correlate future genetic and cytological maps of the species, as well as entry points to the genome. Cross-species amplification of these markers to eleven Tephritidae species and sequencing of thirty-one of the amplified products revealed a varying degree of conservation that declines outside the Bactrocera genus. Conclusion Microsatellite markers are very powerful tools for genetic and population analyses, particularly in species deprived of any other means of genetic analysis. The presented set of microsatellite markers possesses all features that would render them useful in such analyses. This could also prove helpful for species where SIT is a desired outcome, since the development of effective SIT can be aided by detailed knowledge at the genetic and molecular level. Furthermore, their presented efficacy in several other species of the Tephritidae family not only makes them useful for their analysis but also provides tools for phylogenetic comparisons among them.
Collapse
Affiliation(s)
- Antonios A Augustinos
- Department of Biochemistry and Biotechnology, University of Thessaly, Thessaly, Greece.
| | | | | | | | | | | | | |
Collapse
|
43
|
Kofler R, Schlötterer C, Luschützky E, Lelley T. Survey of microsatellite clustering in eight fully sequenced species sheds light on the origin of compound microsatellites. BMC Genomics 2008; 9:612. [PMID: 19091106 PMCID: PMC2644718 DOI: 10.1186/1471-2164-9-612] [Citation(s) in RCA: 48] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/07/2008] [Accepted: 12/17/2008] [Indexed: 01/24/2023] Open
Abstract
Background Compound microsatellites are a special variation of microsatellites in which two or more individual microsatellites are found directly adjacent to each other. Until now, such composite microsatellites have not been investigated in a comprehensive manner. Results Our in silico survey of microsatellite clustering in genomes of Homo sapiens, Maccaca mulatta, Mus musculus, Rattus norvegicus, Ornithorhynchus anatinus, Gallus gallus, Danio rerio and Drosophila melanogaster revealed an unexpected high abundance of compound microsatellites. About 4 – 25% of all microsatellites could be categorized as compound microsatellites. Compound microsatellites are approximately 15 times more frequent than expected under the assumption of a random distribution of microsatellites. Interestingly, microsatellites do not only tend to cluster but the adjacent repeat types of compound microsatellites have very similar motifs: in most cases (>90%) these motifs differ only by a single mutation (base substitution or indel). We propose that the majority of the compound microsatellites originates by duplication of imperfections in a microsatellite tract. This process occurs mostly at the end of a microsatellite, leading to a new repeat type and a potential microsatellite repeat track. Conclusion Our findings suggest a more dynamic picture of microsatellite evolution than previously believed. Imperfections within microsatellites might not only cause the "death" of microsatellites they might also result in their "birth".
Collapse
Affiliation(s)
- Robert Kofler
- University of Natural Resources and Applied Life Sciences, Department for Agrobiotechnology IFA-Tulln, Institute of Biotechnology in Plant Production, Tulln, Austria.
| | | | | | | |
Collapse
|
44
|
Richard GF, Kerrest A, Dujon B. Comparative genomics and molecular dynamics of DNA repeats in eukaryotes. Microbiol Mol Biol Rev 2008; 72:686-727. [PMID: 19052325 PMCID: PMC2593564 DOI: 10.1128/mmbr.00011-08] [Citation(s) in RCA: 334] [Impact Index Per Article: 20.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/20/2022] Open
Abstract
Repeated elements can be widely abundant in eukaryotic genomes, composing more than 50% of the human genome, for example. It is possible to classify repeated sequences into two large families, "tandem repeats" and "dispersed repeats." Each of these two families can be itself divided into subfamilies. Dispersed repeats contain transposons, tRNA genes, and gene paralogues, whereas tandem repeats contain gene tandems, ribosomal DNA repeat arrays, and satellite DNA, itself subdivided into satellites, minisatellites, and microsatellites. Remarkably, the molecular mechanisms that create and propagate dispersed and tandem repeats are specific to each class and usually do not overlap. In the present review, we have chosen in the first section to describe the nature and distribution of dispersed and tandem repeats in eukaryotic genomes in the light of complete (or nearly complete) available genome sequences. In the second part, we focus on the molecular mechanisms responsible for the fast evolution of two specific classes of tandem repeats: minisatellites and microsatellites. Given that a growing number of human neurological disorders involve the expansion of a particular class of microsatellites, called trinucleotide repeats, a large part of the recent experimental work on microsatellites has focused on these particular repeats, and thus we also review the current knowledge in this area. Finally, we propose a unified definition for mini- and microsatellites that takes into account their biological properties and try to point out new directions that should be explored in a near future on our road to understanding the genetics of repeated sequences.
Collapse
Affiliation(s)
- Guy-Franck Richard
- Institut Pasteur, Unité de Génétique Moléculaire des Levures, CNRS, URA2171, Université Pierre et Marie Curie, UFR927, 25 rue du Dr. Roux, F-75015, Paris, France.
| | | | | |
Collapse
|
45
|
Wang S, Lorenzen MD, Beeman RW, Brown SJ. Analysis of repetitive DNA distribution patterns in the Tribolium castaneum genome. Genome Biol 2008; 9:R61. [PMID: 18366801 PMCID: PMC2397513 DOI: 10.1186/gb-2008-9-3-r61] [Citation(s) in RCA: 45] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/07/2007] [Revised: 01/19/2008] [Accepted: 03/26/2008] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND Insect genomes vary widely in size, a large fraction of which is often devoted to repetitive DNA. Re-association kinetics indicate that up to 42% of the genome of the red flour beetle, Tribolium castaneum, is repetitive. Analysis of the abundance and distribution of repetitive DNA in the recently sequenced genome of T. castaneum is important for understanding the structure and function of its genome. RESULTS Using TRF, TEpipe and RepeatScout we found that approximately 30% of the T. castaneum assembled genome is composed of repetitive DNA. Of this, 17% is found in tandem arrays and the remaining 83% is dispersed, including transposable elements, which in themselves constitute 5-6% of the genome. RepeatScout identified 31 highly repetitive DNA elements with repeat units longer than 100 bp, which constitute 7% of the genome; 65% of these highly repetitive elements and 74% of transposable elements accumulate in regions representing 40% of the assembled genome that is anchored to chromosomes. These regions tend to occur near one end of each chromosome, similar to previously described blocks of pericentric heterochromatin. They contain fewer genes with longer introns, and often correspond with regions of low recombination in the genetic map. CONCLUSION Our study found that transposable elements and other repetitive DNA accumulate in certain regions in the assembled T. castaneum genome. Several lines of evidence suggest these regions are derived from the large blocks of pericentric heterochromatin in T. castaneum chromosomes.
Collapse
Affiliation(s)
- Suzhi Wang
- Department of Biology, Kansas State University, Manhattan, KS 66506, USA.
| | | | | | | |
Collapse
|
46
|
Xu Z, Gutierrez L, Hitchens M, Scherer S, Sater AK, Wells DE. Distribution of polymorphic and non-polymorphic microsatellite repeats in Xenopus tropicalis. Bioinform Biol Insights 2008; 2:157-69. [PMID: 19812773 PMCID: PMC2735965 DOI: 10.4137/bbi.s561] [Citation(s) in RCA: 11] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/17/2022] Open
Abstract
The results of our bioinformatics analysis have found over 91,000 di-, tri-, and tetranucleotide microsatellites in our survey of 25% of the X. tropicalis genome, suggesting there may be over 360,000 within the entire genome. Within the X. tropicalis genome, dinucleotide (78.7%) microsatellites vastly out numbered tri- and tetranucleotide microsatellites. Similarly, AT-rich repeats are overwhelmingly dominant. The four AT-only motifs (AT, AAT, AAAT, and AATT) account for 51,858 out of 91,304 microsatellites found. Individually, AT microsatellites were the most common repeat found, representing over half of all di-, tri-, and tetranucleotide microsatellites. This contrasts with data from other studies, which show that AC is the most frequent microsatellite in vertebrate genomes (Toth et al. 2000). In addition, we have determined the rate of polymorphism for 5,128 non-redundant microsatellites, embedded in unique sequences. Interestingly, this subgroup of microsatellites was determined to have significantly longer repeats than genomic microsatellites as a whole. In addition, microsatellite loci with tandem repeat lengths more than 30 bp exhibited a significantly higher degree of polymorphism than other loci. Pairwise comparisons show that tetranucleotide microsatellites have the highest polymorphic rates. In addition, AAT and ATC showed significant higher polymorphism than other trinucleotide microsatellites, while AGAT and AAAG were significantly more polymorphic than other tetranucleotide microsatellites.
Collapse
Affiliation(s)
- Zhenkang Xu
- Department of Biology and Biochemistry, University of Houston, Houston, TX 77204, USA
| | | | | | | | | | | |
Collapse
|
47
|
Kim TS, Booth JG, Gauch HG, Sun Q, Park J, Lee YH, Lee K. Simple sequence repeats in Neurospora crassa: distribution, polymorphism and evolutionary inference. BMC Genomics 2008; 9:31. [PMID: 18215294 PMCID: PMC2257937 DOI: 10.1186/1471-2164-9-31] [Citation(s) in RCA: 64] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/25/2007] [Accepted: 01/23/2008] [Indexed: 11/10/2022] Open
Abstract
Background Simple sequence repeats (SSRs) have been successfully used for various genetic and evolutionary studies in eukaryotic systems. The eukaryotic model organism Neurospora crassa is an excellent system to study evolution and biological function of SSRs. Results We identified and characterized 2749 SSRs of 963 SSR types in the genome of N. crassa. The distribution of tri-nucleotide (nt) SSRs, the most common SSRs in N. crassa, was significantly biased in exons. We further characterized the distribution of 19 abundant SSR types (AST), which account for 71% of total SSRs in the N. crassa genome, using a Poisson log-linear model. We also characterized the size variation of SSRs among natural accessions using Polymorphic Index Content (PIC) and ANOVA analyses and found that there are genome-wide, chromosome-dependent and local-specific variations. Using polymorphic SSRs, we have built linkage maps from three line-cross populations. Conclusion Taking our computational, statistical and experimental data together, we conclude that 1) the distributions of the SSRs in the sequenced N. crassa genome differ systematically between chromosomes as well as between SSR types, 2) the size variation of tri-nt SSRs in exons might be an important mechanism in generating functional variation of proteins in N. crassa, 3) there are different levels of evolutionary forces in variation of amino acid repeats, and 4) SSRs are stable molecular markers for genetic studies in N. crassa.
Collapse
Affiliation(s)
- Tae-Sung Kim
- Department of Plant Pathology and Plant-Microbe Biology, Cornell University, Ithaca, NY 14853 USA.
| | | | | | | | | | | | | |
Collapse
|
48
|
Gallach M, Arnau V, Marín I. Global patterns of sequence evolution in Drosophila. BMC Genomics 2007; 8:408. [PMID: 17996078 PMCID: PMC2180185 DOI: 10.1186/1471-2164-8-408] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/14/2007] [Accepted: 11/09/2007] [Indexed: 01/30/2023] Open
Abstract
Background Sequencing of the genomes of several Drosophila allows for the first precise analyses of how global sequence patterns change among multiple, closely related animal species. A basic question is whether there are characteristic features that differentiate chromosomes within a species or between different species. Results We explored the euchromatin of the chromosomes of seven Drosophila species to establish their global patterns of DNA sequence diversity. Between species, differences in the types and amounts of simple sequence repeats were found. Within each species, the autosomes have almost identical oligonucleotide profiles. However, X chromosomes and autosomes have, in all species, a qualitatively different composition. The X chromosomes are less complex than the autosomes, containing both a higher amount of simple DNA sequences and, in several cases, chromosome-specific repetitive sequences. Moreover, we show that the right arm of the X chromosome of Drosophila pseudoobscura, which evolved from an autosome 10 – 18 millions of years ago, has a composition which is identical to that of the original, left arm of the X chromosome. Conclusion The consistent differences among species, differences among X chromosomes and autosomes and the convergent evolution of X and neo-X chromosomes demonstrate that strong forces are acting on drosophilid genomes to generate peculiar chromosomal landscapes. We discuss the relationships of the patterns observed with differential recombination and mutation rates and with the process of dosage compensation.
Collapse
Affiliation(s)
- Miguel Gallach
- Instituto de Biomedicina de Valencia, Consejo Superior de Investigaciones Científicas (IBV-CSIC), Valencia, Spain.
| | | | | |
Collapse
|
49
|
Sharma PC, Grover A, Kahl G. Mining microsatellites in eukaryotic genomes. Trends Biotechnol 2007; 25:490-8. [PMID: 17945369 DOI: 10.1016/j.tibtech.2007.07.013] [Citation(s) in RCA: 170] [Impact Index Per Article: 10.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/14/2007] [Revised: 07/12/2007] [Accepted: 07/31/2007] [Indexed: 12/13/2022]
Abstract
During recent decades, microsatellites have become the most popular source of genetic markers. More recently, the availability of enormous sequence data for a large number of eukaryotic genomes has accelerated research aimed at understanding the origin and functions of microsatellites and searching for new applications. This review presents recent developments of in silico mining of microsatellites to reveal various facets of the distribution and dynamics of microsatellites in eukaryotic genomes. Two aspects of microsatellite search strategies--using a suitable search tool and accessing a relevant microsatellite database--have been explored. Judicious microsatellite mining not only helps in addressing biological questions but also facilitates better exploitation of microsatellites for diverse applications.
Collapse
Affiliation(s)
- Prakash C Sharma
- University School of Biotechnology, Guru Gobind Singh Indraprastha University, Kashmere Gate, Delhi 110 006, India.
| | | | | |
Collapse
|
50
|
Shekar M, Karunasagar I, Karunasagar I. Abundance, composition and distribution of simple sequence repeats and dinucleotide compositional bias within WSSV genomes. J Genet 2007; 86:69-73. [PMID: 17656852 DOI: 10.1007/s12041-007-0010-2] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
Affiliation(s)
- Malathi Shekar
- Department of Fishery Microbiology, UNESCO Centre for Marine Biotechnology, Karnataka Veterinary, Animal and Fishery Sciences University, College of Fisheries, Mangalore 575 002, India
| | | | | |
Collapse
|