1
|
Kristmundsdottir S, Jonsson H, Hardarson MT, Palsson G, Beyter D, Eggertsson HP, Gylfason A, Sveinbjornsson G, Holley G, Stefansson OA, Halldorsson GH, Olafsson S, Arnadottir GA, Olason PI, Eiriksson O, Masson G, Thorsteinsdottir U, Rafnar T, Sulem P, Helgason A, Gudbjartsson DF, Halldorsson BV, Stefansson K. Sequence variants affecting the genome-wide rate of germline microsatellite mutations. Nat Commun 2023; 14:3855. [PMID: 37386006 PMCID: PMC10310707 DOI: 10.1038/s41467-023-39547-6] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/10/2022] [Accepted: 06/16/2023] [Indexed: 07/01/2023] Open
Abstract
Microsatellites are polymorphic tracts of short tandem repeats with one to six base-pair (bp) motifs and are some of the most polymorphic variants in the genome. Using 6084 Icelandic parent-offspring trios we estimate 63.7 (95% CI: 61.9-65.4) microsatellite de novo mutations (mDNMs) per offspring per generation, excluding one bp repeats motifs (homopolymers) the estimate is 48.2 mDNMs (95% CI: 46.7-49.6). Paternal mDNMs occur at longer repeats than maternal ones, which are in turn larger with a mean size of 3.4 bp vs 3.1 bp for paternal ones. mDNMs increase by 0.97 (95% CI: 0.90-1.04) and 0.31 (95% CI: 0.25-0.37) per year of father's and mother's age at conception, respectively. Here, we find two independent coding variants that associate with the number of mDNMs transmitted to offspring; The minor allele of a missense variant (allele frequency (AF) = 1.9%) in MSH2, a mismatch repair gene, increases transmitted mDNMs from both parents (effect: 13.1 paternal and 7.8 maternal mDNMs). A synonymous variant (AF = 20.3%) in NEIL2, a DNA damage repair gene, increases paternally transmitted mDNMs (effect: 4.4 mDNMs). Thus, the microsatellite mutation rate in humans is in part under genetic control.
Collapse
Affiliation(s)
- Snaedis Kristmundsdottir
- deCODE genetics / Amgen Inc., Reykjavik, Iceland
- School of Technology, Reykjavik University, Reykjavik, Iceland
| | | | - Marteinn T Hardarson
- deCODE genetics / Amgen Inc., Reykjavik, Iceland
- School of Technology, Reykjavik University, Reykjavik, Iceland
| | | | - Doruk Beyter
- deCODE genetics / Amgen Inc., Reykjavik, Iceland
| | | | | | | | | | | | - Gisli H Halldorsson
- deCODE genetics / Amgen Inc., Reykjavik, Iceland
- School of Engineering and Natural Sciences, University of Iceland, Reykjavik, Iceland
| | | | - Gudny A Arnadottir
- deCODE genetics / Amgen Inc., Reykjavik, Iceland
- Faculty of Medicine, School of Health Sciences, University of Iceland, Reykjavik, Iceland
| | | | | | - Gisli Masson
- deCODE genetics / Amgen Inc., Reykjavik, Iceland
| | - Unnur Thorsteinsdottir
- deCODE genetics / Amgen Inc., Reykjavik, Iceland
- Faculty of Medicine, School of Health Sciences, University of Iceland, Reykjavik, Iceland
| | | | | | - Agnar Helgason
- deCODE genetics / Amgen Inc., Reykjavik, Iceland
- Department of Anthropology, University of Iceland, Reykjavik, Iceland
| | - Daniel F Gudbjartsson
- deCODE genetics / Amgen Inc., Reykjavik, Iceland
- School of Engineering and Natural Sciences, University of Iceland, Reykjavik, Iceland
| | - Bjarni V Halldorsson
- deCODE genetics / Amgen Inc., Reykjavik, Iceland.
- School of Technology, Reykjavik University, Reykjavik, Iceland.
| | | |
Collapse
|
2
|
Nasrin T, Hoque M, Ali S. Systems biology of the genomes' microsatellite signature of Orthopoxvirus including the Monkeypox virus. Comp Immunol Microbiol Infect Dis 2023; 98:102002. [PMID: 37329681 DOI: 10.1016/j.cimid.2023.102002] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/15/2022] [Revised: 05/07/2023] [Accepted: 05/30/2023] [Indexed: 06/19/2023]
Abstract
This study is an attempt to extract and analyse the microsatellites or simple sequence repeats (SSRs) from the genomes of eight species of the genus Orthopoxvirus. The average size of genomes included in the study was 205 kb while the GC% was 33% for all but one. A total of 10,584 SSRs and 854 cSSRs were observed. POX2 with the largest genome of 224.499 kb had maximum of 1493 SSRs and 121 cSSRs (compound SSR) while POX7 with the smallest genome of 185.578 kb had minimum incident SSRs and cSSRs at 1181 and 96, respectively. There was significant correlation between genome size and SSR incidence. Di-nucleotide repeats were the most prevalent (57.47%) followed by mono- at 33% and tri- at 8.6%. Mono-nucleotide SSRs were predominantly T (51%) and A (48.4%). A majority of 80.32% SSRs were in the coding region. The three most similar genomes as per heat map POX1, POX7 and POX5 (93% similarity) are adjacent to one another in the phylogenetic tree. Ankyrin/Ankyrin like protein and Kelch protein which are associated with host determination and divergence have the highest SSR density in almost all studied viruses. Thus, SSRs are involved in genome evolution and host determination of viruses.
Collapse
Affiliation(s)
- Taslima Nasrin
- Clinical and Applied Genomics (CAG) Laboratory Department of Biological Sciences, Aliah University, Kolkata, India
| | - Mehboob Hoque
- Applied Biochemistry Laboratory, Department of Biological Sciences, Aliah University, Kolkata, India
| | - Safdar Ali
- Clinical and Applied Genomics (CAG) Laboratory Department of Biological Sciences, Aliah University, Kolkata, India.
| |
Collapse
|
3
|
Swaerts K, Dedeurwaerdere F, De Smet D, De Jaeger P, Martens GA. DeltaMSI: artificial intelligence-based modeling of microsatellite instability scoring on next-generation sequencing data. BMC Bioinformatics 2023; 24:73. [PMID: 36859168 PMCID: PMC9976396 DOI: 10.1186/s12859-023-05186-3] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/11/2022] [Accepted: 02/14/2023] [Indexed: 03/03/2023] Open
Abstract
BACKGROUND DNA mismatch repair deficiency (dMMR) testing is crucial for detection of microsatellite unstable (MSI) tumors. MSI is detected by aberrant indel length distributions of microsatellite markers, either by visual inspection of PCR-fragment length profiles or by automated bioinformatic scoring on next-generation sequencing (NGS) data. The former is time-consuming and low-throughput while the latter typically relies on simplified binary scoring of a single parameter of the indel distribution. The purpose of this study was to use machine learning to process the full complexity of indel distributions and integrate it into a robust script for screening of dMMR on small gene panel-based NGS data of clinical tumor samples without paired normal tissue. METHODS Scikit-learn was used to train 7 models on normalized read depth data of 36 microsatellite loci in a cohort of 133 MMR proficient (pMMR) and 46 dMMR tumor samples, taking loss of MLH1/MSH2/PMS2/MSH6 protein expression as reference method. After selection of the optimal model and microsatellite panel the two top-performing models per locus (logistic regression and support vector machine) were integrated into a novel script (DeltaMSI) for combined prediction of MSI status on 28 marker loci at sample level. Diagnostic performance of DeltaMSI was compared to that of mSINGS, a widely used script for MSI detection on unpaired tumor samples. The robustness of DeltaMSI was evaluated on 1072 unselected, consecutive solid tumor samples in a real-world setting sequenced using capture chemistry, and 116 solid tumor samples sequenced by amplicon chemistry. Likelihood ratios were used to select result intervals with clinical validity. RESULTS DeltaMSI achieved higher robustness at equal diagnostic power (AUC = 0.950; 95% CI 0.910-0.975) as compared to mSINGS (AUC = 0.876; 95% CI 0.823-0.918). Its sensitivity of 90% at 100% specificity indicated its clinical potential for high-throughput MSI screening in all tumor types. Clinical Trial Number/IRB B1172020000040, Ethical Committee, AZ Delta General Hospital.
Collapse
Affiliation(s)
- Koen Swaerts
- grid.478056.80000 0004 0439 8570Department of Laboratory Medicine, AZ Delta General Hospital, Deltalaan 1, 8800 Roeselare, Belgium ,grid.478056.80000 0004 0439 8570RADar Innovation Center, AZ Delta General Hospital, Roeselare, Belgium
| | - Franceska Dedeurwaerdere
- grid.478056.80000 0004 0439 8570Department of Pathology, AZ Delta General Hospital, Roeselare, Belgium
| | - Dieter De Smet
- grid.478056.80000 0004 0439 8570Department of Laboratory Medicine, AZ Delta General Hospital, Deltalaan 1, 8800 Roeselare, Belgium ,grid.478056.80000 0004 0439 8570RADar Innovation Center, AZ Delta General Hospital, Roeselare, Belgium
| | - Peter De Jaeger
- grid.478056.80000 0004 0439 8570RADar Innovation Center, AZ Delta General Hospital, Roeselare, Belgium
| | - Geert A. Martens
- grid.478056.80000 0004 0439 8570Department of Laboratory Medicine, AZ Delta General Hospital, Deltalaan 1, 8800 Roeselare, Belgium ,grid.478056.80000 0004 0439 8570RADar Innovation Center, AZ Delta General Hospital, Roeselare, Belgium ,grid.5342.00000 0001 2069 7798Department of Biomolecular Medicine, Ghent University, Ghent, Belgium
| |
Collapse
|
4
|
Wieland J, Buchan S, Sen Gupta S, Mantzouratou A. Genomic instability and the link to infertility: A focus on microsatellites and genomic instability syndromes. Eur J Obstet Gynecol Reprod Biol 2022; 274:229-237. [PMID: 35671666 DOI: 10.1016/j.ejogrb.2022.06.001] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/21/2022] [Revised: 05/25/2022] [Accepted: 06/01/2022] [Indexed: 12/01/2022]
Abstract
Infertility is associated to multiple types of different genomic instabilities and is a genetic feature of genomic instability syndromes. While the mismatch repair machinery contributes to the maintenance of genome integrity, surprisingly its potential role in infertility is overlooked. Defects in mismatch repair mechanisms contribute to microsatellite instability and genomic instability syndromes, due to the inability to repair newly replicated DNA. This article reviews the literature to date to elucidate the contribution of microsatellite instability to genomic instability syndromes and infertility. The key findings presented reveal microsatellite instability is poorly researched in genomic instability syndromes and infertility.
Collapse
Affiliation(s)
- Jack Wieland
- Department of Life and Environmental Sciences, Faculty of Science and Technology, Bournemouth University, Poole BH12 5BB, UK.
| | - Sarah Buchan
- Department of Life and Environmental Sciences, Faculty of Science and Technology, Bournemouth University, Poole BH12 5BB, UK.
| | - Sioban Sen Gupta
- Institute for Women's Health, 86-96 Chenies Mews, University College London, London WC1E 6HX, UK.
| | - Anna Mantzouratou
- Department of Life and Environmental Sciences, Faculty of Science and Technology, Bournemouth University, Poole BH12 5BB, UK.
| |
Collapse
|
5
|
Kasai S, Nishizawa D, Hasegawa J, Fukuda KI, Ichinohe T, Nagashima M, Hayashida M, Ikeda K. Short Tandem Repeat Variation in the CNR1 Gene Associated With Analgesic Requirements of Opioids in Postoperative Pain Management. Front Genet 2022; 13:815089. [PMID: 35360861 PMCID: PMC8963810 DOI: 10.3389/fgene.2022.815089] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/15/2021] [Accepted: 02/02/2022] [Indexed: 11/25/2022] Open
Abstract
Short tandem repeats (STRs) and variable number of tandem repeats (VNTRs) that have been identified at approximately 0.7 and 0.5 million loci in the human genome, respectively, are highly multi-allelic variations rather than single-nucleotide polymorphisms. The number of repeats of more than a few thousand STRs was associated with the expression of nearby genes, indicating that STRs are influential genetic variations in human traits. Analgesics act on the central nervous system via their intrinsic receptors to produce analgesic effects. In the present study, we focused on STRs and VNTRs in the CNR1, GRIN2A, PENK, and PDYN genes and analyzed two peripheral pain sensation-related traits and seven analgesia-related traits in postoperative pain management. A total of 192 volunteers who underwent the peripheral pain sensation tests and 139 and 252 patients who underwent open abdominal and orthognathic cosmetic surgeries, respectively, were included in the study. None of the four STRs or VNTRs were associated with peripheral pain sensation. Short tandem repeats in the CNR1, GRIN2A, and PENK genes were associated with the frequency of fentanyl use, fentanyl dose, and visual analog scale pain scores 3 h after orthognathic cosmetic surgery (Spearman's rank correlation coefficient ρ = 0.199, p = 0.002, ρ = 0.174, p = 0.006, and ρ = 0.135, p = 0.033, respectively), analgesic dose, including epidural analgesics after open abdominal surgery (ρ = -0.200, p = 0.018), and visual analog scale pain scores 24 h after orthognathic cosmetic surgery (ρ = 0.143, p = 0.023), respectively. The associations between STRs in the CNR1 gene and the frequency of fentanyl use and fentanyl dose after orthognathic cosmetic surgery were confirmed by Holm's multiple-testing correction. These findings indicate that STRs in the CNR1 gene influence analgesia in the orofacial region.
Collapse
Affiliation(s)
- Shinya Kasai
- Addictive Substance Project, Department of Psychiatry and Behavioral Sciences, Tokyo Metropolitan Institute of Medical Science, Tokyo, Japan
| | - Daisuke Nishizawa
- Addictive Substance Project, Department of Psychiatry and Behavioral Sciences, Tokyo Metropolitan Institute of Medical Science, Tokyo, Japan
| | - Junko Hasegawa
- Addictive Substance Project, Department of Psychiatry and Behavioral Sciences, Tokyo Metropolitan Institute of Medical Science, Tokyo, Japan
| | - Ken-ichi Fukuda
- Department of Oral Health Science, Tokyo Dental College, Tokyo, Japan
| | - Tatsuya Ichinohe
- Department of Dental Anesthesiology, Tokyo Dental College, Tokyo, Japan
| | - Makoto Nagashima
- Department of Surgery, Toho University Sakura Medical Center, Sakura, Japan
| | - Masakazu Hayashida
- Department of Anesthesiology and Pain Medicine, Juntendo University School of Medicine, Tokyo, Japan
| | - Kazutaka Ikeda
- Addictive Substance Project, Department of Psychiatry and Behavioral Sciences, Tokyo Metropolitan Institute of Medical Science, Tokyo, Japan
| |
Collapse
|
6
|
Hudson CJ, Groh AMR, Macciardi F, Hudson RC. Genetic polymorphism data support a relationship between schizophrenia and microsatellite variability in PLA2G4A in Northern Europeans not Han Chinese. Am J Med Genet B Neuropsychiatr Genet 2022; 189:3-5. [PMID: 34889025 PMCID: PMC9300063 DOI: 10.1002/ajmg.b.32881] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 09/19/2021] [Accepted: 11/22/2021] [Indexed: 11/10/2022]
Affiliation(s)
| | - Adam M. R. Groh
- Montreal Neurological Institute‐HospitalMcGill UniversityMontréalQuébecCanada
| | - Fabio Macciardi
- Department of PsychiatryUniversity of California, Irvine (UCI)IrvineCaliforniaUSA
| | | |
Collapse
|
7
|
Ferragut Cardoso AP, Banerjee M, Nail AN, Lykoudi A, States JC. miRNA dysregulation is an emerging modulator of genomic instability. Semin Cancer Biol 2021; 76:120-131. [PMID: 33979676 PMCID: PMC8576067 DOI: 10.1016/j.semcancer.2021.05.004] [Citation(s) in RCA: 72] [Impact Index Per Article: 18.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/01/2021] [Revised: 04/30/2021] [Accepted: 05/03/2021] [Indexed: 12/12/2022]
Abstract
Genomic instability consists of a range of genetic alterations within the genome that contributes to tumor heterogeneity and drug resistance. It is a well-established characteristic of most cancer cells. Genome instability induction results from defects in DNA damage surveillance mechanisms, mitotic checkpoints and DNA repair machinery. Accumulation of genetic alterations ultimately sets cells towards malignant transformation. Recent studies suggest that miRNAs are key players in mediating genome instability. miRNAs are a class of small RNAs expressed in most somatic tissues and are part of the epigenome. Importantly, in many cancers, miRNA expression is dysregulated. Consequently, this review examines the role of miRNA dysregulation as a causal step for induction of genome instability and subsequent carcinogenesis. We focus specifically on mechanistic studies assessing miRNA(s) and specific subtypes of genome instability or known modes of genome instability. In addition, we provide insight on the existing knowledge gaps within the field and possible ways to address them.
Collapse
Affiliation(s)
- Ana P Ferragut Cardoso
- Department of Pharmacology and Toxicology, University of Louisville, Louisville, KY, 40202, USA
| | - Mayukh Banerjee
- Department of Pharmacology and Toxicology, University of Louisville, Louisville, KY, 40202, USA
| | - Alexandra N Nail
- Department of Pharmacology and Toxicology, University of Louisville, Louisville, KY, 40202, USA
| | - Angeliki Lykoudi
- Department of Pharmacology and Toxicology, University of Louisville, Louisville, KY, 40202, USA
| | - J Christopher States
- Department of Pharmacology and Toxicology, University of Louisville, Louisville, KY, 40202, USA.
| |
Collapse
|
8
|
Hudson CJ, Zhu JX, Durocher AM. Re-analysis of genetic polymorphism data supports a relationship between schizophrenia and microsatellite variability in PLA2G4A. Psychiatr Genet 2021; 31:102-105. [PMID: 33853093 PMCID: PMC8115738 DOI: 10.1097/ypg.0000000000000280] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/13/2020] [Accepted: 03/08/2021] [Indexed: 11/25/2022]
Affiliation(s)
| | | | - Alexandra M. Durocher
- Schulich School of Medicine and Dentistry, Western University, London, Ontario, Canada
| |
Collapse
|
9
|
Jing M, Yang H, Li K, Huang L. Characterization of three new mitochondrial genomes of Coraciiformes (Megaceryle lugubris, Alcedo atthis, Halcyon smyrnensis) and insights into their phylogenetics. Genet Mol Biol 2020; 43:e20190392. [PMID: 33026411 PMCID: PMC7539371 DOI: 10.1590/1678-4685-gmb-2019-0392] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/22/2019] [Accepted: 07/13/2020] [Indexed: 12/04/2022] Open
Abstract
Coraciiformes contains more than 200 species with great differences on external
morphology and life-style. The evolutionary relationships within Coraciiformes
and the phylogenetic placement of Coraciiformes in Aves are still questioned.
Mitochondrial genome (mitogenome) sequences are popular markers in molecular
phylogenetic studies of birds. This study presented the genome characteristics
of three new mitogenomes in Coraciiformes and explored the phylogenetic
relationships among Coraciiformes and other five related orders with mitogenome
data of 30 species. The sizes of three mitogenomes were 17,383 bp
(Alcedo atthis), 17,892 bp (Halcyon
smyrnensis) and 17,223 bp (Megaceryle lugubris).
Each mitogenome contained one control region and 37 genes that were common in
vertebrate mitogenomes. The organization of three mitogenomes was identical to
the putative ancestral gene order in Aves. Among 13 available Coraciiform
mitogenomes, 12 protein coding genes showed indications of negative selection,
while the MT-ND6 presented sign of positive selection or relaxed purifying
selection. The phylogenetic results supported that Upupidae and Bucerotidae
should be separated from Coraciiformes, and that Coraciiformes is more closely
related to Piciformes than to Strigiformes, Trogoniformes and Cuculiformes. Our
study provide valuable data for further phylogenetic investigation of
Coraciiformes.
Collapse
Affiliation(s)
- Meidong Jing
- Nantong University, School of Life Sciences, Nantong, Jiangsu, P. R. China
| | - Huanhuan Yang
- Ludong University, School of Life Sciences, Yantai, Shandong, P. R. China
| | - Kai Li
- Nantong Xingdong International Airport, Nantong, Jiangsu, P. R. China
| | - Ling Huang
- Nantong University, School of Life Sciences, Nantong, Jiangsu, P. R. China
| |
Collapse
|
10
|
Nelakurti DD, Pappula AL, Rajasekaran S, Miles WO, Petreaca RC. Comprehensive Analysis of MEN1 Mutations and Their Role in Cancer. Cancers (Basel) 2020; 12:cancers12092616. [PMID: 32937789 PMCID: PMC7565326 DOI: 10.3390/cancers12092616] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/30/2020] [Revised: 09/04/2020] [Accepted: 09/10/2020] [Indexed: 12/24/2022] Open
Abstract
Simple Summary Cancers are characterized by accumulation of genetic mutations in key cell cycle regulators that alter or disable the function of these genes. Such mutations can be inherited or arise spontaneously during the life of the individual. The MEN1 gene prevents uncontrolled cell division and it is considered a tumor suppressor. Inherited MEN1 mutations are associated with certain parathyroid and pancreatic syndromes while spontaneous mutations have been detected in cancer cells. We investigated whether inherited mutations appear in cancer cells which would suggest that patients with parathyroid and pancreatic syndromes have a predisposition to develop cancer. We find a weak correlation between the spectrum of inherited mutations and those appearing spontaneously. Thus, inherited MEN1 mutations may not be a good predictor of tumorigenesis. Abstract MENIN is a scaffold protein encoded by the MEN1 gene that functions in multiple biological processes, including cell proliferation, migration, gene expression, and DNA damage repair. MEN1 is a tumor suppressor gene, and mutations that disrupts MEN1 function are common to many tumor types. Mutations within MEN1 may also be inherited (germline). Many of these inherited mutations are associated with a number of pathogenic syndromes of the parathyroid and pancreas, and some also predispose patients to hyperplasia. In this study, we cataloged the reported germline mutations from the ClinVar database and compared them with the somatic mutations detected in cancers from the Catalogue of Somatic Mutations in Cancer (COSMIC) database. We then used statistical software to determine the probability of mutations being pathogenic or driver. Our data show that many confirmed germline mutations do not appear in tumor samples. Thus, most mutations that disable MEN1 function in tumors are somatic in nature. Furthermore, of the germline mutations that do appear in tumors, only a fraction has the potential to be pathogenic or driver mutations.
Collapse
Affiliation(s)
- Devi D. Nelakurti
- Biomedical Science Undergraduate Program, The Ohio State University Medical School, Columbus, OH 43210, USA;
| | - Amrit L. Pappula
- Computer Science and Engineering Undergraduate Program, The Ohio State University, Columbus, OH 43210, USA;
| | - Swetha Rajasekaran
- Department of Molecular Genetics, The Ohio State University, Columbus, OH 43210, USA;
| | - Wayne O. Miles
- Department of Cancer Biology and Genetics, The Ohio State University Medical School, Columbus, OH 43210, USA;
| | - Ruben C. Petreaca
- Department of Molecular Genetics, The Ohio State University, Marion, OH 43302, USA
- Correspondence:
| |
Collapse
|
11
|
Zhang H, Li D, Zhao X, Pan S, Wu X, Peng S, Huang H, Shi R, Tan Z. Relatively semi-conservative replication and a folded slippage model for short tandem repeats. BMC Genomics 2020; 21:563. [PMID: 32807079 PMCID: PMC7430839 DOI: 10.1186/s12864-020-06949-5] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/13/2020] [Accepted: 07/27/2020] [Indexed: 12/11/2022] Open
Abstract
Background The ubiquitous presence of short tandem repeats (STRs) in virtually all genomes implicates their functional relevance, while a widely-accepted definition of STR is yet to be established. Previous studies majorly focus on relatively longer STRs, while shorter repeats were generally excluded. Herein, we have adopted a more generous criteria to define shorter repeats, which has led to the definition of a much larger number of STRs that lack prior analysis. Using this definition, we analyzed the short repeats in 55 randomly selected segments in 55 randomly selected genomic sequences from a fairly wide range of species covering animals, plants, fungi, protozoa, bacteria, archaea and viruses. Results Our analysis reveals a high percentage of short repeats in all 55 randomly selected segments, indicating that the universal presence of high-content short repeats could be a common characteristic of genomes across all biological kingdoms. Therefore, it is reasonable to assume a mechanism for continuous production of repeats that can make the replicating process relatively semi-conservative. We have proposed a folded replication slippage model that considers the geometric space of nucleotides and hydrogen bond stability to explain the mechanism more explicitly, with improving the existing straight-line slippage model. The folded slippage model can explain the expansion and contraction of mono- to hexa- nucleotide repeats with proper folding angles. Analysis of external forces in the folding template strands also suggests that expansion exists more commonly than contraction in the short tandem repeats. Conclusion The folded replication slippage model provides a reasonable explanation for the continuous occurrences of simple sequence repeats in genomes. This model also contributes to the explanation of STR-to-genome evolution and is an alternative model that complements semi-conservative replication.
Collapse
Affiliation(s)
- Hongxi Zhang
- Bioinformatics Center, College of Biology, Hunan University, Changsha, 410082, China
| | - Douyue Li
- Bioinformatics Center, College of Biology, Hunan University, Changsha, 410082, China
| | - Xiangyan Zhao
- Bioinformatics Center, College of Biology, Hunan University, Changsha, 410082, China
| | - Saichao Pan
- Bioinformatics Center, College of Biology, Hunan University, Changsha, 410082, China
| | - Xiaolong Wu
- Bioinformatics Center, College of Biology, Hunan University, Changsha, 410082, China
| | - Shan Peng
- Bioinformatics Center, College of Biology, Hunan University, Changsha, 410082, China
| | - Hanrou Huang
- Bioinformatics Center, College of Biology, Hunan University, Changsha, 410082, China
| | - Ruixue Shi
- Bioinformatics Center, College of Biology, Hunan University, Changsha, 410082, China
| | - Zhongyang Tan
- Bioinformatics Center, College of Biology, Hunan University, Changsha, 410082, China.
| |
Collapse
|
12
|
Wang Q, Pierce-Hoffman E, Cummings BB, Alföldi J, Francioli LC, Gauthier LD, Hill AJ, O'Donnell-Luria AH, Karczewski KJ, MacArthur DG. Landscape of multi-nucleotide variants in 125,748 human exomes and 15,708 genomes. Nat Commun 2020; 11:2539. [PMID: 32461613 PMCID: PMC7253413 DOI: 10.1038/s41467-019-12438-5] [Citation(s) in RCA: 84] [Impact Index Per Article: 16.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/02/2019] [Accepted: 09/09/2019] [Indexed: 12/31/2022] Open
Abstract
Multi-nucleotide variants (MNVs), defined as two or more nearby variants existing on the same haplotype in an individual, are a clinically and biologically important class of genetic variation. However, existing tools typically do not accurately classify MNVs, and understanding of their mutational origins remains limited. Here, we systematically survey MNVs in 125,748 whole exomes and 15,708 whole genomes from the Genome Aggregation Database (gnomAD). We identify 1,792,248 MNVs across the genome with constituent variants falling within 2 bp distance of one another, including 18,756 variants with a novel combined effect on protein sequence. Finally, we estimate the relative impact of known mutational mechanisms - CpG deamination, replication error by polymerase zeta, and polymerase slippage at repeat junctions - on the generation of MNVs. Our results demonstrate the value of haplotype-aware variant annotation, and refine our understanding of genome-wide mutational mechanisms of MNVs.
Collapse
Affiliation(s)
- Qingbo Wang
- Program in Medical and Population Genetics, The Broad Institute of MIT and Harvard, Cambridge, MA, 02142, USA
- Analytic and Translational Genetics Unit, Massachusetts General Hospital, Boston, MA, 02114, USA
- Program in Bioinformatics and Integrative Genomics, Harvard Medical School, Boston, MA, 02115, USA
| | - Emma Pierce-Hoffman
- Program in Medical and Population Genetics, The Broad Institute of MIT and Harvard, Cambridge, MA, 02142, USA
| | - Beryl B Cummings
- Program in Medical and Population Genetics, The Broad Institute of MIT and Harvard, Cambridge, MA, 02142, USA
- Analytic and Translational Genetics Unit, Massachusetts General Hospital, Boston, MA, 02114, USA
- Program in Biomedical and Biological Sciences, Harvard Medical School, Boston, MA, 02115, USA
| | - Jessica Alföldi
- Program in Medical and Population Genetics, The Broad Institute of MIT and Harvard, Cambridge, MA, 02142, USA
- Analytic and Translational Genetics Unit, Massachusetts General Hospital, Boston, MA, 02114, USA
| | - Laurent C Francioli
- Program in Medical and Population Genetics, The Broad Institute of MIT and Harvard, Cambridge, MA, 02142, USA
- Analytic and Translational Genetics Unit, Massachusetts General Hospital, Boston, MA, 02114, USA
| | - Laura D Gauthier
- Program in Medical and Population Genetics, The Broad Institute of MIT and Harvard, Cambridge, MA, 02142, USA
- Data Sciences Platform, Broad Institute of MIT and Harvard, Cambridge, MA, 02142, USA
| | - Andrew J Hill
- Program in Medical and Population Genetics, The Broad Institute of MIT and Harvard, Cambridge, MA, 02142, USA
- Department of Genome Sciences, University of Washington, Seattle, WA, 98195, USA
| | - Anne H O'Donnell-Luria
- Program in Medical and Population Genetics, The Broad Institute of MIT and Harvard, Cambridge, MA, 02142, USA
- Analytic and Translational Genetics Unit, Massachusetts General Hospital, Boston, MA, 02114, USA
| | - Konrad J Karczewski
- Program in Medical and Population Genetics, The Broad Institute of MIT and Harvard, Cambridge, MA, 02142, USA
- Analytic and Translational Genetics Unit, Massachusetts General Hospital, Boston, MA, 02114, USA
| | - Daniel G MacArthur
- Program in Medical and Population Genetics, The Broad Institute of MIT and Harvard, Cambridge, MA, 02142, USA.
- Analytic and Translational Genetics Unit, Massachusetts General Hospital, Boston, MA, 02114, USA.
- Centre for Population Genomics, Garvan Institute of Medical Research, and UNSW Sydney, Sydney, Australia.
- Centre for Population Genomics, Murdoch Children's Research Institute, Melbourne, Australia.
| |
Collapse
|
13
|
Jia P, Yang X, Guo L, Liu B, Lin J, Liang H, Sun J, Zhang C, Ye K. MSIsensor-pro: Fast, Accurate, and Matched-normal-sample-free Detection of Microsatellite Instability. GENOMICS PROTEOMICS & BIOINFORMATICS 2020; 18:65-71. [PMID: 32171661 PMCID: PMC7393535 DOI: 10.1016/j.gpb.2020.02.001] [Citation(s) in RCA: 56] [Impact Index Per Article: 11.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 02/03/2020] [Revised: 02/22/2020] [Accepted: 02/25/2020] [Indexed: 02/06/2023]
Abstract
Microsatellite instability (MSI) is a key biomarker for cancer therapy and prognosis. Traditional experimental assays are laborious and time-consuming, and next-generation sequencing-based computational methods do not work on leukemia samples, paraffin-embedded samples, or patient-derived xenografts/organoids, due to the requirement of matched normal samples. Herein, we developed MSIsensor-pro, an open-source single sample MSI scoring method for research and clinical applications. MSIsensor-pro introduces a multinomial distribution model to quantify polymerase slippages for each tumor sample and a discriminative site selection method to enable MSI detection without matched normal samples. We demonstrate that MSIsensor-pro is an ultrafast, accurate, and robust MSI calling method. Using samples with various sequencing depths and tumor purities, MSIsensor-pro significantly outperformed the current leading methods in both accuracy and computational cost. MSIsensor-pro is available at https://github.com/xjtu-omics/msisensor-pro and free for non-commercial use, while a commercial license is provided upon request.
Collapse
Affiliation(s)
- Peng Jia
- School of Automation Science and Engineering, Faculty of Electronic and Information Engineering, Xi'an Jiaotong University, Xi'an 710049, China; MOE Key Laboratory for Intelligent Networks & Networks Security, Faculty of Electronic and Information Engineering, Xi'an Jiaotong University, Xi'an 710049, China
| | - Xiaofei Yang
- MOE Key Laboratory for Intelligent Networks & Networks Security, Faculty of Electronic and Information Engineering, Xi'an Jiaotong University, Xi'an 710049, China; School of Computer Science and Technology, Faculty of Electronic and Information Engineering, Xi'an Jiaotong University, Xi'an 710049, China
| | - Li Guo
- School of Automation Science and Engineering, Faculty of Electronic and Information Engineering, Xi'an Jiaotong University, Xi'an 710049, China; MOE Key Laboratory for Intelligent Networks & Networks Security, Faculty of Electronic and Information Engineering, Xi'an Jiaotong University, Xi'an 710049, China; School of Life Science and Technology, Xi'an Jiaotong University, Xi'an 710049, China
| | - Bowen Liu
- School of Automation Science and Engineering, Faculty of Electronic and Information Engineering, Xi'an Jiaotong University, Xi'an 710049, China; MOE Key Laboratory for Intelligent Networks & Networks Security, Faculty of Electronic and Information Engineering, Xi'an Jiaotong University, Xi'an 710049, China
| | - Jiadong Lin
- School of Automation Science and Engineering, Faculty of Electronic and Information Engineering, Xi'an Jiaotong University, Xi'an 710049, China; MOE Key Laboratory for Intelligent Networks & Networks Security, Faculty of Electronic and Information Engineering, Xi'an Jiaotong University, Xi'an 710049, China; Leiden Institute of Advanced Computer Science, Leiden University, Leiden 2311 ZE, Netherlands
| | - Hao Liang
- School of Automation Science and Engineering, Faculty of Electronic and Information Engineering, Xi'an Jiaotong University, Xi'an 710049, China; MOE Key Laboratory for Intelligent Networks & Networks Security, Faculty of Electronic and Information Engineering, Xi'an Jiaotong University, Xi'an 710049, China
| | - Jianyong Sun
- School of Mathematics and Statistics, Xi'an Jiaotong University, Xi'an 710049, China
| | - Chengsheng Zhang
- Precision Medicine Center, the First Affiliated Hospital of Xi'an Jiaotong University, Xi'an 710061, China; Jackson Laboratory for Genomic Medicine, Farmington, CT 06032, USA
| | - Kai Ye
- School of Automation Science and Engineering, Faculty of Electronic and Information Engineering, Xi'an Jiaotong University, Xi'an 710049, China; MOE Key Laboratory for Intelligent Networks & Networks Security, Faculty of Electronic and Information Engineering, Xi'an Jiaotong University, Xi'an 710049, China; School of Life Science and Technology, Xi'an Jiaotong University, Xi'an 710049, China; Genome Institute, the First Affiliated Hospital of Xi'an Jiaotong University, Xi'an 710061, China.
| |
Collapse
|
14
|
Two new mitogenomes of Picidae (Aves, Piciformes): Sequence, structure and phylogenetic analyses. Int J Biol Macromol 2019; 133:683-692. [DOI: 10.1016/j.ijbiomac.2019.04.157] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/29/2019] [Revised: 04/21/2019] [Accepted: 04/22/2019] [Indexed: 02/06/2023]
|
15
|
MHC haplotype diversity in Icelandic horses determined by polymorphic microsatellites. Genes Immun 2019; 20:660-670. [PMID: 31068686 DOI: 10.1038/s41435-019-0075-y] [Citation(s) in RCA: 14] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/31/2018] [Revised: 03/07/2019] [Accepted: 03/18/2019] [Indexed: 01/31/2023]
Abstract
The Icelandic horse has been maintained as a closed population in its eponymous homeland for many generations, with no recorded introductions of new horses of any breed since the year 1000 CE. Here we determined the diversity of major histocompatibility complex (MHC) haplotypes in 156 Icelandic horses from two groups, based on a panel of 12 polymorphic intra-MHC microsatellites tested in families of various composition. We identified a total of 79 MHC haplotypes in these two groups, including one documented intra-MHC recombination event from a total of 147 observed meioses. None of these MHC haplotypes have been previously described in any other horse breed. Only one MHC homozygote was found in the entire population studied. These results indicate a very high level of MHC heterozygosity and haplotype diversity in the Icelandic horse. The environment in Iceland is remarkable for its lack of common agents of equine infectious disease, including equine herpesvirus type 1, influenza virus, and streptococcus equi. The driving forces for maintenance of MHC heterozygosity in Icelandic horses must thus be sought outside of these major horse pathogens. Based on our results, we propose that intra-MHC recombination may play a major role in the generation of novel haplotypes.
Collapse
|
16
|
Park B, Ha BS, Lee SH, Kim MK, Choi JI, Ryu JS. Variable-number tandem repeat loci-discriminating Pleurotus ostreatus cultivars. MYCOSCIENCE 2019. [DOI: 10.1016/j.myc.2018.09.004] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/17/2022]
|
17
|
Zavodna M, Bagshaw A, Brauning R, Gemmell NJ. The effects of transcription and recombination on mutational dynamics of short tandem repeats. Nucleic Acids Res 2018; 46:1321-1330. [PMID: 29300948 PMCID: PMC5814968 DOI: 10.1093/nar/gkx1253] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/05/2017] [Revised: 11/27/2017] [Accepted: 12/27/2017] [Indexed: 01/07/2023] Open
Abstract
Short tandem repeats (STR) are ubiquitous components of the genomic architecture of most living organisms. Recent work has highlighted the widespread functional significance of such repeats, particularly around gene regulation, but the mutational processes underlying the evolution of these highly abundant and highly variable sequences are not fully understood. Traditional models assume that strand misalignment during replication is the predominant mechanism, but empirical data suggest the involvement of other processes including recombination and transcription. Despite this evidence, the relative influences of these processes have not previously been tested experimentally on a genome-wide scale. Using deep sequencing, we identify mutations at >200 microsatellites, across 700 generations in replicated populations of two otherwise identical sexual and asexual Saccharomyces cerevisiae strains. Using generalized linear models, we investigate correlates of STR mutability including the nature of the mutation, STR composition and contextual factors including recombination, transcription and replication origins. Sexual capability was not a significant predictor of microsatellite mutability, but, intriguingly, we identify transcription as a significant positive predictor. We also find that STR density is substantially increased in regions neighboring, but not within, recombination hotspots.
Collapse
Affiliation(s)
- Monika Zavodna
- Department of Anatomy, University of Otago, Dunedin 9054, New Zealand
| | - Andrew Bagshaw
- Department of Pathology, University of Otago, Christchurch 8140, New Zealand
| | - Rudiger Brauning
- AgResearch Limited, Invermay Agricultural Centre, Mosgiel, New Zealand
| | - Neil J Gemmell
- Department of Anatomy, University of Otago, Dunedin 9054, New Zealand
- Allan Wilson Centre for Molecular Ecology and Evolution, University of Otago, Dunedin 9054, New Zealand
| |
Collapse
|
18
|
Awasthi P, Singh A, Sheikh G, Mahajan V, Gupta AP, Gupta S, Bedi YS, Gandhi SG. Mining and characterization of EST-SSR markers for Zingiber officinale Roscoe with transferability to other species of Zingiberaceae. PHYSIOLOGY AND MOLECULAR BIOLOGY OF PLANTS : AN INTERNATIONAL JOURNAL OF FUNCTIONAL PLANT BIOLOGY 2017; 23:925-931. [PMID: 29158639 PMCID: PMC5671454 DOI: 10.1007/s12298-017-0472-5] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/13/2016] [Revised: 07/13/2017] [Accepted: 09/19/2017] [Indexed: 05/11/2023]
Abstract
Zingiber officinale is a model spice herb, well known for its medicinal value. It is primarily a vegetatively propagated commercial crop. However, considerable diversity in its morphology, fiber content and chemoprofiles has been reported. The present study explores the utility of EST-derived markers in studying genetic diversity in different accessions of Z. officinale and their cross transferability within the Zingiberaceae family. A total of 38,115 ESTs sequences were assembled to generate 7850 contigs and 10,762 singletons. SSRs were searched in the unigenes and 515 SSR-containing ESTs were identified with a frequency of 1 SSR per 25.21 kb of the genome. These ESTs were also annotated using BLAST2GO. Primers were designed for 349 EST-SSRs and 25 primer pairs were randomly picked for EST SSR study. Out of these, 16 primer pairs could be optimized for amplification in different accessions of Z. officinale as well as other species belonging to Zingiberaceae. GES454, GES466, GES480 and GES486 markers were found to exhibit 100% cross-transferability among different members of Zingiberaceae.
Collapse
Affiliation(s)
- Praveen Awasthi
- Indian Institute of Integrative Medicine (CSIR-IIIM), Council of Scientific and Industrial Research, Canal Road, Jammu Tawi, 180001 India
| | - Ashish Singh
- Indian Institute of Integrative Medicine (CSIR-IIIM), Council of Scientific and Industrial Research, Canal Road, Jammu Tawi, 180001 India
| | - Gulfam Sheikh
- Indian Institute of Integrative Medicine (CSIR-IIIM), Council of Scientific and Industrial Research, Canal Road, Jammu Tawi, 180001 India
- Division of Biosciences, Faculty of Sciences, Academy of Scientific and Innovative Research, Kolkata, India
| | - Vidushi Mahajan
- Indian Institute of Integrative Medicine (CSIR-IIIM), Council of Scientific and Industrial Research, Canal Road, Jammu Tawi, 180001 India
- Division of Biosciences, Faculty of Sciences, Academy of Scientific and Innovative Research, Kolkata, India
| | - Ajai Prakash Gupta
- Indian Institute of Integrative Medicine (CSIR-IIIM), Council of Scientific and Industrial Research, Canal Road, Jammu Tawi, 180001 India
| | - Suphla Gupta
- Indian Institute of Integrative Medicine (CSIR-IIIM), Council of Scientific and Industrial Research, Canal Road, Jammu Tawi, 180001 India
- Division of Biosciences, Faculty of Sciences, Academy of Scientific and Innovative Research, Kolkata, India
| | - Yashbir S. Bedi
- Indian Institute of Integrative Medicine (CSIR-IIIM), Council of Scientific and Industrial Research, Canal Road, Jammu Tawi, 180001 India
- Division of Biosciences, Faculty of Sciences, Academy of Scientific and Innovative Research, Kolkata, India
| | - Sumit G. Gandhi
- Indian Institute of Integrative Medicine (CSIR-IIIM), Council of Scientific and Industrial Research, Canal Road, Jammu Tawi, 180001 India
- Division of Biosciences, Faculty of Sciences, Academy of Scientific and Innovative Research, Kolkata, India
| |
Collapse
|
19
|
Bagshaw AT. Functional Mechanisms of Microsatellite DNA in Eukaryotic Genomes. Genome Biol Evol 2017; 9:2428-2443. [PMID: 28957459 PMCID: PMC5622345 DOI: 10.1093/gbe/evx164] [Citation(s) in RCA: 74] [Impact Index Per Article: 9.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 08/23/2017] [Indexed: 02/06/2023] Open
Abstract
Microsatellite repeat DNA is best known for its length mutability, which is implicated in several neurological diseases and cancers, and often exploited as a genetic marker. Less well-known is the body of work exploring the widespread and surprisingly diverse functional roles of microsatellites. Recently, emerging evidence includes the finding that normal microsatellite polymorphism contributes substantially to the heritability of human gene expression on a genome-wide scale, calling attention to the task of elucidating the mechanisms involved. At present, these are underexplored, but several themes have emerged. I review evidence demonstrating roles for microsatellites in modulation of transcription factor binding, spacing between promoter elements, enhancers, cytosine methylation, alternative splicing, mRNA stability, selection of transcription start and termination sites, unusual structural conformations, nucleosome positioning and modification, higher order chromatin structure, noncoding RNA, and meiotic recombination hot spots.
Collapse
|
20
|
Parity-dependent hairpin configurations of repetitive DNA sequence promote slippage associated with DNA expansion. Proc Natl Acad Sci U S A 2017; 114:9535-9540. [PMID: 28827328 DOI: 10.1073/pnas.1708691114] [Citation(s) in RCA: 28] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
Repetitive DNA sequences are ubiquitous in life, and changes in the number of repeats often have various physiological and pathological implications. DNA repeats are capable of interchanging between different noncanonical and canonical conformations in a dynamic fashion, causing configurational slippage that often leads to repeat expansion associated with neurological diseases. In this report, we used single-molecule spectroscopy together with biophysical analyses to demonstrate the parity-dependent hairpin structural polymorphism of TGGAA repeat DNA. We found that the DNA adopted two configurations depending on the repeat number parity (even or odd). Transitions between these two configurations were also observed for longer repeats. In addition, the ability to modulate this transition was found to be enhanced by divalent ions. Based on the atomic structure, we propose a local seeding model where the kinked GGA motifs in the stem region of TGGAA repeat DNA act as hot spots to facilitate the transition between the two configurations, which may give rise to disease-associated repeat expansion.
Collapse
|
21
|
Abstract
The instability of microsatellite DNA repeats is responsible for at least 40 neurodegenerative diseases. Recently, Mirkin and co-workers presented a novel mechanism for microsatellite expansions based on break-induced replication (BIR) at sites of microsatellite-induced replication stalling and fork collapse. The BIR model aims to explain single-step, large expansions of CAG/CTG trinucleotide repeats in dividing cells. BIR has been characterized extensively in Saccharomyces cerevisiae as a mechanism to repair broken DNA replication forks (single-ended DSBs) and degraded telomeric DNA. However, the structural footprints of BIR-like DSB repair have been recognized in human genomic instability and tied to the etiology of diverse developmental diseases; thus, the implications of the paper by Kim et al. (Kim JC, Harris ST, Dinter T, Shah KA, et al., Nat Struct Mol Biol 24: 55-60) extend beyond trinucleotide repeat expansion in yeast and microsatellite instability in human neurological disorders. Significantly, insight into BIR-like repair can explain certain pathways of complex genome rearrangements (CGRs) initiated at non-B form microsatellite DNA in human cancers.
Collapse
Affiliation(s)
- Michael Leffak
- Department of Biochemistry and Molecular Biology, Boonshoft School of Medicine, Wright State University, Dayton, OH, USA
| |
Collapse
|
22
|
The Role of Microsatellites in Streptophyta Gene Evolution. J Mol Evol 2017; 84:144-148. [PMID: 28116472 DOI: 10.1007/s00239-016-9778-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/24/2016] [Accepted: 12/19/2016] [Indexed: 10/20/2022]
Abstract
Microsatellites form hotspot regions for recombination. In this research, we investigated whether genic microsatellites can be responsible for generating new genes by enhancing crossover between gene containing microsatellites and other genomic regions. We tested our hypothesis on 33,531 UniGene entries containing microsatellites. Each sequence was divided into microsatellites upstream and downstream fragments, and each pair of sequences was compared to study the microsatellites effect. The candidate pairs of genes are supposed to share a high similar fragment in one side of the microsatellites, while the other fragments should be completely different. This in silico approach detected 448 valid pairs of sequences in which both of them showed semi-resemblance nature. The synteny analysis for the detected sequences against 55 plant genomes indicated low representation of them across plant kingdom. Our results will add a body of knowledge toward understanding the role of microsatellites in gene evolution.
Collapse
|
23
|
Zumaraga MP, Medina PJ, Recto JM, Abrahan L, Azurin E, Tanchoco CC, Jimeno CA, Palmes-Saloma C. Targeted next generation sequencing of the entire vitamin D receptor gene reveals polymorphisms correlated with vitamin D deficiency among older Filipino women with and without fragility fracture. J Nutr Biochem 2016; 41:98-108. [PMID: 28068558 DOI: 10.1016/j.jnutbio.2016.12.003] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/15/2016] [Revised: 11/08/2016] [Accepted: 12/12/2016] [Indexed: 12/16/2022]
Abstract
This study aimed to discover genetic variants in the entire 101 kB vitamin D receptor (VDR) gene for vitamin D deficiency in a group of postmenopausal Filipino women using targeted next generation sequencing (TNGS) approach in a case-control study design. A total of 50 women with and without osteoporotic fracture seen at the Philippine Orthopedic Center were included. Blood samples were collected for determination of serum vitamin D, calcium, phosphorus, glucose, blood urea nitrogen, creatinine, aspartate aminotransferase, alanine aminotransferase and as primary source for targeted VDR gene sequencing using the Ion Torrent Personal Genome Machine. The variant calling was based on the GATK best practice workflow and annotated using Annovar tool. A total of 1496 unique variants in the whole 101-kb VDR gene were identified. Novel sequence variations not registered in the dbSNP database were found among cases and controls at a rate of 23.1% and 16.6% of total discovered variants, respectively. One disease-associated enhancer showed statistically significant association to low serum 25-hydroxy vitamin D levels (Pearson chi-square P-value=0.009). The transcription factor binding site prediction program PROMO predicted the disruption of three transcription factor binding sites in this enhancer region. These findings show the power of TNGS in identifying sequence variations in a very large gene and the surprising results obtained in this study greatly expand the catalog of known VDR sequence variants that may represent an important clue in the emergence of vitamin D deficiency. Such information will also provide the additional guidance necessary toward a personalized nutritional advice to reach sufficient vitamin D status.
Collapse
Affiliation(s)
- Mark Pretzel Zumaraga
- National Institute of Molecular Biology and Biotechnology, University of the Philippines, Diliman, Quezon City, Philippines; Philippine Genome Center, Quezon City, Philippines; Food and Nutrition Research Institute, Department of Science and Technology, Taguig City, Philippines
| | - Paul Julius Medina
- Philippine Orthopedic Center, Banawe, Quezon City, Philippines; Mindanao State University - College of Medicine, Iligan City, Philippines
| | - Juan Miguel Recto
- National Institute of Molecular Biology and Biotechnology, University of the Philippines, Diliman, Quezon City, Philippines
| | - Lauro Abrahan
- Philippine Orthopedic Center, Banawe, Quezon City, Philippines
| | - Edelyn Azurin
- Philippine Orthopedic Center, Banawe, Quezon City, Philippines
| | - Celeste C Tanchoco
- Food and Nutrition Research Institute, Department of Science and Technology, Taguig City, Philippines
| | - Cecilia A Jimeno
- Philippine Society of Endocrinology, Diabetes and Metabolism, Pasig City, Philippines
| | - Cynthia Palmes-Saloma
- National Institute of Molecular Biology and Biotechnology, University of the Philippines, Diliman, Quezon City, Philippines; Philippine Genome Center, Quezon City, Philippines.
| |
Collapse
|
24
|
Rapid microfluidic analysis of a Y-STR multiplex for screening of forensic samples. Anal Bioanal Chem 2016; 409:939-947. [PMID: 27900418 DOI: 10.1007/s00216-016-9950-9] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/25/2016] [Revised: 08/18/2016] [Accepted: 09/14/2016] [Indexed: 10/20/2022]
Abstract
In this paper, we demonstrate a rapid analysis procedure for use with a small set of rapidly mutating Y chromosomal short tandem repeat (Y-STR) loci that combines both rapid polymerase chain reaction (PCR) and microfluidic separation elements. The procedure involves a high-speed polymerase and a rapid cycling protocol to permit PCR amplification in 16 min. The resultant amplified sample is next analysed using a short 1.8-cm microfluidic electrophoresis system that permits a four-locus Y-STR genotype to be produced in 80 s. The entire procedure takes less than 25 min from sample collection to result. This paper describes the rapid amplification protocol as well as studies of the reproducibility and sensitivity of the procedure and its optimisation. The amplification process utilises a small high-speed thermocycler, microfluidic device and compact laptop, making it portable and potentially useful for rapid, inexpensive on-site genotyping. The four loci used for the multiplex were selected due to their rapid mutation rates and should proved useful in preliminary screening of samples and suspects. Overall, this technique provides a method for rapid sample screening of suspect and crime scene samples in forensic casework. Graphical abstract ᅟ.
Collapse
|
25
|
Massively parallel sequencing of 68 insertion/deletion markers identifies novel microhaplotypes for utility in human identity testing. Forensic Sci Int Genet 2016; 25:198-209. [PMID: 27685342 DOI: 10.1016/j.fsigen.2016.09.005] [Citation(s) in RCA: 25] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/28/2016] [Revised: 08/01/2016] [Accepted: 09/19/2016] [Indexed: 11/23/2022]
Abstract
Short tandem repeat (STR) loci are the traditional markers used for kinship, missing persons, and direct comparison human identity testing. These markers hold considerable value due to their highly polymorphic nature, amplicon size, and ability to be multiplexed. However, many STRs are still too large for use in analysis of highly degraded DNA. Small bi-allelic polymorphisms, such as insertions/deletions (INDELs), may be better suited for analyzing compromised samples, and their allele size differences are amenable to analysis by capillary electrophoresis. The INDEL marker allelic states range in size from 2 to 6 base pairs, enabling small amplicon size. In addition, heterozygote balance may be increased by minimizing preferential amplification of the smaller allele, as is more common with STR markers. Multiplexing a large number of INDELs allows for generating panels with high discrimination power. The Nextera™ Rapid Capture Custom Enrichment Kit (Illumina, Inc., San Diego, CA) and massively parallel sequencing (MPS) on the Illumina MiSeq were used to sequence 68 well-characterized INDELs in four major US population groups. In addition, the STR Allele Identification Tool: Razor (STRait Razor) was used in a novel way to analyze INDEL sequences and detect adjacent single nucleotide polymorphisms (SNPs) and other polymorphisms. This application enabled the discovery of unique allelic variants, which increased the discrimination power and decreased the single-locus random match probabilities (RMPs) of 22 of these well-characterized INDELs which can be considered as microhaplotypes. These findings suggest that additional microhaplotypes containing human identification (HID) INDELs may exist elsewhere in the genome.
Collapse
|
26
|
Abstract
Pre-mRNA splicing is a key post-transcriptional regulation process in which introns are excised and exons are ligated together. A novel class of structured intron was recently discovered in fish. Simple expansions of complementary AC and GT dimers at opposite boundaries of an intron were found to form a bridging structure, thereby enforcing correct splice site pairing across the intron. In some fish introns, the RNA structures are strong enough to bypass the need of regulatory protein factors for splicing. Here, we discuss the prevalence and potential functions of highly structured introns. In humans, structured introns usually arise through the co-occurrence of C and G-rich repeats at intron boundaries. We explore the potentially instructive example of the HLA receptor genes. In HLA pre-mRNA, structured introns flank the exons that encode the highly polymorphic β sheet cleft, making the processing of the transcript robust to variants that disrupt splicing factor binding. While selective forces that have shaped HLA receptor are fairly atypical, numerous other highly polymorphic genes that encode receptors contain structured introns. Finally, we discuss how the elevated mutation rate associated with the simple repeats that often compose structured intron can make structured introns themselves rapidly evolving elements.
Collapse
Affiliation(s)
- Chien-Ling Lin
- a Molecular Biology, Cell Biology and Biochemistry, Brown University , Providence , RI , USA
| | - Allison J Taggart
- a Molecular Biology, Cell Biology and Biochemistry, Brown University , Providence , RI , USA
| | - William G Fairbrother
- a Molecular Biology, Cell Biology and Biochemistry, Brown University , Providence , RI , USA.,b Center for Computational Molecular Biology, Brown University , Providence , RI , USA.,c Hassenfeld Child Health Innovation Institute of Brown University , Providence , RI , USA
| |
Collapse
|
27
|
Saeed AF, Wang R, Wang S. Microsatellites in Pursuit of Microbial Genome Evolution. Front Microbiol 2016; 6:1462. [PMID: 26779133 PMCID: PMC4700210 DOI: 10.3389/fmicb.2015.01462] [Citation(s) in RCA: 21] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/11/2015] [Accepted: 12/07/2015] [Indexed: 12/27/2022] Open
Abstract
Microsatellites or short sequence repeats are widespread genetic markers which are hypermutable 1-6 bp long short nucleotide motifs. Significantly, their applications in genetics are extensive due to their ceaseless mutational degree, widespread length variations and hypermutability skills. These features make them useful in determining the driving forces of evolution by using powerful molecular techniques. Consequently, revealing important questions, for example, what is the significance of these abundant sequences in DNA, what are their roles in genomic evolution? The answers of these important questions are hidden in the ways these short motifs contributed in altering the microbial genomes since the origin of life. Even though their size ranges from 1 -to- 6 bases, these repeats are becoming one of the most popular genetic probes in determining their associations and phylogenetic relationships in closely related genomes. Currently, they have been widely used in molecular genetics, biotechnology and evolutionary biology. However, due to limited knowledge; there is a significant gap in research and lack of information concerning hypermutational mechanisms. These mechanisms play a key role in microsatellite loci point mutations and phase variations. This review will extend the understandings of impacts and contributions of microsatellite in genomic evolution and their universal applications in microbiology.
Collapse
Affiliation(s)
- Abdullah F. Saeed
- Key Laboratory of Biopesticide and Chemical Biology of Education Ministry, School of Life Sciences, Fujian Agriculture and Forestry UniversityFuzhou, China
| | | | | |
Collapse
|
28
|
Wei W, Davis RE, Suo X, Zhao Y. Occurrence, distribution and possible functional roles of simple sequence repeats in phytoplasma genomes. Int J Syst Evol Microbiol 2015; 65:2748-2760. [DOI: 10.1099/ijs.0.000273] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/24/2023] Open
Abstract
Phytoplasmas are unculturable, cell-wall-less bacteria that parasitize plants and insects. This transkingdom life cycle requires rapid responses to vastly different environments, including transitions from plant phloem sieve elements to various insect tissues and alternations among diverse plant hosts. Features that enable such flexibility in other microbes include simple sequence repeats (SSRs) — mutation-prone, phase-variable short DNA tracts that function as ‘evolutionary rheostats’ and enhance rapid adaptations. To gain insights into the occurrence, distribution and potentially functional roles of SSRs in phytoplasmas, we performed computational analysis on the genomes of five completely sequenced phytoplasma strains, ‘Candidatus Phytoplasma asteris’-related strains OYM and AYWB, ‘Candidatus Phytoplasma australiense’-related strains CBWB and SLY and ‘Candidatus Phytoplasma mali’-related strain AP-AT. The overall density of SSRs in phytoplasma genomes was higher than in representative strains of other prokaryotes. While mono- and trinucleotide SSRs were significantly overrepresented in the phytoplasma genomes, dinucleotide SSRs and other higher-order SSRs were underrepresented. The occurrence and distribution of long SSRs in the prophage islands and phytoplasma-unique genetic loci indicated that SSRs played a role in compounding the complexity of sequence mosaics in individual genomes and in increasing allelic diversity among genomes. Findings from computational analyses were further complemented by an examination of SSRs in varied additional phytoplasma strains, with a focus on potential contingency genes. Some SSRs were located in regions that could profoundly alter the regulation of transcription and translation of affected genes and/or the composition of protein products.
Collapse
Affiliation(s)
- Wei Wei
- Molecular Plant Pathology Laboratory, USDA-Agricultural Research Service, Beltsville, MD, 20705, USA
| | - Robert E. Davis
- Molecular Plant Pathology Laboratory, USDA-Agricultural Research Service, Beltsville, MD, 20705, USA
| | - Xiaobing Suo
- Molecular Plant Pathology Laboratory, USDA-Agricultural Research Service, Beltsville, MD, 20705, USA
| | - Yan Zhao
- Molecular Plant Pathology Laboratory, USDA-Agricultural Research Service, Beltsville, MD, 20705, USA
| |
Collapse
|
29
|
Schaper E, Anisimova M. The evolution and function of protein tandem repeats in plants. THE NEW PHYTOLOGIST 2015; 206:397-410. [PMID: 25420631 DOI: 10.1111/nph.13184] [Citation(s) in RCA: 43] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/22/2014] [Accepted: 10/18/2014] [Indexed: 05/27/2023]
Abstract
Sequence tandem repeats (TRs) are abundant in proteomes across all domains of life. For plants, little is known about their distribution or contribution to protein function. We exhaustively annotated TRs and studied the evolution of TR unit variations for all Ensembl plants. Using phylogenetic patterns of TR units, we detected conserved TRs with unit number and order preserved during evolution, and those TRs that have diverged via recent TR unit gains/losses. We correlated the mode of evolution of TRs to protein function. TR number was strongly correlated with proteome size, with about one-half of all TRs recognized as common protein domains. The majority of TRs have been highly conserved over long evolutionary distances, some since the separation of red algae and green plants c. 1.6 billion yr ago. Conversely, recurrent recent TR unit mutations were rare. Our results suggest that the first TRs by far predate the first plants, and that TR appearance is an ongoing process with similar rates across the plant kingdom. Interestingly, the few detected highly mutable TRs might provide a source of variation for rapid adaptation. In particular, such TRs are enriched in leucine-rich repeats (LRRs) commonly found in R genes, where TR unit gain/loss may facilitate resistance to emerging pathogens.
Collapse
Affiliation(s)
- Elke Schaper
- Department of Computer Science, ETH Zürich, Zürich, 8092, Switzerland
- Institute of Integrative Biology, ETH Zürich, Zürich, 8092, Switzerland
- Vital-IT Competency Center, Swiss Institute for Bioinformatics (SIB), Lausanne, 1015, Switzerland
| | - Maria Anisimova
- Institute of Applied Simulation (IAS), School of Life Sciences and Facility Management, Zürich University of Applied Sciences (ZHAW), Wädenswil, 8820, Switzerland
| |
Collapse
|
30
|
Schlick-Steiner BC, Arthofer W, Moder K, Steiner FM. Recent insertion/deletion (reINDEL) mutations: increasing awareness to boost molecular-based research in ecology and evolution. Ecol Evol 2014; 5:24-35. [PMID: 25628861 PMCID: PMC4298431 DOI: 10.1002/ece3.1330] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/10/2014] [Revised: 10/27/2014] [Accepted: 10/30/2014] [Indexed: 01/18/2023] Open
Abstract
Today, the comparative analysis of DNA molecules mainly uses information inferred from nucleotide substitutions. Insertion/deletion (INDEL) mutations, in contrast, are largely considered uninformative and discarded, due to our lacking knowledge on their evolution. However, including rather than discarding INDELs would be relevant to any research area in ecology and evolution that uses molecular data. As a practical approach to better understanding INDEL evolution in general, we propose the study of recent INDEL (reINDEL) mutations – mutations where both ancestral and derived state are seen in the sample. The precondition for reINDEL identification is knowledge about the pedigree of the individuals sampled. Sound reINDEL knowledge will allow the improved modeling needed for including INDELs in the downstream analysis of molecular data. Both microsatellites, currently still the predominant marker system in the analysis of populations, and sequences generated by next-generation sequencing, a promising and rapidly developing range of technologies, offer the opportunity for reINDEL identification. However, a 2013 sample of animal microsatellite studies contained unexpectedly few reINDELs identified. As most likely explanation, we hypothesize that reINDELs are underreported rather than absent and that this underreporting stems from common reINDEL unawareness. If our hypothesis applies, increased reINDEL awareness should allow gathering data rapidly. We recommend the routine reporting of either the absence or presence of reINDELs together with standardized key information on the nature of mutations when they are detected and the use of the keyword “reINDEL” to increase visibility in both instances of successful and unsuccessful search.
Collapse
Affiliation(s)
- Birgit C Schlick-Steiner
- Molecular Ecology Group, Institute of Ecology, University of Innsbruck Technikerstr. 25, 6020, Innsbruck, Austria
| | - Wolfgang Arthofer
- Molecular Ecology Group, Institute of Ecology, University of Innsbruck Technikerstr. 25, 6020, Innsbruck, Austria
| | - Karl Moder
- Institute of Applied Statistics and Computing, University of Natural Resources and Life Sciences Peter Jordan-Str. 82, 1180, Vienna, Austria
| | - Florian M Steiner
- Molecular Ecology Group, Institute of Ecology, University of Innsbruck Technikerstr. 25, 6020, Innsbruck, Austria
| |
Collapse
|
31
|
Genome wide survey of microsatellites in ssDNA viruses infecting vertebrates. Gene 2014; 552:209-18. [DOI: 10.1016/j.gene.2014.09.032] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/12/2014] [Revised: 08/15/2014] [Accepted: 09/15/2014] [Indexed: 01/26/2023]
|
32
|
Exome-wide somatic microsatellite variation is altered in cells with DNA repair deficiencies. PLoS One 2014; 9:e110263. [PMID: 25402475 PMCID: PMC4234249 DOI: 10.1371/journal.pone.0110263] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/09/2014] [Accepted: 09/18/2014] [Indexed: 11/19/2022] Open
Abstract
Microsatellites (MST), tandem repeats of 1–6 nucleotide motifs, are mutational hot-spots with a bias for insertions and deletions (INDELs) rather than single nucleotide polymorphisms (SNPs). The majority of MST instability studies are limited to a small number of loci, the Bethesda markers, which are only informative for a subset of colorectal cancers. In this paper we evaluate non-haplotype alleles present within next-gen sequencing data to evaluate somatic MST variation (SMV) within DNA repair proficient and DNA repair defective cell lines. We confirm that alleles present within next-gen data that do not contribute to the haplotype can be reliably quantified and utilized to evaluate the SMV without requiring comparisons of matched samples. We observed that SMV patterns found in DNA repair proficient cell lines without DNA repair defects, MCF10A, HEK293 and PD20 RV:D2, had consistent patterns among samples. Further, we were able to confirm that changes in SMV patterns in cell lines lacking functional BRCA2, FANCD2 and mismatch repair were consistent with the different pathways perturbed. Using this new exome sequencing analysis approach we show that DNA instability can be identified in a sample and that patterns of instability vary depending on the impaired DNA repair mechanism, and that genes harboring minor alleles are strongly associated with cancer pathways. The MST Minor Allele Caller used for this study is available at https://github.com/zalmanv/MST_minor_allele_caller.
Collapse
|
33
|
Putman AI, Carbone I. Challenges in analysis and interpretation of microsatellite data for population genetic studies. Ecol Evol 2014; 4:4399-428. [PMID: 25540699 PMCID: PMC4267876 DOI: 10.1002/ece3.1305] [Citation(s) in RCA: 202] [Impact Index Per Article: 18.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/02/2014] [Revised: 10/02/2014] [Accepted: 10/03/2014] [Indexed: 12/14/2022] Open
Abstract
Advancing technologies have facilitated the ever-widening application of genetic markers such as microsatellites into new systems and research questions in biology. In light of the data and experience accumulated from several years of using microsatellites, we present here a literature review that synthesizes the limitations of microsatellites in population genetic studies. With a focus on population structure, we review the widely used fixation (F ST) statistics and Bayesian clustering algorithms and find that the former can be confusing and problematic for microsatellites and that the latter may be confounded by complex population models and lack power in certain cases. Clustering, multivariate analyses, and diversity-based statistics are increasingly being applied to infer population structure, but in some instances these methods lack formalization with microsatellites. Migration-specific methods perform well only under narrow constraints. We also examine the use of microsatellites for inferring effective population size, changes in population size, and deeper demographic history, and find that these methods are untested and/or highly context-dependent. Overall, each method possesses important weaknesses for use with microsatellites, and there are significant constraints on inferences commonly made using microsatellite markers in the areas of population structure, admixture, and effective population size. To ameliorate and better understand these constraints, researchers are encouraged to analyze simulated datasets both prior to and following data collection and analysis, the latter of which is formalized within the approximate Bayesian computation framework. We also examine trends in the literature and show that microsatellites continue to be widely used, especially in non-human subject areas. This review assists with study design and molecular marker selection, facilitates sound interpretation of microsatellite data while fostering respect for their practical limitations, and identifies lessons that could be applied toward emerging markers and high-throughput technologies in population genetics.
Collapse
Affiliation(s)
- Alexander I Putman
- Department of Plant Pathology, North Carolina State University Raleigh, North Carolina, 27695-7616
| | - Ignazio Carbone
- Department of Plant Pathology, North Carolina State University Raleigh, North Carolina, 27695-7616
| |
Collapse
|
34
|
Butler IA, Siletti K, Oxley PR, Kronauer DJC. Conserved microsatellites in ants enable population genetic and colony pedigree studies across a wide range of species. PLoS One 2014; 9:e107334. [PMID: 25244681 PMCID: PMC4170976 DOI: 10.1371/journal.pone.0107334] [Citation(s) in RCA: 23] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/15/2014] [Accepted: 08/15/2014] [Indexed: 01/05/2023] Open
Abstract
Broadly applicable polymorphic genetic markers are essential tools for population genetics, and different types of markers have been developed for this purpose. Microsatellites have been employed as particularly polymorphic markers for over 20 years. However, PCR primers for microsatellite loci are often not useful outside the species for which they were designed. This implies that a new set of loci has to be identified and primers developed for every new study species. To overcome this constraint, we identified 45 conserved microsatellite loci based on the eight currently available ant genomes and designed primers for PCR amplification. Among these loci, we chose 24 for in-depth study in six species covering six different ant subfamilies. On average, 11.16 of these 24 loci were polymorphic and in Hardy-Weinberg equilibrium in any given species. The average number of alleles for these polymorphic loci within single populations of the different species was 4.59. This set of genetic markers will thus be useful for population genetic and colony pedigree studies across a wide range of ant species, supplementing the markers available for previously studied species and greatly facilitating the study of the many ant species lacking genetic markers. Our study shows that it is possible to develop microsatellite loci that are both conserved over a broad range of taxa, yet polymorphic within species. This should encourage researchers to develop similar tools for other large taxonomic groups.
Collapse
Affiliation(s)
- Ian A. Butler
- Laboratory of Insect Social Evolution, The Rockefeller University, New York, New York, United States of America
- * E-mail:
| | - Kimberly Siletti
- Laboratory of Insect Social Evolution, The Rockefeller University, New York, New York, United States of America
| | - Peter R. Oxley
- Laboratory of Insect Social Evolution, The Rockefeller University, New York, New York, United States of America
| | - Daniel J. C. Kronauer
- Laboratory of Insect Social Evolution, The Rockefeller University, New York, New York, United States of America
| |
Collapse
|
35
|
Tessereau C, Lesecque Y, Monnet N, Buisson M, Barjhoux L, Léoné M, Feng B, Goldgar DE, Sinilnikova OM, Mousset S, Duret L, Mazoyer S. Estimation of the RNU2 macrosatellite mutation rate by BRCA1 mutation tracing. Nucleic Acids Res 2014; 42:9121-30. [PMID: 25034697 PMCID: PMC4132748 DOI: 10.1093/nar/gku639] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/02/2022] Open
Abstract
Large tandem repeat sequences have been poorly investigated as severe technical limitations and their frequent absence from the genome reference hinder their analysis. Extensive allelotyping of this class of variation has not been possible until now and their mutational dynamics are still poorly known. In order to estimate the mutation rate of a macrosatellite, we analysed in detail the RNU2 locus, which displays at least 50 different alleles containing 5-82 copies of a 6.1 kb repeat unit. Mining data from the 1000 Genomes Project allowed us to precisely estimate copy numbers of the RNU2 repeat unit using read depth of coverage. This further revealed significantly different mean values in various recent modern human populations, favoring a scenario of fast evolution of this locus. Its proximity to a disease gene with numerous founder mutations, BRCA1, within the same linkage disequilibrium block, offered the unique opportunity to trace RNU2 arrays over a large timescale. Analysis of the transmission of RNU2 arrays associated with one ‘private’ mutation in an extended kindred and four founder mutations in multiple kindreds gave an estimation by maximum likelihood of 5 × 10−3 mutations per generation, which is close to that of microsatellites.
Collapse
Affiliation(s)
- Chloé Tessereau
- Genetics of Breast Cancer Team, Cancer Research Centre of Lyon, CNRS UMR5286, Inserm U1052, Université Lyon 1, Centre Léon Bérard, Lyon, France Genomic Vision, Bagneux, Paris, France
| | - Yann Lesecque
- Laboratoire de Biométrie et Biologie Evolutive, CNRS UMR5558, Université Lyon 1, France
| | - Nastasia Monnet
- Genetics of Breast Cancer Team, Cancer Research Centre of Lyon, CNRS UMR5286, Inserm U1052, Université Lyon 1, Centre Léon Bérard, Lyon, France
| | - Monique Buisson
- Genetics of Breast Cancer Team, Cancer Research Centre of Lyon, CNRS UMR5286, Inserm U1052, Université Lyon 1, Centre Léon Bérard, Lyon, France
| | - Laure Barjhoux
- Genetics of Breast Cancer Team, Cancer Research Centre of Lyon, CNRS UMR5286, Inserm U1052, Université Lyon 1, Centre Léon Bérard, Lyon, France
| | - Mélanie Léoné
- Unité Mixte de Génétique Constitutionnelle des Cancers Fréquents, Hospices Civils de Lyon/Centre Léon Bérard, Lyon, France
| | - Bingjian Feng
- Department of Dermatology and Huntsman Cancer Institute University of Utah School of Medicine, Salt Lake City, Utah, USA
| | - David E Goldgar
- Department of Dermatology and Huntsman Cancer Institute University of Utah School of Medicine, Salt Lake City, Utah, USA
| | - Olga M Sinilnikova
- Genetics of Breast Cancer Team, Cancer Research Centre of Lyon, CNRS UMR5286, Inserm U1052, Université Lyon 1, Centre Léon Bérard, Lyon, France Unité Mixte de Génétique Constitutionnelle des Cancers Fréquents, Hospices Civils de Lyon/Centre Léon Bérard, Lyon, France
| | - Sylvain Mousset
- Laboratoire de Biométrie et Biologie Evolutive, CNRS UMR5558, Université Lyon 1, France
| | - Laurent Duret
- Laboratoire de Biométrie et Biologie Evolutive, CNRS UMR5558, Université Lyon 1, France
| | - Sylvie Mazoyer
- Genetics of Breast Cancer Team, Cancer Research Centre of Lyon, CNRS UMR5286, Inserm U1052, Université Lyon 1, Centre Léon Bérard, Lyon, France
| |
Collapse
|
36
|
Murray ML, Yang M, Fauth C, Byers PH. FKBP14-related Ehlers-Danlos syndrome: Expansion of the phenotype to include vascular complications. Am J Med Genet A 2014; 164A:1750-5. [DOI: 10.1002/ajmg.a.36492] [Citation(s) in RCA: 22] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/24/2013] [Accepted: 01/22/2014] [Indexed: 11/07/2022]
Affiliation(s)
- Mitzi L. Murray
- Department of Pathology; University of Washington; Seattle Washington
- Department of Medicine (Medical Genetics); University of Washington; Seattle Washington
| | - Margaret Yang
- Department of Pathology; University of Washington; Seattle Washington
| | - Christine Fauth
- Division of Human Genetics; Innsbruck Medical University; Innsbruck Austria
| | - Peter H. Byers
- Department of Pathology; University of Washington; Seattle Washington
- Department of Medicine (Medical Genetics); University of Washington; Seattle Washington
| |
Collapse
|
37
|
Schaper E, Gascuel O, Anisimova M. Deep conservation of human protein tandem repeats within the eukaryotes. Mol Biol Evol 2014; 31:1132-48. [PMID: 24497029 PMCID: PMC3995336 DOI: 10.1093/molbev/msu062] [Citation(s) in RCA: 46] [Impact Index Per Article: 4.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/01/2023] Open
Abstract
Tandem repeats (TRs) are a major element of protein sequences in all domains of life. They are particularly abundant in mammals, where by conservative estimates one in three proteins contain a TR. High generation-scale duplication and deletion rates were reported for nucleic TR units. However, it is not known whether protein TR units can also be frequently lost or gained providing a source of variation for rapid adaptation of protein function, or alternatively, tend to have conserved TR unit configurations over long evolutionary times. To obtain a systematic picture, we performed a proteome-wide analysis of the mode of evolution for human protein TRs. For this purpose, we propose a novel method for the detection of orthologous TRs based on circular profile hidden Markov models. For all detected TRs, we reconstructed bispecies TR unit phylogenies across 61 eukaryotes ranging from human to yeast. Moreover, we performed additional analyses to correlate functional and structural annotations of human TRs with their mode of evolution. Surprisingly, we find that the vast majority of human TRs are ancient, with TR unit number and order preserved intact since distant speciation events. For example, ≥61% of all human TRs have been strongly conserved at least since the root of all mammals, approximately 300 Ma. Further, we find no human protein TR that shows evidence for strong recent duplications and deletions. The results are in contrast to the high generation-scale mutability of nucleic TRs. Presumably, most protein TRs fold into stable and conserved structures that are indispensable for the function of the TR-containing protein. All of our data and results are available for download from http://www.atgc-montpellier.fr/TRE.
Collapse
Affiliation(s)
- Elke Schaper
- Department of Computer Science, ETH Zürich, Zürich, Switzerland
| | | | | |
Collapse
|
38
|
LaRue BL, Lagacé R, Chang CW, Holt A, Hennessy L, Ge J, King JL, Chakraborty R, Budowle B. Characterization of 114 insertion/deletion (INDEL) polymorphisms, and selection for a global INDEL panel for human identification. Leg Med (Tokyo) 2014; 16:26-32. [DOI: 10.1016/j.legalmed.2013.10.006] [Citation(s) in RCA: 19] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/08/2013] [Revised: 08/19/2013] [Accepted: 10/22/2013] [Indexed: 11/15/2022]
|
39
|
Kvikstad EM, Duret L. Strong heterogeneity in mutation rate causes misleading hallmarks of natural selection on indel mutations in the human genome. Mol Biol Evol 2013; 31:23-36. [PMID: 24113537 PMCID: PMC3879449 DOI: 10.1093/molbev/mst185] [Citation(s) in RCA: 16] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/07/2023] Open
Abstract
Elucidating the mechanisms of mutation accumulation and fixation is critical to understand the nature of genetic variation and its contribution to genome evolution. Of particular interest is the effect of insertions and deletions (indels) on the evolution of genome landscapes. Recent population-scaled sequencing efforts provide unprecedented data for analyzing the relative impact of selection versus nonadaptive forces operating on indels. Here, we combined McDonald-Kreitman tests with the analysis of derived allele frequency spectra to investigate the dynamics of allele fixation of short (1-50 bp) indels in the human genome. Our analyses revealed apparently higher fixation probabilities for insertions than deletions. However, this fixation bias is not consistent with either selection or biased gene conversion and varies with local mutation rate, being particularly pronounced at indel hotspots. Furthermore, we identified an unprecedented number of loci with evidence for multiple indel events in the primate phylogeny. Even in nonrepetitive sequence contexts (a priori not prone to indel mutations), such loci are 60-fold more frequent than expected according to a model of uniform indel mutation rate. This provides evidence of as yet unidentified cryptic indel hotspots. We propose that indel homoplasy, at known and cryptic hotspots, produces systematic errors in determination of ancestral alleles via parsimony and advise caution interpreting classic selection tests given the strong heterogeneity in indel rates across the genome. These results will have great impact on studies seeking to infer evolutionary forces operating on indels observed in closely related species, because such mutations are traditionally presumed homoplasy-free.
Collapse
Affiliation(s)
- Erika M Kvikstad
- Laboratoire de Biométrie et Biologie Evolutive, UMR 5558, CNRS, Université Lyon 1, Villeurbanne, France
| | | |
Collapse
|
40
|
Loire E, Higuet D, Netter P, Achaz G. Evolution of coding microsatellites in primate genomes. Genome Biol Evol 2013; 5:283-95. [PMID: 23315383 PMCID: PMC3590770 DOI: 10.1093/gbe/evt003] [Citation(s) in RCA: 21] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/07/2023] Open
Abstract
Microsatellites (SSRs) are highly susceptible to expansions and contractions. When located in a coding sequence, the insertion or the deletion of a single unit for a mono-, di-, tetra-, or penta(nucleotide)-SSR creates a frameshift. As a consequence, one would expect to find only very few of these SSRs in coding sequences because of their strong deleterious potential. Unexpectedly, genomes contain many coding SSRs of all types. Here, we report on a study of their evolution in a phylogenetic context using the genomes of four primates: human, chimpanzee, orangutan, and macaque. In a set of 5,015 orthologous genes unambiguously aligned among the four species, we show that, except for tri- and hexa-SSRs, for which insertions and deletions are frequently observed, SSRs in coding regions evolve mainly by substitutions. We show that the rate of substitution in all types of coding SSRs is typically two times higher than in the rest of coding sequences. Additionally, we observe that although numerous coding SSRs are created and lost by substitutions in the lineages, their numbers remain constant. This last observation suggests that the coding SSRs have reached equilibrium. We hypothesize that this equilibrium involves a combination of mutation, drift, and selection. We thus estimated the fitness cost of mono-SSRs and show that it increases with the number of units. We finally show that the cost of coding mono-SSRs greatly varies from function to function, suggesting that the strength of the selection that acts against them can be correlated to gene functions.
Collapse
Affiliation(s)
- Etienne Loire
- UMR 7138, Systématique, Adaptation, Evolution (UPMC, CNRS, MNHN, IRD), Paris, France
| | | | | | | |
Collapse
|
41
|
Ananda G, Walsh E, Jacob KD, Krasilnikova M, Eckert KA, Chiaromonte F, Makova KD. Distinct mutational behaviors differentiate short tandem repeats from microsatellites in the human genome. Genome Biol Evol 2013; 5:606-20. [PMID: 23241442 PMCID: PMC3622297 DOI: 10.1093/gbe/evs116] [Citation(s) in RCA: 47] [Impact Index Per Article: 3.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/23/2022] Open
Abstract
A tandem repeat's (TR) propensity to mutate increases with repeat number, and can become very pronounced beyond a critical boundary, transforming it into a microsatellite (MS). However, a clear understanding of the mutational behavior of different TR classes and motifs and related mechanisms is lacking, as is a consensus on the existence of a boundary separating short TRs (STRs) from MSs. This hinders our understanding of MSs' mutational properties and their effective use as genetic markers. Using indel calls for 179 individuals from 1000 Genomes Pilot-1 Project, we determined polymorphism incidence for four major TR classes, and formalized its varying relationship with repeat number using segmented regression. We observed a biphasic regime with a transition from a faster to a slower exponential growth at 9, 5, 4, and 4 repeats for mono-, di-, tri-, and tetranucleotide TRs, respectively. We used an in vitro mutagenesis assay to evaluate the contribution of strand slippage errors to mutability. STRs and MSs differ in their absolute polymorphism levels, but more importantly in their rates of mutability growth. Although strand slippage is a major factor driving mononucleotide polymorphism incidence, dinucleotide polymorphism incidence is greater than that expected due to strand slippage alone, indicating that additional cellular factors might be driving dinucleotide mutability in the human genome. Leveraging on hundreds of human genomes, we present the first comprehensive, genome-wide analysis of TR mutational behavior, encompassing several motif sizes and compositions.
Collapse
Affiliation(s)
- Guruprasad Ananda
- Integrative Biosciences, Bioinformatics and Genomics Option, Pennsylvania State University, PA, USA
| | | | | | | | | | | | | |
Collapse
|
42
|
Gao C, Ren X, Mason AS, Li J, Wang W, Xiao M, Fu D. Revisiting an important component of plant genomes: microsatellites. FUNCTIONAL PLANT BIOLOGY : FPB 2013; 40:645-661. [PMID: 32481138 DOI: 10.1071/fp12325] [Citation(s) in RCA: 23] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/31/2012] [Accepted: 01/16/2013] [Indexed: 06/11/2023]
Abstract
Microsatellites are some of the most highly variable repetitive DNA tracts in genomes. Few studies focus on whether the characteristic instability of microsatellites is linked to phenotypic effects in plants. We summarise recent data to investigate how microsatellite variations affect gene expression and hence phenotype. We discuss how the basic characteristics of microsatellites may contribute to phenotypic effects. In summary, microsatellites in plants are universal and highly mutable, they coexist and coevolve with transposable elements, and are under selective pressure. The number of motif nucleotides, the type of motif and transposon activity all contribute to the nonrandom generation and decay of microsatellites, and to conservation and distribution biases. Although microsatellites are generated by accident, they mature through responses to environmental change before final decay. This process is mediated by organism adjustment mechanisms, which maintain a balance between birth versus death and growth versus decay in microsatellites. Close relationships also exist between the physical structure, variation and functionality of microsatellites: in most plant species, sequences containing microsatellites are associated with catalytic activity and binding functions, are expressed in the membrane and organelles, and participate in the developmental and metabolic processes. Microsatellites contribute to genome structure and functional plasticity, and may be considered to promote species evolution in plants in response to environmental changes. In conclusion, the generation, loss, functionality and evolution of microsatellites can be related to plant gene expression and functional alterations. The effect of microsatellites on phenotypic variation may be as significant in plants as it is in animals.
Collapse
Affiliation(s)
- Caihua Gao
- Engineering Research Center of South Upland Agriculture, Ministry of Education, College of Agronomy and Biotechnology, Southwest University, Chongqing 400715, China
| | - Xiaodong Ren
- Engineering Research Center of South Upland Agriculture, Ministry of Education, College of Agronomy and Biotechnology, Southwest University, Chongqing 400715, China
| | - Annaliese S Mason
- Centre for Integrative Legume Research and School of Agriculture and Food Sciences, The University of Queensland, Brisbane 4072, Qld, Australia
| | - Jiana Li
- Engineering Research Center of South Upland Agriculture, Ministry of Education, College of Agronomy and Biotechnology, Southwest University, Chongqing 400715, China
| | - Wei Wang
- Engineering Research Center of South Upland Agriculture, Ministry of Education, College of Agronomy and Biotechnology, Southwest University, Chongqing 400715, China
| | - Meili Xiao
- Engineering Research Center of South Upland Agriculture, Ministry of Education, College of Agronomy and Biotechnology, Southwest University, Chongqing 400715, China
| | - Donghui Fu
- Key Laboratory of Crop Physiology, Ecology and Genetic Breeding, Ministry of Education, Jiangxi Agricultural University, Nanchang, Jiangxi 330045, China
| |
Collapse
|
43
|
Pause-melting misalignment: a novel model for the birth and motif indel of tandem repeats in the mitochondrial genome. BMC Genomics 2013; 14:103. [PMID: 23414101 PMCID: PMC3599147 DOI: 10.1186/1471-2164-14-103] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/02/2012] [Accepted: 02/11/2013] [Indexed: 01/11/2023] Open
Abstract
Background Tandem repeats (TRs) in the mitochondrial (mt) genome control region have been documented in a wide variety of vertebrate species. The mechanism by which repeated tracts originate and undergo duplication and deletion, however, remains unclear. Results We analyzed DNA sequences of mt genome TRs (mtTRs) in the ridged-eye flounder (Pleuronichthys cornutus), and characterized DNA sequences of mtTRs from other vertebrates using the data available in GenBank. Tandem repeats are concentrated in the control regions; however, we found approximately 16.6% of the TRs elsewhere in the mt genome. The flounder mtTRs possess three motif types with hypervariable characteristics at the 3′ end of the control region (CR). Conclusion Based on our analysis of this larger dataset of mtTR sequences, we propose a novel model of Pause Melting Misalignment (PMM) to describe the birth and motif indel of tandem repeats. PMM is activated during a pause event in mitochondrial replication in which a dynamic competition between the nascent (N) heavy strand and the displaced (D) heavy strand may lead to the melting of the N-strand from the template (T) light strand. When mispairing occurs during rebinding of the N-strand, one or several motifs can be inserted or deleted in both strands during the next round of mt-replication or repair. This model can explain the characteristics of TRs in available vertebrate mt genomes.
Collapse
|
44
|
Zarnack K, König J, Tajnik M, Martincorena I, Eustermann S, Stévant I, Reyes A, Anders S, Luscombe N, Ule J. Direct competition between hnRNP C and U2AF65 protects the transcriptome from the exonization of Alu elements. Cell 2013; 152:453-66. [PMID: 23374342 PMCID: PMC3629564 DOI: 10.1016/j.cell.2012.12.023] [Citation(s) in RCA: 346] [Impact Index Per Article: 28.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/25/2012] [Revised: 09/22/2012] [Accepted: 12/12/2012] [Indexed: 11/26/2022]
Abstract
There are ~650,000 Alu elements in transcribed regions of the human genome. These elements contain cryptic splice sites, so they are in constant danger of aberrant incorporation into mature transcripts. Despite posing a major threat to transcriptome integrity, little is known about the molecular mechanisms preventing their inclusion. Here, we present a mechanism for protecting the human transcriptome from the aberrant exonization of transposable elements. Quantitative iCLIP data show that the RNA-binding protein hnRNP C competes with the splicing factor U2AF65 at many genuine and cryptic splice sites. Loss of hnRNP C leads to formation of previously suppressed Alu exons, which severely disrupt transcript function. Minigene experiments explain disease-associated mutations in Alu elements that hamper hnRNP C binding. Thus, by preventing U2AF65 binding to Alu elements, hnRNP C plays a critical role as a genome-wide sentinel protecting the transcriptome. The findings have important implications for human evolution and disease.
Collapse
Affiliation(s)
- Kathi Zarnack
- European Molecular Biology Laboratory (EMBL) European Bioinformatics Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Julian König
- MRC Laboratory of Molecular Biology, Hills Road, Cambridge CB2 0QH, UK
| | - Mojca Tajnik
- MRC Laboratory of Molecular Biology, Hills Road, Cambridge CB2 0QH, UK
- Faculty of Medicine, University of Ljubljana, Vrazov trg 2, SI-1104 Ljubljana, Slovenia
| | - Iñigo Martincorena
- European Molecular Biology Laboratory (EMBL) European Bioinformatics Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | | | - Isabelle Stévant
- European Molecular Biology Laboratory (EMBL) European Bioinformatics Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Alejandro Reyes
- EMBL, Genome Biology Unit, Meyerhofstraße 1, 69117 Heidelberg, Germany
| | - Simon Anders
- EMBL, Genome Biology Unit, Meyerhofstraße 1, 69117 Heidelberg, Germany
| | - Nicholas M. Luscombe
- European Molecular Biology Laboratory (EMBL) European Bioinformatics Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SD, UK
- UCL Genetics Institute, Department of Genetics, Environment and Evolution, University College London, Gower Street, London WC1E 6BT, UK
- Cancer Research UK London Research Institute, 44 Lincoln’s Inn Fields, London WC2A 3LY, UK
- Okinawa Institute for Science and Technology Graduate University, 1919-1 Tancha, Onna-son, Kunigami-gun, Okinawa 904-0495, Japan
| | - Jernej Ule
- MRC Laboratory of Molecular Biology, Hills Road, Cambridge CB2 0QH, UK
| |
Collapse
|
45
|
Stolle E, Kidner JH, Moritz RFA. Patterns of evolutionary conservation of microsatellites (SSRs) suggest a faster rate of genome evolution in Hymenoptera than in Diptera. Genome Biol Evol 2013; 5:151-62. [PMID: 23292136 PMCID: PMC3595035 DOI: 10.1093/gbe/evs133] [Citation(s) in RCA: 22] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 12/21/2012] [Indexed: 12/25/2022] Open
Abstract
Microsatellites, or simple sequence repeats (SSRs), are common and widespread DNA elements in genomes of many organisms. However, their dynamics in genome evolution is unclear, whereby they are thought to evolve neutrally. More available genome sequences along with dated phylogenies allowed for studying the evolution of these repetitive DNA elements along evolutionary time scales. This could be used to compare rates of genome evolution. We show that SSRs in insects can be retained for several hundred million years. Different types of microsatellites seem to be retained longer than others. By comparing Dipteran with Hymenopteran species, we found very similar patterns of SSR loss during their evolution, but both taxa differ profoundly in the rate. Relative to divergence time, Diptera lost SSRs twice as fast as Hymenoptera. The loss of SSRs on the Drosophila melanogaster X-chromosome was higher than on the other chromosomes. However, accounting for generation time, the Diptera show an 8.5-fold slower rate of SSR loss than the Hymenoptera, which, in contrast to previous studies, suggests a faster genome evolution in the latter. This shows that generation time differences can have a profound effect. A faster genome evolution in these insects could be facilitated by several factors very different to Diptera, which is discussed in light of our results on the haplodiploid D. melanogaster X-chromosome. Furthermore, large numbers of SSRs can be found to be in synteny and thus could be exploited as a tool to investigate genome structure and evolution.
Collapse
Affiliation(s)
- Eckart Stolle
- Department of Zoology, Institute of Biology, Martin-Luther-University Halle-Wittenberg, Halle (Saale), Germany.
| | | | | |
Collapse
|
46
|
Joukhadar R, Jighly A. Microsatellites grant more stable flanking genes. BMC Res Notes 2012; 5:556. [PMID: 23035963 PMCID: PMC3515467 DOI: 10.1186/1756-0500-5-556] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/02/2012] [Accepted: 10/02/2012] [Indexed: 12/19/2022] Open
Abstract
Background Microsatellites, or simple sequence repeats (SSRs), are DNA sequences that include tandem copies of specific sequences no longer than six bases. SSRs are ubiquitous in all genomes and highly mutable. Presentation of the hypothesis Results from previous studies suggest that flanking regions of SSR are exhibit high stability in a wide range of organisms. We hypothesized that the SSRs ability to discard weak DNA polymerases could be responsible for this unusual stability. . When the weak polymerases are being decayed over SSRs, the flanking sequences would have higher opportunity to be replicated by more stable DNA polymerases. We present evidence of the molecular basis of our hypothesis. Testing the hypothesis The hypothesis could be tested by examining the activity of DNA polymerase during and after a number of PCRs. The PCR reactions should be run with the same SSR locus possessing differences in the SSR length. The hypothesis could also be tested by comparing the mutational rate of a transferred gene between two transformations. The first one has a naked T-DNA (transferred DNA), while the second one has the same T-DNA flanked with two SSRs. Implications of the hypothesis In any transformation experiment, flanking the T-DNA fragment with SSR sequences would result in more stably transferred genes. This process would decrease the unpredictable risks that may occur because of the mutational pressure on this foreign segment.
Collapse
|
47
|
Meglécz E, Nève G, Biffin E, Gardner MG. Breakdown of phylogenetic signal: a survey of microsatellite densities in 454 shotgun sequences from 154 non model eukaryote species. PLoS One 2012; 7:e40861. [PMID: 22815847 PMCID: PMC3397955 DOI: 10.1371/journal.pone.0040861] [Citation(s) in RCA: 39] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/19/2012] [Accepted: 06/14/2012] [Indexed: 11/19/2022] Open
Abstract
Microsatellites are ubiquitous in Eukaryotic genomes. A more complete understanding of their origin and spread can be gained from a comparison of their distribution within a phylogenetic context. Although information for model species is accumulating rapidly, it is insufficient due to a lack of species depth, thus intragroup variation is necessarily ignored. As such, apparent differences between groups may be overinflated and generalizations cannot be inferred until an analysis of the variation that exists within groups has been conducted. In this study, we examined microsatellite coverage and motif patterns from 454 shotgun sequences of 154 Eukaryote species from eight distantly related phyla (Cnidaria, Arthropoda, Onychophora, Bryozoa, Mollusca, Echinodermata, Chordata and Streptophyta) to test if a consistent phylogenetic pattern emerges from the microsatellite composition of these species. It is clear from our results that data from model species provide incomplete information regarding the existing microsatellite variability within the Eukaryotes. A very strong heterogeneity of microsatellite composition was found within most phyla, classes and even orders. Autocorrelation analyses indicated that while microsatellite contents of species within clades more recent than 200 Mya tend to be similar, the autocorrelation breaks down and becomes negative or non-significant with increasing divergence time. Therefore, the age of the taxon seems to be a primary factor in degrading the phylogenetic pattern present among related groups. The most recent classes or orders of Chordates still retain the pattern of their common ancestor. However, within older groups, such as classes of Arthropods, the phylogenetic pattern has been scrambled by the long independent evolution of the lineages.
Collapse
Affiliation(s)
- Emese Meglécz
- IMBE UMR 7263 CNRS IRD, Aix-Marseille University, Marseille, France.
| | | | | | | |
Collapse
|
48
|
Sawaya SM, Lennon D, Buschiazzo E, Gemmell N, Minin VN. Measuring microsatellite conservation in mammalian evolution with a phylogenetic birth-death model. Genome Biol Evol 2012; 4:636-47. [PMID: 22593552 PMCID: PMC3516246 DOI: 10.1093/gbe/evs050] [Citation(s) in RCA: 28] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/20/2022] Open
Abstract
Microsatellites make up ∼3% of the human genome, and there is increasing evidence that some microsatellites can have important functions and can be conserved by selection. To investigate this conservation, we performed a genome-wide analysis of human microsatellites and measured their conservation using a binary character birth--death model on a mammalian phylogeny. Using a maximum likelihood method to estimate birth and death rates for different types of microsatellites, we show that the rates at which microsatellites are gained and lost in mammals depend on their sequence composition, length, and position in the genome. Additionally, we use a mixture model to account for unequal death rates among microsatellites across the human genome. We use this model to assign a probability-based conservation score to each microsatellite. We found that microsatellites near the transcription start sites of genes are often highly conserved, and that distance from a microsatellite to the nearest transcription start site is a good predictor of the microsatellite conservation score. An analysis of gene ontology terms for genes that contain microsatellites near their transcription start site reveals that regulatory genes involved in growth and development are highly enriched with conserved microsatellites.
Collapse
Affiliation(s)
- Sterling M Sawaya
- Centre for Reproduction and Genomics, Department of Anatomy and Structural Biology, University of Otago, Dunedin, New Zealand
| | | | | | | | | |
Collapse
|
49
|
Liu G, Leffak M. Instability of (CTG)n•(CAG)n trinucleotide repeats and DNA synthesis. Cell Biosci 2012; 2:7. [PMID: 22369689 PMCID: PMC3310812 DOI: 10.1186/2045-3701-2-7] [Citation(s) in RCA: 26] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/07/2011] [Accepted: 02/27/2012] [Indexed: 12/21/2022] Open
Abstract
Expansion of (CTG)n•(CAG)n trinucleotide repeat (TNR) microsatellite sequences is the cause of more than a dozen human neurodegenerative diseases. (CTG)n and (CAG)n repeats form imperfectly base paired hairpins that tend to expand in vivo in a length-dependent manner. Yeast, mouse and human models confirm that (CTG)n•(CAG)n instability increases with repeat number, and implicate both DNA replication and DNA damage response mechanisms in (CTG)n•(CAG)n TNR expansion and contraction. Mutation and knockdown models that abrogate the expression of individual genes might also mask more subtle, cumulative effects of multiple additional pathways on (CTG)n•(CAG)n instability in whole animals. The identification of second site genetic modifiers may help to explain the variability of (CTG)n•(CAG)n TNR instability patterns between tissues and individuals, and offer opportunities for prognosis and treatment.
Collapse
Affiliation(s)
- Guoqi Liu
- Department of Biochemistry and Molecular Biology, Boonshoft School of Medicine, Wright State University, Dayton, OH 45435, USA.
| | | |
Collapse
|
50
|
Meglécz E, Nève G, Biffin E, Gardner MG. Breakdown of phylogenetic signal: a survey of microsatellite densities in 454 shotgun sequences from 154 non model eukaryote species. PLoS One 2012. [PMID: 22815847 DOI: 10.1371/journal.pone.004086] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 05/11/2023] Open
Abstract
Microsatellites are ubiquitous in Eukaryotic genomes. A more complete understanding of their origin and spread can be gained from a comparison of their distribution within a phylogenetic context. Although information for model species is accumulating rapidly, it is insufficient due to a lack of species depth, thus intragroup variation is necessarily ignored. As such, apparent differences between groups may be overinflated and generalizations cannot be inferred until an analysis of the variation that exists within groups has been conducted. In this study, we examined microsatellite coverage and motif patterns from 454 shotgun sequences of 154 Eukaryote species from eight distantly related phyla (Cnidaria, Arthropoda, Onychophora, Bryozoa, Mollusca, Echinodermata, Chordata and Streptophyta) to test if a consistent phylogenetic pattern emerges from the microsatellite composition of these species. It is clear from our results that data from model species provide incomplete information regarding the existing microsatellite variability within the Eukaryotes. A very strong heterogeneity of microsatellite composition was found within most phyla, classes and even orders. Autocorrelation analyses indicated that while microsatellite contents of species within clades more recent than 200 Mya tend to be similar, the autocorrelation breaks down and becomes negative or non-significant with increasing divergence time. Therefore, the age of the taxon seems to be a primary factor in degrading the phylogenetic pattern present among related groups. The most recent classes or orders of Chordates still retain the pattern of their common ancestor. However, within older groups, such as classes of Arthropods, the phylogenetic pattern has been scrambled by the long independent evolution of the lineages.
Collapse
Affiliation(s)
- Emese Meglécz
- IMBE UMR 7263 CNRS IRD, Aix-Marseille University, Marseille, France.
| | | | | | | |
Collapse
|