1
|
Vegezzi E, Ishiura H, Bragg DC, Pellerin D, Magrinelli F, Currò R, Facchini S, Tucci A, Hardy J, Sharma N, Danzi MC, Zuchner S, Brais B, Reilly MM, Tsuji S, Houlden H, Cortese A. Neurological disorders caused by novel non-coding repeat expansions: clinical features and differential diagnosis. Lancet Neurol 2024; 23:725-739. [PMID: 38876750 DOI: 10.1016/s1474-4422(24)00167-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/03/2024] [Revised: 04/04/2024] [Accepted: 04/09/2024] [Indexed: 06/16/2024]
Abstract
Nucleotide repeat expansions in the human genome are a well-known cause of neurological disease. In the past decade, advances in DNA sequencing technologies have led to a better understanding of the role of non-coding DNA, that is, the DNA that is not transcribed into proteins. These techniques have also enabled the identification of pathogenic non-coding repeat expansions that cause neurological disorders. Mounting evidence shows that adult patients with familial or sporadic presentations of epilepsy, cognitive dysfunction, myopathy, neuropathy, ataxia, or movement disorders can be carriers of non-coding repeat expansions. The description of the clinical, epidemiological, and molecular features of these recently identified non-coding repeat expansion disorders should guide clinicians in the diagnosis and management of these patients, and help in the genetic counselling for patients and their families.
Collapse
Affiliation(s)
| | - Hiroyuki Ishiura
- Department of Neurology, Graduate School of Medicine, The University of Tokyo, Tokyo, Japan
| | - D Cristopher Bragg
- Department of Neurology, Massachusetts General Hospital and Harvard Medical School, Boston, MA, USA
| | - David Pellerin
- Department of Neuromuscular Diseases, UCL Queen Square Institute of Neurology and The National Hospital for Neurology and Neurosurgery, London, UK; Department of Neurology and Neurosurgery, Montreal Neurological Hospital and Institute, McGill University, Montreal, QC, Canada
| | - Francesca Magrinelli
- Department of Clinical and Movement Neurosciences, UCL Queen Square Institute of Neurology and The National Hospital for Neurology and Neurosurgery, London, UK
| | - Riccardo Currò
- Department of Neuromuscular Diseases, UCL Queen Square Institute of Neurology and The National Hospital for Neurology and Neurosurgery, London, UK; Department of Brain and Behavioral Sciences, University of Pavia, Pavia, Italy
| | - Stefano Facchini
- IRCCS Mondino Foundation, Pavia, Italy; Department of Neuromuscular Diseases, UCL Queen Square Institute of Neurology and The National Hospital for Neurology and Neurosurgery, London, UK
| | - Arianna Tucci
- Department of Neuromuscular Diseases, UCL Queen Square Institute of Neurology and The National Hospital for Neurology and Neurosurgery, London, UK; William Harvey Research Institute, Queen Mary University of London, London, UK
| | - John Hardy
- Department of Neurogedengerative Disease, UCL Queen Square Institute of Neurology and The National Hospital for Neurology and Neurosurgery, London, UK
| | - Nutan Sharma
- Department of Neurology, Massachusetts General Hospital and Harvard Medical School, Boston, MA, USA
| | - Matt C Danzi
- Department of Human Genetics and Hussman Institute for Human Genomics, University of Miami Miller School of Medicine, Miami, FL, USA
| | - Stephan Zuchner
- Department of Human Genetics and Hussman Institute for Human Genomics, University of Miami Miller School of Medicine, Miami, FL, USA
| | - Bernard Brais
- Department of Neurology and Neurosurgery, Montreal Neurological Hospital and Institute, McGill University, Montreal, QC, Canada
| | - Mary M Reilly
- Department of Neuromuscular Diseases, UCL Queen Square Institute of Neurology and The National Hospital for Neurology and Neurosurgery, London, UK
| | - Shoji Tsuji
- Department of Neurology, Graduate School of Medicine, The University of Tokyo, Tokyo, Japan; Institute of Medical Genomics, International University of Health and Welfare, Chiba, Japan
| | - Henry Houlden
- Department of Neuromuscular Diseases, UCL Queen Square Institute of Neurology and The National Hospital for Neurology and Neurosurgery, London, UK
| | - Andrea Cortese
- Department of Neuromuscular Diseases, UCL Queen Square Institute of Neurology and The National Hospital for Neurology and Neurosurgery, London, UK; Department of Brain and Behavioral Sciences, University of Pavia, Pavia, Italy.
| |
Collapse
|
2
|
Dai J, Rubel T, Han Y, Molloy EK. Dollo-CDP: a polynomial-time algorithm for the clade-constrained large Dollo parsimony problem. Algorithms Mol Biol 2024; 19:2. [PMID: 38191515 PMCID: PMC10775561 DOI: 10.1186/s13015-023-00249-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/18/2023] [Accepted: 12/10/2023] [Indexed: 01/10/2024] Open
Abstract
The last decade of phylogenetics has seen the development of many methods that leverage constraints plus dynamic programming. The goal of this algorithmic technique is to produce a phylogeny that is optimal with respect to some objective function and that lies within a constrained version of tree space. The popular species tree estimation method ASTRAL, for example, returns a tree that (1) maximizes the quartet score computed with respect to the input gene trees and that (2) draws its branches (bipartitions) from the input constraint set. This technique has yet to be used for parsimony problems where the input are binary characters, sometimes with missing values. Here, we introduce the clade-constrained character parsimony problem and present an algorithm that solves this problem for the Dollo criterion score in [Formula: see text] time, where n is the number of leaves, k is the number of characters, and [Formula: see text] is the set of clades used as constraints. Dollo parsimony, which requires traits/mutations to be gained at most once but allows them to be lost any number of times, is widely used for tumor phylogenetics as well as species phylogenetics, for example analyses of low-homoplasy retroelement insertions across the vertebrate tree of life. This motivated us to implement our algorithm in a software package, called Dollo-CDP, and evaluate its utility for analyzing retroelement insertion presence / absence patterns for bats, birds, toothed whales as well as simulated data. Our results show that Dollo-CDP can improve upon heuristic search from a single starting tree, often recovering a better scoring tree. Moreover, Dollo-CDP scales to data sets with much larger numbers of taxa than branch-and-bound while still having an optimality guarantee, albeit a more restricted one. Lastly, we show that our algorithm for Dollo parsimony can easily be adapted to Camin-Sokal parsimony but not Fitch parsimony.
Collapse
Affiliation(s)
- Junyan Dai
- Department of Computer Science, University of Maryland, College Park, MD, USA
| | - Tobias Rubel
- Department of Computer Science, University of Maryland, College Park, MD, USA
| | - Yunheng Han
- Department of Computer Science, University of Maryland, College Park, MD, USA
| | - Erin K Molloy
- Department of Computer Science, University of Maryland, College Park, MD, USA.
- University of Maryland Institute for Advanced Computer Studies, College Park, MD, USA.
| |
Collapse
|
3
|
Doronina L, Ogoniak L, Schmitz J. Homoplasy of Retrotransposon Insertions in Toothed Whales. Genes (Basel) 2023; 14:1830. [PMID: 37761970 PMCID: PMC10531181 DOI: 10.3390/genes14091830] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/01/2023] [Revised: 09/19/2023] [Accepted: 09/20/2023] [Indexed: 09/29/2023] Open
Abstract
Retrotransposon insertion patterns facilitate a virtually homoplasy-free picture of phylogenetic history. Still, a few most likely random parallel insertions or deletions result in rare cases of homoplasy in primates. The following question arises: how frequent is retrotransposon homoplasy in other phylogenetic clades? Here, we derived genome insertion data of toothed whales to evaluate the extension of homoplasy in a representative laurasiatherian group. Among more than a thousand extracted and aligned retrotransposon loci, we detected 37 cases of precise parallel insertions in species that are separated by over more than 10 million years, a time frame which minimizes the effects of incomplete lineage sorting. We compared the phylogenetic signal of insertions with the flanking sequences of these loci to further exclude potential polymorphic loci derived by incomplete lineage sorting. We found that the phylogenetic signals of retrotransposon insertion patterns exhibiting true homoplasy differ from the signals of their flanking sequences. In toothed whales, precise parallel insertions account for around 0.18-0.29% of insertion cases, which is about 12.5 times the frequency of such insertions among Alus in primates. We also detected five specific deletions of retrotransposons on various lineages of toothed whale evolution, a frequency of 0.003%, which is slightly higher than such occurrences in primates. Overall, the level of retrotransposon homoplasy in toothed whales is still marginal compared to the phylogenetic diagnostic retrotransposon presence/absence signal.
Collapse
Affiliation(s)
- Liliya Doronina
- Institute of Experimental Pathology, ZMBE, University of Münster, 48149 Münster, Germany;
- Institute for Evolution and Biodiversity, University of Münster, 48149 Münster, Germany
| | - Lynn Ogoniak
- Institute of Experimental Pathology, ZMBE, University of Münster, 48149 Münster, Germany;
| | - Jürgen Schmitz
- Institute of Experimental Pathology, ZMBE, University of Münster, 48149 Münster, Germany;
| |
Collapse
|
4
|
Churakov G, Kuritzin A, Chukharev K, Zhang F, Wünnemann F, Ulyantsev V, Schmitz J. A 4-lineage Statistical Suite to Evaluate the Support of Large-Scale Retrotransposon Insertion Data to Reconstruct Evolutionary Trees. Syst Biol 2023; 72:649-661. [PMID: 36688484 DOI: 10.1093/sysbio/syac082] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/26/2020] [Revised: 12/04/2022] [Accepted: 12/23/2022] [Indexed: 01/24/2023] Open
Abstract
Retrophylogenomics makes use of genome-wide retrotransposon presence/absence insertion patterns to resolve questions in phylogeny and population genetics. In the genomics era, evaluating high-throughput data requires the associated development of appropriately powerful statistical tools. The currently used KKSC 3-lineage statistical test for estimating the significance of retrophylogenomic data is limited by the number of possible tree topologies it can assess in one step. To improve on this, we have extended the analysis to simultaneously compare four lineages, enabling us to evaluate ten distinct presence/absence insertion patterns for 26 possible tree topologies plus 129 trees with different incidences of hybridization or introgression. The new tool provides statistics for cases involving multiple ancestral hybridizations/introgressions, ancestral incomplete lineage sorting, bifurcation, and polytomy. The test is embedded in a user-friendly web R application (http://retrogenomics.uni-muenster.de:3838/hammlet/) and is available for use by the scientific community. [ancestral hybridization/introgression; ancestral incomplete lineage sorting (ILS); empirical distribution; KKSC-statistics; 4-lineage (4-LIN) insertion polymorphism; polytomy; retrophylogenomics.].
Collapse
Affiliation(s)
- Gennady Churakov
- Institute of Experimental Pathology (ZMBE), University of Münster, Münster, Germany
- Department of Biochemistry, Institute of Experimental Medicine, St. Petersburg, Russia
| | - Andrej Kuritzin
- Department of System Analysis, Saint Petersburg State Institute of Technology, St. Petersburg, Russia
| | - Konstantin Chukharev
- Information Technologies, Mechanics and Optics, University Saint Petersburg, St. Petersburg, Russia
| | - Fengjun Zhang
- Institute of Experimental Pathology (ZMBE), University of Münster, Münster, Germany
| | - Florian Wünnemann
- Institute for Computational Biomedicine, University Heidelberg, Heidelberg, Germany
| | - Vladimir Ulyantsev
- Information Technologies, Mechanics and Optics, University Saint Petersburg, St. Petersburg, Russia
| | - Jürgen Schmitz
- Institute of Experimental Pathology (ZMBE), University of Münster, Münster, Germany
| |
Collapse
|
5
|
Storer JM, Walker JA, Rewerts LC, Brown MA, Beckstrom TO, Herke SW, Roos C, Batzer MA. Owl Monkey Alu Insertion Polymorphisms and Aotus Phylogenetics. Genes (Basel) 2022; 13:2069. [PMID: 36360306 PMCID: PMC9691001 DOI: 10.3390/genes13112069] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/30/2022] [Revised: 10/25/2022] [Accepted: 11/04/2022] [Indexed: 07/30/2023] Open
Abstract
Owl monkeys (genus Aotus), or "night monkeys" are platyrrhine primates in the Aotidae family. Early taxonomy only recognized one species, Aotus trivirgatus, until 1983, when Hershkovitz proposed nine unique species designations, classified into red-necked and gray-necked species groups based predominately on pelage coloration. Recent studies questioned this conventional separation of the genus and proposed designations based on the geographical location of wild populations. Alu retrotransposons are a class of mobile element insertion (MEI) widely used to study primate phylogenetics. A scaffold-level genome assembly for one Aotus species, Aotus nancymaae [Anan_2.0], facilitated large-scale ascertainment of nearly 2000 young lineage-specific Alu insertions. This study provides candidate oligonucleotides for locus-specific PCR assays for over 1350 of these elements. For 314 Alu elements across four taxa with multiple specimens, PCR analyses identified 159 insertion polymorphisms, including 21 grouping A. nancymaae and Aotus azarae (red-necked species) as sister taxa, with Aotus vociferans and A. trivirgatus (gray-necked) being more basal. DNA sequencing identified five novel Alu elements from three different taxa. The Alu datasets reported in this study will assist in species identification and provide a valuable resource for Aotus phylogenetics, population genetics and conservation strategies when applied to wild populations.
Collapse
Affiliation(s)
- Jessica M. Storer
- Department of Biological Sciences, Louisiana State University, 202 Life Sciences Building, Baton Rouge, LA 70803, USA
- Institute for Systems Biology, Seattle, WA 98109, USA
| | - Jerilyn A. Walker
- Department of Biological Sciences, Louisiana State University, 202 Life Sciences Building, Baton Rouge, LA 70803, USA
| | - Lydia C. Rewerts
- Department of Biological Sciences, Louisiana State University, 202 Life Sciences Building, Baton Rouge, LA 70803, USA
| | - Morgan A. Brown
- Department of Biological Sciences, Louisiana State University, 202 Life Sciences Building, Baton Rouge, LA 70803, USA
| | - Thomas O. Beckstrom
- Department of Biological Sciences, Louisiana State University, 202 Life Sciences Building, Baton Rouge, LA 70803, USA
- Department of Oral and Maxillofacial Surgery, University of Washington, 1959 NE Pacific Street, Health Sciences Building B-241, Seattle, WA 98195, USA
| | - Scott W. Herke
- Department of Biological Sciences, Louisiana State University, 202 Life Sciences Building, Baton Rouge, LA 70803, USA
| | - Christian Roos
- Gene Bank of Primates and Primate Genetics Laboratory, German Primate Center, Leibniz Institute for Primate Research, 37077 Göttingen, Germany
| | - Mark A. Batzer
- Department of Biological Sciences, Louisiana State University, 202 Life Sciences Building, Baton Rouge, LA 70803, USA
| |
Collapse
|
6
|
Doronina L, Feigin CY, Schmitz J. Reunion of Australasian Possums by Shared SINE Insertions. Syst Biol 2022; 71:1045-1053. [PMID: 35289914 PMCID: PMC9366447 DOI: 10.1093/sysbio/syac025] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/22/2021] [Revised: 03/09/2022] [Accepted: 03/11/2022] [Indexed: 11/29/2022] Open
Abstract
Although first posited to be of a single origin, the two superfamilies of phalangeriform marsupial possums (Phalangeroidea: brushtail possums and cuscuses and Petauroidea: possums and gliders) have long been considered, based on multiple sequencing studies, to have evolved from two separate origins. However, previous data from these sequence analyses suggested a variety of conflicting trees. Therefore, we reinvestigated these relationships by screening $\sim$200,000 orthologous short interspersed element (SINE) loci across the newly available whole-genome sequences of phalangeriform species and their relatives. Compared to sequence data, SINE presence/absence patterns are evolutionarily almost neutral molecular markers of the phylogenetic history of species. Their random and highly complex genomic insertion ensures their virtually homoplasy-free nature and enables one to compare hundreds of shared unique orthologous events to determine the true species tree. Here, we identify 106 highly reliable phylogenetic SINE markers whose presence/absence patterns within multiple Australasian possum genomes unexpectedly provide the first significant evidence for the reunification of Australasian possums into one monophyletic group. Together, our findings indicate that nucleotide homoplasy and ancestral incomplete lineage sorting have most likely driven the conflicting signal distributions seen in previous sequence-based studies. [Ancestral incomplete lineage sorting; possum genomes; possum monophyly; retrophylogenomics; SINE presence/absence.].
Collapse
Affiliation(s)
- Liliya Doronina
- Institute of Experimental Pathology (ZMBE), University of Münster, Von-Esmarch-Str. 56, D-48149 Münster, Germany
| | - Charles Y Feigin
- Department of Molecular Biology, Princeton University, 119 Lewis Thomas Laboratory, Washington Road, Princeton, NJ 08544-1014, USA
- School of BioSciences, The University of Melbourne, BioSciences 4, Royal Pde, Parkville, VIC 3010, Australia
| | - Jürgen Schmitz
- Institute of Experimental Pathology (ZMBE), University of Münster, Von-Esmarch-Str. 56, D-48149 Münster, Germany
| |
Collapse
|
7
|
Han S, Dias GB, Basting PJ, Nelson MG, Patel S, Marzo M, Bergman CM. Ongoing transposition in cell culture reveals the phylogeny of diverse Drosophila S2 sublines. Genetics 2022; 221:iyac077. [PMID: 35536183 PMCID: PMC9252272 DOI: 10.1093/genetics/iyac077] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/10/2022] [Accepted: 04/28/2022] [Indexed: 11/13/2022] Open
Abstract
Cultured cells are widely used in molecular biology despite poor understanding of how cell line genomes change in vitro over time. Previous work has shown that Drosophila cultured cells have a higher transposable element content than whole flies, but whether this increase in transposable element content resulted from an initial burst of transposition during cell line establishment or ongoing transposition in cell culture remains unclear. Here, we sequenced the genomes of 25 sublines of Drosophila S2 cells and show that transposable element insertions provide abundant markers for the phylogenetic reconstruction of diverse sublines in a model animal cell culture system. DNA copy number evolution across S2 sublines revealed dramatically different patterns of genome organization that support the overall evolutionary history reconstructed using transposable element insertions. Analysis of transposable element insertion site occupancy and ancestral states support a model of ongoing transposition dominated by episodic activity of a small number of retrotransposon families. Our work demonstrates that substantial genome evolution occurs during long-term Drosophila cell culture, which may impact the reproducibility of experiments that do not control for subline identity.
Collapse
Affiliation(s)
- Shunhua Han
- Institute of Bioinformatics, University of Georgia, Athens, GA 30602, USA
| | - Guilherme B Dias
- Institute of Bioinformatics, University of Georgia, Athens, GA 30602, USA
- Department of Genetics, University of Georgia, Athens, GA 30602, USA
| | - Preston J Basting
- Institute of Bioinformatics, University of Georgia, Athens, GA 30602, USA
| | - Michael G Nelson
- Faculty of Life Sciences, University of Manchester, Manchester M13 9PT, UK
| | - Sanjai Patel
- Faculty of Life Sciences, University of Manchester, Manchester M13 9PT, UK
| | - Mar Marzo
- Faculty of Life Sciences, University of Manchester, Manchester M13 9PT, UK
| | - Casey M Bergman
- Institute of Bioinformatics, University of Georgia, Athens, GA 30602, USA
- Department of Genetics, University of Georgia, Athens, GA 30602, USA
| |
Collapse
|
8
|
SINEs as Credible Signs to Prove Common Ancestry in the Tree of Life: A Brief Review of Pioneering Case Studies in Retroposon Systematics. Genes (Basel) 2022; 13:genes13060989. [PMID: 35741751 PMCID: PMC9223172 DOI: 10.3390/genes13060989] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/13/2022] [Revised: 05/28/2022] [Accepted: 05/28/2022] [Indexed: 12/31/2022] Open
Abstract
Currently, the insertions of SINEs (and other retrotransposed elements) are regarded as one of the most reliable synapomorphies in molecular systematics. The methodological mainstream of molecular systematics is the calculation of nucleotide (or amino acid) sequence divergences under a suitable substitution model. In contrast, SINE insertion analysis does not require any complex model because SINE insertions are unidirectional and irreversible. This straightforward methodology was named the “SINE method,” which resolved various taxonomic issues that could not be settled by sequence comparison alone. The SINE method has challenged several traditional hypotheses proposed based on the fossil record and anatomy, prompting constructive discussions in the Evo/Devo era. Here, we review our pioneering SINE studies on salmon, cichlids, cetaceans, Afrotherian mammals, and birds. We emphasize the power of the SINE method in detecting incomplete lineage sorting by tracing the genealogy of specific genomic loci with minimal noise. Finally, in the context of the whole-genome era, we discuss how the SINE method can be applied to further our understanding of the tree of life.
Collapse
|
9
|
A retrotransposon storm marks clinical phenoconversion to late-onset Alzheimer's disease. GeroScience 2022; 44:1525-1550. [PMID: 35585302 PMCID: PMC9213607 DOI: 10.1007/s11357-022-00580-w] [Citation(s) in RCA: 16] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/09/2021] [Accepted: 04/26/2022] [Indexed: 12/03/2022] Open
Abstract
Recent reports have suggested that the reactivation of otherwise transcriptionally silent transposable elements (TEs) might induce brain degeneration, either by dysregulating the expression of genes and pathways implicated in cognitive decline and dementia or through the induction of immune-mediated neuroinflammation resulting in the elimination of neural and glial cells. In the work we present here, we test the hypothesis that differentially expressed TEs in blood could be used as biomarkers of cognitive decline and development of AD. To this aim, we used a sample of aging subjects (age > 70) that developed late-onset Alzheimer’s disease (LOAD) over a relatively short period of time (12–48 months), for which blood was available before and after their phenoconversion, and a group of cognitive stable subjects as controls. We applied our developed and validated customized pipeline that allows the identification, characterization, and quantification of the differentially expressed (DE) TEs before and after the onset of manifest LOAD, through analyses of RNA-Seq data. We compared the level of DE TEs within more than 600,000 TE-mapping RNA transcripts from 25 individuals, whose specimens we obtained before and after their phenotypic conversion (phenoconversion) to LOAD, and discovered that 1790 TE transcripts showed significant expression differences between these two timepoints (logFC ± 1.5, logCMP > 5.3, nominal p value < 0.01). These DE transcripts mapped both over- and under-expressed TE elements. Occurring before the clinical phenoconversion, this TE storm features significant increases in DE transcripts of LINEs, LTRs, and SVAs, while those for SINEs are significantly depleted. These dysregulations end with signs of manifest LOAD. This set of highly DE transcripts generates a TE transcriptional profile that accurately discriminates the before and after phenoconversion states of these subjects. Our findings suggest that a storm of DE TEs occurs before phenoconversion from normal cognition to manifest LOAD in risk individuals compared to controls, and may provide useful blood-based biomarkers for heralding such a clinical transition, also suggesting that TEs can indeed participate in the complex process of neurodegeneration.
Collapse
|
10
|
Storer JM, Hubley R, Rosen J, Smit AFA. Methodologies for the De novo Discovery of Transposable Element Families. Genes (Basel) 2022; 13:709. [PMID: 35456515 PMCID: PMC9025800 DOI: 10.3390/genes13040709] [Citation(s) in RCA: 10] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/23/2022] [Revised: 04/14/2022] [Accepted: 04/15/2022] [Indexed: 02/07/2023] Open
Abstract
The discovery and characterization of transposable element (TE) families are crucial tasks in the process of genome annotation. Careful curation of TE libraries for each organism is necessary as each has been exposed to a unique and often complex set of TE families. De novo methods have been developed; however, a fully automated and accurate approach to the development of complete libraries remains elusive. In this review, we cover established methods and recent developments in de novo TE analysis. We also present various methodologies used to assess these tools and discuss opportunities for further advancement of the field.
Collapse
Affiliation(s)
| | | | | | - Arian F. A. Smit
- Institute for Systems Biology, Seattle, WA 98109, USA; (J.M.S.); (R.H.); (J.R.)
| |
Collapse
|
11
|
Recently Integrated Alu Elements in Capuchin Monkeys: A Resource for Cebus/ Sapajus Genomics. Genes (Basel) 2022; 13:genes13040572. [PMID: 35456378 PMCID: PMC9030454 DOI: 10.3390/genes13040572] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/07/2022] [Revised: 03/21/2022] [Accepted: 03/22/2022] [Indexed: 11/16/2022] Open
Abstract
Capuchins are platyrrhines (monkeys found in the Americas) within the Cebidae family. For most of their taxonomic history, the two main morphological types of capuchins, gracile (untufted) and robust (tufted), were assigned to a single genus, Cebus. Further, all tufted capuchins were assigned to a single species, Cebus apella, despite broad geographic ranges spanning Central and northern South America. In 2012, tufted capuchins were assigned to their genus, Sapajus, with eight currently recognized species and five Cebus species, although these numbers are still under debate. Alu retrotransposons are a class of mobile element insertion (MEI) widely used to study primate phylogenetics. However, Alu elements have rarely been used to study capuchins. Recent genome-level assemblies for capuchins (Cebus imitator; [Cebus_imitator_1.0] and Sapajus apella [GSC_monkey_1.0]) facilitated large scale ascertainment of young lineage-specific Alu insertions. Reported here are 1607 capuchin specific and 678 Sapajus specific Alu insertions along with candidate oligonucleotides for locus-specific PCR assays for many elements. PCR analyses identified 104 genus level and 51 species level Alu insertion polymorphisms. The Alu datasets reported in this study provide a valuable resource that will assist in the classification of archival samples lacking phenotypic data and for the study of capuchin phylogenetic relationships.
Collapse
|
12
|
Li M, Larsen PA. Primate-specific retrotransposons and the evolution of circadian networks in the human brain. Neurosci Biobehav Rev 2021; 131:988-1004. [PMID: 34592258 DOI: 10.1016/j.neubiorev.2021.09.049] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/05/2021] [Revised: 08/03/2021] [Accepted: 09/26/2021] [Indexed: 11/26/2022]
Abstract
The circadian rhythm of the human brain is attuned to sleep-wake cycles that entail global alterations in neuronal excitability. This periodicity involves a highly coordinated regulation of gene expression. A growing number of studies are documenting a fascinating connection between primate-specific retrotransposons (Alu elements) and key epigenetic regulatory processes in the primate brain. Collectively, these studies indicate that Alu elements embedded in the human neuronal genome mediate post-transcriptional processes that unite human-specific neuroepigenetic landscapes and circadian rhythm. Here, we review evidence linking Alu retrotransposon-mediated posttranscriptional pathways to circadian gene expression. We hypothesize that Alu retrotransposons participate in the organization of circadian brain function through multidimensional neuroepigenetic pathways. We anticipate that these pathways are closely tied to the evolution of human cognition and their perturbation contributes to the manifestation of human-specific neurological diseases. Finally, we address current challenges and accompanying opportunities in studying primate- and human-specific transposable elements.
Collapse
Affiliation(s)
- Manci Li
- University of Minnesota, St. Paul, MN, 55108, United States
| | - Peter A Larsen
- University of Minnesota, St. Paul, MN, 55108, United States.
| |
Collapse
|
13
|
Hausmann F, Kurtz S. DeepGRP: engineering a software tool for predicting genomic repetitive elements using Recurrent Neural Networks with attention. Algorithms Mol Biol 2021; 16:20. [PMID: 34425870 PMCID: PMC8381506 DOI: 10.1186/s13015-021-00199-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/05/2021] [Accepted: 08/03/2021] [Indexed: 12/30/2022] Open
Abstract
BACKGROUND Repetitive elements contribute a large part of eukaryotic genomes. For example, about 40 to 50% of human, mouse and rat genomes are repetitive. So identifying and classifying repeats is an important step in genome annotation. This annotation step is traditionally performed using alignment based methods, either in a de novo approach or by aligning the genome sequence to a species specific set of repetitive sequences. Recently, Li (Bioinformatics 35:4408-4410, 2019) developed a novel software tool dna-brnn to annotate repetitive sequences using a recurrent neural network trained on sample annotations of repetitive elements. RESULTS We have developed the methods of dna-brnn further and engineered a new software tool DeepGRP. This combines the basic concepts of Li (Bioinformatics 35:4408-4410, 2019) with current techniques developed for neural machine translation, the attention mechanism, for the task of nucleotide-level annotation of repetitive elements. An evaluation on the human genome shows a 20% improvement of the Matthews correlation coefficient for the predictions delivered by DeepGRP, when compared to dna-brnn. DeepGRP predicts two additional classes of repeats (compared to dna-brnn) and is able to transfer repeat annotations, using RepeatMasker-based training data to a different species (mouse). Additionally, we could show that DeepGRP predicts repeats annotated in the Dfam database, but not annotated by RepeatMasker. DeepGRP is highly scalable due to its implementation in the TensorFlow framework. For example, the GPU-accelerated version of DeepGRP is approx. 1.8 times faster than dna-brnn, approx. 8.6 times faster than RepeatMasker and over 100 times faster than HMMER searching for models of the Dfam database. CONCLUSIONS By incorporating methods from neural machine translation, DeepGRP achieves a consistent improvement of the quality of the predictions compared to dna-brnn. Improved running times are obtained by employing TensorFlow as implementation framework and the use of GPUs. By incorporating two additional classes of repeats, DeepGRP provides more complete annotations, which were evaluated against three state-of-the-art tools for repeat annotation.
Collapse
Affiliation(s)
- Fabian Hausmann
- Institute of Medical Systems Biology, University Medical Center Hamburg-Eppendorf, Falkenried 94, 20251 Hamburg, Germany
| | - Stefan Kurtz
- ZBH - Center for Bioinformatics, MIN-Fakultät, Universität Hamburg, Bundesstrasse 43, 20146 Hamburg, Germany
| |
Collapse
|
14
|
Santagostino M, Piras FM, Cappelletti E, Del Giudice S, Semino O, Nergadze SG, Giulotto E. Insertion of Telomeric Repeats in the Human and Horse Genomes: An Evolutionary Perspective. Int J Mol Sci 2020; 21:E2838. [PMID: 32325780 PMCID: PMC7215372 DOI: 10.3390/ijms21082838] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/14/2020] [Revised: 04/15/2020] [Accepted: 04/16/2020] [Indexed: 01/06/2023] Open
Abstract
Interstitial telomeric sequences (ITSs) are short stretches of telomeric-like repeats (TTAGGG)n at nonterminal chromosomal sites. We previously demonstrated that, in the genomes of primates and rodents, ITSs were inserted during the repair of DNA double-strand breaks. These conclusions were derived from sequence comparisons of ITS-containing loci and ITS-less orthologous loci in different species. To our knowledge, insertion polymorphism of ITSs, i.e., the presence of an ITS-containing allele and an ITS-less allele in the same species, has not been described. In this work, we carried out a genome-wide analysis of 2504 human genomic sequences retrieved from the 1000 Genomes Project and a PCR-based analysis of 209 human DNA samples. In spite of the large number of individual genomes analyzed we did not find any evidence of insertion polymorphism in the human population. On the contrary, the analysis of ITS loci in the genome of a single horse individual, the reference genome, allowed us to identify five heterozygous ITS loci, suggesting that insertion polymorphism of ITSs is an important source of genetic variability in this species. Finally, following a comparative sequence analysis of horse ITSs and of their orthologous empty loci in other Perissodactyla, we propose models for the mechanism of ITS insertion during the evolution of this order.
Collapse
Affiliation(s)
| | | | | | | | | | | | - Elena Giulotto
- Department of Biology and Biotechnology, University of Pavia, 27100 Pavia, Italy; (M.S.); (F.M.P.); (E.C.); (S.D.G.); (O.S.); (S.G.N.)
| |
Collapse
|
15
|
Walker JA, Jordan VE, Storer JM, Steely CJ, Gonzalez-Quiroga P, Beckstrom TO, Rewerts LC, St Romain CP, Rockwell CE, Rogers J, Jolly CJ, Konkel MK, Batzer MA. Alu insertion polymorphisms shared by Papio baboons and Theropithecus gelada reveal an intertwined common ancestry. Mob DNA 2019; 10:46. [PMID: 31788036 PMCID: PMC6880559 DOI: 10.1186/s13100-019-0187-y] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/10/2019] [Accepted: 11/01/2019] [Indexed: 12/16/2022] Open
Abstract
Background Baboons (genus Papio) and geladas (Theropithecus gelada) are now generally recognized as close phylogenetic relatives, though morphologically quite distinct and generally classified in separate genera. Primate specific Alu retrotransposons are well-established genomic markers for the study of phylogenetic and population genetic relationships. We previously reported a computational reconstruction of Papio phylogeny using large-scale whole genome sequence (WGS) analysis of Alu insertion polymorphisms. Recently, high coverage WGS was generated for Theropithecus gelada. The objective of this study was to apply the high-throughput "poly-Detect" method to computationally determine the number of Alu insertion polymorphisms shared by T. gelada and Papio, and vice versa, by each individual Papio species and T. gelada. Secondly, we performed locus-specific polymerase chain reaction (PCR) assays on a diverse DNA panel to complement the computational data. Results We identified 27,700 Alu insertions from T. gelada WGS that were also present among six Papio species, with nearly half (12,956) remaining unfixed among 12 Papio individuals. Similarly, each of the six Papio species had species-indicative Alu insertions that were also present in T. gelada. In general, P. kindae shared more insertion polymorphisms with T. gelada than did any of the other five Papio species. PCR-based genotype data provided additional support for the computational findings. Conclusions Our discovery that several thousand Alu insertion polymorphisms are shared by T. gelada and Papio baboons suggests a much more permeable reproductive barrier between the two genera then previously suspected. Their intertwined evolution likely involves a long history of admixture, gene flow and incomplete lineage sorting.
Collapse
Affiliation(s)
- Jerilyn A Walker
- 1Department of Biological Sciences, Louisiana State University, 202 Life Sciences Building, Baton Rouge, Louisiana, 70803 USA
| | - Vallmer E Jordan
- 1Department of Biological Sciences, Louisiana State University, 202 Life Sciences Building, Baton Rouge, Louisiana, 70803 USA
| | - Jessica M Storer
- 1Department of Biological Sciences, Louisiana State University, 202 Life Sciences Building, Baton Rouge, Louisiana, 70803 USA
| | - Cody J Steely
- 1Department of Biological Sciences, Louisiana State University, 202 Life Sciences Building, Baton Rouge, Louisiana, 70803 USA
| | - Paulina Gonzalez-Quiroga
- 1Department of Biological Sciences, Louisiana State University, 202 Life Sciences Building, Baton Rouge, Louisiana, 70803 USA
| | - Thomas O Beckstrom
- 1Department of Biological Sciences, Louisiana State University, 202 Life Sciences Building, Baton Rouge, Louisiana, 70803 USA
| | - Lydia C Rewerts
- 1Department of Biological Sciences, Louisiana State University, 202 Life Sciences Building, Baton Rouge, Louisiana, 70803 USA
| | - Corey P St Romain
- 1Department of Biological Sciences, Louisiana State University, 202 Life Sciences Building, Baton Rouge, Louisiana, 70803 USA
| | - Catherine E Rockwell
- 1Department of Biological Sciences, Louisiana State University, 202 Life Sciences Building, Baton Rouge, Louisiana, 70803 USA
| | - Jeffrey Rogers
- 2Human Genome Sequencing Center, Baylor College of Medicine, Houston, TX 77030 USA.,3Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX 77030 USA
| | - Clifford J Jolly
- 4Department of Anthropology, New York University, New York, NY 10003 USA
| | - Miriam K Konkel
- 1Department of Biological Sciences, Louisiana State University, 202 Life Sciences Building, Baton Rouge, Louisiana, 70803 USA.,Department of Genetics & Biochemistry, Clemson Center for Human Genetics, Clemson, SC 29634 USA
| | | | - Mark A Batzer
- 1Department of Biological Sciences, Louisiana State University, 202 Life Sciences Building, Baton Rouge, Louisiana, 70803 USA
| |
Collapse
|
16
|
Yaxley KJ, Foley RA. Reconstructing the ancestral phenotypes of great apes and humans (Homininae) using subspecies-level phylogenies. Biol J Linn Soc Lond 2019. [DOI: 10.1093/biolinnean/blz140] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022]
Abstract
Abstract
Owing to their close affinity, the African great apes are of interest in the study of human evolution. Although numerous researchers have described the ancestors we share with these species with reference to extant great apes, few have done so with phylogenetic comparative methods. One obstacle to the application of these techniques is the within-species phenotypic variation found in this group. Here, we leverage this variation, modelling common ancestors using ancestral state reconstructions (ASRs) with reference to subspecies-level trait data. A subspecies-level phylogeny of the African great apes and humans was estimated from full-genome mitochondrial DNA sequences and used to implement ASRs for 14 continuous traits known to vary between great ape subspecies. Although the inclusion of within-species phenotypic variation increased the phylogenetic signal for our traits and improved the performance of our ASRs, whether this was done through the inclusion of subspecies phylogeny or through the use of existing methods made little difference. Our ASRs corroborate previous findings that the last common ancestor of humans, chimpanzees and bonobos was a chimp-like animal, but also suggest that the last common ancestor of humans, chimpanzees, bonobos and gorillas was an animal unlike any extant African great ape.
Collapse
Affiliation(s)
| | - Robert A Foley
- Leverhulme Centre for Human Evolutionary Studies, University of Cambridge, Cambridge, UK
| |
Collapse
|
17
|
Doronina L, Reising O, Clawson H, Ray DA, Schmitz J. True Homoplasy of Retrotransposon Insertions in Primates. Syst Biol 2019; 68:482-493. [PMID: 30445649 DOI: 10.1093/sysbio/syy076] [Citation(s) in RCA: 25] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/17/2018] [Revised: 11/05/2018] [Accepted: 11/13/2018] [Indexed: 01/24/2023] Open
Abstract
How reliable are the presence/absence insertion patterns of the supposedly homoplasy-free retrotransposons, which were randomly inserted in the quasi infinite genomic space? To systematically examine this question in an up-to-date, multigenome comparison, we screened millions of primate transposed Alu SINE elements for incidences of homoplasious precise insertions and deletions. In genome-wide analyses, we identified and manually verified nine cases of precise parallel Alu insertions of apparently identical elements at orthologous positions in two ape lineages and twelve incidences of precise deletions of previously established SINEs. Correspondingly, eight precise parallel insertions and no exact deletions were detected in a comparison of lemuriform primate and human insertions spanning the range of primate diversity. With an overall frequency of homoplasious Alu insertions of only 0.01% (for human-chimpanzee-rhesus macaque) and 0.02-0.04% (for human-bushbaby-lemurs) and precise Alu deletions of 0.001-0.002% (for human-chimpanzee-rhesus macaque), real homoplasy is not considered to be a quantitatively relevant source of evolutionary noise. Thus, presence/absence patterns of Alu retrotransposons and, presumably, all LINE1-mobilized elements represent indeed the virtually homoplasy-free markers they are considered to be. Therefore, ancestral incomplete lineage sorting and hybridization remain the only serious sources of conflicting presence/absence patterns of retrotransposon insertions, and as such are detectable and quantifiable. [Homoplasy; precise deletions; precise parallel insertions; primates; retrotransposons.].
Collapse
Affiliation(s)
- Liliya Doronina
- Institute of Experimental Pathology (ZMBE), University of Münster, Von-Esmarch-Str. 56, D-48149 Münster, Germany
| | - Olga Reising
- Institute of Experimental Pathology (ZMBE), University of Münster, Von-Esmarch-Str. 56, D-48149 Münster, Germany
| | - Hiram Clawson
- Department of Biomolecular Engineering, University of California, 1156 High Street, Santa Cruz, CA, USA
| | - David A Ray
- Department of Biological Sciences, Texas Tech University, 2901 Main Street, Lubbock, TX, USA
| | - Jürgen Schmitz
- Institute of Experimental Pathology (ZMBE), University of Münster, Von-Esmarch-Str. 56, D-48149 Münster, Germany
| |
Collapse
|
18
|
Jordan VE, Walker JA, Beckstrom TO, Steely CJ, McDaniel CL, St Romain CP, Worley KC, Phillips-Conroy J, Jolly CJ, Rogers J, Konkel MK, Batzer MA. A computational reconstruction of Papio phylogeny using Alu insertion polymorphisms. Mob DNA 2018; 9:13. [PMID: 29632618 PMCID: PMC5885306 DOI: 10.1186/s13100-018-0118-3] [Citation(s) in RCA: 15] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/30/2017] [Accepted: 03/26/2018] [Indexed: 12/17/2022] Open
Abstract
Background Since the completion of the human genome project, the diversity of genome sequencing data produced for non-human primates has increased exponentially. Papio baboons are well-established biological models for studying human biology and evolution. Despite substantial interest in the evolution of Papio, the systematics of these species has been widely debated, and the evolutionary history of Papio diversity is not fully understood. Alu elements are primate-specific transposable elements with a well-documented mutation/insertion mechanism and the capacity for resolving controversial phylogenetic relationships. In this study, we conducted a whole genome analysis of Alu insertion polymorphisms unique to the Papio lineage. To complete these analyses, we created a computational algorithm to identify novel Alu insertions in next-generation sequencing data. Results We identified 187,379 Alu insertions present in the Papio lineage, yet absent from M. mulatta [Mmul8.0.1]. These elements were characterized using genomic data sequenced from a panel of twelve Papio baboons: two from each of the six extant Papio species. These data were used to construct a whole genome Alu-based phylogeny of Papio baboons. The resulting cladogram fully-resolved relationships within Papio. Conclusions These data represent the most comprehensive Alu-based phylogenetic reconstruction reported to date. In addition, this study produces the first fully resolved Alu-based phylogeny of Papio baboons. Electronic supplementary material The online version of this article (10.1186/s13100-018-0118-3) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Vallmer E Jordan
- 1Department of Biological Sciences, Louisiana State University, 202 Life Sciences Building, Baton Rouge, LA 70803 USA
| | - Jerilyn A Walker
- 1Department of Biological Sciences, Louisiana State University, 202 Life Sciences Building, Baton Rouge, LA 70803 USA
| | - Thomas O Beckstrom
- 1Department of Biological Sciences, Louisiana State University, 202 Life Sciences Building, Baton Rouge, LA 70803 USA
| | - Cody J Steely
- 1Department of Biological Sciences, Louisiana State University, 202 Life Sciences Building, Baton Rouge, LA 70803 USA
| | - Cullen L McDaniel
- 1Department of Biological Sciences, Louisiana State University, 202 Life Sciences Building, Baton Rouge, LA 70803 USA
| | - Corey P St Romain
- 1Department of Biological Sciences, Louisiana State University, 202 Life Sciences Building, Baton Rouge, LA 70803 USA
| | | | - Kim C Worley
- 2Human Genome Sequencing Center, Baylor College of Medicine, Houston, TX 77030 USA.,3Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX 77030 USA
| | - Jane Phillips-Conroy
- 4Department of Neuroscience, Washington University School of Medicine, St. Louis, MO 63110 USA
| | - Clifford J Jolly
- 5Department of Anthropology, New York University, New York, NY 10003 USA
| | - Jeffrey Rogers
- 2Human Genome Sequencing Center, Baylor College of Medicine, Houston, TX 77030 USA.,3Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX 77030 USA
| | - Miriam K Konkel
- 1Department of Biological Sciences, Louisiana State University, 202 Life Sciences Building, Baton Rouge, LA 70803 USA.,6Department of Genetics & Biochemistry, Clemson University, Clemson, SC 29634 USA
| | - Mark A Batzer
- 1Department of Biological Sciences, Louisiana State University, 202 Life Sciences Building, Baton Rouge, LA 70803 USA
| |
Collapse
|
19
|
Steely CJ, Baker JN, Walker JA, Loupe CD, Batzer MA. Analysis of lineage-specific Alu subfamilies in the genome of the olive baboon, Papio anubis. Mob DNA 2018; 9:10. [PMID: 29560044 PMCID: PMC5858127 DOI: 10.1186/s13100-018-0115-6] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/15/2017] [Accepted: 03/13/2018] [Indexed: 02/08/2023] Open
Abstract
Background Alu elements are primate-specific retroposons that mobilize using the enzymatic machinery of L1 s. The recently completed baboon genome project found that the mobilization rate of Alu elements is higher than in the genome of any other primate studied thus far. However, the Alu subfamily structure present in and specific to baboons had not been examined yet. Results Here we report 129 Alu subfamilies that are propagating in the genome of the olive baboon, with 127 of these subfamilies being new and specific to the baboon lineage. We analyzed 233 Alu insertions in the genome of the olive baboon using locus specific polymerase chain reaction assays, covering 113 of the 129 subfamilies. The allele frequency data from these insertions show that none of the nine groups of subfamilies are nearing fixation in the lineage. Conclusions Many subfamilies of Alu elements are actively mobilizing throughout the baboon lineage, with most being specific to the baboon lineage.
Collapse
Affiliation(s)
- Cody J Steely
- Department of Biological Sciences, Louisiana State University, 202 Life Sciences Bldg., Baton Rouge, LA 70803 USA
| | - Jasmine N Baker
- Department of Biological Sciences, Louisiana State University, 202 Life Sciences Bldg., Baton Rouge, LA 70803 USA
| | - Jerilyn A Walker
- Department of Biological Sciences, Louisiana State University, 202 Life Sciences Bldg., Baton Rouge, LA 70803 USA
| | - Charles D Loupe
- Department of Biological Sciences, Louisiana State University, 202 Life Sciences Bldg., Baton Rouge, LA 70803 USA
| | | | - Mark A Batzer
- Department of Biological Sciences, Louisiana State University, 202 Life Sciences Bldg., Baton Rouge, LA 70803 USA
| |
Collapse
|
20
|
Baker JN, Walker JA, Denham MW, Loupe CD, Batzer MA. Recently integrated Alu insertions in the squirrel monkey ( Saimiri) lineage and application for population analyses. Mob DNA 2018; 9:9. [PMID: 29449901 PMCID: PMC5808450 DOI: 10.1186/s13100-018-0114-7] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/20/2017] [Accepted: 02/05/2018] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND The evolution of Alu elements has been ongoing in primate lineages and Alu insertion polymorphisms are widely used in phylogenetic and population genetics studies. Alu subfamilies in the squirrel monkey (Saimiri), a New World Monkey (NWM), were recently reported. Squirrel monkeys are commonly used in biomedical research and often require species identification. The purpose of this study was two-fold: 1) Perform locus-specific PCR analyses on recently integrated Alu insertions in Saimiri to determine their amplification dynamics, and 2) Identify a subset of Alu insertion polymorphisms with species informative allele frequency distributions between the Saimiri sciureus and Saimiri boliviensis groups. RESULTS PCR analyses were performed on a DNA panel of 32 squirrel monkey individuals for 382 Alu insertion events ≤2% diverged from 46 different Alu subfamily consensus sequences, 25 Saimiri specific and 21 NWM specific Alu subfamilies. Of the 382 loci, 110 were polymorphic for presence / absence among squirrel monkey individuals, 35 elements from 14 different Saimiri specific Alu subfamilies and 75 elements from 19 different NWM specific Alu subfamilies (13 of 46 subfamilies analyzed did not contain polymorphic insertions). Of the 110 Alu insertion polymorphisms, 51 had species informative allele frequency distributions between Saimiri sciureus and Saimiri boliviensis groups. CONCLUSIONS This study confirms the evolution of Alu subfamilies in Saimiri and provides evidence for an ongoing and prolific expansion of these elements in Saimiri with many active subfamilies concurrently propagating. The subset of polymorphic Alu insertions with species informative allele frequency distribution between Saimiri sciureus and Saimiri boliviensis will be instructive for specimen identification and conservation biology.
Collapse
Affiliation(s)
- Jasmine N. Baker
- Department of Biological Sciences, Louisiana State University, 202 Life Sciences Bldg., Baton Rouge, LA 70803 USA
| | - Jerilyn A. Walker
- Department of Biological Sciences, Louisiana State University, 202 Life Sciences Bldg., Baton Rouge, LA 70803 USA
| | - Michael W. Denham
- Department of Biological Sciences, Louisiana State University, 202 Life Sciences Bldg., Baton Rouge, LA 70803 USA
| | - Charles D. Loupe
- Department of Biological Sciences, Louisiana State University, 202 Life Sciences Bldg., Baton Rouge, LA 70803 USA
| | - Mark A. Batzer
- Department of Biological Sciences, Louisiana State University, 202 Life Sciences Bldg., Baton Rouge, LA 70803 USA
| |
Collapse
|
21
|
Orr CM. Kinematics of the anthropoid os centrale and the functional consequences of scaphoid-centrale fusion in African apes and hominins. J Hum Evol 2017; 114:102-117. [PMID: 29447753 DOI: 10.1016/j.jhevol.2017.10.002] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/27/2017] [Revised: 09/21/2017] [Accepted: 10/05/2017] [Indexed: 02/07/2023]
Abstract
In most primates, the os centrale is interposed between the scaphoid, trapezoid, trapezium, and head of the capitate, thus constituting a component of the wrist's midcarpal complex. Scaphoid-centrale fusion is among the clearest morphological synapomorphies of African apes and hominins. Although it might facilitate knuckle-walking by increasing the rigidity and stability of the radial side of the wrist, the exact functional significance of scaphoid-centrale fusion is unclear. If fusion acts to produce a more rigid radial wrist that stabilizes the hand and limits shearing stresses, then in taxa with a free centrale, it should anchor ligaments that check extension and radial deviation, but exhibit motion independent of the scaphoid. Moreover, because the centrale sits between the scaphoid and capitate (a major stabilizing articulation), scaphoid-centrale mobility should correlate with scaphocapitate mobility in extension and radial deviation. To test these hypotheses, the centrale's ligamentous binding was investigated via dissection in Pongo and Papio, and the kinematics of the centrale were quantified in a cadaveric sample of anthropoids (Pongo sp., Ateles geoffroyi, Colobus guereza, Macaca mulatta, and Papio anubis) using a computed-tomography-based method to track wrist-bone motion. Results indicate that the centrale rotates freely relative to the scaphoid in all taxa. However, centrale mobility is only correlated with scaphocapitate mobility during extension in Pongo-possibly due to differences in overall wrist configuration between apes and monkeys. If an extant ape-like wrist characterized early ancestors of African apes and hominins, then scaphoid-centrale fusion would have increased midcarpal rigidity in extension relative to the primitive condition. Although biomechanically consistent with a knuckle-walking hominin ancestor, this assumes that the trait evolved specifically for that biological role, which must be squared with contradictory interpretations of extant and fossil hominoid morphology. Regardless of its original adaptive significance, scaphoid-centrale fusion likely presented a constraint on early hominin midcarpal mobility.
Collapse
Affiliation(s)
- Caley M Orr
- Department of Cell and Developmental Biology, University of Colorado School of Medicine, Aurora, CO, USA; Department of Anthropology, University of Colorado Denver, Denver, CO, USA.
| |
Collapse
|
22
|
Garbino GST, Martins-Junior AMG. Phenotypic evolution in marmoset and tamarin monkeys (Cebidae, Callitrichinae) and a revised genus-level classification. Mol Phylogenet Evol 2017; 118:156-171. [PMID: 28989098 DOI: 10.1016/j.ympev.2017.10.002] [Citation(s) in RCA: 19] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/25/2017] [Revised: 08/15/2017] [Accepted: 10/03/2017] [Indexed: 11/16/2022]
Abstract
Marmosets and tamarins (Cebidae, Callitrichinae) constitute the most species-rich subfamily of New World monkeys and one of the most diverse phenotypically. Despite the profusion of molecular phylogenies of the group, the evolution of phenotypic characters under the rapidly-emerging consensual phylogeny of the subfamily has been little studied, resulting in taxonomic proposals that have limited support from other datasets. We examined the evolution of 18 phenotypic traits (5 continuous and 13 discrete), including pelage, skull, dentition, postcrania, life-history and vocalization variables in a robust molecular phylogeny of marmoset and tamarin monkeys, quantifying their phylogenetic signal and correlations among some of the traits. At the family level, our resulting topology supports owl monkeys (Aotinae) as sister group of Callitrichinae. The topology of the callitrichine tree was congruent with previous studies except for the position of the midas group of Saguinus tamarins, which placement as sister of the bicolor group did not receive significant statistical support in both Maximum Parsimony and Bayesian Inference analyses. Our results showed that the highest value of phylogenetic signal among continuous traits was displayed by the long call character and the lowest was exhibited in the home range, intermediate values were found in characters related to osteology and skull size. Among discrete traits, pelage and osteology had similar phylogenetic signal. Based on genetic, osteological, pelage and vocalization data, we present an updated genus-level taxonomy of Callitrichinae, which recognizes six genera in the subfamily: Callimico, Callithrix, Cebuella, Mico, Leontopithecus and Saguinus. To reflect their phenotypic distinctiveness and to avoid the use of the informal "species group", we subdivided Saguinus in the subgenera Leontocebus, Saguinus and Tamarinus (revalidated here).
Collapse
Affiliation(s)
- Guilherme S T Garbino
- PPG-Zoologia, Departamento de Zoologia, Instituto de Ciências Biológicas, Universidade Federal de Minas Gerais, Belo Horizonte, Brazil.
| | - Antonio M G Martins-Junior
- Laboratório de Genética e Evolução, Instituto Federal do Pará, Campus de Tucuruí, Brazil; Centro de Genômica e Biologia de Sistemas, Universidade Federal do Pará, Belém, Brazil
| |
Collapse
|
23
|
Steely CJ, Walker JA, Jordan VE, Beckstrom TO, McDaniel CL, St. Romain CP, Bennett EC, Robichaux A, Clement BN, Raveendran M, Worley KC, Phillips-Conroy J, Jolly CJ, Rogers J, Konkel MK, Batzer MA. Alu Insertion Polymorphisms as Evidence for Population Structure in Baboons. Genome Biol Evol 2017; 9:2418-2427. [PMID: 28957465 PMCID: PMC5622324 DOI: 10.1093/gbe/evx184] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 09/07/2017] [Indexed: 12/25/2022] Open
Abstract
Male dispersal from the natal group at or near maturity is a feature of most baboon (Papio) species. It potentially has profound effects upon population structure and evolutionary processes, but dispersal, especially for unusually long distances, is not readily documented by direct field observation. In this pilot study, we investigate the possibility of retrieving baboon population structure in yellow (Papio cynocephalus) and kinda (Papio kindae) baboons from the distribution of variation in a genome-wide set of 494 Alu insertion polymorphisms, made available via the recently completed Baboon Genome Analysis Consortium. Alu insertion variation in a mixed population derived from yellow and olive (Papio anubis) baboons identified each individual's proportion of heritage from either parental species. In an unmixed yellow baboon population, our analysis showed greater similarity between neighboring than between more distantly situated groups, suggesting structuring of the population by male dispersal distance. Finally (and very provisionally), an unexpectedly sharp difference in Alu insertion frequencies between members of neighboring social groups of kinda baboons suggests that intergroup migration may be more rare than predicted in this little known species.
Collapse
Affiliation(s)
- Cody J. Steely
- Department of Biological Sciences, Louisiana State University
| | | | | | | | | | | | | | - Arianna Robichaux
- Department of Biological Sciences, Louisiana State University
- Department of Biological and Physical Sciences, Northwestern State University of Louisiana
| | - Brooke N. Clement
- Department of Biological Sciences, Louisiana State University
- School of Veterinary Medicine, Louisiana State University
| | | | | | - Kim C. Worley
- Human Genome Sequencing Center, Baylor College of Medicine, Houston, Texas
- Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, Texas
| | | | | | - Jeff Rogers
- Human Genome Sequencing Center, Baylor College of Medicine, Houston, Texas
- Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, Texas
| | | | - Mark A. Batzer
- Department of Biological Sciences, Louisiana State University
| |
Collapse
|
24
|
Carneiro J, Rodrigues-Filho LFDS, Schneider H, Sampaio I. Molecular data highlight hybridization in squirrel monkeys (Saimiri, Cebidae). Genet Mol Biol 2016; 39:539-546. [PMID: 27801483 PMCID: PMC5127161 DOI: 10.1590/1678-4685-gmb-2016-0091] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/01/2016] [Accepted: 08/11/2016] [Indexed: 11/30/2022] Open
Abstract
Hybridization has been reported increasingly frequently in recent years, fueling the debate on its role in the evolutionary history of species. Some studies have shown that hybridization is very common in captive New World primates, and hybrid offspring have phenotypes and physiological responses distinct from those of the "pure" parents, due to gene introgression. Here we used the TA15 Alu insertion to investigate hybridization in the genus Saimiri. Our results indicate the hybridization of Saimiri boliviensis peruviensis with S. sciureus macrodon, and S. b. boliviensis with S. ustus. Unexpectedly, some hybrids of both S. boliviensis peruviensis and S. b. boliviensis were homozygous for the absence of the insertion, which indicates that the hybrids were fertile.
Collapse
Affiliation(s)
- Jeferson Carneiro
- Universidade Federal do Pará, Campus Universitário de Bragança, PA,
Brazil
| | | | - Horacio Schneider
- Universidade Federal do Pará, Campus Universitário de Bragança, PA,
Brazil
| | - Iracilda Sampaio
- Universidade Federal do Pará, Campus Universitário de Bragança, PA,
Brazil
| |
Collapse
|
25
|
The phylogenetic system of primates—character evolution in the light of a consolidated tree. ORG DIVERS EVOL 2016. [DOI: 10.1007/s13127-016-0279-1] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/26/2022]
|
26
|
Slater GJ, Cui P, Forasiepi AM, Lenz D, Tsangaras K, Voirin B, de Moraes-Barros N, MacPhee RDE, Greenwood AD. Evolutionary Relationships among Extinct and Extant Sloths: The Evidence of Mitogenomes and Retroviruses. Genome Biol Evol 2016; 8:607-21. [PMID: 26878870 PMCID: PMC4824031 DOI: 10.1093/gbe/evw023] [Citation(s) in RCA: 29] [Impact Index Per Article: 3.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/19/2022] Open
Abstract
Macroevolutionary trends exhibited by retroviruses are complex and not entirely understood. The sloth endogenized foamy-like retrovirus (SloEFV), which demonstrates incongruence in virus–host evolution among extant sloths (Order Folivora), has not been investigated heretofore in any extinct sloth lineages and its premodern history within folivorans is therefore unknown. Determining retroviral coevolutionary trends requires a robust phylogeny of the viral host, but the highly reduced modern sloth fauna (6 species in 2 genera) does not adequately represent what was once a highly diversified clade (∼100 genera) of placental mammals. At present, the amount of molecular data available for extinct sloth taxa is limited, and analytical results based on these data tend to conflict with phylogenetic inferences made on the basis of morphological studies. To augment the molecular data set, we applied hybridization capture and next-generation Illumina sequencing to two extinct and three extant sloth species to retrieve full mitochondrial genomes (mitogenomes) from the hosts and the polymerase gene of SloEFV. The results produced a fully resolved and well-supported phylogeny that supports dividing crown families into two major clades: 1) The three-toed sloth, Bradypus, and Nothrotheriidae and 2) Megalonychidae, including the two-toed sloth, Choloepus, and Mylodontidae. Our calibrated time tree indicates that the Miocene epoch (23.5 Ma), particularly its earlier part, was an important interval for folivoran diversification. Both extant and extinct sloths demonstrate multiple complex invasions of SloEFV into the ancestral sloth germline followed by subsequent introgressions across different sloth lineages. Thus, sloth mitogenome and SloEFV evolution occurred separately and in parallel among sloths.
Collapse
Affiliation(s)
- Graham J Slater
- Department of Paleobiology & Division of Mammals, National Museum of Natural History, Smithsonian Institution, Washington, DC Department of the Geophysical Sciences, University of Chicago
| | - Pin Cui
- Leibniz Institute for Zoo and Wildlife Research, Berlin, Germany
| | | | - Dorina Lenz
- Leibniz Institute for Zoo and Wildlife Research, Berlin, Germany
| | | | - Bryson Voirin
- Max Planck Institute for Ornithology, Seewiesen, Germany
| | - Nadia de Moraes-Barros
- Cibio/Inbio - Centro De Investigação Em Biodiversidade E Recursos Genéticos, Universidade Do Porto, Vairão, Portugal
| | - Ross D E MacPhee
- Department of Mammalogy and Division of Vertebrate Zoology, American Museum of Natural History, New York, NY
| | - Alex D Greenwood
- Leibniz Institute for Zoo and Wildlife Research, Berlin, Germany Department of Veterinary Medicine, Freie Universität Berlin, Berlin, Germany
| |
Collapse
|
27
|
Tocheri MW, Dommain R, McFarlin SC, Burnett SE, Troy Case D, Orr CM, Roach NT, Villmoare B, Eriksen AB, Kalthoff DC, Senck S, Assefa Z, Groves CP, Jungers WL. The evolutionary origin and population history of the grauer gorilla. AMERICAN JOURNAL OF PHYSICAL ANTHROPOLOGY 2016; 159:S4-S18. [DOI: 10.1002/ajpa.22900] [Citation(s) in RCA: 18] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/22/2015] [Revised: 11/10/2015] [Accepted: 11/10/2015] [Indexed: 01/12/2023]
Affiliation(s)
- Matthew W. Tocheri
- Department of AnthropologyLakehead UniversityThunder Bay OntarioP7B 5E1 Canada
- Human Origins Program, National Museum of Natural History, Smithsonian InstitutionWashington, DC20013 USA
| | - René Dommain
- Human Origins Program, National Museum of Natural History, Smithsonian InstitutionWashington, DC20013 USA
| | - Shannon C. McFarlin
- Department of Anthropology and Center for the Advanced Study of Hominid PaleobiologyThe George Washington UniversityWashington, DC20052 USA
- Division of Mammals, National Museum of Natural HistorySmithsonian InstitutionWashington, DC20013 USA
| | - Scott E. Burnett
- Department of AnthropologyEckerd CollegeSt Petersburg FL33711 USA
| | - D. Troy Case
- Department of Sociology and AnthropologyNorth Carolina State UniversityRaleigh NC27695 USA
| | - Caley M. Orr
- Department of Cell and Developmental BiologyUniversity of Colorado School of MedicineAurora CO80045 USA
| | - Neil T. Roach
- Department of Human Evolutionary BiologyHarvard UniversityCambridge, MA02138
- Division of AnthropologyAmerican Museum of Natural HistoryNew York, NY10024 USA
| | - Brian Villmoare
- Department of AnthropologyUniversity of Nevada Las VegasLas Vegas NV89154 USA
- Department of AnthropologyUniversity College LondonLondonWC1H 0BW UK
| | - Amandine B. Eriksen
- Department of AnthropologyThe State University of New YorkBuffalo NY14260 USA
| | | | - Sascha Senck
- Fakultät für Technik und Umweltwissenschaften, University of Applied Sciences Upper AustriaWels4600 Austria
| | - Zelalem Assefa
- Human Origins Program, National Museum of Natural History, Smithsonian InstitutionWashington, DC20013 USA
| | - Colin P. Groves
- School of Archaeology and AnthropologyAustralian National UniversityCanberraACT 0200 Australia
| | - William L. Jungers
- Department of Anatomical SciencesStony Brook University Medical CenterStony Brook NY11794 USA
- Association VahatraBP3972 Madagascar
| |
Collapse
|
28
|
Roos C. Phylogeny and Classification of Gibbons (Hylobatidae). DEVELOPMENTS IN PRIMATOLOGY: PROGRESS AND PROSPECTS 2016. [DOI: 10.1007/978-1-4939-5614-2_7] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/09/2023]
|
29
|
Hénaff E, Zapata L, Casacuberta JM, Ossowski S. Jitterbug: somatic and germline transposon insertion detection at single-nucleotide resolution. BMC Genomics 2015; 16:768. [PMID: 26459856 PMCID: PMC4603299 DOI: 10.1186/s12864-015-1975-5] [Citation(s) in RCA: 30] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/13/2015] [Accepted: 10/02/2015] [Indexed: 11/20/2022] Open
Abstract
Background Transposable elements are major players in genome evolution. Transposon insertion polymorphisms can translate into phenotypic differences in plants and animals and are linked to different diseases including human cancer, making their characterization highly relevant to the study of genome evolution and genetic diseases. Results Here we present Jitterbug, a novel tool that identifies transposable element insertion sites at single-nucleotide resolution based on the pairedend mapping and clipped-read signatures produced by NGS alignments. Jitterbug can be easily integrated into existing NGS analysis pipelines, using the standard BAM format produced by frequently applied alignment tools (e.g. bwa, bowtie2), with no need to realign reads to a set of consensus transposon sequences. Jitterbug is highly sensitive and able to recall transposon insertions with a very high specificity, as demonstrated by benchmarks in the human and Arabidopsis genomes, and validation using long PacBio reads. In addition, Jitterbug estimates the zygosity of transposon insertions with high accuracy and can also identify somatic insertions. Conclusions We demonstrate that Jitterbug can identify mosaic somatic transposon movement using sequenced tumor-normal sample pairs and allows for estimating the cancer cell fraction of clones containing a somatic TE insertion. We suggest that the independent methods we use to evaluate performance are a step towards creating a gold standard dataset for benchmarking structural variant prediction tools. Electronic supplementary material The online version of this article (doi:10.1186/s12864-015-1975-5) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Elizabeth Hénaff
- Genomic and Epigenomic Variation in Disease Group, Centre for Genomic Regulation (CRG), The Barcelona Institute of Science and Technology, Dr. Aiguader 88, 08003, Barcelona, Spain. .,Center for Research in Agricultural Genomics, CRAG (CSIC-IRTA-UAB-UB), Barcelona, Spain. .,current address: Weill Cornell Medical College, Institute for Computational Biomedicine, 1305 York Avenue, New York, NY, 10021, USA.
| | - Luís Zapata
- Genomic and Epigenomic Variation in Disease Group, Centre for Genomic Regulation (CRG), The Barcelona Institute of Science and Technology, Dr. Aiguader 88, 08003, Barcelona, Spain. .,Universitat Pompeu Fabra (UPF), Barcelona, Spain.
| | - Josep M Casacuberta
- Center for Research in Agricultural Genomics, CRAG (CSIC-IRTA-UAB-UB), Barcelona, Spain.
| | - Stephan Ossowski
- Genomic and Epigenomic Variation in Disease Group, Centre for Genomic Regulation (CRG), The Barcelona Institute of Science and Technology, Dr. Aiguader 88, 08003, Barcelona, Spain. .,Universitat Pompeu Fabra (UPF), Barcelona, Spain.
| |
Collapse
|
30
|
Qian Y, Kehr B, Halldórsson BV. PopAlu: population-scale detection of Alu polymorphisms. PeerJ 2015; 3:e1269. [PMID: 26417547 PMCID: PMC4582951 DOI: 10.7717/peerj.1269] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/12/2015] [Accepted: 09/04/2015] [Indexed: 11/20/2022] Open
Abstract
Alu elements are sequences of approximately 300 basepairs that together comprise more than 10% of the human genome. Due to their recent origin in primate evolution some Alu elements are polymorphic in humans, present in some individuals while absent in others. We present PopAlu, a tool to detect polymorphic Alu elements on a population scale from paired-end sequencing data. PopAlu uses read pair distance and orientation as well as split reads to identify the location and precise breakpoints of polymorphic Alus. Genotype calling enables us to differentiate between homozygous and heterozygous carriers, making the output of PopAlu suitable for use in downstream analyses such as genome-wide association studies (GWAS). We show on a simulated dataset that PopAlu calls Alu elements inserted and deleted with respect to a reference genome with high accuracy and high precision. Our analysis of real data of a human trio from the 1000 Genomes Project confirms that PopAlu is able to produce highly accurate genotype calls. To our knowledge, PopAlu is the first tool that identifies polymorphic Alu elements from multiple individuals simultaneously, pinpoints the precise breakpoints and calls genotypes with high accuracy.
Collapse
Affiliation(s)
- Yu Qian
- Bioinformatics Research Center, Aarhus University , Aarhus , Denmark
| | - Birte Kehr
- deCODE genetics/Amgen , Reykjavík , Iceland
| | - Bjarni V Halldórsson
- deCODE genetics/Amgen , Reykjavík , Iceland ; Institute of Biomedical and Neural Engineering, School of Science and Engineering, Reykjavik University , Reykjavík , Iceland
| |
Collapse
|
31
|
Nergadze SG, Lupotto M, Pellanda P, Santagostino M, Vitelli V, Giulotto E. Mitochondrial DNA insertions in the nuclear horse genome. Anim Genet 2015; 41 Suppl 2:176-85. [PMID: 21070293 DOI: 10.1111/j.1365-2052.2010.02130.x] [Citation(s) in RCA: 32] [Impact Index Per Article: 3.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/14/2023]
Abstract
The insertion of mitochondrial DNA in the nuclear genome generates numts, nuclear sequences of mitochondrial origin. In the horse reference genome, we identified 82 numts and showed that the entire horse mitochondrial DNA is represented as numts without gross bias. Numts were inserted in the horse nuclear genome at random sites and were probably generated during the repair of DNA double-strand breaks. We then analysed 12 numt loci in 20 unrelated horses and found that null alleles, lacking the mitochondrial DNA insertion, were present at six of these loci. At some loci, the null allele is prevalent in the sample analysed, suggesting that, in the horse population, the number of numt loci may be higher than 82 present in the reference genome. Contrary to humans, the insertion polymorphism of numts is extremely frequent in the horse population, supporting the hypothesis that the genome of this species is in a rapidly evolving state.
Collapse
Affiliation(s)
- S G Nergadze
- Dipartimento di Genetica e Microbiologia Adriano Buzzati-Traverso, Università di Pavia, Via Ferrata 1, 27100 Pavia, Italy
| | | | | | | | | | | |
Collapse
|
32
|
Kamath PL, Elleder D, Bao L, Cross PC, Powell JH, Poss M. The population history of endogenous retroviruses in mule deer (Odocoileus hemionus). J Hered 2013; 105:173-87. [PMID: 24336966 DOI: 10.1093/jhered/est088] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
Abstract
Mobile elements are powerful agents of genomic evolution and can be exceptionally informative markers for investigating species and population-level evolutionary history. While several studies have utilized retrotransposon-based insertional polymorphisms to resolve phylogenies, few population studies exist outside of humans. Endogenous retroviruses are LTR-retrotransposons derived from retroviruses that have become stably integrated in the host genome during past infections and transmitted vertically to subsequent generations. They offer valuable insight into host-virus co-evolution and a unique perspective on host evolutionary history because they integrate into the genome at a discrete point in time. We examined the evolutionary history of a cervid endogenous gammaretrovirus (CrERVγ) in mule deer (Odocoileus hemionus). We sequenced 14 CrERV proviruses (CrERV-in1 to -in14), and examined the prevalence and distribution of 13 proviruses in 262 deer among 15 populations from Montana, Wyoming, and Utah. CrERV absence in white-tailed deer (O. virginianus), identical 5' and 3' long terminal repeat (LTR) sequences, insertional polymorphism, and CrERV divergence time estimates indicated that most endogenization events occurred within the last 200000 years. Population structure inferred from CrERVs (F ST = 0.008) and microsatellites (θ = 0.01) was low, but significant, with Utah, northwestern Montana, and a Helena herd being particularly differentiated. Clustering analyses indicated regional structuring, and non-contiguous clustering could often be explained by known translocations. Cluster ensemble results indicated spatial localization of viruses, specifically in deer from northeastern and western Montana. This study demonstrates the utility of endogenous retroviruses to elucidate and provide novel insight into both ERV evolutionary history and the history of contemporary host populations.
Collapse
Affiliation(s)
- Pauline L Kamath
- the US Geological Survey, Northern Rocky Mountain Science Center, Bozeman, MT 59715
| | | | | | | | | | | |
Collapse
|
33
|
McLain AT, Carman GW, Fullerton ML, Beckstrom TO, Gensler W, Meyer TJ, Faulk C, Batzer MA. Analysis of western lowland gorilla (Gorilla gorilla gorilla) specific Alu repeats. Mob DNA 2013; 4:26. [PMID: 24262036 PMCID: PMC4177385 DOI: 10.1186/1759-8753-4-26] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/19/2013] [Accepted: 10/23/2013] [Indexed: 02/07/2023] Open
Abstract
Background Research into great ape genomes has revealed widely divergent activity levels over time for Alu elements. However, the diversity of this mobile element family in the genome of the western lowland gorilla has previously been uncharacterized. Alu elements are primate-specific short interspersed elements that have been used as phylogenetic and population genetic markers for more than two decades. Alu elements are present at high copy number in the genomes of all primates surveyed thus far. The AluY subfamily and its derivatives have been recognized as the evolutionarily youngest Alu subfamily in the Old World primate lineage. Results Here we use a combination of computational and wet-bench laboratory methods to assess and catalog AluY subfamily activity level and composition in the western lowland gorilla genome (gorGor3.1). A total of 1,075 independent AluY insertions were identified and computationally divided into 10 subfamilies, with the largest number of gorilla-specific elements assigned to the canonical AluY subfamily. Conclusions The retrotransposition activity level appears to be significantly lower than that seen in the human and chimpanzee lineages, while higher than that seen in orangutan genomes, indicative of differential Alu amplification in the western lowland gorilla lineage as compared to other Homininae.
Collapse
Affiliation(s)
- Adam T McLain
- Department of Biological Sciences, Louisiana State University, Baton Rouge, LA 70803, USA.
| | | | | | | | | | | | | | | |
Collapse
|
34
|
Schneider H, Sampaio I. The systematics and evolution of New World primates - A review. Mol Phylogenet Evol 2013; 82 Pt B:348-57. [PMID: 24201058 DOI: 10.1016/j.ympev.2013.10.017] [Citation(s) in RCA: 63] [Impact Index Per Article: 5.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/19/2013] [Revised: 08/22/2013] [Accepted: 10/14/2013] [Indexed: 11/19/2022]
Abstract
This paper provides an overview of the taxonomy of New World primates from proposals of the 1980's based on morphology to the great number of studies based on molecular data aiming for the elucidation of the phylogeny of New World monkeys. The innovations of the first molecular phylogeny presented by Schneider et al. (1993) positioned Callimico as a sister group of Callithrix and Cebuella; Callicebus as a member of the pitheciids; Brachyteles as sister to Lagothrix; and the night monkeys (Aotus), capuchins (Cebus) and squirrel monkeys (Saimiri) in the same clade with the small callitrichines. These results were subsequently confirmed by dozens of subsequent studies using data from DNA sequences. Some issues difficult to resolve with the phylogenetic analyses of DNA sequences, such as the diversification of the oldest lineages (pitheciids, atelids and cebids), and the confirmation of Aotus as a member of the Cebinae clade (together with Cebus/Saimiri), were clarified with new molecular approaches based on the presence or absence of Alu insertions as well as through the use of phylogenomics. At this time, all relationships at the intergeneric level had been deciphered, with the exception of the definition of the sister group of callitrichines (whether Aotus or Cebus/Saimiri are sister to callitrichines, or if Aotus, Saimiri and Cebus form a clade together). Future studies should prioritize the alpha taxonomy of most Neotropical primate groups, and the use of phylogenetic and geographic data, combined with reliable estimates of divergence times, to clarify the taxonomic status at species and genus level, as well as to help understand the evolutionary history of this remarkable and highly diversified group.
Collapse
Affiliation(s)
- Horacio Schneider
- Instituto de Estudos Costeiros, Universidade Federal do Pará, Campus de Bragança, Alameda Leandro Ribeiro s/n, Bragança, Pará, CEP 68600-000, Brazil.
| | - Iracilda Sampaio
- Instituto de Estudos Costeiros, Universidade Federal do Pará, Campus de Bragança, Alameda Leandro Ribeiro s/n, Bragança, Pará, CEP 68600-000, Brazil.
| |
Collapse
|
35
|
Ben-David S, Yaakov B, Kashkush K. Genome-wide analysis of short interspersed nuclear elements SINES revealed high sequence conservation, gene association and retrotranspositional activity in wheat. THE PLANT JOURNAL : FOR CELL AND MOLECULAR BIOLOGY 2013; 76:201-10. [PMID: 23855320 PMCID: PMC4223381 DOI: 10.1111/tpj.12285] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/15/2013] [Revised: 06/04/2013] [Accepted: 07/03/2013] [Indexed: 05/02/2023]
Abstract
Short interspersed nuclear elements (SINEs) are non-autonomous non-LTR retroelements that are present in most eukaryotic species. While SINEs have been intensively investigated in humans and other animal systems, they are poorly studied in plants, especially in wheat (Triticum aestivum). We used quantitative PCR of various wheat species to determine the copy number of a wheat SINE family, termed Au SINE, combined with computer-assisted analyses of the publicly available 454 pyrosequencing database of T. aestivum. In addition, we utilized site-specific PCR on 57 Au SINE insertions, transposon methylation display and transposon display on newly formed wheat polyploids to assess retrotranspositional activity, epigenetic status and genetic rearrangements in Au SINE, respectively. We retrieved 3706 different insertions of Au SINE from the 454 pyrosequencing database of T. aestivum, and found that most of the elements are inserted in A/T-rich regions, while approximately 38% of the insertions are associated with transcribed regions, including known wheat genes. We observed typical retrotransposition of Au SINE in the second generation of a newly formed wheat allohexaploid, and massive hypermethylation in CCGG sites surrounding Au SINE in the third generation. Finally, we observed huge differences in the copy numbers in diploid Triticum and Aegilops species, and a significant increase in the copy numbers in natural wheat polyploids, but no significant increase in the copy number of Au SINE in the first four generations for two of three newly formed allopolyploid species used in this study. Our data indicate that SINEs may play a prominent role in the genomic evolution of wheat through stress-induced activation.
Collapse
|
36
|
Abstract
We analyzed 83 fully sequenced great ape genomes for mobile element insertions, predicting a total of 49,452 fixed and polymorphic Alu and long interspersed element 1 (L1) insertions not present in the human reference assembly and assigning each retrotransposition event to a different time point during great ape evolution. We used these homoplasy-free markers to construct a mobile element insertions-based phylogeny of humans and great apes and demonstrate their differential power to discern ape subspecies and populations. Within this context, we find a good correlation between L1 diversity and single-nucleotide polymorphism heterozygosity (r(2) = 0.65) in contrast to Alu repeats, which show little correlation (r(2) = 0.07). We estimate that the "rate" of Alu retrotransposition has differed by a factor of 15-fold in these lineages. Humans, chimpanzees, and bonobos show the highest rates of Alu accumulation--the latter two since divergence 1.5 Mya. The L1 insertion rate, in contrast, has remained relatively constant, with rates differing by less than a factor of three. We conclude that Alu retrotransposition has been the most variable form of genetic variation during recent human-great ape evolution, with increases and decreases occurring over very short periods of evolutionary time.
Collapse
|
37
|
Finstermeier K, Zinner D, Brameier M, Meyer M, Kreuz E, Hofreiter M, Roos C. A mitogenomic phylogeny of living primates. PLoS One 2013; 8:e69504. [PMID: 23874967 PMCID: PMC3713065 DOI: 10.1371/journal.pone.0069504] [Citation(s) in RCA: 132] [Impact Index Per Article: 12.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/12/2013] [Accepted: 06/11/2013] [Indexed: 12/28/2022] Open
Abstract
Primates, the mammalian order including our own species, comprise 480 species in 78 genera. Thus, they represent the third largest of the 18 orders of eutherian mammals. Although recent phylogenetic studies on primates are increasingly built on molecular datasets, most of these studies have focused on taxonomic subgroups within the order. Complete mitochondrial (mt) genomes have proven to be extremely useful in deciphering within-order relationships even up to deep nodes. Using 454 sequencing, we sequenced 32 new complete mt genomes adding 20 previously not represented genera to the phylogenetic reconstruction of the primate tree. With 13 new sequences, the number of complete mt genomes within the parvorder Platyrrhini was widely extended, resulting in a largely resolved branching pattern among New World monkey families. We added 10 new Strepsirrhini mt genomes to the 15 previously available ones, thus almost doubling the number of mt genomes within this clade. Our data allow precise date estimates of all nodes and offer new insights into primate evolution. One major result is a relatively young date for the most recent common ancestor of all living primates which was estimated to 66-69 million years ago, suggesting that the divergence of extant primates started close to the K/T-boundary. Although some relationships remain unclear, the large number of mt genomes used allowed us to reconstruct a robust primate phylogeny which is largely in agreement with previous publications. Finally, we show that mt genomes are a useful tool for resolving primate phylogenetic relationships on various taxonomic levels.
Collapse
Affiliation(s)
- Knut Finstermeier
- Research Group Molecular Ecology, Max Planck Institute for Evolutionary Anthropology, Leipzig, Germany
| | - Dietmar Zinner
- Cognitive Ethology Laboratory, German Primate Center, Leibniz Institute for Primate Research, Göttingen, Germany
| | - Markus Brameier
- Primate Genetics Laboratory, German Primate Center, Leibniz Institute for Primate Research, Göttingen, Germany
| | - Matthias Meyer
- Research Group Molecular Ecology, Max Planck Institute for Evolutionary Anthropology, Leipzig, Germany
| | - Eva Kreuz
- Research Group Molecular Ecology, Max Planck Institute for Evolutionary Anthropology, Leipzig, Germany
| | - Michael Hofreiter
- Research Group Molecular Ecology, Max Planck Institute for Evolutionary Anthropology, Leipzig, Germany
| | - Christian Roos
- Primate Genetics Laboratory, German Primate Center, Leibniz Institute for Primate Research, Göttingen, Germany
- Gene Bank of Primates, German Primate Center, Leibniz Institute for Primate Research, Göttingen, Germany
- * E-mail:
| |
Collapse
|
38
|
A scalable and flexible approach for investigating the genomic landscapes of phylogenetic incongruence. Mol Phylogenet Evol 2013; 66:1067-74. [DOI: 10.1016/j.ympev.2012.11.023] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/04/2012] [Revised: 11/16/2012] [Accepted: 11/25/2012] [Indexed: 11/19/2022]
|
39
|
Abstract
A major challenge in molecular biology is reverse-engineering the cis-regulatory logic that plays a major role in the control of gene expression. This program includes searching through DNA sequences to identify “motifs” that serve as the binding sites for transcription factors or, more generally, are predictive of gene expression across cellular conditions. Several approaches have been proposed for de novo motif discovery–searching sequences without prior knowledge of binding sites or nucleotide patterns. However, unbiased validation is not straightforward. We consider two approaches to unbiased validation of discovered motifs: testing the statistical significance of a motif using a DNA “background” sequence model to represent the null hypothesis and measuring performance in predicting membership in gene clusters. We demonstrate that the background models typically used are “too null,” resulting in overly optimistic assessments of significance, and argue that performance in predicting TF binding or expression patterns from DNA motifs should be assessed by held-out data, as in predictive learning. Applying this criterion to common motif discovery methods resulted in universally poor performance, although there is a marked improvement when motifs are statistically significant against real background sequences. Moreover, on synthetic data where “ground truth” is known, discriminative performance of all algorithms is far below the theoretical upper bound, with pronounced “over-fitting” in training. A key conclusion from this work is that the failure of de novo discovery approaches to accurately identify motifs is basically due to statistical intractability resulting from the fixed size of co-regulated gene clusters, and thus such failures do not necessarily provide evidence that unfound motifs are not active biologically. Consequently, the use of prior knowledge to enhance motif discovery is not just advantageous but necessary. An implementation of the LR and ALR algorithms is available at http://code.google.com/p/likelihood-ratio-motifs/.
Collapse
Affiliation(s)
- David Simcha
- Department of Biomedical Engineering, Johns Hopkins University, Baltimore, Maryland, United States of America.
| | | | | |
Collapse
|
40
|
McLain AT, Meyer TJ, Faulk C, Herke SW, Oldenburg JM, Bourgeois MG, Abshire CF, Roos C, Batzer MA. An alu-based phylogeny of lemurs (infraorder: Lemuriformes). PLoS One 2012; 7:e44035. [PMID: 22937148 PMCID: PMC3429421 DOI: 10.1371/journal.pone.0044035] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/02/2012] [Accepted: 07/31/2012] [Indexed: 11/30/2022] Open
Abstract
LEMURS (INFRAORDER: Lemuriformes) are a radiation of strepsirrhine primates endemic to the island of Madagascar. As of 2012, 101 lemur species, divided among five families, have been described. Genetic and morphological evidence indicates all species are descended from a common ancestor that arrived in Madagascar ∼55-60 million years ago (mya). Phylogenetic relationships in this species-rich infraorder have been the subject of debate. Here we use Alu elements, a family of primate-specific Short INterspersed Elements (SINEs), to construct a phylogeny of infraorder Lemuriformes. Alu elements are particularly useful SINEs for the purpose of phylogeny reconstruction because they are identical by descent and confounding events between loci are easily resolved by sequencing. The genome of the grey mouse lemur (Microcebus murinus) was computationally assayed for synapomorphic Alu elements. Those that were identified as Lemuriformes-specific were analyzed against other available primate genomes for orthologous sequence in which to design primers for PCR (polymerase chain reaction) verification. A primate phylogenetic panel of 24 species, including 22 lemur species from all five families, was examined for the presence/absence of 138 Alu elements via PCR to establish relationships among species. Of these, 111 were phylogenetically informative. A phylogenetic tree was generated based on the results of this analysis. We demonstrate strong support for the monophyly of Lemuriformes to the exclusion of other primates, with Daubentoniidae, the aye-aye, as the basal lineage within the infraorder. Our results also suggest Lepilemuridae as a sister lineage to Cheirogaleidae, and Indriidae as sister to Lemuridae. Among the Cheirogaleidae, we show strong support for Microcebus and Mirza as sister genera, with Cheirogaleus the sister lineage to both. Our results also support the monophyly of the Lemuridae. Within Lemuridae we place Lemur and Hapalemur together to the exclusion of Eulemur and Varecia, with Varecia the sister lineage to the other three genera.
Collapse
Affiliation(s)
- Adam T McLain
- Department of Biological Sciences, Louisiana State University, Baton Rouge, Louisiana, United States of America
| | | | | | | | | | | | | | | | | |
Collapse
|
41
|
Meyer TJ, McLain AT, Oldenburg JM, Faulk C, Bourgeois MG, Conlin EM, Mootnick AR, de Jong PJ, Roos C, Carbone L, Batzer MA. An Alu-based phylogeny of gibbons (hylobatidae). Mol Biol Evol 2012; 29:3441-50. [PMID: 22683814 DOI: 10.1093/molbev/mss149] [Citation(s) in RCA: 32] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/17/2022] Open
Abstract
Gibbons (Hylobatidae) are small, arboreal apes indigenous to Southeast Asia that diverged from other apes ∼15-18 Ma. Extant lineages radiated rapidly 6-10 Ma and are organized into four genera (Hylobates, Hoolock, Symphalangus, and Nomascus) consisting of 12-19 species. The use of short interspersed elements (SINEs) as phylogenetic markers has seen recent popularity due to several desirable characteristics: the ancestral state of a locus is known to be the absence of an element, rare potentially homoplasious events are relatively easy to resolve, and samples can be quickly and inexpensively genotyped. During radiation of primates, one particular family of SINEs, the Alu family, has proliferated in primate genomes. Nomascus leucogenys (northern white-cheeked gibbon) sequences were analyzed for repetitive content with RepeatMasker using a custom library. The sequences containing Alu elements identified as members of a gibbon-specific subfamily were then compared with orthologous positions in other primate genomes. A primate phylogenetic panel consisting of 18 primate species, including 13 gibbon species representing all four extant genera, was assayed for all loci, and a total of 125 gibbon-specific Alu insertions were identified. The resulting amplification patterns were used to generate a phylogenetic tree. We demonstrate significant support for Symphalangus as the most basal lineage within the family. Our findings also place Nomascus as a derived lineage, sister to Hoolock, with the Nomascus-Hoolock clade sister to Hylobates. Further, our analysis groups N. leucogenys and Nomascus siki as sister taxa to the exclusion of the other Nomascus species assayed. This study represents the first use of SINEs to determine the genus level phylogenetic relationships within the family Hylobatidae. These relationships have been resolved with robust support at most internal nodes, demonstrating the utility of SINE-based phylogenetic analysis. We postulate that hybridization and rapid radiation may have contributed to the complex and contradictory findings of the previous studies. Our findings will aid in the conservation of these threatened primates and inform future studies of the biogeographical history and distribution of modern gibbon species.
Collapse
Affiliation(s)
- Thomas J Meyer
- Department of Biological Sciences, Louisiana State University
| | | | | | | | | | | | | | | | | | | | | |
Collapse
|
42
|
Yaakov B, Ceylan E, Domb K, Kashkush K. Marker utility of miniature inverted-repeat transposable elements for wheat biodiversity and evolution. TAG. THEORETICAL AND APPLIED GENETICS. THEORETISCHE UND ANGEWANDTE GENETIK 2012; 124:1365-73. [PMID: 22286503 DOI: 10.1007/s00122-012-1793-y] [Citation(s) in RCA: 19] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/19/2011] [Accepted: 01/05/2012] [Indexed: 05/02/2023]
Abstract
Transposable elements (TEs) account for up to 80% of the wheat genome and are considered one of the main drivers of wheat genome evolution. However, the contribution of TEs to the divergence and evolution of wheat genomes is not fully understood. In this study, we have developed 55 miniature inverted-repeat transposable element (MITE) markers that are based on the presence/absence of an element, with over 60% of these 55 MITE insertions associated with wheat genes. We then applied these markers to assess genetic diversity among Triticum and Aegilops species, including diploid (AA, BB and DD genomes), tetraploid (BBAA genome) and hexaploid (BBAADD genome) species. While 18.2% of the MITE markers showed similar insertions in all species indicating that those are fossil insertions, 81.8% of the markers showed polymorphic insertions among species, subspecies, and accessions. Furthermore, a phylogenetic analysis based on MITE markers revealed that species were clustered based on genus, genome composition, and ploidy level, while 47.13% genetic divergence was observed between the two main clusters, diploids versus polyploids. In addition, we provide evidence for MITE dynamics in wild emmer populations. The use of MITEs as evolutionary markers might shed more light on the origin of the B-genome of polyploid wheat.
Collapse
Affiliation(s)
- Beery Yaakov
- Department of Life Sciences, Ben-Gurion University, 84105 Beer-Sheva, Israel
| | | | | | | |
Collapse
|
43
|
Abstract
Background Alu polymorphisms are some of the most common polymorphisms in the genome, yet few methods have been developed for their detection. Methods We present algorithms to discover Alu polymorphisms using paired-end high throughput sequencing data from multiple individuals. We consider the problem of identifying sites containing polymorphic Alu insertions. Results We give efficient and practical algorithms that detect polymorphic Alus, both those that are inserted with respect to the reference genome and those that are deleted. The algorithms have a linear time complexity and can be run on a standard desktop machine in a very short amount of time on top of the output of tools standard for sequencing analysis. Conclusions In our simulated dataset we are able to locate 98.1% of Alus inserted with respect to the reference and 97.7% of Alus deleted, our simulations also show an excellent correlations between the deletions detected in parents and children. We further run our algorithms on publicly available data from the 1000 genomes project and find several thousand Alu polymorphisms in each individual.
Collapse
|
44
|
Bire S, Rouleux-Bonnin F. Transposable elements as tools for reshaping the genome: it is a huge world after all! Methods Mol Biol 2012; 859:1-28. [PMID: 22367863 DOI: 10.1007/978-1-61779-603-6_1] [Citation(s) in RCA: 16] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 05/31/2023]
Abstract
Transposable elements (TEs) are discrete pieces of DNA that can move from one site to another within genomes and sometime between genomes. They are found in all major branches of life. Because of their wide distribution and considerable diversity, they are a considerable source of genomic variation and as such, they constitute powerful drivers of genome evolution. Moreover, it is becoming clear that the epigenetic regulation of certain genes is derived from defense mechanisms against the activity of ancestral transposable elements. TEs now tend to be viewed as natural molecular tools that can reshape the genome, which challenges the idea that TEs are natural tools used to answer biological questions. In the first part of this chapter, we review the classification and distribution of TEs, and look at how they have contributed to the structural and transcriptional reshaping of genomes. In the second part, we describe methodological innovations that have modified their contribution as molecular tools.
Collapse
Affiliation(s)
- Solenne Bire
- GICC, UMR CNRS 6239, Université François Rabelais, UFR des Sciences et Technques, Tours, France
| | | |
Collapse
|
45
|
Chen Z, Xu S, Zhou K, Yang G. Whale phylogeny and rapid radiation events revealed using novel retroposed elements and their flanking sequences. BMC Evol Biol 2011; 11:314. [PMID: 22029548 PMCID: PMC3219603 DOI: 10.1186/1471-2148-11-314] [Citation(s) in RCA: 18] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/17/2011] [Accepted: 10/27/2011] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND A diversity of hypotheses have been proposed based on both morphological and molecular data to reveal phylogenetic relationships within the order Cetacea (dolphins, porpoises, and whales), and great progress has been made in the past two decades. However, there is still some controversy concerning relationships among certain cetacean taxa such as river dolphins and delphinoid species, which needs to be further addressed with more markers in an effort to address unresolved portions of the phylogeny. RESULTS An analysis of additional SINE insertions and SINE-flanking sequences supported the monophyly of the order Cetacea as well as Odontocete, Delphinoidea (Delphinidae + Phocoenidae + Mondontidae), and Delphinidae. A sister relationship between Delphinidae and Phocoenidae + Mondontidae was supported, and members of classical river dolphins and the genera Tursiops and Stenella were found to be paraphyletic. Estimates of divergence times revealed rapid divergences of basal Odontocete lineages in the Oligocene and Early Miocene, and a recent rapid diversification of Delphinidae in the Middle-Late Miocene and Pliocene within a narrow time frame. CONCLUSIONS Several novel SINEs were found to differentiate Delphinidae from the other two families (Monodontidae and Phocoenidae), whereas the sister grouping of the latter two families with exclusion of Delphinidae was further revealed using the SINE-flanking sequences. Interestingly, some anomalous PCR amplification patterns of SINE insertions were detected, which can be explained as the result of potential ancestral SINE polymorphisms and incomplete lineage sorting. Although a few loci were potentially anomalous, this study demonstrated that the SINE-based approach is a powerful tool in phylogenetic studies. Identifying additional SINE elements that resolve the relationships in the superfamily Delphinoidea and family Delphinidae will be important steps forward in completely resolving cetacean phylogenetic relationships in the future.
Collapse
Affiliation(s)
- Zhuo Chen
- Jiangsu Key Laboratory for Biodiversity and Biotechnology, College of Life Sciences, Nanjing Normal University, Nanjing 210046, China
| | - Shixia Xu
- Jiangsu Key Laboratory for Biodiversity and Biotechnology, College of Life Sciences, Nanjing Normal University, Nanjing 210046, China
| | - Kaiya Zhou
- Jiangsu Key Laboratory for Biodiversity and Biotechnology, College of Life Sciences, Nanjing Normal University, Nanjing 210046, China
| | - Guang Yang
- Jiangsu Key Laboratory for Biodiversity and Biotechnology, College of Life Sciences, Nanjing Normal University, Nanjing 210046, China
| |
Collapse
|
46
|
Faulkner GJ. Retrotransposons: Mobile and mutagenic from conception to death. FEBS Lett 2011; 585:1589-94. [DOI: 10.1016/j.febslet.2011.03.061] [Citation(s) in RCA: 27] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/28/2011] [Revised: 03/25/2011] [Accepted: 03/28/2011] [Indexed: 01/13/2023]
|
47
|
Roos C, Zinner D, Kubatko LS, Schwarz C, Yang M, Meyer D, Nash SD, Xing J, Batzer MA, Brameier M, Leendertz FH, Ziegler T, Perwitasari-Farajallah D, Nadler T, Walter L, Osterholz M. Nuclear versus mitochondrial DNA: evidence for hybridization in colobine monkeys. BMC Evol Biol 2011; 11:77. [PMID: 21435245 PMCID: PMC3068967 DOI: 10.1186/1471-2148-11-77] [Citation(s) in RCA: 101] [Impact Index Per Article: 7.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/18/2010] [Accepted: 03/24/2011] [Indexed: 12/22/2022] Open
Abstract
BACKGROUND Colobine monkeys constitute a diverse group of primates with major radiations in Africa and Asia. However, phylogenetic relationships among genera are under debate, and recent molecular studies with incomplete taxon-sampling revealed discordant gene trees. To solve the evolutionary history of colobine genera and to determine causes for possible gene tree incongruences, we combined presence/absence analysis of mobile elements with autosomal, X chromosomal, Y chromosomal and mitochondrial sequence data from all recognized colobine genera. RESULTS Gene tree topologies and divergence age estimates derived from different markers were similar, but differed in placing Piliocolobus/Procolobus and langur genera among colobines. Although insufficient data, homoplasy and incomplete lineage sorting might all have contributed to the discordance among gene trees, hybridization is favored as the main cause of the observed discordance. We propose that African colobines are paraphyletic, but might later have experienced female introgression from Piliocolobus/Procolobus into Colobus. In the late Miocene, colobines invaded Eurasia and diversified into several lineages. Among Asian colobines, Semnopithecus diverged first, indicating langur paraphyly. However, unidirectional gene flow from Semnopithecus into Trachypithecus via male introgression followed by nuclear swamping might have occurred until the earliest Pleistocene. CONCLUSIONS Overall, our study provides the most comprehensive view on colobine evolution to date and emphasizes that analyses of various molecular markers, such as mobile elements and sequence data from multiple loci, are crucial to better understand evolutionary relationships and to trace hybridization events. Our results also suggest that sex-specific dispersal patterns, promoted by a respective social organization of the species involved, can result in different hybridization scenarios.
Collapse
Affiliation(s)
- Christian Roos
- Primate Genetics Laboratory, German Primate Center, Göttingen, Germany.
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
48
|
Yu L, Luan PT, Jin W, Ryder OA, Chemnick LG, Davis HA, Zhang YP. Phylogenetic Utility of Nuclear Introns in Interfamilial Relationships of Caniformia (Order Carnivora). Syst Biol 2011; 60:175-87. [DOI: 10.1093/sysbio/syq090] [Citation(s) in RCA: 39] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open
Affiliation(s)
- Li Yu
- Laboratory for Conservation and Utilization of Bio-Resources and Key Laboratory for Microbial Resources of the Ministry of Education, Yunnan University, Kunming 650091, China
- State Key Laboratory of Genetic Resources and Evolution, Kunming Institute of Zoology, Kunming 650223, China
| | - Peng-Tao Luan
- Laboratory for Conservation and Utilization of Bio-Resources and Key Laboratory for Microbial Resources of the Ministry of Education, Yunnan University, Kunming 650091, China
- State Key Laboratory of Genetic Resources and Evolution, Kunming Institute of Zoology, Kunming 650223, China
| | - Wei Jin
- Laboratory for Conservation and Utilization of Bio-Resources and Key Laboratory for Microbial Resources of the Ministry of Education, Yunnan University, Kunming 650091, China
- State Key Laboratory of Genetic Resources and Evolution, Kunming Institute of Zoology, Kunming 650223, China
| | - Oliver A. Ryder
- San Diego Zoo Conservation Research, PO Box 120551, San Diego, CA 92112, USA
| | - Leona G. Chemnick
- San Diego Zoo Conservation Research, PO Box 120551, San Diego, CA 92112, USA
| | - Heidi A. Davis
- San Diego Zoo Conservation Research, PO Box 120551, San Diego, CA 92112, USA
| | - Ya-ping Zhang
- State Key Laboratory of Genetic Resources and Evolution, Kunming Institute of Zoology, Kunming 650223, China
| |
Collapse
|
49
|
Affiliation(s)
- Miriam K Konkel
- Department of Biological Sciences, Louisiana State University, 202 Life Sciences Bldg., Baton Rouge, LA 70803, USA
| | - Jerilyn A Walker
- Department of Biological Sciences, Louisiana State University, 202 Life Sciences Bldg., Baton Rouge, LA 70803, USA
| | - Mark A Batzer
- Department of Biological Sciences, Louisiana State University, 202 Life Sciences Bldg., Baton Rouge, LA 70803, USA
| |
Collapse
|
50
|
Grechko VV, Kosushkin SA, Borodulina OR, Butaeva FG, Darevsky IS. Short interspersed elements (SINEs) of squamate reptiles (Squam1 and Squam2): structure and phylogenetic significance. JOURNAL OF EXPERIMENTAL ZOOLOGY PART B-MOLECULAR AND DEVELOPMENTAL EVOLUTION 2010; 316B:212-26. [PMID: 21462315 DOI: 10.1002/jez.b.21391] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/05/2010] [Revised: 11/05/2010] [Accepted: 11/07/2010] [Indexed: 11/08/2022]
Abstract
Short interspersed elements (SINEs) are important nuclear molecular markers of the evolution of many eukaryotes. However, the SINEs of squamate reptile genomes have been little studied. We first identified two families of SINEs, termed Squam1 and Squam2, in the DNA of meadow lizard Darevskia praticola (Lacertidae) by performing DNA hybridization and PCR. Later, the same families of retrotransposons were found using the same methods in members of another 25 lizard families (from Iguania, Scincomorpha, Gekkota, Varanoidea, and Diploglossa infraorders) and two snake families, but their abundances in these taxa varied greatly. Both SINEs were Squamata-specific and were absent from mammals, birds, crocodiles, turtles, amphibians, and fish. Squam1 possessed some characteristics common to tRNA-related SINEs from fish and mammals, while Squam2 belonged to the tRNA(Ala) group of SINEs and had a more unusual and divergent structure. Squam2-related sequences were found in several unannotated GenBank sequences of squamate reptiles. Squam1 abundance in the Polychrotidae, Agamidae, Leiolepididae, Chamaeleonidae, Scincidae, Lacertidae, Gekkonidae, Varanidae, Helodermatidae, and two snake families were 10(2) -10(4) times higher than those in other taxa (Corytophanidae, Iguanidae, Anguidae, Cordylidae, Gerrhosauridae, Pygopodidae, and Eublepharidae). A less dramatic degree of copy number variation was observed for Squam2 in different taxa. Several Squam1 copies from Lacertidae, Chamaeleonidae, Gekkonidae, Varanidae, and Colubridae were sequenced and found to have evident orthologous features, as well as taxa-specific autapomorphies. Squam1 from Lacertidae and Chamaeleonidae could be divided into several subgroups based on sequence differences. Possible applications of these SINEs as Squamata phylogeny markers are discussed.
Collapse
Affiliation(s)
- Vernata V Grechko
- Engelhardt Institute of Molecular Biology, Russian Academy of Sciences, Moscow, Russia.
| | | | | | | | | |
Collapse
|