1
|
Hu K, Ni P, Xu M, Zou Y, Chang J, Gao X, Li Y, Ruan J, Hu B, Wang J. HiTE: a fast and accurate dynamic boundary adjustment approach for full-length transposable element detection and annotation. Nat Commun 2024; 15:5573. [PMID: 38956036 PMCID: PMC11219922 DOI: 10.1038/s41467-024-49912-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/09/2023] [Accepted: 06/25/2024] [Indexed: 07/04/2024] Open
Abstract
Recent advancements in genome assembly have greatly improved the prospects for comprehensive annotation of Transposable Elements (TEs). However, existing methods for TE annotation using genome assemblies suffer from limited accuracy and robustness, requiring extensive manual editing. In addition, the currently available gold-standard TE databases are not comprehensive, even for extensively studied species, highlighting the critical need for an automated TE detection method to supplement existing repositories. In this study, we introduce HiTE, a fast and accurate dynamic boundary adjustment approach designed to detect full-length TEs. The experimental results demonstrate that HiTE outperforms RepeatModeler2, the state-of-the-art tool, across various species. Furthermore, HiTE has identified numerous novel transposons with well-defined structures containing protein-coding domains, some of which are directly inserted within crucial genes, leading to direct alterations in gene expression. A Nextflow version of HiTE is also available, with enhanced parallelism, reproducibility, and portability.
Collapse
Affiliation(s)
- Kang Hu
- School of Computer Science and Engineering, Central South University, Changsha, 410083, China
- Xiangjiang Laboratory, Changsha, 410205, China
- Hunan Provincial Key Lab on Bioinformatics, Central South University, Changsha, 410083, China
| | - Peng Ni
- School of Computer Science and Engineering, Central South University, Changsha, 410083, China
- Xiangjiang Laboratory, Changsha, 410205, China
- Hunan Provincial Key Lab on Bioinformatics, Central South University, Changsha, 410083, China
| | - Minghua Xu
- School of Computer Science and Engineering, Central South University, Changsha, 410083, China
- Hunan Provincial Key Lab on Bioinformatics, Central South University, Changsha, 410083, China
| | - You Zou
- School of Computer Science and Engineering, Central South University, Changsha, 410083, China
- Hunan Provincial Key Lab on Bioinformatics, Central South University, Changsha, 410083, China
| | - Jianye Chang
- Shenzhen Branch, Guangdong Laboratory for Lingnan Modern Agriculture, Genome Analysis Laboratory of the Ministry of Agriculture and Rural Affairs, Agricultural Genomics Institute at Shenzhen, Chinese Academy of Agricultural Sciences, Shenzhen, 518000, China
| | - Xin Gao
- Computer Science Program, Computer, Electrical and Mathematical Sciences and Engineering Division, King Abdullah University of Science and Technology (KAUST), Thuwal, Saudi Arabia
- Center of Excellence on Smart Health, King Abdullah University of Science and Technology (KAUST), Thuwal, Saudi Arabia
| | - Yaohang Li
- Department of Computer Science, Old Dominion University, Norfolk, VA, 23529, USA
| | - Jue Ruan
- Shenzhen Branch, Guangdong Laboratory for Lingnan Modern Agriculture, Genome Analysis Laboratory of the Ministry of Agriculture and Rural Affairs, Agricultural Genomics Institute at Shenzhen, Chinese Academy of Agricultural Sciences, Shenzhen, 518000, China
| | - Bin Hu
- Key Laboratory of Brain Health Intelligent Evaluation and Intervention, Ministry of Education (Beijing Institute of Technology), Beijing, P. R. China.
- School of Medical Technology, Beijing Institute of Technology, Beijing, P. R. China.
| | - Jianxin Wang
- School of Computer Science and Engineering, Central South University, Changsha, 410083, China.
- Xiangjiang Laboratory, Changsha, 410205, China.
- Hunan Provincial Key Lab on Bioinformatics, Central South University, Changsha, 410083, China.
| |
Collapse
|
2
|
Modenini G, Abondio P, Guffanti G, Boattini A, Macciardi F. Evolutionarily recent retrotransposons contribute to schizophrenia. Transl Psychiatry 2023; 13:181. [PMID: 37244930 DOI: 10.1038/s41398-023-02472-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 01/13/2023] [Revised: 05/02/2023] [Accepted: 05/12/2023] [Indexed: 05/29/2023] Open
Abstract
Transposable elements (TEs) are mobile genetic elements that constitute half of the human genome. Recent studies suggest that polymorphic non-reference TEs (nrTEs) may contribute to cognitive diseases, such as schizophrenia, through a cis-regulatory effect. The aim of this work is to identify sets of nrTEs putatively linked to an increased risk of developing schizophrenia. To do so, we inspected the nrTE content of genomes from the dorsolateral prefrontal cortex of schizophrenic and control individuals and identified 38 nrTEs that possibly contribute to the emergence of this psychiatric disorder, two of them further confirmed with haplotype-based methods. We then performed in silico functional inferences and found that 9 of the 38 nrTEs act as expression/alternative splicing quantitative trait loci (eQTLs/sQTLs) in the brain, suggesting a possible role in shaping the human cognitive genome structure. To our knowledge, this is the first attempt at identifying polymorphic nrTEs that can contribute to the functionality of the brain. Finally, we suggest that a neurodevelopmental genetic mechanism, which involves evolutionarily young nrTEs, can be key to understanding the ethio-pathogenesis of this complex disorder.
Collapse
Affiliation(s)
| | - Paolo Abondio
- BiGeA Department, University of Bologna, Bologna, Italy
- Department of Cultural Heritage, University of Bologna, Ravenna, Italy
| | - Guia Guffanti
- Department of Psychiatry, McLean Hospital-Harvard Medical School, Belmont, MA, USA
| | | | - Fabio Macciardi
- Department of Medical Education (Neuroscience), CUSM, Colton, CA, USA.
| |
Collapse
|
3
|
Merenciano M, Coronado-Zamora M, González J. Experimental Validation of Transposable Element Insertions Using the Polymerase Chain Reaction (PCR). Methods Mol Biol 2023; 2607:95-114. [PMID: 36449160 DOI: 10.1007/978-1-0716-2883-6_6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/17/2023]
Abstract
Transposable elements (TEs), also known as transposons, are repetitive DNA sequences, present in virtually all organisms, that can move from one genomic position to another. TEs can be a source of mutations with important consequences for organisms. Despite their interest, its repetitive nature has made their study very challenging. However, the emergence of new sequencing technologies that allow obtaining long-read sequences, has improved the in silico de novo detection and annotation of TEs. The de novo annotation of TEs has already been performed in several organisms including the fruit fly Drosophila melanogaster. Yet, experimental validation can be used to confirm the presence of TEs in specific D. melanogaster natural populations. Here, we present a step-by-step protocol to experimentally validate by polymerase chain reaction (PCR) the presence and/or absence of TEs in natural populations of D. melanogaster. This detailed protocol has been implemented in the participant high schools of the Citizen Fly Lab activity that is part of the international citizen science project Melanogaster: Catch the Fly! ( https://melanogaster.eu ). Specifically, the students collaborate with the scientists of the European Drosophila Population Genomics Consortium (DrosEU) in the experimental validation of new genetic variants, previously identified using bioinformatic techniques.
Collapse
Affiliation(s)
| | | | - Josefa González
- Institute of Evolutionary Biology, CSIC, UPF, Barcelona, Spain.
| |
Collapse
|
4
|
Modenini G, Abondio P, Boattini A. The coevolution between APOBEC3 and retrotransposons in primates. Mob DNA 2022; 13:27. [PMID: 36443831 PMCID: PMC9706992 DOI: 10.1186/s13100-022-00283-1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/06/2022] [Accepted: 10/31/2022] [Indexed: 12/02/2022] Open
Abstract
Retrotransposons are genetic elements with the ability to replicate in the genome using reverse transcriptase: they have been associated with the development of different biological structures, such as the Central Nervous System (CNS), and their high mutagenic potential has been linked to various diseases, including cancer and neurological disorders. Throughout evolution and over time, Primates and Homo had to cope with infections from viruses and bacteria, and also with endogenous retroelements. Therefore, host genomes have evolved numerous methods to counteract the activity of endogenous and exogenous pathogens, and the APOBEC3 family of mutators is a prime example of a defensive mechanism in this context.In most Primates, there are seven members of the APOBEC3 family of deaminase proteins: among their functions, there is the ability to inhibit the mobilization of retrotransposons and the functionality of viruses. The evolution of the APOBEC3 proteins found in Primates is correlated with the expansion of two major families of retrotransposons, i.e. ERV and LINE-1.In this review, we will discuss how the rapid expansion of the APOBEC3 family is linked to the evolution of retrotransposons, highlighting the strong evolutionary arms race that characterized the history of APOBEC3s and endogenous retroelements in Primates. Moreover, the possible role of this relationship will be assessed in the context of embryonic development and brain-associated diseases.
Collapse
Affiliation(s)
- Giorgia Modenini
- grid.6292.f0000 0004 1757 1758Department of Biological, Geological and Environmental Sciences, University of Bologna, Bologna, Italy
| | - Paolo Abondio
- grid.6292.f0000 0004 1757 1758Department of Biological, Geological and Environmental Sciences, University of Bologna, Bologna, Italy ,grid.6292.f0000 0004 1757 1758Department of Cultural Heritage, University of Bologna, Ravenna, Italy
| | - Alessio Boattini
- grid.6292.f0000 0004 1757 1758Department of Biological, Geological and Environmental Sciences, University of Bologna, Bologna, Italy
| |
Collapse
|
5
|
Orozco-Arias S, Candamil-Cortes MS, Jaimes PA, Valencia-Castrillon E, Tabares-Soto R, Isaza G, Guyot R. Automatic curation of LTR retrotransposon libraries from plant genomes through machine learning. J Integr Bioinform 2022; 19:jib-2021-0036. [PMID: 35822734 PMCID: PMC9521825 DOI: 10.1515/jib-2021-0036] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/08/2021] [Accepted: 06/10/2022] [Indexed: 11/19/2022] Open
Abstract
Transposable elements are mobile sequences that can move and insert themselves into chromosomes, activating under internal or external stimuli, giving the organism the ability to adapt to the environment. Annotating transposable elements in genomic data is currently considered a crucial task to understand key aspects of organisms such as phenotype variability, species evolution, and genome size, among others. Because of the way they replicate, LTR retrotransposons are the most common transposable elements in plants, accounting in some cases for up to 80% of all DNA information. To annotate these elements, a reference library is usually created, a curation process is performed, eliminating TE fragments and false positives and then annotated in the genome using the homology method. However, the curation process can take weeks, requires extensive manual work and the execution of multiple time-consuming bioinformatics software. Here, we propose a machine learning-based approach to perform this process automatically on plant genomes, obtaining up to 91.18% F1-score. This approach was tested with four plant species, obtaining up to 93.6% F1-score (Oryza granulata) in only 22.61 s, where bioinformatics methods took approximately 6 h. This acceleration demonstrates that the ML-based approach is efficient and could be used in massive sequencing projects.
Collapse
Affiliation(s)
- Simon Orozco-Arias
- Department of Computer Science, Universidad Autónoma de Manizales, Manizales, Colombia.,Department of Systems and Informatics, Universidad de Caldas, Manizales, Colombia
| | | | - Paula A Jaimes
- Department of Computer Science, Universidad Autónoma de Manizales, Manizales, Colombia
| | | | - Reinel Tabares-Soto
- Department of Electronics and Automation, Universidad Autónoma de Manizales, Manizales, Colombia
| | - Gustavo Isaza
- Department of Systems and Informatics, Universidad de Caldas, Manizales, Colombia
| | - Romain Guyot
- Department of Electronics and Automation, Universidad Autónoma de Manizales, Manizales, Colombia.,Institut de Recherche pour le Développement, CIRAD, Univ. Montpellier, Montpellier, France
| |
Collapse
|
6
|
Ben Amara W, Djebbi S, Ben Lazhar-Ajroud W, Naccache C, Mezghani MK. Insights on mauritiana-like Elements Diversity in Mayetiola destructor and M. hordei (Diptera: Cecidomyiidae). Genome 2021; 65:165-181. [PMID: 34780303 DOI: 10.1139/gen-2021-0020] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022]
Abstract
Mariner-like elements (MLEs) are class II transposons belonging to the Tc1-mariner family, that have successfully invaded many insect genomes. In the current study, the availability of the Hessian fly Mayetiola destructor genome has enabled us to perform in silico analysis of MLEs using as query the previously described mariner element (Desmar1) belonging to mauritiana subfamily. Eighteen mauritiana-like elements were detected and were clustered into three main groups named Desmar1-like, MauCons1 and MauCons2. Subsequently, in vitro analysis was carried out to investigate mauritiana-like elements in M. destructor as well as in Mayetiola hordei using primers designed from TIRs of the previously identified MLEs. PCR amplifications were successful and a total of 12 and 17 mauritiana-like elements were discovered in M. destructor and M. hordei, respectively. Sequence analyses of mauritiana-like elements obtained in silico and in vitro have showed that MauCons1 and MauCons2 elements share low similarity with Desmar1 ranging from 50% to 55% suggesting different groups under mauritiana subfamily have invaded the genomes of M. destructor and M. hordei. These groups are likely inherited by vertical transmission that subsequently underwent different evolutionary histories. This work describes new mauritiana-like elements in M. destructor that are distinct from the previouslydiscovered Desmar1 and provides the first evidence of MLEs belonging to mauritiana subfamily in M. hordei.
Collapse
Affiliation(s)
- Wiem Ben Amara
- University of Tunis El Manar Faculty of Sciences of Tunis, 155529, Laboratory of Biochemistry and Biotechnology (LR01ES05), Tunis, Tunisia;
| | - Salma Djebbi
- University of Tunis El Manar Faculty of Sciences of Tunis, 155529, Laboratory of Biochemistry and Biotechnology (LR01ES05), Tunis, Tunisia;
| | - Wafa Ben Lazhar-Ajroud
- University of Tunis El Manar Faculty of Sciences of Tunis, 155529, Laboratory of Biochemistry and Biotechnology (LR01ES05), Tunis, Tunisia;
| | | | - Maha Khemakhem Mezghani
- University of Tunis El Manar Faculty of Sciences of Tunis, 155529, Laboratory of Biochemistry and Biotechnology (LR01ES05), Tunis, Tunisia;
| |
Collapse
|
7
|
Viviani A, Ventimiglia M, Fambrini M, Vangelisti A, Mascagni F, Pugliesi C, Usai G. Impact of transposable elements on the evolution of complex living systems and their epigenetic control. Biosystems 2021; 210:104566. [PMID: 34718084 DOI: 10.1016/j.biosystems.2021.104566] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/01/2021] [Revised: 10/21/2021] [Accepted: 10/21/2021] [Indexed: 10/20/2022]
Abstract
Transposable elements (TEs) contribute to genomic innovations, as well as genome instability, across a wide variety of species. Popular designations such as 'selfish DNA' and 'junk DNA,' common in the 1980s, may be either inaccurate or misleading, while a more enlightened view of the TE-host relationship covers a range from parasitism to mutualism. Both plant and animal hosts have evolved epigenetic mechanisms to reduce the impact of TEs, both by directly silencing them and by reducing their ability to transpose in the genome. However, TEs have also been co-opted by both plant and animal genomes to perform a variety of physiological functions, ranging from TE-derived proteins acting directly in normal biological functions to innovations in transcription factor activity and also influencing gene expression. Their presence, in fact, can affect a range of features at genome, phenotype, and population levels. The impact TEs have had on evolution is multifaceted, and many aspects still remain unexplored. In this review, the epigenetic control of TEs is contextualized according to the evolution of complex living systems.
Collapse
Affiliation(s)
- Ambra Viviani
- Department of Agriculture, Food and Environment (DAFE), University of Pisa, Via del Borghetto, 80-56124, Pisa, Italy
| | - Maria Ventimiglia
- Department of Agriculture, Food and Environment (DAFE), University of Pisa, Via del Borghetto, 80-56124, Pisa, Italy
| | - Marco Fambrini
- Department of Agriculture, Food and Environment (DAFE), University of Pisa, Via del Borghetto, 80-56124, Pisa, Italy
| | - Alberto Vangelisti
- Department of Agriculture, Food and Environment (DAFE), University of Pisa, Via del Borghetto, 80-56124, Pisa, Italy
| | - Flavia Mascagni
- Department of Agriculture, Food and Environment (DAFE), University of Pisa, Via del Borghetto, 80-56124, Pisa, Italy
| | - Claudio Pugliesi
- Department of Agriculture, Food and Environment (DAFE), University of Pisa, Via del Borghetto, 80-56124, Pisa, Italy.
| | - Gabriele Usai
- Department of Agriculture, Food and Environment (DAFE), University of Pisa, Via del Borghetto, 80-56124, Pisa, Italy
| |
Collapse
|
8
|
Ben Amara W, Quesneville H, Khemakhem MM. A Genomic Survey of Mayetiola destructor Mobilome Provides New Insights into the Evolutionary History of Transposable Elements in the Cecidomyiid Midges. PLoS One 2021; 16:e0257996. [PMID: 34634072 PMCID: PMC8504770 DOI: 10.1371/journal.pone.0257996] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/29/2020] [Accepted: 09/16/2021] [Indexed: 11/19/2022] Open
Abstract
The availability of the Whole-Genome Sequence of the wheat pest Mayetiola destructor offers the opportunity to investigate the Transposable Elements (TEs) content and their relationship with the genes involved in the insect virulence. In this study, de novo annotation carried out using REPET pipeline showed that TEs occupy approximately 16% of the genome and are represented by 1038 lineages. Class II elements were the most frequent and most TEs were inactive due to the deletions they have accumulated. The analyses of TEs ages revealed a first burst at 20% of divergence from present that mobilized many TE families including mostly Tc1/mariner and Gypsy superfamilies and a second burst at 2% of divergence, which involved mainly the class II elements suggesting new TEs invasions. Additionally, 86 TEs insertions involving recently transposed elements were identified. Among them, several MITEs and Gypsy retrotransposons were inserted in the vicinity of SSGP and chemosensory genes. The findings represent a valuable resource for more in-depth investigation of the TE impact onto M. destructor genome and their possible influence on the expression of the virulence and chemosensory genes and consequently the behavior of this pest towards its host plants.
Collapse
Affiliation(s)
- Wiem Ben Amara
- Laboratory of Biochemistry and Biotechnology (LR01ES05), Faculty of Sciences of Tunis, University of Tunis El Manar, Tunis, Tunisia
| | - Hadi Quesneville
- INRAE, URGI, Université Paris-Saclay, Versailles, France
- INRAE, BioinfOmics, Plant Bioinformatics Facility, Université Paris-Saclay, Versailles, France
| | - Maha Mezghani Khemakhem
- Laboratory of Biochemistry and Biotechnology (LR01ES05), Faculty of Sciences of Tunis, University of Tunis El Manar, Tunis, Tunisia
- * E-mail:
| |
Collapse
|
9
|
Segel M, Lash B, Song J, Ladha A, Liu CC, Jin X, Mekhedov SL, Macrae RK, Koonin EV, Zhang F. Mammalian retrovirus-like protein PEG10 packages its own mRNA and can be pseudotyped for mRNA delivery. Science 2021; 373:882-889. [PMID: 34413232 DOI: 10.1126/science.abg6155] [Citation(s) in RCA: 187] [Impact Index Per Article: 62.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/17/2021] [Revised: 04/26/2021] [Accepted: 07/06/2021] [Indexed: 12/11/2022]
Abstract
Eukaryotic genomes contain domesticated genes from integrating viruses and mobile genetic elements. Among these are homologs of the capsid protein (known as Gag) of long terminal repeat (LTR) retrotransposons and retroviruses. We identified several mammalian Gag homologs that form virus-like particles and one LTR retrotransposon homolog, PEG10, that preferentially binds and facilitates vesicular secretion of its own messenger RNA (mRNA). We showed that the mRNA cargo of PEG10 can be reprogrammed by flanking genes of interest with Peg10's untranslated regions. Taking advantage of this reprogrammability, we developed selective endogenous encapsidation for cellular delivery (SEND) by engineering both mouse and human PEG10 to package, secrete, and deliver specific RNAs. Together, these results demonstrate that SEND is a modular platform suited for development as an efficient therapeutic delivery modality.
Collapse
Affiliation(s)
- Michael Segel
- Howard Hughes Medical Institute, Cambridge, MA 02139, USA.,Broad Institute of MIT and Harvard, Cambridge, MA 02142, USA.,McGovern Institute for Brain Research, Massachusetts Institute of Technology, Cambridge, MA 02139, USA.,Department of Brain and Cognitive Science, Massachusetts Institute of Technology, Cambridge, MA 02139, USA.,Department of Biological Engineering, Massachusetts Institute of Technology, Cambridge, MA 02139, USA
| | - Blake Lash
- Howard Hughes Medical Institute, Cambridge, MA 02139, USA.,Broad Institute of MIT and Harvard, Cambridge, MA 02142, USA.,McGovern Institute for Brain Research, Massachusetts Institute of Technology, Cambridge, MA 02139, USA.,Department of Brain and Cognitive Science, Massachusetts Institute of Technology, Cambridge, MA 02139, USA.,Department of Biological Engineering, Massachusetts Institute of Technology, Cambridge, MA 02139, USA
| | - Jingwei Song
- Howard Hughes Medical Institute, Cambridge, MA 02139, USA.,Broad Institute of MIT and Harvard, Cambridge, MA 02142, USA.,McGovern Institute for Brain Research, Massachusetts Institute of Technology, Cambridge, MA 02139, USA.,Department of Brain and Cognitive Science, Massachusetts Institute of Technology, Cambridge, MA 02139, USA.,Department of Biological Engineering, Massachusetts Institute of Technology, Cambridge, MA 02139, USA
| | - Alim Ladha
- Howard Hughes Medical Institute, Cambridge, MA 02139, USA.,Broad Institute of MIT and Harvard, Cambridge, MA 02142, USA.,McGovern Institute for Brain Research, Massachusetts Institute of Technology, Cambridge, MA 02139, USA.,Department of Brain and Cognitive Science, Massachusetts Institute of Technology, Cambridge, MA 02139, USA.,Department of Biological Engineering, Massachusetts Institute of Technology, Cambridge, MA 02139, USA
| | - Catherine C Liu
- Department of Biology, Massachusetts Institute of Technology, Cambridge, MA 02139, USA.,Howard Hughes Medical Institute, Cambridge, MA 02139, USA.,Broad Institute of MIT and Harvard, Cambridge, MA 02142, USA.,McGovern Institute for Brain Research, Massachusetts Institute of Technology, Cambridge, MA 02139, USA.,Department of Brain and Cognitive Science, Massachusetts Institute of Technology, Cambridge, MA 02139, USA.,Department of Biological Engineering, Massachusetts Institute of Technology, Cambridge, MA 02139, USA
| | - Xin Jin
- Broad Institute of MIT and Harvard, Cambridge, MA 02142, USA.,McGovern Institute for Brain Research, Massachusetts Institute of Technology, Cambridge, MA 02139, USA.,Department of Brain and Cognitive Science, Massachusetts Institute of Technology, Cambridge, MA 02139, USA.,Society of Fellows, Harvard University, Cambridge, MA 02138 USA
| | - Sergei L Mekhedov
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD 20894, USA
| | - Rhiannon K Macrae
- Howard Hughes Medical Institute, Cambridge, MA 02139, USA.,Broad Institute of MIT and Harvard, Cambridge, MA 02142, USA.,McGovern Institute for Brain Research, Massachusetts Institute of Technology, Cambridge, MA 02139, USA.,Department of Brain and Cognitive Science, Massachusetts Institute of Technology, Cambridge, MA 02139, USA.,Department of Biological Engineering, Massachusetts Institute of Technology, Cambridge, MA 02139, USA
| | - Eugene V Koonin
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD 20894, USA
| | - Feng Zhang
- Howard Hughes Medical Institute, Cambridge, MA 02139, USA.,Broad Institute of MIT and Harvard, Cambridge, MA 02142, USA.,McGovern Institute for Brain Research, Massachusetts Institute of Technology, Cambridge, MA 02139, USA.,Department of Brain and Cognitive Science, Massachusetts Institute of Technology, Cambridge, MA 02139, USA.,Department of Biological Engineering, Massachusetts Institute of Technology, Cambridge, MA 02139, USA
| |
Collapse
|
10
|
Fabian DK, Dönertaş HM, Fuentealba M, Partridge L, Thornton JM. Transposable Element Landscape in Drosophila Populations Selected for Longevity. Genome Biol Evol 2021; 13:6141024. [PMID: 33595657 PMCID: PMC8355499 DOI: 10.1093/gbe/evab031] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 02/11/2021] [Indexed: 12/11/2022] Open
Abstract
Transposable elements (TEs) inflict numerous negative effects on health and fitness as they replicate by integrating into new regions of the host genome. Even though organisms employ powerful mechanisms to demobilize TEs, transposons gradually lose repression during aging. The rising TE activity causes genomic instability and was implicated in age-dependent neurodegenerative diseases, inflammation, and the determination of lifespan. It is therefore conceivable that long-lived individuals have improved TE silencing mechanisms resulting in reduced TE expression relative to their shorter-lived counterparts and fewer genomic insertions. Here, we test this hypothesis by performing the first genome-wide analysis of TE insertions and expression in populations of Drosophila melanogaster selected for longevity through late-life reproduction for 50–170 generations from four independent studies. Contrary to our expectation, TE families were generally more abundant in long-lived populations compared with nonselected controls. Although simulations showed that this was not expected under neutrality, we found little evidence for selection driving TE abundance differences. Additional RNA-seq analysis revealed a tendency for reducing TE expression in selected populations, which might be more important for lifespan than regulating genomic insertions. We further find limited evidence of parallel selection on genes related to TE regulation and transposition. However, telomeric TEs were genomically and transcriptionally more abundant in long-lived flies, suggesting improved telomere maintenance as a promising TE-mediated mechanism for prolonging lifespan. Our results provide a novel viewpoint indicating that reproduction at old age increases the opportunity of TEs to be passed on to the next generation with little impact on longevity.
Collapse
Affiliation(s)
- Daniel K Fabian
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, United Kingdom
- Institute of Healthy Ageing, Department of Genetics, Evolution and Environment, University College London, United Kingdom
- Corresponding author: E-mail:
| | - Handan Melike Dönertaş
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, United Kingdom
| | - Matías Fuentealba
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, United Kingdom
- Institute of Healthy Ageing, Department of Genetics, Evolution and Environment, University College London, United Kingdom
| | - Linda Partridge
- Institute of Healthy Ageing, Department of Genetics, Evolution and Environment, University College London, United Kingdom
- Max Planck Institute for Biology of Ageing, Cologne, Germany
| | - Janet M Thornton
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, United Kingdom
| |
Collapse
|
11
|
Fambrini M, Usai G, Vangelisti A, Mascagni F, Pugliesi C. The plastic genome: The impact of transposable elements on gene functionality and genomic structural variations. Genesis 2020; 58:e23399. [DOI: 10.1002/dvg.23399] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/14/2020] [Revised: 11/07/2020] [Accepted: 11/10/2020] [Indexed: 12/15/2022]
Affiliation(s)
- Marco Fambrini
- Department of Agriculture, Food and Environment (DAFE) University of Pisa Pisa Italy
| | - Gabriele Usai
- Department of Agriculture, Food and Environment (DAFE) University of Pisa Pisa Italy
| | - Alberto Vangelisti
- Department of Agriculture, Food and Environment (DAFE) University of Pisa Pisa Italy
| | - Flavia Mascagni
- Department of Agriculture, Food and Environment (DAFE) University of Pisa Pisa Italy
| | - Claudio Pugliesi
- Department of Agriculture, Food and Environment (DAFE) University of Pisa Pisa Italy
| |
Collapse
|
12
|
Existence of Bov-B LINE Retrotransposons in Snake Lineages Reveals Recent Multiple Horizontal Gene Transfers with Copy Number Variation. Genes (Basel) 2020; 11:genes11111241. [PMID: 33105659 PMCID: PMC7716205 DOI: 10.3390/genes11111241] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/24/2020] [Revised: 10/10/2020] [Accepted: 10/20/2020] [Indexed: 01/09/2023] Open
Abstract
Transposable elements (TEs) are dynamic elements present in all eukaryotic genomes. They can “jump” and amplify within the genome and promote segmental genome rearrangements on both autosomes and sex chromosomes by disruption of gene structures. The Bovine-B long interspersed nuclear element (Bov-B LINE) is among the most abundant TE-retrotransposon families in vertebrates due to horizontal transfer (HT) among vertebrate lineages. Recent studies have shown multiple HTs or the presence of diverse Bov-B LINE groups in the snake lineage. It is hypothesized that Bov-B LINEs are highly dynamic and that the diversity reflects multiple HTs in snake lineages. Partial sequences of Bov-B LINE from 23 snake species were characterized. Phylogenetic analysis resolved at least two Bov-B LINE groups that might correspond to henophidian and caenophidian snakes; however, the tree topology differed from that based on functional nuclear and mitochondrial gene sequences. Several Bov-B LINEs of snakes showed greater than 80% similarity to sequences obtained from insects, whereas the two Bov-B LINE groups as well as sequences from the same snake species classified in different Bov-B LINE groups showed sequence similarities of less than 80%. Calculation of estimated divergence time and pairwise divergence between all individual Bov-B LINE copies suggest invasion times ranging from 79.19 to 98.8 million years ago in snakes. Accumulation of elements in a lineage-specific fashion ranged from 9 × 10−6% to 5.63 × 10−2% per genome. The genomic proportion of Bov-B LINEs varied among snake species but was not directly associated with genome size or invasion time. No differentiation in Bov-B LINE copy number between males and females was observed in any of the snake species examined. Incongruence in tree topology between Bov-B LINEs and other snake phylogenies may reflect past HT events. Sequence divergence of Bov-B LINEs between copies suggests that recent multiple HTs occurred within the same evolutionary timeframe in the snake lineage. The proportion of Bov-B LINEs varies among species, reflecting species specificity in TE invasion. The rapid speciation of snakes, coinciding with Bov-B LINE invasion in snake genomes, leads us to better understand the effect of Bov-B LINEs on snake genome evolution.
Collapse
|
13
|
Bogaerts-Márquez M, Barrón MG, Fiston-Lavier AS, Vendrell-Mir P, Castanera R, Casacuberta JM, González J. T-lex3: an accurate tool to genotype and estimate population frequencies of transposable elements using the latest short-read whole genome sequencing data. Bioinformatics 2020; 36:1191-1197. [PMID: 31580402 PMCID: PMC7703783 DOI: 10.1093/bioinformatics/btz727] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/28/2019] [Revised: 09/16/2019] [Accepted: 09/25/2019] [Indexed: 12/22/2022] Open
Abstract
Motivation Transposable elements (TEs) constitute a significant proportion of the majority of genomes sequenced to date. TEs are responsible for a considerable fraction of the genetic variation within and among species. Accurate genotyping of TEs in genomes is therefore crucial for a complete identification of the genetic differences among individuals, populations and species. Results In this work, we present a new version of T-lex, a computational pipeline that accurately genotypes and estimates the population frequencies of reference TE insertions using short-read high-throughput sequencing data. In this new version, we have re-designed the T-lex algorithm to integrate the BWA-MEM short-read aligner, which is one of the most accurate short-read mappers and can be launched on longer short-reads (e.g. reads >150 bp). We have added new filtering steps to increase the accuracy of the genotyping, and new parameters that allow the user to control both the minimum and maximum number of reads, and the minimum number of strains to genotype a TE insertion. We also showed for the first time that T-lex3 provides accurate TE calls in a plant genome. Availability and implementation To test the accuracy of T-lex3, we called 1630 individual TE insertions in Drosophila melanogaster, 1600 individual TE insertions in humans, and 3067 individual TE insertions in the rice genome. We showed that this new version of T-lex is a broadly applicable and accurate tool for genotyping and estimating TE frequencies in organisms with different genome sizes and different TE contents. T-lex3 is available at Github: https://github.com/GonzalezLab/T-lex3. Supplementary information Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- María Bogaerts-Márquez
- Institute of Evolutionary Biology (CSIC-Universitat Pompeu Fabra), Paseo Maritimo Barceloneta 37-49, Barcelona, Spain
| | - Maite G Barrón
- Institute of Evolutionary Biology (CSIC-Universitat Pompeu Fabra), Paseo Maritimo Barceloneta 37-49, Barcelona, Spain
| | - Anna-Sophie Fiston-Lavier
- Institut des Sciences de l'Evolution de Montpellier (UMR 5554, CNRS-UM-IRD-EPHE), 11 Université de Motpellier, Place Eugène Bataillon, Montpellier, France
| | - Pol Vendrell-Mir
- Center for Research in Agricultural Genomics, CRAG (CSIC-IRTA-UAB-UB), Campus UAB, Cerdanyola del Vallès, Barcelona, Spain
| | - Raúl Castanera
- Center for Research in Agricultural Genomics, CRAG (CSIC-IRTA-UAB-UB), Campus UAB, Cerdanyola del Vallès, Barcelona, Spain
| | - Josep M Casacuberta
- Center for Research in Agricultural Genomics, CRAG (CSIC-IRTA-UAB-UB), Campus UAB, Cerdanyola del Vallès, Barcelona, Spain
| | - Josefa González
- Institute of Evolutionary Biology (CSIC-Universitat Pompeu Fabra), Paseo Maritimo Barceloneta 37-49, Barcelona, Spain
| |
Collapse
|
14
|
Lerat E, Goubert C, Guirao‐Rico S, Merenciano M, Dufour A, Vieira C, González J. Population-specific dynamics and selection patterns of transposable element insertions in European natural populations. Mol Ecol 2019; 28:1506-1522. [PMID: 30506554 PMCID: PMC6849870 DOI: 10.1111/mec.14963] [Citation(s) in RCA: 28] [Impact Index Per Article: 5.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/30/2018] [Revised: 10/30/2018] [Accepted: 11/05/2018] [Indexed: 01/02/2023]
Abstract
Transposable elements (TEs) are ubiquitous sequences in genomes of virtually all species. While TEs have been investigated for several decades, only recently we have the opportunity to study their genome-wide population dynamics. Most of the studies so far have been restricted either to the analysis of the insertions annotated in the reference genome or to the analysis of a limited number of populations. Taking advantage of the European Drosophila population genomics consortium (DrosEU) sequencing data set, we have identified and measured the dynamics of TEs in a large sample of European Drosophila melanogaster natural populations. We showed that the mobilome landscape is population-specific and highly diverse depending on the TE family. In contrast with previous studies based on SNP variants, no geographical structure was observed for TE abundance or TE divergence in European populations. We further identified de novo individual insertions using two available programs and, as expected, most of the insertions were present at low frequencies. Nevertheless, we identified a subset of TEs present at high frequencies and located in genomic regions with a high recombination rate. These TEs are candidates for being the target of positive selection, although neutral processes should be discarded before reaching any conclusion on the type of selection acting on them. Finally, parallel patterns of association between the frequency of TE insertions and several geographical and temporal variables were found between European and North American populations, suggesting that TEs can be potentially implicated in the adaptation of populations across continents.
Collapse
Affiliation(s)
- Emmanuelle Lerat
- Laboratoire de Biométrie et Biologie EvolutiveUMR 5558Université de Lyon, Université Lyon 1, CNRSVilleurbanneFrance
| | - Clément Goubert
- Molecular Biology and GeneticsCornell UniversityIthacaNew York
| | - Sara Guirao‐Rico
- Institute of Evolutionary Biology (CSIC‐Universitat Pompeu Fabra)BarcelonaSpain
| | - Miriam Merenciano
- Institute of Evolutionary Biology (CSIC‐Universitat Pompeu Fabra)BarcelonaSpain
| | - Anne‐Béatrice Dufour
- Laboratoire de Biométrie et Biologie EvolutiveUMR 5558Université de Lyon, Université Lyon 1, CNRSVilleurbanneFrance
| | - Cristina Vieira
- Laboratoire de Biométrie et Biologie EvolutiveUMR 5558Université de Lyon, Université Lyon 1, CNRSVilleurbanneFrance
| | - Josefa González
- Institute of Evolutionary Biology (CSIC‐Universitat Pompeu Fabra)BarcelonaSpain
| |
Collapse
|
15
|
Rech GE, Bogaerts-Márquez M, Barrón MG, Merenciano M, Villanueva-Cañas JL, Horváth V, Fiston-Lavier AS, Luyten I, Venkataram S, Quesneville H, Petrov DA, González J. Stress response, behavior, and development are shaped by transposable element-induced mutations in Drosophila. PLoS Genet 2019; 15:e1007900. [PMID: 30753202 PMCID: PMC6372155 DOI: 10.1371/journal.pgen.1007900] [Citation(s) in RCA: 45] [Impact Index Per Article: 9.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/02/2018] [Accepted: 12/16/2018] [Indexed: 11/30/2022] Open
Abstract
Most of the current knowledge on the genetic basis of adaptive evolution is based on the analysis of single nucleotide polymorphisms (SNPs). Despite increasing evidence for their causal role, the contribution of structural variants to adaptive evolution remains largely unexplored. In this work, we analyzed the population frequencies of 1,615 Transposable Element (TE) insertions annotated in the reference genome of Drosophila melanogaster, in 91 samples from 60 worldwide natural populations. We identified a set of 300 polymorphic TEs that are present at high population frequencies, and located in genomic regions with high recombination rate, where the efficiency of natural selection is high. The age and the length of these 300 TEs are consistent with relatively young and long insertions reaching high frequencies due to the action of positive selection. Besides, we identified a set of 21 fixed TEs also likely to be adaptive. Indeed, we, and others, found evidence of selection for 84 of these reference TE insertions. The analysis of the genes located nearby these 84 candidate adaptive insertions suggested that the functional response to selection is related with the GO categories of response to stimulus, behavior, and development. We further showed that a subset of the candidate adaptive TEs affects expression of nearby genes, and five of them have already been linked to an ecologically relevant phenotypic effect. Our results provide a more complete understanding of the genetic variation and the fitness-related traits relevant for adaptive evolution. Similar studies should help uncover the importance of TE-induced adaptive mutations in other species as well.
Collapse
Affiliation(s)
- Gabriel E. Rech
- Institute of Evolutionary Biology (IBE), CSIC-Universitat Pompeu Fabra, Barcelona, Spain
| | - María Bogaerts-Márquez
- Institute of Evolutionary Biology (IBE), CSIC-Universitat Pompeu Fabra, Barcelona, Spain
| | - Maite G. Barrón
- Institute of Evolutionary Biology (IBE), CSIC-Universitat Pompeu Fabra, Barcelona, Spain
| | - Miriam Merenciano
- Institute of Evolutionary Biology (IBE), CSIC-Universitat Pompeu Fabra, Barcelona, Spain
| | | | - Vivien Horváth
- Institute of Evolutionary Biology (IBE), CSIC-Universitat Pompeu Fabra, Barcelona, Spain
| | - Anna-Sophie Fiston-Lavier
- Institut des Sciences de l'Evolution de Montpellier (UMR 5554, CNRS-UM-IRD-EPHE), Université de Montpellier, Place Eugène Bataillon, Montpellier, France
| | | | - Sandeep Venkataram
- Department of Biology, Stanford University, Stanford, CA, United States of America
| | | | - Dmitri A. Petrov
- Department of Biology, Stanford University, Stanford, CA, United States of America
| | - Josefa González
- Institute of Evolutionary Biology (IBE), CSIC-Universitat Pompeu Fabra, Barcelona, Spain
| |
Collapse
|