1
|
Petitpas M, Lapous R, Le Duc M, Lariagon C, Lemoine J, Langrume C, Manzanares-Dauleux MJ, Jubault M. Environmental conditions modulate the effect of epigenetic factors controlling the response of Arabidopsis thaliana to Plasmodiophora brassicae. FRONTIERS IN PLANT SCIENCE 2024; 15:1245545. [PMID: 38872892 PMCID: PMC11171141 DOI: 10.3389/fpls.2024.1245545] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 06/23/2023] [Accepted: 04/26/2024] [Indexed: 06/15/2024]
Abstract
The resistance of Arabidopsis thaliana to clubroot, a major disease of Brassicaceae caused by the obligate protist Plasmodiophora brassicae, is controlled in part by epigenetic factors. The detection of some of these epigenetic quantitative trait loci (QTLepi) has been shown to depend on experimental conditions. The aim of the present study was to assess whether and how temperature and/or soil water availability influenced both the detection and the extent of the effect of response QTLepi. The epigenetic recombinant inbred line (epiRIL) population, derived from the cross between ddm1-2 and Col-0 (partially resistant and susceptible to clubroot, respectively), was phenotyped for response to P. brassicae under four abiotic conditions including standard conditions, a 5°C temperature increase, drought, and flooding. The abiotic constraints tested had a significant impact on both the leaf growth of the epiRIL population and the outcome of the epiRIL-pathogen interaction. Linkage analysis led to the detection of a total of 31 QTLepi, 18 of which were specific to one abiotic condition and 13 common to at least two environments. EpiRIL showed significant plasticity under epigenetic control, which appeared to be specific to the traits evaluated and to the abiotic conditions. These results highlight that the environment can affect the epigenetic architecture of plant growth and immune responses and advance our understanding of the epigenetic factors underlying plasticity in response to climate change.
Collapse
Affiliation(s)
| | | | | | | | | | | | | | - Mélanie Jubault
- IGEPP, Institut Agro Rennes-Angers – INRAE – Université de Rennes, Le Rheu, France
| |
Collapse
|
2
|
Riehl K, Riccio C, Miska EA, Hemberg M. TransposonUltimate: software for transposon classification, annotation and detection. Nucleic Acids Res 2022; 50:e64. [PMID: 35234904 PMCID: PMC9226531 DOI: 10.1093/nar/gkac136] [Citation(s) in RCA: 30] [Impact Index Per Article: 15.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/09/2021] [Revised: 02/09/2022] [Accepted: 02/14/2022] [Indexed: 12/17/2022] Open
Abstract
Most genomes harbor a large number of transposons, and they play an important role in evolution and gene regulation. They are also of interest to clinicians as they are involved in several diseases, including cancer and neurodegeneration. Although several methods for transposon identification are available, they are often highly specialised towards specific tasks or classes of transposons, and they lack common standards such as a unified taxonomy scheme and output file format. We present TransposonUltimate, a powerful bundle of three modules for transposon classification, annotation, and detection of transposition events. TransposonUltimate comes as a Conda package under the GPL-3.0 licence, is well documented and it is easy to install through https://github.com/DerKevinRiehl/TransposonUltimate. We benchmark the classification module on the large TransposonDB covering 891,051 sequences to demonstrate that it outperforms the currently best existing solutions. The annotation and detection modules combine sixteen existing softwares, and we illustrate its use by annotating Caenorhabditis elegans, Rhizophagus irregularis and Oryza sativa subs. japonica genomes. Finally, we use the detection module to discover 29 554 transposition events in the genomes of 20 wild type strains of C. elegans. Databases, assemblies, annotations and further findings can be downloaded from (https://doi.org/10.5281/zenodo.5518085).
Collapse
Affiliation(s)
- Kevin Riehl
- Gurdon Institute, University of Cambridge, Cambridge CB2 1QN, UK
| | - Cristian Riccio
- Gurdon Institute, University of Cambridge, Cambridge CB2 1QN, UK
- Wellcome Sanger Institute, Wellcome Genome Campus, Hinxton CB10 1SA, UK
| | - Eric A Miska
- Gurdon Institute, University of Cambridge, Cambridge CB2 1QN, UK
- Wellcome Sanger Institute, Wellcome Genome Campus, Hinxton CB10 1SA, UK
- Department of Genetics, University of Cambridge, Downing Street, Cambridge CB2 3EH, UK
| | - Martin Hemberg
- Wellcome Sanger Institute, Wellcome Genome Campus, Hinxton CB10 1SA, UK
- Evergrande Center for Immunologic Diseases, Harvard Medical School and Brigham and Women’s Hospital, 75 Francis Street, Boston, MA 02215, USA
| |
Collapse
|
3
|
Oberlin S, Rajeswaran R, Trasser M, Barragán-Borrero V, Schon MA, Plotnikova A, Loncsek L, Nodine MD, Marí-Ordóñez A, Voinnet O. Innate, translation-dependent silencing of an invasive transposon in Arabidopsis. EMBO Rep 2021; 23:e53400. [PMID: 34931432 PMCID: PMC8892269 DOI: 10.15252/embr.202153400] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/07/2021] [Revised: 12/05/2021] [Accepted: 12/06/2021] [Indexed: 11/25/2022] Open
Abstract
Co‐evolution between hosts’ and parasites’ genomes shapes diverse pathways of acquired immunity based on silencing small (s)RNAs. In plants, sRNAs cause heterochromatinization, sequence degeneration, and, ultimately, loss of autonomy of most transposable elements (TEs). Recognition of newly invasive plant TEs, by contrast, involves an innate antiviral‐like silencing response. To investigate this response’s activation, we studied the single‐copy element EVADÉ (EVD), one of few representatives of the large Ty1/Copia family able to proliferate in Arabidopsis when epigenetically reactivated. In Ty1/Copia elements, a short subgenomic mRNA (shGAG) provides the necessary excess of structural GAG protein over the catalytic components encoded by the full‐length genomic flGAG‐POL. We show here that the predominant cytosolic distribution of shGAG strongly favors its translation over mostly nuclear flGAG‐POL. During this process, an unusually intense ribosomal stalling event coincides with mRNA breakage yielding unconventional 5’OH RNA fragments that evade RNA quality control. The starting point of sRNA production by RNA‐DEPENDENT‐RNA‐POLYMERASE‐6 (RDR6), exclusively on shGAG, occurs precisely at this breakage point. This hitherto‐unrecognized “translation‐dependent silencing” (TdS) is independent of codon usage or GC content and is not observed on TE remnants populating the Arabidopsis genome, consistent with their poor association, if any, with polysomes. We propose that TdS forms a primal defense against EVD de novo invasions that underlies its associated sRNA pattern.
Collapse
Affiliation(s)
- Stefan Oberlin
- Department of Biology, Swiss Federal Institute of Technology (ETH), Zurich, Switzerland
| | - Rajendran Rajeswaran
- Department of Biology, Swiss Federal Institute of Technology (ETH), Zurich, Switzerland
| | - Marieke Trasser
- Gregor Mendel Institute of Molecular Plant Biology (GMI) of the Austrian Academy of Sciences, Vienna, Austria.,Vienna BioCenter PhD Program, Doctoral School of the University of Vienna and Medical University of Vienna, Vienna, Austria
| | - Verónica Barragán-Borrero
- Department of Biology, Swiss Federal Institute of Technology (ETH), Zurich, Switzerland.,Gregor Mendel Institute of Molecular Plant Biology (GMI) of the Austrian Academy of Sciences, Vienna, Austria
| | - Michael A Schon
- Gregor Mendel Institute of Molecular Plant Biology (GMI) of the Austrian Academy of Sciences, Vienna, Austria
| | - Alexandra Plotnikova
- Gregor Mendel Institute of Molecular Plant Biology (GMI) of the Austrian Academy of Sciences, Vienna, Austria
| | - Lukas Loncsek
- Gregor Mendel Institute of Molecular Plant Biology (GMI) of the Austrian Academy of Sciences, Vienna, Austria
| | - Michael D Nodine
- Gregor Mendel Institute of Molecular Plant Biology (GMI) of the Austrian Academy of Sciences, Vienna, Austria.,Laboratory of Molecular Biology, Wageningen University, Wageningen, The Netherlands
| | - Arturo Marí-Ordóñez
- Department of Biology, Swiss Federal Institute of Technology (ETH), Zurich, Switzerland.,Gregor Mendel Institute of Molecular Plant Biology (GMI) of the Austrian Academy of Sciences, Vienna, Austria
| | - Olivier Voinnet
- Department of Biology, Swiss Federal Institute of Technology (ETH), Zurich, Switzerland
| |
Collapse
|
4
|
Oggenfuss U, Badet T, Wicker T, Hartmann FE, Singh NK, Abraham L, Karisto P, Vonlanthen T, Mundt C, McDonald BA, Croll D. A population-level invasion by transposable elements triggers genome expansion in a fungal pathogen. eLife 2021; 10:e69249. [PMID: 34528512 PMCID: PMC8445621 DOI: 10.7554/elife.69249] [Citation(s) in RCA: 29] [Impact Index Per Article: 9.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/09/2021] [Accepted: 08/28/2021] [Indexed: 12/16/2022] Open
Abstract
Genome evolution is driven by the activity of transposable elements (TEs). The spread of TEs can have deleterious effects including the destabilization of genome integrity and expansions. However, the precise triggers of genome expansions remain poorly understood because genome size evolution is typically investigated only among deeply divergent lineages. Here, we use a large population genomics dataset of 284 individuals from populations across the globe of Zymoseptoria tritici, a major fungal wheat pathogen. We built a robust map of genome-wide TE insertions and deletions to track a total of 2456 polymorphic loci within the species. We show that purifying selection substantially depressed TE frequencies in most populations, but some rare TEs have recently risen in frequency and likely confer benefits. We found that specific TE families have undergone a substantial genome-wide expansion from the pathogen's center of origin to more recently founded populations. The most dramatic increase in TE insertions occurred between a pair of North American populations collected in the same field at an interval of 25 years. We find that both genome-wide counts of TE insertions and genome size have increased with colonization bottlenecks. Hence, the demographic history likely played a major role in shaping genome evolution within the species. We show that both the activation of specific TEs and relaxed purifying selection underpin this incipient expansion of the genome. Our study establishes a model to recapitulate TE-driven genome evolution over deeper evolutionary timescales.
Collapse
Affiliation(s)
- Ursula Oggenfuss
- Laboratory of Evolutionary Genetics, Institute of Biology, University of NeuchâtelNeuchatelSwitzerland
| | - Thomas Badet
- Laboratory of Evolutionary Genetics, Institute of Biology, University of NeuchâtelNeuchatelSwitzerland
| | - Thomas Wicker
- Institute for Plant and Microbial Biology, University of ZurichZurichSwitzerland
| | - Fanny E Hartmann
- Ecologie Systématique Evolution, Bâtiment 360, Univ. Paris-Sud, AgroParisTech, CNRS, Université Paris-SaclayOrsayFrance
- Plant Pathology, Institute of Integrative Biology, ETH ZurichZurichSwitzerland
| | - Nikhil Kumar Singh
- Laboratory of Evolutionary Genetics, Institute of Biology, University of NeuchâtelNeuchatelSwitzerland
| | - Leen Abraham
- Laboratory of Evolutionary Genetics, Institute of Biology, University of NeuchâtelNeuchatelSwitzerland
| | - Petteri Karisto
- Plant Pathology, Institute of Integrative Biology, ETH ZurichZurichSwitzerland
| | - Tiziana Vonlanthen
- Plant Pathology, Institute of Integrative Biology, ETH ZurichZurichSwitzerland
| | - Christopher Mundt
- Department of Botany and Plant Pathology, Oregon State UniversityCorvallisUnited States
| | - Bruce A McDonald
- Plant Pathology, Institute of Integrative Biology, ETH ZurichZurichSwitzerland
| | - Daniel Croll
- Laboratory of Evolutionary Genetics, Institute of Biology, University of NeuchâtelNeuchatelSwitzerland
| |
Collapse
|
5
|
Quesneville H. Twenty years of transposable element analysis in the Arabidopsis thaliana genome. Mob DNA 2020; 11:28. [PMID: 32742313 PMCID: PMC7385966 DOI: 10.1186/s13100-020-00223-x] [Citation(s) in RCA: 49] [Impact Index Per Article: 12.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/18/2020] [Accepted: 07/16/2020] [Indexed: 11/10/2022] Open
Abstract
Transposable elements (TEs) are mobile repetitive DNA sequences shown to be major drivers of genome evolution. As the first plant to have its genome sequenced and analyzed at the genomic scale, Arabidopsis thaliana has largely contributed to our TE knowledge. The present report describes 20 years of accumulated TE knowledge gained through the study of the Arabidopsis genome and covers the known TE families, their relative abundance, and their genomic distribution. It presents our knowledge of the different TE family activities, mobility, population and long-term evolutionary dynamics. Finally, the role of TE as substrates for new genes and their impact on gene expression is illustrated through a few selected demonstrative cases. Promising future directions for TE studies in this species conclude the review.
Collapse
|
6
|
Catoni M, Jonesman T, Cerruti E, Paszkowski J. Mobilization of Pack-CACTA transposons in Arabidopsis suggests the mechanism of gene shuffling. Nucleic Acids Res 2019; 47:1311-1320. [PMID: 30476196 PMCID: PMC6379663 DOI: 10.1093/nar/gky1196] [Citation(s) in RCA: 16] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/01/2018] [Revised: 11/08/2018] [Accepted: 11/16/2018] [Indexed: 11/21/2022] Open
Abstract
Pack-TYPE transposons are a unique class of potentially mobile non-autonomous elements that can capture, merge and relocate fragments of chromosomal DNA. It has been postulated that their activity accelerates the evolution of host genes. However, this important presumption is based only on the sequences of currently inactive Pack-TYPE transposons and the acquisition of chromosomal DNA has not been recorded in real time. Analysing the DNA copy number variation in hypomethylated Arabidopsis lines, we have now for the first time witnessed the mobilization of novel Pack-TYPE elements related to the CACTA transposon family, over several plant generations. Remarkably, these elements can insert into genes as closely spaced direct repeats and they frequently undergo incomplete excisions, resulting in the deletion of one of the end sequences. These properties suggest a mechanism of efficient acquisition of genic DNA residing between neighbouring Pack-TYPE transposons and its subsequent mobilization. Our work documents crucial steps in the formation of in vivo novel Pack-TYPE transposons, and thus the possible mechanism of gene shuffling mediated by this type of mobile element.
Collapse
Affiliation(s)
- Marco Catoni
- The Sainsbury Laboratory, University of Cambridge, Cambridge, CB2 1LR, UK.,School of Biosciences, University of Birmingham, Edgbaston, Birmingham, B15 2TT, UK
| | - Thomas Jonesman
- The Sainsbury Laboratory, University of Cambridge, Cambridge, CB2 1LR, UK
| | - Elisa Cerruti
- The Sainsbury Laboratory, University of Cambridge, Cambridge, CB2 1LR, UK
| | - Jerzy Paszkowski
- The Sainsbury Laboratory, University of Cambridge, Cambridge, CB2 1LR, UK
| |
Collapse
|
7
|
Transposition favors the generation of large effect mutations that may facilitate rapid adaption. Nat Commun 2019; 10:3421. [PMID: 31366887 PMCID: PMC6668482 DOI: 10.1038/s41467-019-11385-5] [Citation(s) in RCA: 90] [Impact Index Per Article: 18.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/29/2019] [Accepted: 07/12/2019] [Indexed: 12/13/2022] Open
Abstract
Transposable elements (TEs) are mobile parasitic sequences that have been repeatedly coopted during evolution to generate new functions and rewire gene regulatory networks. Yet, the contribution of active TEs to the creation of heritable mutations remains unknown. Using TE accumulation lines in Arabidopsis thaliana we show that once initiated, transposition produces an exponential spread of TE copies, which rapidly leads to high mutation rates. Most insertions occur near or within genes and targets differ between TE families. Furthermore, we uncover an essential role of the histone variant H2A.Z in the preferential integration of Ty1/copia retrotransposons within environmentally responsive genes and away from essential genes. We also show that epigenetic silencing of new Ty1/copia copies can affect their impact on major fitness-related traits, including flowering time. Our findings demonstrate that TEs are potent episodic (epi)mutagens that, thanks to marked chromatin tropisms, limit the mutation load and increase the potential for rapid adaptation.
Collapse
|
8
|
Liégard B, Baillet V, Etcheverry M, Joseph E, Lariagon C, Lemoine J, Evrard A, Colot V, Gravot A, Manzanares‐Dauleux MJ, Jubault M. Quantitative resistance to clubroot infection mediated by transgenerational epigenetic variation in Arabidopsis. THE NEW PHYTOLOGIST 2019; 222:468-479. [PMID: 30393890 PMCID: PMC6587750 DOI: 10.1111/nph.15579] [Citation(s) in RCA: 23] [Impact Index Per Article: 4.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/25/2018] [Accepted: 10/26/2018] [Indexed: 05/02/2023]
Abstract
Quantitative disease resistance, often influenced by environmental factors, is thought to be the result of DNA sequence variants segregating at multiple loci. However, heritable differences in DNA methylation, so-called transgenerational epigenetic variants, also could contribute to quantitative traits. Here, we tested this possibility using the well-characterized quantitative resistance of Arabidopsis to clubroot, a Brassica major disease caused by Plasmodiophora brassicae. For that, we used the epigenetic recombinant inbred lines (epiRIL) derived from the cross ddm1-2 × Col-0, which show extensive epigenetic variation but limited DNA sequence variation. Quantitative loci under epigenetic control (QTLepi ) mapping was carried out on 123 epiRIL infected with P. brassicae and using various disease-related traits. EpiRIL displayed a wide range of continuous phenotypic responses. Twenty QTLepi were detected across the five chromosomes, with a bona fide epigenetic origin for 16 of them. The effect of five QTLepi was dependent on temperature conditions. Six QTLepi co-localized with previously identified clubroot resistance genes and QTL in Arabidopsis. Co-localization of clubroot resistance QTLepi with previously detected DNA-based QTL reveals a complex model in which a combination of allelic and epiallelic variations interacts with the environment to lead to variation in clubroot quantitative resistance.
Collapse
Affiliation(s)
- Benjamin Liégard
- IGEPPINRAAGROCAMPUS OUESTUniversité de RennesF‐35000RennesFrance
| | - Victoire Baillet
- Institut de Biologie de l'Ecole Normale Supérieure (IBENS)Ecole Normale SupérieureCentre National de la Recherche Scientifique (CNRS)Institut National de la Santé et de la Recherche Médicale (INSERM)F‐75005ParisFrance
| | - Mathilde Etcheverry
- Institut de Biologie de l'Ecole Normale Supérieure (IBENS)Ecole Normale SupérieureCentre National de la Recherche Scientifique (CNRS)Institut National de la Santé et de la Recherche Médicale (INSERM)F‐75005ParisFrance
| | - Evens Joseph
- IGEPPINRAAGROCAMPUS OUESTUniversité de RennesF‐35000RennesFrance
| | | | - Jocelyne Lemoine
- IGEPPINRAAGROCAMPUS OUESTUniversité de RennesF‐35000RennesFrance
| | - Aurélie Evrard
- IGEPPINRAAGROCAMPUS OUESTUniversité de RennesF‐35000RennesFrance
| | - Vincent Colot
- Institut de Biologie de l'Ecole Normale Supérieure (IBENS)Ecole Normale SupérieureCentre National de la Recherche Scientifique (CNRS)Institut National de la Santé et de la Recherche Médicale (INSERM)F‐75005ParisFrance
| | - Antoine Gravot
- IGEPPINRAAGROCAMPUS OUESTUniversité de RennesF‐35000RennesFrance
| | | | - Mélanie Jubault
- IGEPPINRAAGROCAMPUS OUESTUniversité de RennesF‐35000RennesFrance
| |
Collapse
|
9
|
Furci L, Jain R, Stassen J, Berkowitz O, Whelan J, Roquis D, Baillet V, Colot V, Johannes F, Ton J. Identification and characterisation of hypomethylated DNA loci controlling quantitative resistance in Arabidopsis. eLife 2019; 8:40655. [PMID: 30608232 PMCID: PMC6342528 DOI: 10.7554/elife.40655] [Citation(s) in RCA: 60] [Impact Index Per Article: 12.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/31/2018] [Accepted: 01/03/2019] [Indexed: 12/12/2022] Open
Abstract
Variation in DNA methylation enables plants to inherit traits independently of changes to DNA sequence. Here, we have screened an Arabidopsis population of epigenetic recombinant inbred lines (epiRILs) for resistance against Hyaloperonospora arabidopsidis (Hpa). These lines share the same genetic background, but show variation in heritable patterns of DNA methylation. We identified four epigenetic quantitative trait loci (epiQTLs) that provide quantitative resistance without reducing plant growth or resistance to other (a)biotic stresses. Phenotypic characterisation and RNA-sequencing analysis revealed that Hpa-resistant epiRILs are primed to activate defence responses at the relatively early stages of infection. Collectively, our results show that hypomethylation at selected pericentromeric regions is sufficient to provide quantitative disease resistance, which is associated with genome-wide priming of defence-related genes. Based on comparisons of global gene expression and DNA methylation between the wild-type and resistant epiRILs, we discuss mechanisms by which the pericentromeric epiQTLs could regulate the defence-related transcriptome. In plants, animals and microbes genetic information is encoded by DNA, which are made up of sequences of building blocks, called nucleotide bases. These sequences can be separated into sections known as genes that each encode specific traits. It was previously thought that only changes to the sequence of bases in a DNA molecule could alter the traits passed on to future generations. However, it has recently become clear that some traits can also be inherited through modifications to the DNA that do not alter its sequence. One such modification is to attach a tag, known as a methyl group, to a nucleotide base known as cytosine. These methyl tags can be added to, or removed from, DNA to create different patterns of methylation. Previous studies have shown that plants whose DNA is less methylated than normal (‘hypo-methylated’) are more resistant to plant diseases. However, the location and identity of the hypo-methylated DNA regions controlling this resistance remained unknown. To address this problem, Furci, Jain et al. studied how DNA methylation in a small weed known as Arabidopsis thaliana affects how well the plants can resist a disease known as downy mildew. Furci, Jain et al. studied a population of over 100 A. thaliana lines that have the same DNA sequences but different patterns of DNA methylation. The experiments identified four DNA locations that were less methylated in lines with enhanced resistance to downy mildew. Importantly, this form of resistance did not appear to reduce how well the plants grew, or make them less able to resist other diseases or environmental stresses. The results of further experiments suggested that reduced methylation at the four DNA regions prime the plant’s immune system, enabling a faster and stronger activation of a multitude of defence genes across the genome after attack by downy mildew. The next steps following on from this work are to investigate exactly how the four DNA regions with reduced methylation can prime so many different defence genes in the plant. Further research is also needed to determine whether it is possible to breed crop plants with lower levels of methylation at specific DNA locations to improve disease resistance, but without decreasing the amount and quality of food produced.
Collapse
Affiliation(s)
- Leonardo Furci
- P3 Centre for Plant and Soil Biology, Department of Animal and Plant Sciences, University of Sheffield, Sheffield, United Kingdom
| | - Ritushree Jain
- P3 Centre for Plant and Soil Biology, Department of Animal and Plant Sciences, University of Sheffield, Sheffield, United Kingdom
| | - Joost Stassen
- P3 Centre for Plant and Soil Biology, Department of Animal and Plant Sciences, University of Sheffield, Sheffield, United Kingdom
| | - Oliver Berkowitz
- Department of Animal, Plant and Soil Science, ARC Centre of Excellence in Plant Energy Biology, La Trobe University, Melbourne, Australia
| | - James Whelan
- Department of Animal, Plant and Soil Science, ARC Centre of Excellence in Plant Energy Biology, La Trobe University, Melbourne, Australia
| | - David Roquis
- Department of Plant Sciences, Technical University of Munich, Freising, Germany.,Institute for Advanced Study, Technical University of Munich, Garching, Germany
| | - Victoire Baillet
- Institut de Biologie de l'Ecole Normale Supérieure (IBENS), Ecole Normale Supérieure, Centre National de la Recherche Scientifique (CNRS), Institut National de la Santé et de la Recherche Médicale (INSERM), PSL Université Paris, Paris, France
| | - Vincent Colot
- Institut de Biologie de l'Ecole Normale Supérieure (IBENS), Ecole Normale Supérieure, Centre National de la Recherche Scientifique (CNRS), Institut National de la Santé et de la Recherche Médicale (INSERM), PSL Université Paris, Paris, France
| | - Frank Johannes
- Department of Plant Sciences, Technical University of Munich, Freising, Germany.,Institute for Advanced Study, Technical University of Munich, Garching, Germany
| | - Jurriaan Ton
- P3 Centre for Plant and Soil Biology, Department of Animal and Plant Sciences, University of Sheffield, Sheffield, United Kingdom
| |
Collapse
|
10
|
Bae J, Lee KW, Islam MN, Yim HS, Park H, Rho M. iMGEins: detecting novel mobile genetic elements inserted in individual genomes. BMC Genomics 2018; 19:944. [PMID: 30563451 PMCID: PMC6299635 DOI: 10.1186/s12864-018-5290-9] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/03/2018] [Accepted: 11/20/2018] [Indexed: 11/10/2022] Open
Abstract
Background Recent advances in sequencing technology have allowed us to investigate personal genomes to find structural variations, which have been studied extensively to identify their association with the physiology of diseases such as cancer. In particular, mobile genetic elements (MGEs) are one of the major constituents of the human genomes, and cause genome instability by insertion, mutation, and rearrangement. Result We have developed a new program, iMGEins, to identify such novel MGEs by using sequencing reads of individual genomes, and to explore the breakpoints with the supporting reads and MGEs detected. iMGEins is the first MGE detection program that integrates three algorithmic components: discordant read-pair mapping, split-read mapping, and insertion sequence assembly. Our evaluation results showed its outstanding performance in detecting novel MGEs from simulated genomes, as well as real personal genomes. In detail, the average recall and precision rates of iMGEins are 96.67 and 100%, respectively, which are the highest among the programs compared. In the testing with real human genomes of the NA12878 sample, iMGEins shows the highest accuracy in detecting MGEs within 20 bp proximity of the breakpoints annotated. Conclusion In order to study the dynamics of MGEs in individual genomes, iMGEins was developed to accurately detect breakpoints and report inserted MGEs. Compared with other programs, iMGEins has valuable features of identifying novel MGEs and assembling the MGEs inserted. Electronic supplementary material The online version of this article (10.1186/s12864-018-5290-9) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Junwoo Bae
- Department of Electronics and Computer Engineering, Hanyang University, Seoul, Korea
| | - Kyeong Won Lee
- Marine Biotechnology Research Center, Korea Institute of Ocean Science and Technology, Ansan, Korea
| | - Mohammad Nazrul Islam
- Marine Biotechnology Research Center, Korea Institute of Ocean Science and Technology, Ansan, Korea.,Department of Marine Biotechnology, Korea University of Science and Technology, Daejeon, Korea.,Department of Biotechnology, Sher-e-Bangla Agricultural University, Dhaka, 1207, Bangladesh
| | - Hyung-Soon Yim
- Marine Biotechnology Research Center, Korea Institute of Ocean Science and Technology, Ansan, Korea.,Department of Marine Biotechnology, Korea University of Science and Technology, Daejeon, Korea
| | - Heejin Park
- Department of Computer Science and Engineering, Hanyang University, Seoul, Korea. .,Department of Biomedical Informatics, Hanyang University, Seoul, Korea.
| | - Mina Rho
- Department of Computer Science and Engineering, Hanyang University, Seoul, Korea. .,Department of Biomedical Informatics, Hanyang University, Seoul, Korea.
| |
Collapse
|
11
|
Sheppard AE, Stoesser N, German-Mesner I, Vegesana K, Sarah Walker A, Crook DW, Mathers AJ. TETyper: a bioinformatic pipeline for classifying variation and genetic contexts of transposable elements from short-read whole-genome sequencing data. Microb Genom 2018; 4:e000232. [PMID: 30465646 PMCID: PMC6412039 DOI: 10.1099/mgen.0.000232] [Citation(s) in RCA: 22] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/09/2018] [Accepted: 10/12/2018] [Indexed: 01/04/2023] Open
Abstract
Much of the worldwide dissemination of antibiotic resistance has been driven by resistance gene associations with mobile genetic elements (MGEs), such as plasmids and transposons. Although increasing, our understanding of resistance spread remains relatively limited, as methods for tracking mobile resistance genes through multiple species, strains and plasmids are lacking. We have developed a bioinformatic pipeline for tracking variation within, and mobility of, specific transposable elements (TEs), such as transposons carrying antibiotic-resistance genes. TETyper takes short-read whole-genome sequencing data as input and identifies single-nucleotide mutations and deletions within the TE of interest, to enable tracking of specific sequence variants, as well as the surrounding genetic context(s), to enable identification of transposition events. A major advantage of TETyper over previous methods is that it does not require a genome reference. To investigate global dissemination of Klebsiella pneumoniae carbapenemase (KPC) and its associated transposon Tn4401, we applied TETyper to a collection of over 3000 publicly available Illumina datasets containing blaKPC. This revealed surprising diversity, with over 200 distinct flanking genetic contexts for Tn4401, indicating high levels of transposition. Integration of sample metadata revealed insights into associations between geographic locations, host species, Tn4401 sequence variants and flanking genetic contexts. To demonstrate the ability of TETyper to cope with high-copy-number TEs and to track specific short-term evolutionary changes, we also applied it to the insertion sequence IS26 within a defined K. pneumoniae outbreak. TETyper is implemented in python and is freely available at https://github.com/aesheppard/TETyper.
Collapse
Affiliation(s)
- Anna E. Sheppard
- Nuffield Department of Medicine, University of Oxford, Oxford, UK
- National Institute for Health Research Health Protection Research Unit in Healthcare Associated Infections and Antimicrobial Resistance, University of Oxford, Oxford, UK
| | - Nicole Stoesser
- Nuffield Department of Medicine, University of Oxford, Oxford, UK
| | - Ian German-Mesner
- Health Information & Technology, University of Virginia Health System, Charlottesville, Virginia, USA
| | - Kasi Vegesana
- Health Information & Technology, University of Virginia Health System, Charlottesville, Virginia, USA
| | - A. Sarah Walker
- Nuffield Department of Medicine, University of Oxford, Oxford, UK
| | - Derrick W. Crook
- Nuffield Department of Medicine, University of Oxford, Oxford, UK
| | - Amy J. Mathers
- Division of Infectious Disease and International Health, Department of Medicine, University of Virginia Health System, Charlottesville, Virginia, USA
- Clinical Microbiology Laboratory, Department of Pathology, University of Virginia Health System, Charlottesville, Virginia, USA
| |
Collapse
|
12
|
|
13
|
Serrato-Capuchina A, Matute DR. The Role of Transposable Elements in Speciation. Genes (Basel) 2018; 9:E254. [PMID: 29762547 PMCID: PMC5977194 DOI: 10.3390/genes9050254] [Citation(s) in RCA: 99] [Impact Index Per Article: 16.5] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/30/2018] [Revised: 04/26/2018] [Accepted: 04/26/2018] [Indexed: 01/20/2023] Open
Abstract
Understanding the phenotypic and molecular mechanisms that contribute to genetic diversity between and within species is fundamental in studying the evolution of species. In particular, identifying the interspecific differences that lead to the reduction or even cessation of gene flow between nascent species is one of the main goals of speciation genetic research. Transposable elements (TEs) are DNA sequences with the ability to move within genomes. TEs are ubiquitous throughout eukaryotic genomes and have been shown to alter regulatory networks, gene expression, and to rearrange genomes as a result of their transposition. However, no systematic effort has evaluated the role of TEs in speciation. We compiled the evidence for TEs as potential causes of reproductive isolation across a diversity of taxa. We find that TEs are often associated with hybrid defects that might preclude the fusion between species, but that the involvement of TEs in other barriers to gene flow different from postzygotic isolation is still relatively unknown. Finally, we list a series of guides and research avenues to disentangle the effects of TEs on the origin of new species.
Collapse
Affiliation(s)
- Antonio Serrato-Capuchina
- Biology Department, Genome Sciences Building, University of North Carolina, Chapel Hill, NC 27514, USA.
| | - Daniel R Matute
- Biology Department, Genome Sciences Building, University of North Carolina, Chapel Hill, NC 27514, USA.
| |
Collapse
|
14
|
Sanchez DH, Gaubert H, Drost HG, Zabet NR, Paszkowski J. High-frequency recombination between members of an LTR retrotransposon family during transposition bursts. Nat Commun 2017; 8:1283. [PMID: 29097664 PMCID: PMC5668417 DOI: 10.1038/s41467-017-01374-x] [Citation(s) in RCA: 29] [Impact Index Per Article: 4.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/22/2017] [Accepted: 09/14/2017] [Indexed: 12/04/2022] Open
Abstract
Retrotransposons containing long terminal repeats (LTRs) form a substantial fraction of eukaryotic genomes. The timing of past transposition can be estimated by quantifying the accumulation of mutations in initially identical LTRs. This way, retrotransposons are divided into young, potentially mobile elements, and old that moved thousands or even millions of years ago. Both types are found within a single retrotransposon family and it is assumed that the old members will remain immobile and degenerate further. Here, we provide evidence in Arabidopsis that old members enter into replication/transposition cycles through high rates of intra-family recombination. The recombination occurs pairwise, resembling the formation of recombinant retroviruses. Thus, each transposition burst generates a novel progeny population of chromosomally integrated LTR retrotransposons consisting of pairwise recombination products produced in a process comparable the sexual exchange of genetic information. Our observations provide an explanation for the reported high rates of sequence diversification in retrotransposons. Retrotransposons are abundant in eukaryotic genomes. Here, Sanchez et al. show evidence of high-frequency recombination between members of an LTR retrotransposon family during transposition bursts in Arabidopsis.
Collapse
Affiliation(s)
- Diego H Sanchez
- The Sainsbury Laboratory, University of Cambridge, Cambridge, CB2 1LR, UK
| | - Hervé Gaubert
- The Sainsbury Laboratory, University of Cambridge, Cambridge, CB2 1LR, UK
| | - Hajk-Georg Drost
- The Sainsbury Laboratory, University of Cambridge, Cambridge, CB2 1LR, UK
| | - Nicolae Radu Zabet
- The Sainsbury Laboratory, University of Cambridge, Cambridge, CB2 1LR, UK.,School of Biological Sciences, University of Essex, Colchester, CO4 3SQ, UK
| | - Jerzy Paszkowski
- The Sainsbury Laboratory, University of Cambridge, Cambridge, CB2 1LR, UK.
| |
Collapse
|
15
|
McClintock: An Integrated Pipeline for Detecting Transposable Element Insertions in Whole-Genome Shotgun Sequencing Data. G3-GENES GENOMES GENETICS 2017. [PMID: 28637810 PMCID: PMC5555480 DOI: 10.1534/g3.117.043893] [Citation(s) in RCA: 48] [Impact Index Per Article: 6.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 12/31/2022]
Abstract
Transposable element (TE) insertions are among the most challenging types of variants to detect in genomic data because of their repetitive nature and complex mechanisms of replication . Nevertheless, the recent availability of large resequencing data sets has spurred the development of many new methods to detect TE insertions in whole-genome shotgun sequences. Here we report an integrated bioinformatics pipeline for the detection of TE insertions in whole-genome shotgun data, called McClintock (https://github.com/bergmanlab/mcclintock), which automatically runs and standardizes output for multiple TE detection methods. We demonstrate the utility of McClintock by evaluating six TE detection methods using simulated and real genome data from the model microbial eukaryote, Saccharomyces cerevisiae We find substantial variation among McClintock component methods in their ability to detect nonreference TEs in the yeast genome, but show that nonreference TEs at nearly all biologically realistic locations can be detected in simulated data by combining multiple methods that use split-read and read-pair evidence. In general, our results reveal that split-read methods detect fewer nonreference TE insertions than read-pair methods, but generally have much higher positional accuracy. Analysis of a large sample of real yeast genomes reveals that most McClintock component methods can recover known aspects of TE biology in yeast such as the transpositional activity status of families, target preferences, and target site duplication structure, albeit with varying levels of accuracy. Our work provides a general framework for integrating and analyzing results from multiple TE detection methods, as well as useful guidance for researchers studying TEs in yeast resequencing data.
Collapse
|
16
|
Oberlin S, Sarazin A, Chevalier C, Voinnet O, Marí-Ordóñez A. A genome-wide transcriptome and translatome analysis of Arabidopsis transposons identifies a unique and conserved genome expression strategy for Ty1/Copia retroelements. Genome Res 2017; 27:1549-1562. [PMID: 28784835 PMCID: PMC5580714 DOI: 10.1101/gr.220723.117] [Citation(s) in RCA: 23] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/25/2017] [Accepted: 07/18/2017] [Indexed: 01/17/2023]
Abstract
Retroelements, the prevalent class of plant transposons, have major impacts on host genome integrity and evolution. They produce multiple proteins from highly compact genomes and, similar to viruses, must have evolved original strategies to optimize gene expression, although this aspect has been seldom investigated thus far. Here, we have established a high-resolution transcriptome/translatome map for the near-entirety of Arabidopsis thaliana transposons, using two distinct DNA methylation mutants in which transposon expression is broadly de-repressed. The value of this map to study potentially intact and transcriptionally active transposons in A. thaliana is illustrated by our comprehensive analysis of the cotranscriptional and translational features of Ty1/Copia elements, a family of young and active retroelements in plant genomes, and how such features impact their biology. Genome-wide transcript profiling revealed a unique and widely conserved alternative splicing event coupled to premature termination that allows for the synthesis of a short subgenomic RNA solely dedicated to production of the GAG structural protein and that preferentially associates with polysomes for efficient translation. Mutations engineered in a transgenic version of the Arabidopsis EVD Ty1/Copia element further show how alternative splicing is crucial for the appropriate coordination of full-length and subgenomic RNA transcription. We propose that this hitherto undescribed genome expression strategy, conserved among plant Ty1/Copia elements, enables an excess of structural versus catalytic components, mandatory for mobilization.
Collapse
Affiliation(s)
- Stefan Oberlin
- Department of Biology, ETH Zurich, 8092 Zurich, Switzerland
| | - Alexis Sarazin
- Department of Biology, ETH Zurich, 8092 Zurich, Switzerland
| | | | | | | |
Collapse
|
17
|
Affiliation(s)
| | - Vincent Colot
- Institut de Biologie de l'Ecole Normale Supérieure (IBENS), Ecole Normale Supérieure, Centre National de la Recherche Scientifique (CNRS), Institut National de la Santé et de la Recherche Médicale (INSERM), F-75005 Paris, France;
| |
Collapse
|
18
|
Kofler R, Gómez-Sánchez D, Schlötterer C. PoPoolationTE2: Comparative Population Genomics of Transposable Elements Using Pool-Seq. Mol Biol Evol 2016; 33:2759-64. [PMID: 27486221 PMCID: PMC5026257 DOI: 10.1093/molbev/msw137] [Citation(s) in RCA: 64] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/27/2023] Open
Abstract
The evolutionary dynamics of transposable elements (TEs) are still poorly understood. One reason is that TE abundance needs to be studied at the population level, but sequencing individuals on a population scale is still too expensive to characterize TE abundance in multiple populations. Although sequencing pools of individuals dramatically reduces sequencing costs, a comparison of TE abundance between pooled samples has been difficult, if not impossible, due to various biases. Here, we introduce a novel bioinformatic tool, PoPoolationTE2, which is specifically tailored for the comparison of TE abundance among pooled population samples or different tissues. Using computer simulations, we demonstrate that PoPoolationTE2 not only faithfully recovers TE insertion frequencies and positions but, by homogenizing the power to identify TEs across samples, it provides an unbiased comparison of TE abundance between pooled population samples. We anticipate that PoPoolationTE2 will greatly facilitate the analysis of TE insertion patterns in a broad range of applications.
Collapse
Affiliation(s)
- Robert Kofler
- Institut für Populationsgenetik, Vetmeduni Vienna, Veterinärplatz 1, 1210 Wien, Austria
| | - Daniel Gómez-Sánchez
- Institut für Populationsgenetik, Vetmeduni Vienna, Veterinärplatz 1, 1210 Wien, Austria
| | - Christian Schlötterer
- Institut für Populationsgenetik, Vetmeduni Vienna, Veterinärplatz 1, 1210 Wien, Austria
| |
Collapse
|
19
|
Next-generation sequencing-based detection of germline L1-mediated transductions. BMC Genomics 2016; 17:342. [PMID: 27161561 PMCID: PMC4862182 DOI: 10.1186/s12864-016-2670-x] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/11/2015] [Accepted: 04/26/2016] [Indexed: 01/01/2023] Open
Abstract
Background While active LINE-1 (L1) elements possess the ability to mobilize flanking sequences to different genomic loci through a process termed transduction influencing genomic content and structure, an approach for detecting polymorphic germline non-reference transductions in massively-parallel sequencing data has been lacking. Results Here we present the computational approach TIGER (Transduction Inference in GERmline genomes), enabling the discovery of non-reference L1-mediated transductions by combining L1 discovery with detection of unique insertion sequences and detailed characterization of insertion sites. We employed TIGER to characterize polymorphic transductions in fifteen genomes from non-human primate species (chimpanzee, orangutan and rhesus macaque), as well as in a human genome. We achieved high accuracy as confirmed by PCR and two single molecule DNA sequencing techniques, and uncovered differences in relative rates of transduction between primate species. Conclusions By enabling detection of polymorphic transductions, TIGER makes this form of relevant structural variation amenable for population and personal genome analysis. Electronic supplementary material The online version of this article (doi:10.1186/s12864-016-2670-x) contains supplementary material, which is available to authorized users.
Collapse
|
20
|
Ecovoiu AA, Ghionoiu IC, Ciuca AM, Ratiu AC. Genome ARTIST: a robust, high-accuracy aligner tool for mapping transposon insertions and self-insertions. Mob DNA 2016; 7:3. [PMID: 26855675 PMCID: PMC4744444 DOI: 10.1186/s13100-016-0061-0] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/02/2015] [Accepted: 01/19/2016] [Indexed: 01/16/2023] Open
Abstract
Background A critical topic of insertional mutagenesis experiments performed on model organisms is mapping the hits of artificial transposons (ATs) at nucleotide level accuracy. Mapping errors may occur when sequencing artifacts or mutations as single nucleotide polymorphisms (SNPs) and small indels are present very close to the junction between a genomic sequence and a transposon inverted repeat (TIR). Another particular item of insertional mutagenesis is mapping of the transposon self-insertions and, to our best knowledge, there is no publicly available mapping tool designed to analyze such molecular events. Results We developed Genome ARTIST, a pairwise gapped aligner tool which works out both issues by means of an original, robust mapping strategy. Genome ARTIST is not designed to use next-generation sequencing (NGS) data but to analyze ATs insertions obtained in small to medium-scale mutagenesis experiments. Genome ARTIST employs a heuristic approach to find DNA sequence similarities and harnesses a multi-step implementation of a Smith-Waterman adapted algorithm to compute the mapping alignments. The experience is enhanced by easily customizable parameters and a user-friendly interface that describes the genomic landscape surrounding the insertion. Genome ARTIST is functional with many genomes of bacteria and eukaryotes available in Ensembl and GenBank repositories. Our tool specifically harnesses the sequence annotation data provided by FlyBase for Drosophila melanogaster (the fruit fly), which enables mapping of insertions relative to various genomic features such as natural transposons. Genome ARTIST was tested against other alignment tools using relevant query sequences derived from the D. melanogaster and Mus musculus (mouse) genomes. Real and simulated query sequences were also comparatively inquired, revealing that Genome ARTIST is a very robust solution for mapping transposon insertions. Conclusions Genome ARTIST is a stand-alone user-friendly application, designed for high-accuracy mapping of transposon insertions and self-insertions. The tool is also useful for routine aligning assessments like detection of SNPs or checking the specificity of primers and probes. Genome ARTIST is an open source software and is available for download at www.genomeartist.ro and at GitHub (https://github.com/genomeartist/genomeartist ). Electronic supplementary material The online version of this article (doi:10.1186/s13100-016-0061-0) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Alexandru Al Ecovoiu
- Department of Genetics, Faculty of Biology, University of Bucharest, Bucharest, Romania
| | | | | | - Attila Cristian Ratiu
- Department of Genetics, Faculty of Biology, University of Bucharest, Bucharest, Romania
| |
Collapse
|
21
|
Ewing AD. Transposable element detection from whole genome sequence data. Mob DNA 2015; 6:24. [PMID: 26719777 PMCID: PMC4696183 DOI: 10.1186/s13100-015-0055-3] [Citation(s) in RCA: 123] [Impact Index Per Article: 13.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/15/2015] [Accepted: 12/21/2015] [Indexed: 11/25/2022] Open
Abstract
The number of software tools available for detecting transposable element insertions from whole genome sequence data has been increasing steadily throughout the last ~5 years. Some of these methods have unique features suiting them for particular use cases, but in general they follow one or more of a common set of approaches. Here, detection and filtering approaches are reviewed in the light of transposable element biology and the current state of whole genome sequencing. We demonstrate that the current state-of-the-art methods still do not produce highly concordant results and provide resources to assist future development in transposable element detection methods.
Collapse
Affiliation(s)
- Adam D Ewing
- Mater Research Institute - University of Queensland, 37 Kent St Level 4, Woolloongabba, QLD 4102 Australia
| |
Collapse
|