1
|
Hannon-Hatfield JA, Chen J, Bergman CM, Garfinkel DJ. Evolution of a Restriction Factor by Domestication of a Yeast Retrotransposon. Mol Biol Evol 2024; 41:msae050. [PMID: 38442736 PMCID: PMC10951436 DOI: 10.1093/molbev/msae050] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/21/2023] [Revised: 02/13/2024] [Accepted: 02/23/2024] [Indexed: 03/07/2024] Open
Abstract
Transposable elements drive genome evolution in all branches of life. Transposable element insertions are often deleterious to their hosts and necessitate evolution of control mechanisms to limit their spread. The long terminal repeat retrotransposon Ty1 prime (Ty1'), a subfamily of the Ty1 family, is present in many Saccharomyces cerevisiae strains, but little is known about what controls its copy number. Here, we provide evidence that a novel gene from an exapted Ty1' sequence, domesticated restriction of Ty1' relic 2 (DRT2), encodes a restriction factor that inhibits Ty1' movement. DRT2 arose through domestication of a Ty1' GAG gene and contains the C-terminal domain of capsid, which in the related Ty1 canonical subfamily functions as a self-encoded restriction factor. Bioinformatic analysis reveals the widespread nature of DRT2, its evolutionary history, and pronounced structural variation at the Ty1' relic 2 locus. Ty1' retromobility analyses demonstrate DRT2 restriction factor functionality, and northern blot and RNA-seq analysis indicate that DRT2 is transcribed in multiple strains. Velocity cosedimentation profiles indicate an association between Drt2 and Ty1' virus-like particles or assembly complexes. Chimeric Ty1' elements containing DRT2 retain retromobility, suggesting an ancestral role of productive Gag C-terminal domain of capsid functionality is present in the sequence. Unlike Ty1 canonical, Ty1' retromobility increases with copy number, suggesting that C-terminal domain of capsid-based restriction is not limited to the Ty1 canonical subfamily self-encoded restriction factor and drove the endogenization of DRT2. The discovery of an exapted Ty1' restriction factor provides insight into the evolution of the Ty1 family, evolutionary hot-spots, and host-transposable element interactions.
Collapse
Affiliation(s)
- J Adam Hannon-Hatfield
- Department of Biochemistry and Molecular Biology, University of Georgia, Athens, GA, USA
| | - Jingxuan Chen
- Institute of Bioinformatics, University of Georgia, Athens, GA, USA
| | - Casey M Bergman
- Institute of Bioinformatics, University of Georgia, Athens, GA, USA
- Department of Genetics, University of Georgia, Athens, GA, USA
| | - David J Garfinkel
- Department of Biochemistry and Molecular Biology, University of Georgia, Athens, GA, USA
| |
Collapse
|
2
|
Chen J, Basting PJ, Han S, Garfinkel DJ, Bergman CM. Reproducible evaluation of transposable element detectors with McClintock 2 guides accurate inference of Ty insertion patterns in yeast. Mob DNA 2023; 14:8. [PMID: 37452430 PMCID: PMC10347736 DOI: 10.1186/s13100-023-00296-4] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/21/2023] [Accepted: 06/09/2023] [Indexed: 07/18/2023] Open
Abstract
BACKGROUND Many computational methods have been developed to detect non-reference transposable element (TE) insertions using short-read whole genome sequencing data. The diversity and complexity of such methods often present challenges to new users seeking to reproducibly install, execute, or evaluate multiple TE insertion detectors. RESULTS We previously developed the McClintock meta-pipeline to facilitate the installation, execution, and evaluation of six first-generation short-read TE detectors. Here, we report a completely re-implemented version of McClintock written in Python using Snakemake and Conda that improves its installation, error handling, speed, stability, and extensibility. McClintock 2 now includes 12 short-read TE detectors, auxiliary pre-processing and analysis modules, interactive HTML reports, and a simulation framework to reproducibly evaluate the accuracy of component TE detectors. When applied to the model microbial eukaryote Saccharomyces cerevisiae, we find substantial variation in the ability of McClintock 2 components to identify the precise locations of non-reference TE insertions, with RelocaTE2 showing the highest recall and precision in simulated data. We find that RelocaTE2, TEMP, TEMP2 and TEBreak provide consistent estimates of [Formula: see text]50 non-reference TE insertions per strain and that Ty2 has the highest number of non-reference TE insertions in a species-wide panel of [Formula: see text]1000 yeast genomes. Finally, we show that best-in-class predictors for yeast applied to resequencing data have sufficient resolution to reveal a dyad pattern of integration in nucleosome-bound regions upstream of yeast tRNA genes for Ty1, Ty2, and Ty4, allowing us to extend knowledge about fine-scale target preferences revealed previously for experimentally-induced Ty1 insertions to spontaneous insertions for other copia-superfamily retrotransposons in yeast. CONCLUSION McClintock ( https://github.com/bergmanlab/mcclintock/ ) provides a user-friendly pipeline for the identification of TEs in short-read WGS data using multiple TE detectors, which should benefit researchers studying TE insertion variation in a wide range of different organisms. Application of the improved McClintock system to simulated and empirical yeast genome data reveals best-in-class methods and novel biological insights for one of the most widely-studied model eukaryotes and provides a paradigm for evaluating and selecting non-reference TE detectors in other species.
Collapse
Affiliation(s)
- Jingxuan Chen
- Institute of Bioinformatics, University of Georgia, Athens, GA USA
| | | | - Shunhua Han
- Institute of Bioinformatics, University of Georgia, Athens, GA USA
| | - David J. Garfinkel
- Department of Biochemistry and Molecular Biology, University of Georgia, Athens, GA USA
| | - Casey M. Bergman
- Institute of Bioinformatics, University of Georgia, Athens, GA USA
- Department of Genetics, University of Georgia, Athens, GA USA
| |
Collapse
|
3
|
Chen J, Basting PJ, Han S, Garfinkel DJ, Bergman CM. Reproducible evaluation of transposable element detectors with McClintock 2 guides accurate inference of Ty insertion patterns in yeast. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.02.13.528343. [PMID: 36824955 PMCID: PMC9948991 DOI: 10.1101/2023.02.13.528343] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 02/17/2023]
Abstract
BACKGROUND Many computational methods have been developed to detect non-reference transposable element (TE) insertions using short-read whole genome sequencing data. The diversity and complexity of such methods often present challenges to new users seeking to reproducibly install, execute, or evaluate multiple TE insertion detectors. RESULTS We previously developed the McClintock meta-pipeline to facilitate the installation, execution, and evaluation of six first-generation short-read TE detectors. Here, we report a completely re-implemented version of McClintock written in Python using Snakemake and Conda that improves its installation, error handling, speed, stability, and extensibility. McClintock 2 now includes 12 short-read TE detectors, auxiliary pre-processing and analysis modules, interactive HTML reports, and a simulation framework to reproducibly evaluate the accuracy of component TE detectors. When applied to the model microbial eukaryote Saccharomyces cerevisiae, we find substantial variation in the ability of McClintock 2 components to identify the precise locations of non-reference TE insertions, with RelocaTE2 showing the highest recall and precision in simulated data. We find that RelocaTE2, TEMP, TEMP2 and TEBreak provide a consistent and biologically meaningful view of non-reference TE insertions in a species-wide panel of ∼1000 yeast genomes, as evaluated by coverage-based abundance estimates and expected patterns of tRNA promoter targeting. Finally, we show that best-in-class predictors for yeast have sufficient resolution to reveal a dyad pattern of integration in nucleosome-bound regions upstream of yeast tRNA genes for Ty1, Ty2, and Ty4, allowing us to extend knowledge about fine-scale target preferences first revealed experimentally for Ty1 to natural insertions and related copia-superfamily retrotransposons in yeast. CONCLUSION McClintock (https://github.com/bergmanlab/mcclintock/) provides a user-friendly pipeline for the identification of TEs in short-read WGS data using multiple TE detectors, which should benefit researchers studying TE insertion variation in a wide range of different organisms. Application of the improved McClintock system to simulated and empirical yeast genome data reveals best-in-class methods and novel biological insights for one of the most widely-studied model eukaryotes and provides a paradigm for evaluating and selecting non-reference TE detectors for other species.
Collapse
Affiliation(s)
- Jingxuan Chen
- Institute of Bioinformatics, University of Georgia, Athens, GA
| | | | - Shunhua Han
- Institute of Bioinformatics, University of Georgia, Athens, GA
| | - David J. Garfinkel
- Department of Biochemistry and Molecular Biology, University of Georgia, Athens, GA
| | - Casey M. Bergman
- Institute of Bioinformatics, University of Georgia, Athens, GA
- Department of Genetics, University of Georgia, Athens, GA
| |
Collapse
|
4
|
Abascal-Palacios G, Jochem L, Pla-Prats C, Beuron F, Vannini A. Structural basis of Ty3 retrotransposon integration at RNA Polymerase III-transcribed genes. Nat Commun 2021; 12:6992. [PMID: 34848735 PMCID: PMC8632968 DOI: 10.1038/s41467-021-27338-w] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/09/2021] [Accepted: 11/15/2021] [Indexed: 12/29/2022] Open
Abstract
Retrotransposons are endogenous elements that have the ability to mobilise their DNA between different locations in the host genome. The Ty3 retrotransposon integrates with an exquisite specificity in a narrow window upstream of RNA Polymerase (Pol) III-transcribed genes, representing a paradigm for harmless targeted integration. Here we present the cryo-EM reconstruction at 4.0 Å of an active Ty3 strand transfer complex bound to TFIIIB transcription factor and a tRNA gene. The structure unravels the molecular mechanisms underlying Ty3 targeting specificity at Pol III-transcribed genes and sheds light into the architecture of retrotransposon machinery during integration. Ty3 intasome contacts a region of TBP, a subunit of TFIIIB, which is blocked by NC2 transcription regulator in RNA Pol II-transcribed genes. A newly-identified chromodomain on Ty3 integrase interacts with TFIIIB and the tRNA gene, defining with extreme precision the integration site position.
Collapse
Affiliation(s)
| | - Laura Jochem
- Division of Structural Biology, The Institute of Cancer Research, London, SW7 3RP, UK
| | - Carlos Pla-Prats
- Friedrich Miescher Institute for Biomedical Research, Basel, Switzerland
| | - Fabienne Beuron
- Division of Structural Biology, The Institute of Cancer Research, London, SW7 3RP, UK
| | - Alessandro Vannini
- Division of Structural Biology, The Institute of Cancer Research, London, SW7 3RP, UK.
- Human Technopole, 20157, Milan, Italy.
| |
Collapse
|
5
|
Cui Y, Guo Y. The local integration preference of the Tf1 retrotransposon in Schizosaccharomyces pombe. Virology 2021; 565:52-57. [PMID: 34736160 DOI: 10.1016/j.virol.2021.10.008] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/12/2021] [Revised: 10/17/2021] [Accepted: 10/25/2021] [Indexed: 10/20/2022]
Abstract
Transposons are mobile DNAs that can move to different locations in host genomes. The integration site selection of transposons is critical for both themselves and host cells. Studies on the integration of retrotransposons and retroviruses have focused more on the global preference than on the local preference. The local preferences of retrotransposons are usually weak and of large diversity. Here, we analyzed hundreds of thousands of independent integration events of the Tf1 retrotransposon in Schizosaccharomyces pombe. The consensus sequence at the Tf1 integration sites shows a palindromic pattern, which can be divided into four sections, each of them contains one or more CGnTA units with a period of 10 base pairs, indicating interaction with subunits of the integrase oligomer in the pre-integration complex. Moreover, the analysis on the nucleosome occupancy flanking Tf1 target sites shows that Tf1 integration favors regions with one entire nucleosome depletion.
Collapse
Affiliation(s)
- Yujin Cui
- Guangdong Provincial Key Laboratory of Malignant Tumor Epigenetics and Gene Regulation, Guangdong-Hong Kong Joint Laboratory for RNA Medicine, Medical Research Center, Sun Yat-Sen Memorial Hospital, Sun Yat-Sen University, Guangzhou, 510120, China; Guangzhou PharmaRays Technology Co., Ltd, Guangzhou, 510000, China
| | - Yabin Guo
- Guangdong Provincial Key Laboratory of Malignant Tumor Epigenetics and Gene Regulation, Guangdong-Hong Kong Joint Laboratory for RNA Medicine, Medical Research Center, Sun Yat-Sen Memorial Hospital, Sun Yat-Sen University, Guangzhou, 510120, China.
| |
Collapse
|
6
|
Bleykasten-Grosshans C, Fabrizio R, Friedrich A, Schacherer J. Species-wide transposable element repertoires retrace the evolutionary history of the Saccharomyces cerevisiae host. Mol Biol Evol 2021; 38:4334-4345. [PMID: 34115140 PMCID: PMC8476168 DOI: 10.1093/molbev/msab171] [Citation(s) in RCA: 19] [Impact Index Per Article: 6.3] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/03/2023] Open
Abstract
Transposable elements (TE) are an important source of genetic variation with a dynamic and content that greatly differ in a wide range of species. The origin of the intraspecific content variation is not always clear and little is known about the precise nature of it. Here, we surveyed the species-wide content of the Ty LTR-retrotransposons in a broad collection of 1,011 Saccharomyces cerevisiae natural isolates to understand what can stand behind the variation of the repertoire that is the type and number of Ty elements. We have compiled an exhaustive catalog of all the TE sequence variants present in the S. cerevisiae species by identifying a large set of new sequence variants. The characterization of the TE content in each isolate clearly highlighted that each subpopulation exhibits a unique and specific repertoire, retracing the evolutionary history of the species. Most interestingly, we have shown that ancient interspecific hybridization events had a major impact in the birth of new sequence variants and therefore in the shaping of the TE repertoires. We also investigated the transpositional activity of these elements in a large set of natural isolates, and we found a broad variability related to the level of ploidy as well as the genetic background. Overall, our results pointed out that the evolution of the Ty content is deeply impacted by clade-specific events such as introgressions and therefore follows the population structure. In addition, our study lays the foundation for future investigations to better understand the transpositional regulation and more broadly the TE–host interactions.
Collapse
Affiliation(s)
| | - Romeo Fabrizio
- Université de Strasbourg, CNRS, GMGM UMR 7156, Strasbourg, France
| | - Anne Friedrich
- Université de Strasbourg, CNRS, GMGM UMR 7156, Strasbourg, France
| | - Joseph Schacherer
- Université de Strasbourg, CNRS, GMGM UMR 7156, Strasbourg, France.,Institut Universitaire de France (IUF)
| |
Collapse
|
7
|
Bonnet A, Lesage P. Light and shadow on the mechanisms of integration site selection in yeast Ty retrotransposon families. Curr Genet 2021; 67:347-357. [PMID: 33590295 DOI: 10.1007/s00294-021-01154-7] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/07/2020] [Revised: 01/04/2021] [Accepted: 01/07/2021] [Indexed: 12/21/2022]
Abstract
Transposable elements are ubiquitous in genomes. Their successful expansion depends in part on their sites of integration in their host genome. In Saccharomyces cerevisiae, evolution has selected various strategies to target the five Ty LTR-retrotransposon families into gene-poor regions in a genome, where coding sequences occupy 70% of the DNA. The integration of Ty1/Ty2/Ty4 and Ty3 occurs upstream and at the transcription start site of the genes transcribed by RNA polymerase III, respectively. Ty5 has completely different integration site preferences, targeting heterochromatin regions. Here, we review the history that led to the identification of the cellular tethering factors that play a major role in anchoring Ty retrotransposons to their preferred sites. We also question the involvement of additional factors in the fine-tuning of the integration site selection, with several studies converging towards an importance of the structure and organization of the chromatin.
Collapse
Affiliation(s)
- Amandine Bonnet
- INSERM U944, CNRS UMR 7212, Genomes and Cell Biology of Disease Unit, Institut de Recherche Saint-Louis, Université de Paris, Hôpital Saint-Louis, Paris, France
| | - Pascale Lesage
- INSERM U944, CNRS UMR 7212, Genomes and Cell Biology of Disease Unit, Institut de Recherche Saint-Louis, Université de Paris, Hôpital Saint-Louis, Paris, France.
| |
Collapse
|
8
|
Asif‐Laidin A, Conesa C, Bonnet A, Grison C, Adhya I, Menouni R, Fayol H, Palmic N, Acker J, Lesage P. A small targeting domain in Ty1 integrase is sufficient to direct retrotransposon integration upstream of tRNA genes. EMBO J 2020; 39:e104337. [PMID: 32677087 PMCID: PMC7459421 DOI: 10.15252/embj.2019104337] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/06/2020] [Revised: 06/09/2020] [Accepted: 06/18/2020] [Indexed: 12/25/2022] Open
Abstract
Integration of transposable elements into the genome is mutagenic. Mechanisms targeting integrations into relatively safe locations, hence minimizing deleterious consequences for cell fitness, have emerged during evolution. In budding yeast, integration of the Ty1 LTR retrotransposon upstream of RNA polymerase III (Pol III)-transcribed genes requires interaction between Ty1 integrase (IN1) and AC40, a subunit common to Pol I and Pol III. Here, we identify the Ty1 targeting domain of IN1 that ensures (i) IN1 binding to Pol I and Pol III through AC40, (ii) IN1 genome-wide recruitment to Pol I- and Pol III-transcribed genes, and (iii) Ty1 integration only at Pol III-transcribed genes, while IN1 recruitment by AC40 is insufficient to target Ty1 integration into Pol I-transcribed genes. Swapping the targeting domains between Ty5 and Ty1 integrases causes Ty5 integration at Pol III-transcribed genes, indicating that the targeting domain of IN1 alone confers Ty1 integration site specificity.
Collapse
Affiliation(s)
- Amna Asif‐Laidin
- INSERM U944, CNRS UMR 7212Genomes& Cell Biology of Disease UnitInstitut de Recherche Saint‐LouisHôpital Saint‐LouisUniversité de ParisParisFrance
| | - Christine Conesa
- CEACNRSInstitute for Integrative Biology of the Cell (I2BC)Université Paris‐SaclayGif‐sur‐YvetteFrance
| | - Amandine Bonnet
- INSERM U944, CNRS UMR 7212Genomes& Cell Biology of Disease UnitInstitut de Recherche Saint‐LouisHôpital Saint‐LouisUniversité de ParisParisFrance
| | - Camille Grison
- INSERM U944, CNRS UMR 7212Genomes& Cell Biology of Disease UnitInstitut de Recherche Saint‐LouisHôpital Saint‐LouisUniversité de ParisParisFrance
| | - Indranil Adhya
- CEACNRSInstitute for Integrative Biology of the Cell (I2BC)Université Paris‐SaclayGif‐sur‐YvetteFrance
| | - Rachid Menouni
- INSERM U944, CNRS UMR 7212Genomes& Cell Biology of Disease UnitInstitut de Recherche Saint‐LouisHôpital Saint‐LouisUniversité de ParisParisFrance
| | - Hélène Fayol
- INSERM U944, CNRS UMR 7212Genomes& Cell Biology of Disease UnitInstitut de Recherche Saint‐LouisHôpital Saint‐LouisUniversité de ParisParisFrance
| | - Noé Palmic
- INSERM U944, CNRS UMR 7212Genomes& Cell Biology of Disease UnitInstitut de Recherche Saint‐LouisHôpital Saint‐LouisUniversité de ParisParisFrance
| | - Joël Acker
- CEACNRSInstitute for Integrative Biology of the Cell (I2BC)Université Paris‐SaclayGif‐sur‐YvetteFrance
| | - Pascale Lesage
- INSERM U944, CNRS UMR 7212Genomes& Cell Biology of Disease UnitInstitut de Recherche Saint‐LouisHôpital Saint‐LouisUniversité de ParisParisFrance
| |
Collapse
|
9
|
Maxwell PH. Diverse transposable element landscapes in pathogenic and nonpathogenic yeast models: the value of a comparative perspective. Mob DNA 2020; 11:16. [PMID: 32336995 PMCID: PMC7175516 DOI: 10.1186/s13100-020-00215-x] [Citation(s) in RCA: 11] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/27/2020] [Accepted: 04/16/2020] [Indexed: 12/14/2022] Open
Abstract
Genomics and other large-scale analyses have drawn increasing attention to the potential impacts of transposable elements (TEs) on their host genomes. However, it remains challenging to transition from identifying potential roles to clearly demonstrating the level of impact TEs have on genome evolution and possible functions that they contribute to their host organisms. I summarize TE content and distribution in four well-characterized yeast model systems in this review: the pathogens Candida albicans and Cryptococcus neoformans, and the nonpathogenic species Saccharomyces cerevisiae and Schizosaccharomyces pombe. I compare and contrast their TE landscapes to their lifecycles, genomic features, as well as the presence and nature of RNA interference pathways in each species to highlight the valuable diversity represented by these models for functional studies of TEs. I then review the regulation and impacts of the Ty1 and Ty3 retrotransposons from Saccharomyces cerevisiae and Tf1 and Tf2 retrotransposons from Schizosaccharomyces pombe to emphasize parallels and distinctions between these well-studied elements. I propose that further characterization of TEs in the pathogenic yeasts would enable this set of four yeast species to become an excellent set of models for comparative functional studies to address outstanding questions about TE-host relationships.
Collapse
|