1
|
Craig RJ, Gallaher SD, Shu S, Salomé PA, Jenkins JW, Blaby-Haas CE, Purvine SO, O’Donnell S, Barry K, Grimwood J, Strenkert D, Kropat J, Daum C, Yoshinaga Y, Goodstein DM, Vallon O, Schmutz J, Merchant SS. The Chlamydomonas Genome Project, version 6: Reference assemblies for mating-type plus and minus strains reveal extensive structural mutation in the laboratory. THE PLANT CELL 2023; 35:644-672. [PMID: 36562730 PMCID: PMC9940879 DOI: 10.1093/plcell/koac347] [Citation(s) in RCA: 27] [Impact Index Per Article: 27.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/02/2022] [Revised: 10/12/2022] [Accepted: 12/16/2022] [Indexed: 05/20/2023]
Abstract
Five versions of the Chlamydomonas reinhardtii reference genome have been produced over the last two decades. Here we present version 6, bringing significant advances in assembly quality and structural annotations. PacBio-based chromosome-level assemblies for two laboratory strains, CC-503 and CC-4532, provide resources for the plus and minus mating-type alleles. We corrected major misassemblies in previous versions and validated our assemblies via linkage analyses. Contiguity increased over ten-fold and >80% of filled gaps are within genes. We used Iso-Seq and deep RNA-seq datasets to improve structural annotations, and updated gene symbols and textual annotation of functionally characterized genes via extensive manual curation. We discovered that the cell wall-less classical reference strain CC-503 exhibits genomic instability potentially caused by deletion of the helicase RECQ3, with major structural mutations identified that affect >100 genes. We therefore present the CC-4532 assembly as the primary reference, although this strain also carries unique structural mutations and is experiencing rapid proliferation of a Gypsy retrotransposon. We expect all laboratory strains to harbor gene-disrupting mutations, which should be considered when interpreting and comparing experimental results. Collectively, the resources presented here herald a new era of Chlamydomonas genomics and will provide the foundation for continued research in this important reference organism.
Collapse
Affiliation(s)
- Rory J Craig
- California Institute for Quantitative Biosciences, University of California, Berkeley, California 94720, USA
- Institute of Ecology and Evolution, School of Biological Sciences, University of Edinburgh, Edinburgh EH9 3FL, UK
| | - Sean D Gallaher
- California Institute for Quantitative Biosciences, University of California, Berkeley, California 94720, USA
| | - Shengqiang Shu
- United States Department of Energy, Joint Genome Institute, Berkeley, California 94720, USA
| | - Patrice A Salomé
- Department of Chemistry and Biochemistry, University of California, Los Angeles, California 90095, USA
- Institute for Genomics and Proteomics, University of California, Los Angeles, California 90095, USA
| | - Jerry W Jenkins
- HudsonAlpha Genome Sequencing Center, HudsonAlpha Institute for Biotechnology, Huntsville, Alabama 35806, USA
| | - Crysten E Blaby-Haas
- The Molecular Foundry, Lawrence Berkeley National Laboratory, Berkeley, California 94720, USA
| | - Samuel O Purvine
- Environmental Molecular Sciences Laboratory, Pacific Northwest National Laboratory, Richland, Washington 99354, USA
| | - Samuel O’Donnell
- Laboratory of Computational and Quantitative Biology, UMR 7238, CNRS, Institut de Biologie Paris-Seine, Sorbonne Université, Paris 75005, France
| | - Kerrie Barry
- United States Department of Energy, Joint Genome Institute, Berkeley, California 94720, USA
| | - Jane Grimwood
- HudsonAlpha Genome Sequencing Center, HudsonAlpha Institute for Biotechnology, Huntsville, Alabama 35806, USA
| | - Daniela Strenkert
- California Institute for Quantitative Biosciences, University of California, Berkeley, California 94720, USA
| | - Janette Kropat
- Department of Chemistry and Biochemistry, University of California, Los Angeles, California 90095, USA
| | - Chris Daum
- United States Department of Energy, Joint Genome Institute, Berkeley, California 94720, USA
| | - Yuko Yoshinaga
- United States Department of Energy, Joint Genome Institute, Berkeley, California 94720, USA
| | - David M Goodstein
- United States Department of Energy, Joint Genome Institute, Berkeley, California 94720, USA
| | - Olivier Vallon
- Unité Mixte de Recherche 7141, CNRS, Institut de Biologie Physico-Chimique, Sorbonne Université, Paris 75005, France
| | - Jeremy Schmutz
- United States Department of Energy, Joint Genome Institute, Berkeley, California 94720, USA
- HudsonAlpha Genome Sequencing Center, HudsonAlpha Institute for Biotechnology, Huntsville, Alabama 35806, USA
| | - Sabeeha S Merchant
- California Institute for Quantitative Biosciences, University of California, Berkeley, California 94720, USA
- Department of Molecular and Cell Biology, University of California, Berkeley, California 94720, USA
- Department of Plant and Microbial Biology, University of California, Berkeley, California 94720, USA
- Division of Environmental Genomics and Systems Biology, Lawrence Berkeley National Laboratory, Berkeley, California 94720, USA
| |
Collapse
|
2
|
Craig RJ, Yushenova IA, Rodriguez F, Arkhipova IR. An ancient clade of Penelope-like retroelements with permuted domains is present in the green lineage and protists, and dominates many invertebrate genomes. Mol Biol Evol 2021; 38:5005-5020. [PMID: 34320655 PMCID: PMC8557442 DOI: 10.1093/molbev/msab225] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/28/2022] Open
Abstract
Penelope-like elements (PLEs) are an enigmatic clade of retrotransposons whose reverse transcriptases (RTs) share a most recent common ancestor with telomerase RTs. The single ORF of canonical endonuclease (EN)+ PLEs encodes RT and a C-terminal GIY–YIG EN that enables intrachromosomal integration, whereas EN− PLEs lack EN and are generally restricted to chromosome termini. EN+ PLEs have only been found in animals, except for one case of horizontal transfer to conifers, whereas EN− PLEs occur in several kingdoms. Here, we report a new, deep-branching PLE clade with a permuted domain order, whereby an N-terminal GIY–YIG EN is linked to a C-terminal RT by a short domain with a characteristic CxC motif. These N-terminal EN+ PLEs share a structural organization, including pseudo-LTRs and complex tandem/inverted insertions, with canonical EN+ PLEs from Penelope/Poseidon, Neptune, and Nematis clades, and show insertion bias for microsatellites, but lack canonical hammerhead ribozyme motifs. However, their phylogenetic distribution is much broader. The Naiads, found in numerous invertebrate phyla, can reach tens of thousands of copies per genome. In spiders and clams, Naiads independently evolved to encode selenoproteins containing multiple selenocysteines. Chlamys, which lack the CCHH motif universal to PLE ENs, occur in green algae, spike mosses (targeting ribosomal DNA), and slime molds. Unlike canonical PLEs, RTs of N-terminal EN+ PLEs contain the insertion-in-fingers domain (IFD), strengthening the link between PLEs and telomerases. Additionally, we describe Hydra, a novel metazoan C-terminal EN+ clade. Overall, we conclude that PLE diversity, taxonomic distribution, and abundance are comparable with non-LTR and LTR-retrotransposons.
Collapse
Affiliation(s)
- Rory J Craig
- Institute of Evolutionary Biology, University of Edinburgh, Edinburgh, UK
| | - Irina A Yushenova
- Josephine Bay Paul Center for Comparative Molecular Biology and Evolution, Marine Biological Laboratory, Woods Hole, MA, USA
| | - Fernando Rodriguez
- Josephine Bay Paul Center for Comparative Molecular Biology and Evolution, Marine Biological Laboratory, Woods Hole, MA, USA
| | - Irina R Arkhipova
- Josephine Bay Paul Center for Comparative Molecular Biology and Evolution, Marine Biological Laboratory, Woods Hole, MA, USA
| |
Collapse
|
3
|
Solovyeva A, Levakin I, Zorin E, Adonin L, Khotimchenko Y, Podgornaya O. Transposons-Based Clonal Diversity in Trematode Involves Parts of CR1 (LINE) in Eu- and Heterochromatin. Genes (Basel) 2021; 12:1129. [PMID: 34440303 PMCID: PMC8392823 DOI: 10.3390/genes12081129] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/15/2021] [Revised: 07/22/2021] [Accepted: 07/23/2021] [Indexed: 01/21/2023] Open
Abstract
Trematode parthenitae have long been believed to form clonal populations, but clonal diversity has been discovered in this asexual stage of the lifecycle. Clonal polymorphism in the model species Himasthla elongata has been previously described, but the source of this phenomenon remains unknown. In this work, we traced cercarial clonal diversity using a simplified amplified fragment length polymorphism (SAFLP) method and characterised the nature of fragments in diverse electrophoretic bands. The repetitive elements were identified in both the primary sequence of the H. elongata genome and in the transcriptome data. Long-interspersed nuclear elements (LINEs) and long terminal repeat retrotransposons (LTRs) were found to represent an overwhelming majority of the genome and the transposon transcripts. Most sequenced fragments from SAFLP pattern contained the reverse transcriptase (RT, ORF2) domains of LINEs, and only a few sequences belonged to ORFs of LTRs and ORF1 of LINEs. A fragment corresponding to a CR1-like (LINE) spacer region was discovered and named CR1-renegade (CR1-rng). In addition to RT-containing CR1 transcripts, we found short CR1-rng transcripts in the redia transcriptome and short contigs in the mobilome. Probes against CR1-RT and CR1-rng presented strikingly different pictures in FISH mapping, despite both being fragments of CR1. In silico data and Southern blotting indicated that CR1-rng is not tandemly organised. CR1 involvement in clonal diversity is discussed.
Collapse
Affiliation(s)
- Anna Solovyeva
- Institute of Cytology of the Russian Academy of Science, Tikhoretsky Ave 4, 194064 Saint Petersburg, Russia;
- Zoological Institute of the Russian Academy of Sciences, Universitetskaya Nab 1, 199034 Saint Petersburg, Russia;
| | - Ivan Levakin
- Zoological Institute of the Russian Academy of Sciences, Universitetskaya Nab 1, 199034 Saint Petersburg, Russia;
| | - Evgeny Zorin
- All-Russia Research Institute for Agricultural Microbiology, Pushkin 8, 196608 Saint Petersburg, Russia;
| | - Leonid Adonin
- Moscow Institute of Physics and Technology, Institutskiy per 9, 141701 Dolgoprudny, Russia;
| | - Yuri Khotimchenko
- School of Biomedicine, Far Eastern Federal University, Sukhanova St 8, 690091 Vladivostok, Russia;
| | - Olga Podgornaya
- Institute of Cytology of the Russian Academy of Science, Tikhoretsky Ave 4, 194064 Saint Petersburg, Russia;
- Department of Cytology and Histology, Saint Petersburg State University, Universitetskaya Nab 7/9, 199034 Saint Petersburg, Russia
| |
Collapse
|
4
|
Maršavelski A, Sabljić I, Sugimori D, Kojić-Prodić B. The substrate selectivity of the two homologous SGNH hydrolases from Streptomyces bacteria: Molecular dynamics and experimental study. Int J Biol Macromol 2020; 158:222-230. [PMID: 32348859 DOI: 10.1016/j.ijbiomac.2020.04.198] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/19/2020] [Revised: 04/22/2020] [Accepted: 04/23/2020] [Indexed: 11/24/2022]
Abstract
Two extracellular enzymes of the SGNH hydrolase superfamily reveal highly homologous 3D structures, but act on different substrates; one is a true phospholipase A1 from Streptomyces albidoflavus (SaPLA1, EC: 3.1.1.32, PDB code: 4HYQ), whereas the promiscuous enzyme from Streptomyces rimosus (SrLip, EC: 3.1.1.3, PDB code: 5MAL) exhibits lipase, phospholipase, esterase, thioesterase, and Tweenase activities. To get insight into binding modes of phospholipid and triglyceride substrates in both enzymes and understand their chain-length preferences, we opted for computational approach based on in silico prepared enzyme-substrate complexes. Docking procedure and molecular dynamics simulations at microsecond time scale were applied. The modelled complexes of SaPLA1 and SrLip enzymes revealed substrate accommodation: a) the acyl-chain attached to sn-1 position fits into the hydrophobic pocket, b) the acyl-chain attached to sn-2 position fits in the hydrophobic cleft, whereas c) the sn-3 bound acyl chain of the triglyceride or polar head of the glycerophospholipid fits into the binding groove. Moreover, our results pinpointed subtle amino acid differences in the hydrophobic pockets of these two enzymes which accommodate the acyl chain attached to sn-1 position of glycerol to be responsible for the chain length preference. Slight differences in the binding grooves of SaPLA1 and SrLip, which accommodate the acyl chain attached to sn-3 position are responsible for exclusive phospholipase and both phospholipase/lipase activities of these two enzymes, respectively. The results of modelling correlate with the experimentally obtained kinetic parameters given in the literature and are important for protein engineering that aims to obtain a variant of enzyme, which would preferably act on the substrate of interest.
Collapse
Affiliation(s)
| | - Igor Sabljić
- Department of Molecular Sciences, Swedish University of Agricultural Sciences, Uppsala SE-75651, Sweden; Ruđer Bošković Institute, Zagreb, Croatia
| | - Daisuke Sugimori
- Department of Symbiotic Systems Science and Technology, Fukushima University, 1 Kanayagawa, Fukushima 960-1296, Japan
| | | |
Collapse
|
5
|
The duck EB66® cell substrate reveals a novel retrotransposon. Biologicals 2019; 61:22-31. [DOI: 10.1016/j.biologicals.2019.08.001] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/10/2018] [Revised: 07/31/2019] [Accepted: 08/02/2019] [Indexed: 11/18/2022] Open
|
6
|
Wang PL, Luchetti A, Alberto Ruggieri A, Xiong XM, Xu MRX, Zhang XG, Zhang HH. Successful Invasions of Short Internally Deleted Elements (SIDEs) and Its Partner CR1 in Lepidoptera Insects. Genome Biol Evol 2019; 11:2505-2516. [PMID: 31384954 PMCID: PMC6740152 DOI: 10.1093/gbe/evz174] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 08/01/2019] [Indexed: 11/28/2022] Open
Abstract
Although DNA transposons often generated internal deleted derivatives such as miniature inverted-repeat transposable elements, short internally deleted elements (SIDEs) derived from nonlong terminal-repeat retrotransposons are rare. Here, we found a novel SIDE, named Persaeus, that originated from the chicken repeat 1 (CR1) retrotransposon Zenon and it has been found widespread in Lepidoptera insects. Our findings suggested that Persaeus and the partner Zenon have experienced a transposition burst in their host genomes and the copy number of Persaeus and Zenon in assayed genomes are significantly correlated. Accordingly, the activity though age analysis indicated that the replication wave of Persaeus coincided with that of Zenon. Phylogenetic analyses suggested that Persaeus may have evolved at least four times independently, and that it has been vertically transferred into its host genomes. Together, our results provide new insights into the evolution dynamics of SIDEs and its partner non-LTRs.
Collapse
Affiliation(s)
- Ping-Lan Wang
- College of Pharmacy and Life Science, Jiujiang University, China
| | - Andrea Luchetti
- Dipartimento di Scienze Biologiche, Geologiche e Ambientali, Università di Bologna, Italy
| | | | | | - Min-Rui-Xuan Xu
- College of Pharmacy and Life Science, Jiujiang University, China
| | - Xiao-Gu Zhang
- College of Pharmacy and Life Science, Jiujiang University, China
| | - Hua-Hao Zhang
- College of Pharmacy and Life Science, Jiujiang University, China
| |
Collapse
|
7
|
de Mendoza A, Pflueger J, Lister R. Capture of a functionally active methyl-CpG binding domain by an arthropod retrotransposon family. Genome Res 2019; 29:1277-1286. [PMID: 31239280 PMCID: PMC6673714 DOI: 10.1101/gr.243774.118] [Citation(s) in RCA: 15] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/05/2018] [Accepted: 06/20/2019] [Indexed: 12/30/2022]
Abstract
The repressive capacity of cytosine DNA methylation is mediated by recruitment of silencing complexes by methyl-CpG binding domain (MBD) proteins. Despite MBD proteins being associated with silencing, we discovered that a family of arthropod Copia retrotransposons have incorporated a host-derived MBD. We functionally show how retrotransposon-encoded MBDs preferentially bind to CpG-dense methylated regions, which correspond to transposable element regions of the host genome, in the myriapod Strigamia maritima Consistently, young MBD-encoding Copia retrotransposons (CopiaMBD) accumulate in regions with higher CpG densities than other LTR-retrotransposons also present in the genome. This would suggest that retrotransposons use MBDs to integrate into heterochromatic regions in Strigamia, avoiding potentially harmful insertions into host genes. In contrast, CopiaMBD insertions in the spider Stegodyphus dumicola genome disproportionately accumulate in methylated gene bodies compared with other spider LTR-retrotransposons. Given that transposons are not actively targeted by DNA methylation in the spider genome, this distribution bias would also support a role for MBDs in the integration process. Together, these data show that retrotransposons can co-opt host-derived epigenome readers, potentially harnessing the host epigenome landscape to advantageously tune the retrotransposition process.
Collapse
Affiliation(s)
- Alex de Mendoza
- Australian Research Council Centre of Excellence in Plant Energy Biology, School of Molecular Sciences, The University of Western Australia, Perth, Western Australia, 6009, Australia.,Harry Perkins Institute of Medical Research, Perth, Western Australia, 6009, Australia
| | - Jahnvi Pflueger
- Australian Research Council Centre of Excellence in Plant Energy Biology, School of Molecular Sciences, The University of Western Australia, Perth, Western Australia, 6009, Australia.,Harry Perkins Institute of Medical Research, Perth, Western Australia, 6009, Australia
| | - Ryan Lister
- Australian Research Council Centre of Excellence in Plant Energy Biology, School of Molecular Sciences, The University of Western Australia, Perth, Western Australia, 6009, Australia.,Harry Perkins Institute of Medical Research, Perth, Western Australia, 6009, Australia
| |
Collapse
|
8
|
Bertocchi NA, de Oliveira TD, Del Valle Garnero A, Coan RLB, Gunski RJ, Martins C, Torres FP. Distribution of CR1-like transposable element in woodpeckers (Aves Piciformes): Z sex chromosomes can act as a refuge for transposable elements. Chromosome Res 2018; 26:333-343. [PMID: 30499043 DOI: 10.1007/s10577-018-9592-1] [Citation(s) in RCA: 13] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/02/2018] [Revised: 10/14/2018] [Accepted: 11/13/2018] [Indexed: 11/28/2022]
Abstract
Birds have relatively few repetitive sequences compared to other groups of vertebrates; however, the members of order Piciformes (woodpeckers) have more of these sequences, composed mainly of transposable elements (TE). The TE most often found in birds is a retrotransposon chicken repeat 1 (CR1). Piciformes lineages were subjected to an expansion of the CR1 elements, carrying a larger fraction of transposable elements. This study compared patterns of chromosome distribution among five bird species, through chromosome mapping of the CR1 sequence and reconstructed their phylogenetic tree. We analyzed several members of Piciformes (Colaptes campestris, Colaptes melanochloros, Melanerpes candidus, and Veniliornis spilogaster), as well as Galliformes (Gallus gallus). Gallus gallus is the species with which most genomic and hence cytogenetic studies have been performed. The results showed that CR1 sequences are a monophyletic group and do not depend on their hosts. All species analyzed showed a hybridization signal by fluorescence in situ hybridization (FISH). In all species, the chromosomal distribution of CR1 was not restricted to heterochromatin regions in the macrochromosomes, principally pair 1 and the Z sex chromosome. Accumulation in the Z sex chromosomes can serve as a refuge for transposable elements. These results highlight the importance of transposable elements in host genomes and karyotype evolution.
Collapse
Affiliation(s)
- Natasha Avila Bertocchi
- Programa de Pós-graduação em Genética e Biologia Molecular, Universidade Federal do Rio Grande do Sul (UFRGS), Porto Alegre, Rio Grande do Sul, 91501-970, Brazil.
| | - Thays Duarte de Oliveira
- Programa de Pós-graduação em Biologia Animal, Universidade Federal do Rio Grande do Sul (UFRGS), Porto Alegre, Rio Grande do Sul, 91540-000, Brazil
| | - Analía Del Valle Garnero
- Programa de Pós-graduação em Ciências Biológicas, Universidade Federal do Pampa (Unipampa), São Gabriel, Rio Grande do Sul, 97300-000, Brazil.,Laboratório de Diversidade Genética Animal, Universidade Federal do Pampa (Unipampa), São Gabriel, Rio Grande do Sul, 97300-000, Brazil
| | - Rafael Luiz Buogo Coan
- Departamento de Morfologia, Laboratório Genômica Integrativa, Universidade Estadual Paulista (UNESP), Botucatu, São Paulo, 18618-689, Brazil
| | - Ricardo José Gunski
- Programa de Pós-graduação em Ciências Biológicas, Universidade Federal do Pampa (Unipampa), São Gabriel, Rio Grande do Sul, 97300-000, Brazil.,Laboratório de Diversidade Genética Animal, Universidade Federal do Pampa (Unipampa), São Gabriel, Rio Grande do Sul, 97300-000, Brazil
| | - Cesar Martins
- Departamento de Morfologia, Laboratório Genômica Integrativa, Universidade Estadual Paulista (UNESP), Botucatu, São Paulo, 18618-689, Brazil
| | - Fabiano Pimentel Torres
- Programa de Pós-graduação em Ciências Biológicas, Universidade Federal do Pampa (Unipampa), São Gabriel, Rio Grande do Sul, 97300-000, Brazil.,Laboratório de Diversidade Genética Animal, Universidade Federal do Pampa (Unipampa), São Gabriel, Rio Grande do Sul, 97300-000, Brazil
| |
Collapse
|
9
|
Khazina E, Weichenrieder O. Human LINE-1 retrotransposition requires a metastable coiled coil and a positively charged N-terminus in L1ORF1p. eLife 2018; 7:34960. [PMID: 29565245 PMCID: PMC5940361 DOI: 10.7554/elife.34960] [Citation(s) in RCA: 25] [Impact Index Per Article: 4.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/10/2018] [Accepted: 03/21/2018] [Indexed: 12/22/2022] Open
Abstract
LINE-1 (L1) is an autonomous retrotransposon, which acted throughout mammalian evolution and keeps contributing to human genotypic diversity, genetic disease and cancer. L1 encodes two essential proteins: L1ORF1p, a unique RNA-binding protein, and L1ORF2p, an endonuclease and reverse transcriptase. L1ORF1p contains an essential, but rapidly evolving N-terminal portion, homo-trimerizes via a coiled coil and packages L1RNA into large assemblies. Here, we determined crystal structures of the entire coiled coil domain of human L1ORF1p. We show that retrotransposition requires a non-ideal and metastable coiled coil structure, and a strongly basic L1ORF1p amino terminus. Human L1ORF1p therefore emerges as a highly calibrated molecular machine, sensitive to mutation but functional in different hosts. Our analysis rationalizes the locally rapid L1ORF1p sequence evolution and reveals striking mechanistic parallels to coiled coil-containing membrane fusion proteins. It also suggests how trimeric L1ORF1p could form larger meshworks and indicates critical novel steps in L1 retrotransposition. Almost half of the human genome consists of DNA strings that have been copied and pasted from one part of the genome to another many thousands of times. These strings of DNA are called mobile genetic elements. Mobile elements can disrupt important genes, causing disease and cancer, but they can also drive evolution. Presently, only one type of mobile element, called LINE-1, is active in the human genome and able to multiply without help from other mobile elements. LINE-1 DNA is ‘transcribed’ to form molecules of LINE-1 RNA, which can then be ‘translated’ into two distinct proteins. These bind to LINE-1 RNA, which then gets back-transcribed into DNA and inserted as a new LINE-1 element in a new region of the genome. One of the two proteins, called L1ORF1p, forms complexes where three copies of the protein come together. These ‘trimers’ cover and protect LINE-1 RNA and are required for LINE-1 mobility. Different versions of L1ORF1p are found in different animals. Part of the protein is the same across all mammals, and this ‘conserved’ part controls the ability of L1ORF1p to bind to RNA. The non-conserved part of L1ORF1p differs even between humans and their closest animal relatives and little was known about its structure or role. However, this rapidly evolving part of L1ORF1p is essential for LINE-1 mobility. Using X-ray crystallography, Khazina and Weichenrieder obtained a molecular snapshot of the part of L1ORF1p that interacts with other copies of the protein to form trimers. Combined with earlier snapshots of L1ORF1p’s conserved part, this generated a complete structural model of the L1ORF1p trimer. Additional biophysical characterizations suggest that L1ORF1p trimers form a semi-stable structure that can partially open up, indicating how trimers could form larger assemblies of L1ORF1p on LINE-1 RNA. Indeed, the need to maintain a semi-stable structure could explain why L1ORF1p is evolving so rapidly. A second important finding is that the beginning of L1ORF1p needs to be positively charged – a requirement that warrants further exploration. The structural and mechanistic insight into L1ORF1p points to critical new steps in LINE-1 mobilization. It will help to design inhibitor molecules with the goal to halt the mobilization process at various points and to dissect such steps in great detail. Understanding how to control LINE-1 mobility could help to improve stem cell therapies and reproduction assistance techniques, due to the fact that LINE-1 mobility is a potential source of mutation in stem cells, egg and sperm cells, and newly formed embryos.
Collapse
Affiliation(s)
- Elena Khazina
- Department of Biochemistry, Max Planck Institute for Developmental Biology, Tübingen, Germany
| | - Oliver Weichenrieder
- Department of Biochemistry, Max Planck Institute for Developmental Biology, Tübingen, Germany
| |
Collapse
|
10
|
Arkhipova IR, Yushenova IA, Rodriguez F. Giant Reverse Transcriptase-Encoding Transposable Elements at Telomeres. Mol Biol Evol 2017; 34:2245-2257. [PMID: 28575409 PMCID: PMC5850863 DOI: 10.1093/molbev/msx159] [Citation(s) in RCA: 25] [Impact Index Per Article: 3.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/19/2022] Open
Abstract
Transposable elements are omnipresent in eukaryotic genomes and have a profound impact on chromosome structure, function and evolution. Their structural and functional diversity is thought to be reasonably well-understood, especially in retroelements, which transpose via an RNA intermediate copied into cDNA by the element-encoded reverse transcriptase, and are characterized by a compact structure. Here, we report a novel type of expandable eukaryotic retroelements, which we call Terminons. These elements can attach to G-rich telomeric repeat overhangs at the chromosome ends, in a process apparently facilitated by complementary C-rich repeats at the 3′-end of the RNA template immediately adjacent to a hammerhead ribozyme motif. Terminon units, which can exceed 40 kb in length, display an unusually complex and diverse structure, and can form very long chains, with host genes often captured between units. As the principal polymerizing component, Terminons contain Athena reverse transcriptases previously described in bdelloid rotifers and belonging to the enigmatic group of Penelope-like elements, but can additionally accumulate multiple cooriented ORFs, including DEDDy 3′-exonucleases, GDSL esterases/lipases, GIY-YIG-like endonucleases, rolling-circle replication initiator (Rep) proteins, and putatively structural ORFs with coiled-coil motifs and transmembrane domains. The extraordinary length and complexity of Terminons and the high degree of interfamily variability in their ORF content challenge the current views on the structural organization of eukaryotic retroelements, and highlight their possible connections with the viral world and the implications for the elevated frequency of gene transfer.
Collapse
Affiliation(s)
- Irina R Arkhipova
- Marine Biological Laboratory, Josephine Bay Paul Center for Comparative Molecular Biology and Evolution, Woods Hole, MA
| | - Irina A Yushenova
- Marine Biological Laboratory, Josephine Bay Paul Center for Comparative Molecular Biology and Evolution, Woods Hole, MA
| | - Fernando Rodriguez
- Marine Biological Laboratory, Josephine Bay Paul Center for Comparative Molecular Biology and Evolution, Woods Hole, MA
| |
Collapse
|
11
|
Horn AV, Celic I, Dong C, Martirosyan I, Han JS. A conserved role for the ESCRT membrane budding complex in LINE retrotransposition. PLoS Genet 2017; 13:e1006837. [PMID: 28586350 PMCID: PMC5478143 DOI: 10.1371/journal.pgen.1006837] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/26/2017] [Revised: 06/20/2017] [Accepted: 05/23/2017] [Indexed: 11/18/2022] Open
Abstract
Long interspersed nuclear element-1s (LINE-1s, or L1s) are an active family of retrotransposable elements that continue to mutate mammalian genomes. Despite the large contribution of L1 to mammalian genome evolution, we do not know where active L1 particles (particles in the process of retrotransposition) are located in the cell, or how they move towards the nucleus, the site of L1 reverse transcription. Using a yeast model of LINE retrotransposition, we identified ESCRT (endosomal sorting complex required for transport) as a critical complex for LINE retrotransposition, and verified that this interaction is conserved for human L1. ESCRT interacts with L1 via a late domain motif, and this interaction facilitates L1 replication. Loss of the L1/ESCRT interaction does not impair RNP formation or enzymatic activity, but leads to loss of retrotransposition and reduced L1 endonuclease activity in the nucleus. This study highlights the importance of the ESCRT complex in the L1 life cycle and suggests an unusual mode for L1 RNP trafficking. Long interspersed nuclear elements (LINEs) are a class of retrotransposable elements that mutate mammalian genomes. LINEs have been highly successful in the human genome, multiplying to over 800,000 copies. The LINE-encoded replication machinery is also used by other retrotransposons, and in total, has been responsible for the generation of over 1/3 of human DNA sequence. To replicate, a LINE mRNA forms a ribonucleoprotein particle (RNP) with its proteins. This RNP eventually enters the nucleus to integrate a cDNA copy of itself into chromosomes. The events between RNP formation and successful integration are difficult to study and largely unknown. Here we show that the ESCRT complex plays a conserved role in LINE retrotransposition in both yeast and humans. ESCRT is a membrane budding complex involved in cellular trafficking and membrane budding/fusion. Our results imply that membranes play an integral part of LINE replication, and ESCRT may be required for RNP trafficking towards the nucleus.
Collapse
Affiliation(s)
- Axel V. Horn
- Department of Biochemistry and Molecular Biology, Tulane University School of Medicine, New Orleans, LA, United States of America
- Department of Embryology, Carnegie Institution for Science, Baltimore, MD, United States of America
| | - Ivana Celic
- Department of Biochemistry and Molecular Biology, Tulane University School of Medicine, New Orleans, LA, United States of America
| | - Chun Dong
- Department of Embryology, Carnegie Institution for Science, Baltimore, MD, United States of America
| | - Irena Martirosyan
- Department of Embryology, Carnegie Institution for Science, Baltimore, MD, United States of America
| | - Jeffrey S. Han
- Department of Biochemistry and Molecular Biology, Tulane University School of Medicine, New Orleans, LA, United States of America
- Department of Embryology, Carnegie Institution for Science, Baltimore, MD, United States of America
- * E-mail:
| |
Collapse
|
12
|
Rodriguez F, Kenefick AW, Arkhipova IR. LTR-Retrotransposons from Bdelloid Rotifers Capture Additional ORFs Shared between Highly Diverse Retroelement Types. Viruses 2017; 9:v9040078. [PMID: 28398238 PMCID: PMC5408684 DOI: 10.3390/v9040078] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/31/2017] [Revised: 04/04/2017] [Accepted: 04/04/2017] [Indexed: 12/16/2022] Open
Abstract
Rotifers of the class Bdelloidea, microscopic freshwater invertebrates, possess a highlydiversified repertoire of transposon families, which, however, occupy less than 4% of genomic DNA in the sequenced representative Adineta vaga. We performed a comprehensive analysis of A. vaga retroelements, and found that bdelloid long terminal repeat (LTR)retrotransposons, in addition to conserved open reading frame (ORF) 1 and ORF2 corresponding to gag and pol genes, code for an unusually high variety of ORF3 sequences. Retrovirus-like LTR families in A. vaga belong to four major lineages, three of which are rotiferspecific and encode a dUTPase domain. However only one lineage contains a canonical envlike fusion glycoprotein acquired from paramyxoviruses (non-segmented negative-strand RNA viruses), although smaller ORFs with transmembrane domains may perform similar roles. A different ORF3 type encodes a GDSL esterase/lipase, which was previously identified as ORF1 in several clades of non-LTR retrotransposons, and implicated in membrane targeting. Yet another ORF3 type appears in unrelated LTR-retrotransposon lineages, and displays strong homology to DEDDy-type exonucleases involved in 3'-end processing of RNA and single-stranded DNA. Unexpectedly, each of the enzymatic ORF3s is also associated with different subsets of Penelope-like Athena retroelement families. The unusual association of the same ORF types with retroelements from different classes reflects their modular structure with a high degree of flexibility, and points to gene sharing between different groups of retroelements.
Collapse
Affiliation(s)
- Fernando Rodriguez
- Josephine Bay Paul Center for Comparative Molecular Biology and Evolution, Marine Biological Laboratory, 7 MBL Street, Woods Hole, MA 02543, USA.
| | - Aubrey W Kenefick
- Josephine Bay Paul Center for Comparative Molecular Biology and Evolution, Marine Biological Laboratory, 7 MBL Street, Woods Hole, MA 02543, USA.
- Present address: UC Davis Genome Center-GBSF, University of California, Davis, CA 95616, USA.
| | - Irina R Arkhipova
- Josephine Bay Paul Center for Comparative Molecular Biology and Evolution, Marine Biological Laboratory, 7 MBL Street, Woods Hole, MA 02543, USA.
| |
Collapse
|
13
|
Ivancevic AM, Kortschak RD, Bertozzi T, Adelson DL. LINEs between Species: Evolutionary Dynamics of LINE-1 Retrotransposons across the Eukaryotic Tree of Life. Genome Biol Evol 2016; 8:3301-3322. [PMID: 27702814 PMCID: PMC5203782 DOI: 10.1093/gbe/evw243] [Citation(s) in RCA: 51] [Impact Index Per Article: 6.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/19/2022] Open
Abstract
LINE-1 (L1) retrotransposons are dynamic elements. They have the potential to cause great genomic change because of their ability to ‘jump’ around the genome and amplify themselves, resulting in the duplication and rearrangement of regulatory DNA. Active L1, in particular, are often thought of as tightly constrained, homologous and ubiquitous elements with well-characterized domain organization. For the past 30 years, model organisms have been used to define L1s as 6–8 kb sequences containing a 5′-UTR, two open reading frames working harmoniously in cis, and a 3′-UTR with a polyA tail. In this study, we demonstrate the remarkable and overlooked diversity of L1s via a comprehensive phylogenetic analysis of elements from over 500 species from widely divergent branches of the tree of life. The rapid and recent growth of L1 elements in mammalian species is juxtaposed against the diverse lineages found in other metazoans and plants. In fact, some of these previously unexplored mammalian species (e.g. snub-nosed monkey, minke whale) exhibit L1 retrotranspositional ‘hyperactivity’ far surpassing that of human or mouse. In contrast, non-mammalian L1s have become so varied that the current classification system seems to inadequately capture their structural characteristics. Our findings illustrate how both long-term inherited evolutionary patterns and random bursts of activity in individual species can significantly alter genomes, highlighting the importance of L1 dynamics in eukaryotes.
Collapse
Affiliation(s)
- Atma M Ivancevic
- School of Biological Sciences, University of Adelaide, Adelaide, South Australia, Australia
| | - R Daniel Kortschak
- School of Biological Sciences, University of Adelaide, Adelaide, South Australia, Australia
| | - Terry Bertozzi
- School of Biological Sciences, University of Adelaide, Adelaide, South Australia, Australia.,Evolutionary Biology Unit, South Australian Museum, Adelaide, South Australia, Australia
| | - David L Adelson
- School of Biological Sciences, University of Adelaide, Adelaide, South Australia, Australia
| |
Collapse
|
14
|
da Silva M, Barbosa P, Artoni RF, Feldberg E. Evolutionary Dynamics of 5S rDNA and Recurrent Association of Transposable Elements in Electric Fish of the Family Gymnotidae (Gymnotiformes): The Case of Gymnotus mamiraua. Cytogenet Genome Res 2016; 149:297-303. [PMID: 27750255 DOI: 10.1159/000449431] [Citation(s) in RCA: 18] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 05/17/2016] [Indexed: 11/19/2022] Open
Abstract
Gymnotidae is a family of electric fish endemic to the Neotropics consisting of 2 genera: Electrophorus and Gymnotus. The genus Gymnotus is widely distributed and is found in all of the major Brazilian river systems. Physical and molecular mapping data for the ribosomal DNA (rDNA) in this genus are still scarce, with its chromosomal location known in only 11 species. As other species of Gymnotus with 2n = 54 chromosomes from the Paraná-Paraguay basin, G. mamiraua was found to have a large number of 5S rDNA sites. Isolation and cloning of the 5S rDNA sequences from G. mamiraua identified a fragment of a transposable element similar to the Tc1/mariner transposon associated with a non-transcribed spacer. Double fluorescence in situ hybridization analysis of this element and the 5S rDNA showed that they were colocalized on several chromosomes, in addition to acting as nonsyntenic markers on others. Our data show the association between these sequences and suggest that the Tc1 retrotransposon may be the agent that drives the spread of these 5S rDNA-like sequences in the G. mamiraua genome.
Collapse
Affiliation(s)
- Maelin da Silva
- Programa de Pós Graduação em Genética, Conservação e Biologia Evolutiva, Instituto Nacional de Pesquisas da Amazônia, Manaus, Brazil
| | | | | | | |
Collapse
|
15
|
Distribution patterns and impact of transposable elements in genes of green algae. Gene 2016; 594:151-159. [PMID: 27614292 DOI: 10.1016/j.gene.2016.09.012] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/30/2016] [Revised: 09/01/2016] [Accepted: 09/06/2016] [Indexed: 11/23/2022]
Abstract
Transposable elements (TEs) are DNA sequences able to transpose in the host genome, a remarkable feature that enables them to influence evolutive trajectories of species. An investigation about the TE distribution and TE impact in different gene regions of the green algae species Chlamydomonas reinhardtii and Volvox carteri was performed. Our results indicate that TEs are very scarce near introns boundaries, suggesting that insertions in this region are negatively selected. This contrasts with previous results showing enrichment of tandem repeats in introns boundaries and suggests that different evolutionary forces are acting in these different classes of repeats. Despite the relatively low abundance of TEs in the genome of green algae when compared to mammals, the proportion of poly(A) sites derived from TEs found in C. reinhardtii was similar to that described in human and mice. This fact, associated with the enrichment of TEs in gene 5' and 3' flanks of C. reinhardtii, opens up the possibility that TEs may have considerably contributed for gene regulatory sequences evolution in this species. Moreover, it was possible identify several instances of TE exonization for C. reinhardtii, with a particularly interesting case from a gene coding for Condensin II, a protein involved in the maintenance of chromosomal structure, where the addition of a transposomal PHD finger may contribute to binding specificity of this protein. Taken together, our results suggest that the low abundance of TEs in green algae genomes is correlated with a strict negative selection process, combined with the retention of copies that contribute positively with gene structures.
Collapse
|
16
|
Abstract
Retrotransposons carrying tyrosine recombinases (YR) are widespread in eukaryotes. The first described tyrosine recombinase mobile element, DIRS1, is a retroelement from the slime mold Dictyostelium discoideum. The YR elements are bordered by terminal repeats related to their replication via free circular dsDNA intermediates. Site-specific recombination is believed to integrate the circle without creating duplications of the target sites. Recently a large number of YR retrotransposons have been described, including elements from fungi (mucorales and basidiomycetes), plants (green algae) and a wide range of animals including nematodes, insects, sea urchins, fish, amphibia and reptiles. YR retrotransposons can be divided into three major groups: the DIRS elements, PAT-like and the Ngaro elements. The three groups form distinct clades on phylogenetic trees based on alignments of reverse transcriptase/ribonuclease H (RT/RH) and YR sequences, and also having some structural distinctions. A group of eukaryote DNA transposons, cryptons, also carry tyrosine recombinases. These DNA transposons do not encode a reverse transcriptase. They have been detected in several pathogenic fungi and oomycetes. Sequence comparisons suggest that the crypton YRs are related to those of the YR retrotransposons. We suggest that the YR retrotransposons arose from the combination of a crypton-like YR DNA transposon and the RT/RH encoding sequence of a retrotransposon. This acquisition must have occurred at a very early point in the evolution of eukaryotes.
Collapse
|
17
|
Suh A. The Specific Requirements for CR1 Retrotransposition Explain the Scarcity of Retrogenes in Birds. J Mol Evol 2015. [DOI: 10.1007/s00239-015-9692-x] [Citation(s) in RCA: 20] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/05/2023]
|
18
|
Suh A, Churakov G, Ramakodi MP, Platt RN, Jurka J, Kojima KK, Caballero J, Smit AF, Vliet KA, Hoffmann FG, Brosius J, Green RE, Braun EL, Ray DA, Schmitz J. Multiple lineages of ancient CR1 retroposons shaped the early genome evolution of amniotes. Genome Biol Evol 2014; 7:205-17. [PMID: 25503085 PMCID: PMC4316615 DOI: 10.1093/gbe/evu256] [Citation(s) in RCA: 46] [Impact Index Per Article: 4.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open
Abstract
Chicken repeat 1 (CR1) retroposons are long interspersed elements (LINEs) that are ubiquitous within amniote genomes and constitute the most abundant family of transposed elements in birds, crocodilians, turtles, and snakes. They are also present in mammalian genomes, where they reside as numerous relics of ancient retroposition events. Yet, despite their relevance for understanding amniote genome evolution, the diversity and evolution of CR1 elements has never been studied on an amniote-wide level. We reconstruct the temporal and quantitative activity of CR1 subfamilies via presence/absence analyses across crocodilian phylogeny and comparative analyses of 12 crocodilian genomes, revealing relative genomic stasis of retroposition during genome evolution of extant Crocodylia. Our large-scale phylogenetic analysis of amniote CR1 subfamilies suggests the presence of at least seven ancient CR1 lineages in the amniote ancestor; and amniote-wide analyses of CR1 successions and quantities reveal differential retention (presence of ancient relics or recent activity) of these CR1 lineages across amniote genome evolution. Interestingly, birds and lepidosaurs retained the fewest ancient CR1 lineages among amniotes and also exhibit smaller genome sizes. Our study is the first to analyze CR1 evolution in a genome-wide and amniote-wide context and the data strongly suggest that the ancestral amniote genome contained myriad CR1 elements from multiple ancient lineages, and remnants of these are still detectable in the relatively stable genomes of crocodilians and turtles. Early mammalian genome evolution was thus characterized by a drastic shift from CR1 prevalence to dominance and hyperactivity of L2 LINEs in monotremes and L1 LINEs in therians.
Collapse
Affiliation(s)
- Alexander Suh
- Institute of Experimental Pathology (ZMBE), University of Münster, Germany Department of Evolutionary Biology (EBC), Uppsala University, Sweden
| | - Gennady Churakov
- Institute of Experimental Pathology (ZMBE), University of Münster, Germany
| | - Meganathan P Ramakodi
- Department of Biochemistry, Molecular Biology, Entomology and Plant Pathology, Mississippi State University Institute for Genomics, Biocomputing and Biotechnology, Mississippi State University Present address: Cancer Prevention and Control Program, Fox Chase Cancer Center, Philadelphia, PA Present address: Department of Biology, Temple University, Philadelphia, PA
| | - Roy N Platt
- Department of Biochemistry, Molecular Biology, Entomology and Plant Pathology, Mississippi State University Institute for Genomics, Biocomputing and Biotechnology, Mississippi State University Department of Biological Sciences, Texas Tech University
| | - Jerzy Jurka
- Genetic Information Research Institute, Mountain View, California
| | - Kenji K Kojima
- Genetic Information Research Institute, Mountain View, California
| | | | - Arian F Smit
- Institute for Systems Biology, Seattle, Washington
| | | | - Federico G Hoffmann
- Department of Biochemistry, Molecular Biology, Entomology and Plant Pathology, Mississippi State University Institute for Genomics, Biocomputing and Biotechnology, Mississippi State University
| | - Jürgen Brosius
- Institute of Experimental Pathology (ZMBE), University of Münster, Germany
| | - Richard E Green
- Department of Biomolecular Engineering, University of California
| | - Edward L Braun
- Department of Biology and Genetics Institute, University of Florida
| | - David A Ray
- Department of Biochemistry, Molecular Biology, Entomology and Plant Pathology, Mississippi State University Institute for Genomics, Biocomputing and Biotechnology, Mississippi State University Department of Biological Sciences, Texas Tech University
| | - Jürgen Schmitz
- Institute of Experimental Pathology (ZMBE), University of Münster, Germany
| |
Collapse
|
19
|
Naville M, Chalopin D, Volff JN. Interspecies insertion polymorphism analysis reveals recent activity of transposable elements in extant coelacanths. PLoS One 2014; 9:e114382. [PMID: 25470617 PMCID: PMC4255032 DOI: 10.1371/journal.pone.0114382] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/15/2014] [Accepted: 11/10/2014] [Indexed: 01/29/2023] Open
Abstract
Coelacanths are lobe-finned fish represented by two extant species, Latimeria chalumnae in South Africa and Comoros and L. menadoensis in Indonesia. Due to their intermediate phylogenetic position between ray-finned fish and tetrapods in the vertebrate lineage, they are of great interest from an evolutionary point of view. In addition, extant specimens look similar to 300 million-year-old fossils; because of their apparent slowly evolving morphology, coelacanths have been often described as « living fossils ». As an underlying cause of such a morphological stasis, several authors have proposed a slow evolution of the coelacanth genome. Accordingly, sequencing of the L. chalumnae genome has revealed a globally low substitution rate for protein-coding regions compared to other vertebrates. However, genome and gene evolution can also be influenced by transposable elements, which form a major and dynamic part of vertebrate genomes through their ability to move, duplicate and recombine. In this work, we have searched for evidence of transposition activity in coelacanth genomes through the comparative analysis of orthologous genomic regions from both Latimeria species. Comparison of 5.7 Mb (0.2%) of the L. chalumnae genome with orthologous Bacterial Artificial Chromosome clones from L. menadoensis allowed the identification of 27 species-specific transposable element insertions, with a strong relative contribution of CR1 non-LTR retrotransposons. Species-specific homologous recombination between the long terminal repeats of a new coelacanth endogenous retrovirus was also detected. Our analysis suggests that transposon activity is responsible for at least 0.6% of genome divergence between both Latimeria species. Taken together, this study demonstrates that coelacanth genomes are not evolutionary inert: they contain recently active transposable elements, which have significantly contributed to post-speciation genome divergence in Latimeria.
Collapse
Affiliation(s)
- Magali Naville
- Institut de Génomique Fonctionnelle de Lyon, Ecole Normale Supérieure de Lyon, Lyon, France
| | - Domitille Chalopin
- Institut de Génomique Fonctionnelle de Lyon, Ecole Normale Supérieure de Lyon, Lyon, France
| | - Jean-Nicolas Volff
- Institut de Génomique Fonctionnelle de Lyon, Ecole Normale Supérieure de Lyon, Lyon, France
- * E-mail:
| |
Collapse
|
20
|
Yamaguchi K, Kajikawa M, Okada N. Integrated mechanism for the generation of the 5' junctions of LINE inserts. Nucleic Acids Res 2014; 42:13269-79. [PMID: 25378331 PMCID: PMC4245944 DOI: 10.1093/nar/gku1067] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/01/2023] Open
Abstract
To elucidate the molecular mechanism of the integration of long interspersed elements (LINEs), we characterized the 5′ ends of more than 200 LINE de novo retrotransposition events into chicken DT40 or human HeLa cells. Human L1 inserts produced 15-bp target-site duplications (TSDs) and zebrafish ZfL2-1 inserts produced 5-bp TSDs in DT40 cells, suggesting that TSD length depends on the LINE species. Further analysis of 5′ junctions revealed that the 5′-end-joining pathways of LINEs can be divided into two fundamental types—annealing or direct. We also found that the generation of 5′ inversions depends on host and LINE species. These results led us to propose a new model for 5′-end joining, the type of which is determined by the extent of exposure of 3′ overhangs generated after the second-strand cleavage and by the involvement of host factors.
Collapse
Affiliation(s)
- Katsumi Yamaguchi
- Graduate School of Bioscience and Biotechnology, Tokyo Institute of Technology, 4259-B-15 Nagatsuta-cho, Midori-ku, Yokohama, Kanagawa 226-8501, Japan
| | - Masaki Kajikawa
- Graduate School of Bioscience and Biotechnology, Tokyo Institute of Technology, 4259-B-15 Nagatsuta-cho, Midori-ku, Yokohama, Kanagawa 226-8501, Japan
| | - Norihiro Okada
- Graduate School of Bioscience and Biotechnology, Tokyo Institute of Technology, 4259-B-15 Nagatsuta-cho, Midori-ku, Yokohama, Kanagawa 226-8501, Japan Department of Life Sciences, National Cheng Kung University, Tainan 701, Taiwan Foundation for Advancement of International Science, Tsukuba 305-0821, Japan
| |
Collapse
|
21
|
Metcalfe CJ, Casane D. Modular organization and reticulate evolution of the ORF1 of Jockey superfamily transposable elements. Mob DNA 2014; 5:19. [PMID: 25093042 PMCID: PMC4120745 DOI: 10.1186/1759-8753-5-19] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/12/2014] [Accepted: 05/30/2014] [Indexed: 02/03/2023] Open
Abstract
Background Long interspersed nuclear elements (LINES) are the most common transposable element (TE) in almost all metazoan genomes examined. In most LINE superfamilies there are two open reading frames (ORFs), and both are required for transposition. The ORF2 is well characterized, while the structure and function of the ORF1 is less well understood. ORF1s have been classified into five types based on structural organization and the domains identified. Here we perform a large scale analysis of ORF1 domains of 448 elements from the Jockey superfamily using multiple alignments and Hidden Markov Model (HMM)-HMM comparisons. Results Three major lineages, Chicken repeat 1 (CR1), LINE2 (L2) and Jockey, were identified. All Jockey lineage elements have the same type of ORF1. In contrast, in the L2 and CR1 lineage elements, all five ORF1 types are found, with no one type of ORF1 predominating. A plant homeodomain (PHD) is much more prevalent than previously suspected. ORF1 type variations involving the PHD domain were found in many subgroups of the L2 and CR1 lineages. A Jockey lineage-like ORF1 with a PHD domain was found in both lineages. A phylogenetic analysis of this ORF1 suggests that it has been horizontally transferred. Likewise, an esterase containing ORF1 type was only found in two exclusively vertebrate L2 and CR1 groups, indicating that it may have been acquired in a vertebrate common ancestor and then transferred between the lineages. Conclusions The ORF1 of the CR1 and L2 lineages is very structurally diverse. The presence of a PHD domain in many ORF1s of the L2 and CR1 lineages is suggestive of domain shuffling. There is also evidence of possible horizontal transfer of entire ORF1s between lineages. In conclusion, while the structure of the ORF2 appears to be highly constrained and its evolution tree-like, the structure of the ORF1 within the CR1 and L2 lineages is much more variable and its evolution reticulate.
Collapse
Affiliation(s)
- Cushla J Metcalfe
- Universidade de São Paulo, Instituto de Biociências, Rua do Matão 277, Cidade Universitária, São Paulo 05508-090 SP, Brazil
| | - Didier Casane
- Laboratoire Evolution, Génomes et Spéciation, UPR9034 CNRS, 1 avenue de la terrasse, 91198 Gif-sur-Yvette, France ; Université Paris Diderot, Sorbonne Paris Cité, 5 rue Thomas-Mann, 75205 Paris, France
| |
Collapse
|
22
|
A multicopy Y-chromosomal SGNH hydrolase gene expressed in the testis of the platyfish has been captured and mobilized by a Helitron transposon. BMC Genet 2014; 15:44. [PMID: 24712907 PMCID: PMC4021074 DOI: 10.1186/1471-2156-15-44] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/18/2013] [Accepted: 03/19/2014] [Indexed: 01/25/2023] Open
Abstract
Background Teleost fish present a high diversity of sex determination systems, with possible frequent evolutionary turnover of sex chromosomes and sex-determining genes. In order to identify genes involved in male sex determination and differentiation in the platyfish Xiphophorus maculatus, bacterial artificial chromosome contigs from the sex-determining region differentiating the Y from the X chromosome have been assembled and analyzed. Results A novel three-copy gene called teximY (for testis-expressed in Xiphophorus maculatus on the Y) was identified on the Y but not on the X chromosome. A highly related sequence called texim1, probably at the origin of the Y-linked genes, as well as three more divergent texim genes were detected in (pseudo)autosomal regions of the platyfish genome. Texim genes, for which no functional data are available so far in any organism, encode predicted esterases/lipases with a SGNH hydrolase domain. Texim proteins are related to proteins from very different origins, including proteins encoded by animal CR1 retrotransposons, animal platelet-activating factor acetylhydrolases (PAFah) and bacterial hydrolases. Texim gene distribution is patchy in animals. Texim sequences were detected in several fish species including killifish, medaka, pufferfish, sea bass, cod and gar, but not in zebrafish. Texim-like genes are also present in Oikopleura (urochordate), Amphioxus (cephalochordate) and sea urchin (echinoderm) but absent from mammals and other tetrapods. Interestingly, texim genes are associated with a Helitron transposon in different fish species but not in urochordates, cephalochordates and echinoderms, suggesting capture and mobilization of an ancestral texim gene in the bony fish lineage. RT-qPCR analyses showed that Y-linked teximY genes are preferentially expressed in testis, with expression at late stages of spermatogenesis (late spermatids and spermatozeugmata). Conclusions These observations suggest either that TeximY proteins play a role in Helitron transposition in the male germ line in fish, or that texim genes are spermatogenesis genes mobilized and spread by transposable elements in fish genomes.
Collapse
|
23
|
Schneider AM, Schmidt S, Jonas S, Vollmer B, Khazina E, Weichenrieder O. Structure and properties of the esterase from non-LTR retrotransposons suggest a role for lipids in retrotransposition. Nucleic Acids Res 2013; 41:10563-72. [PMID: 24003030 PMCID: PMC3905857 DOI: 10.1093/nar/gkt786] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/16/2022] Open
Abstract
Non-LTR retrotransposons are mobile genetic elements and play a major role in eukaryotic genome evolution and disease. Similar to retroviruses they encode a reverse transcriptase, but their genomic integration mechanism is fundamentally different, and they lack homologs of the retroviral nucleocapsid-forming protein Gag. Instead, their first open reading frames encode distinct multi-domain proteins (ORF1ps) presumed to package the retrotransposon-encoded RNA into ribonucleoprotein particles (RNPs). The mechanistic roles of ORF1ps are poorly understood, particularly of ORF1ps that appear to harbor an enzymatic function in the form of an SGNH-type lipolytic acetylesterase. We determined the crystal structures of the coiled coil and esterase domains of the ORF1p from the Danio rerio ZfL2-1 element. We demonstrate a dimerization of the coiled coil and a hydrolytic activity of the esterase. Furthermore, the esterase binds negatively charged phospholipids and liposomes, but not oligo-(A) RNA. Unexpectedly, the esterase can split into two dynamic half-domains, suited to engulf long fatty acid substrates extending from the active site. These properties indicate a role for lipids and membranes in non-LTR retrotransposition. We speculate that Gag-like membrane targeting properties of ORF1ps could play a role in RNP assembly and in membrane-dependent transport or localization processes.
Collapse
Affiliation(s)
- Anna M Schneider
- Department of Biochemistry, Max Planck Institute for Developmental Biology, Spemannstrasse 35, 72076 Tübingen, Germany and Friedrich Miescher Laboratory of the Max Planck Society, Spemannstrasse 39, 72076 Tübingen, Germany
| | | | | | | | | | | |
Collapse
|
24
|
RNA-Mediated Gene Duplication and Retroposons: Retrogenes, LINEs, SINEs, and Sequence Specificity. INTERNATIONAL JOURNAL OF EVOLUTIONARY BIOLOGY 2013; 2013:424726. [PMID: 23984183 PMCID: PMC3747384 DOI: 10.1155/2013/424726] [Citation(s) in RCA: 16] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 05/14/2013] [Accepted: 07/01/2013] [Indexed: 11/18/2022]
Abstract
A substantial number of “retrogenes” that are derived from the mRNA of various intron-containing genes have been reported. A class of mammalian retroposons, long interspersed element-1 (LINE1, L1), has been shown to be involved in the reverse transcription of retrogenes (or processed pseudogenes) and non-autonomous short interspersed elements (SINEs). The 3′-end sequences of various SINEs originated from a corresponding LINE. As the 3′-untranslated regions of several LINEs are essential for retroposition, these LINEs presumably require “stringent” recognition of the 3′-end sequence of the RNA template. However, the 3′-ends of mammalian L1s do not exhibit any similarity to SINEs, except for the presence of 3′-poly(A) repeats. Since the 3′-poly(A) repeats of L1 and Alu SINE are critical for their retroposition, L1 probably recognizes the poly(A) repeats, thereby mobilizing not only Alu SINE but also cytosolic mRNA. Many flowering plants only harbor L1-clade LINEs and a significant number of SINEs with poly(A) repeats, but no homology to the LINEs. Moreover, processed pseudogenes have also been found in flowering plants. I propose that the ancestral L1-clade LINE in the common ancestor of green plants may have recognized a specific RNA template, with stringent recognition then becoming relaxed during the course of plant evolution.
Collapse
|
25
|
Kovačić F, Granzin J, Wilhelm S, Kojić-Prodić B, Batra-Safferling R, Jaeger KE. Structural and functional characterisation of TesA - a novel lysophospholipase A from Pseudomonas aeruginosa. PLoS One 2013; 8:e69125. [PMID: 23874889 PMCID: PMC3715468 DOI: 10.1371/journal.pone.0069125] [Citation(s) in RCA: 44] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/06/2013] [Accepted: 06/04/2013] [Indexed: 11/19/2022] Open
Abstract
TesA from Pseudomonas aeruginosa belongs to the GDSL hydrolase family of serine esterases and lipases that possess a broad substrate- and regiospecificity. It shows high sequence homology to TAP, a multifunctional enzyme from Escherichia coli exhibiting thioesterase, lysophospholipase A, protease and arylesterase activities. Recently, we demonstrated high arylesterase activity for TesA, but only minor thioesterase and no protease activity. Here, we present a comparative analysis of TesA and TAP at the structural, biochemical and physiological levels. The crystal structure of TesA was determined at 1.9 Å and structural differences were identified, providing a possible explanation for the differences in substrate specificities. The comparison of TesA with other GDSL-hydrolase structures revealed that the flexibility of active-site loops significantly affects their substrate specificity. This assumption was tested using a rational approach: we have engineered the putative coenzyme A thioester binding site of E. coli TAP into TesA of P. aeruginosa by introducing mutations D17S and L162R. This TesA variant showed increased thioesterase activity comparable to that of TAP. TesA is the first lysophospholipase A described for the opportunistic human pathogen P. aeruginosa. The enzyme is localized in the periplasm and may exert important functions in the homeostasis of phospholipids or detoxification of lysophospholipids.
Collapse
Affiliation(s)
- Filip Kovačić
- Institut für Molekulare Enzymtechnologie, Heinrich-Heine Universität Düsseldorf, Forschungszentrum Jülich, Jülich, Germany
| | - Joachim Granzin
- Institute of Complex Systems (ICS-6), Forschungszentrum Jülich, Jülich, Germany
| | - Susanne Wilhelm
- Institut für Molekulare Enzymtechnologie, Heinrich-Heine Universität Düsseldorf, Forschungszentrum Jülich, Jülich, Germany
| | | | | | - Karl-Erich Jaeger
- Institut für Molekulare Enzymtechnologie, Heinrich-Heine Universität Düsseldorf, Forschungszentrum Jülich, Jülich, Germany
| |
Collapse
|
26
|
Metcalfe CJ, Filée J, Germon I, Joss J, Casane D. Evolution of the Australian lungfish (Neoceratodus forsteri) genome: a major role for CR1 and L2 LINE elements. Mol Biol Evol 2012; 29:3529-39. [PMID: 22734051 DOI: 10.1093/molbev/mss159] [Citation(s) in RCA: 38] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
Abstract
Haploid genomes greater than 25,000 Mb are rare, within the animals only the lungfish and some of the salamanders and crustaceans are known to have genomes this large. There is very little data on the structure of genomes this size. It is known, however, that for animal genomes up to 3,000 Mb, there is in general a good correlation between genome size and the percent of the genome composed of repetitive sequence and that this repetitive component is highly dynamic. In this study, we sampled the Australian lungfish genome using three mini-genomic libraries and found that with very little sequence, the results converged on an estimate of 40% of the genome being composed of recognizable transposable elements (TEs), chiefly from the CR1 and L2 long interspersed nuclear element clades. We further characterized the CR1 and L2 elements in the lungfish genome and show that although most CR1 elements probably represent recent amplifications, the L2 elements are more diverse and are more likely the result of a series of amplifications. We suggest that our sampling method has probably underestimated the recognizable TE content. However, on the basis of the most likely sources of error, we suggest that this very large genome is not largely composed of recently amplified, undetected TEs but may instead include a large component of older degenerate TEs. Based on these estimates, and on Thomson's (Thomson K. 1972. An attempt to reconstruct evolutionary changes in the cellular DNA content of lungfish. J Exp Zool. 180:363-372) inference that in the lineage leading to the extant Australian lungfish, there was massive increase in genome size between 350 and 200 mya, after which the size of the genome changed little, we speculate that the very large Australian lungfish genome may be the result of a massive amplification of TEs followed by a long period with a very low rate of sequence removal and some ongoing TE activity.
Collapse
Affiliation(s)
- Cushla J Metcalfe
- Laboratoire Evolution, Génomes et Spéciation, Centre National de la Recherche Scientifique, Gif-sur-Yvette, and Université Paris Diderot, Paris, France
| | | | | | | | | |
Collapse
|
27
|
CR1 retroposons provide a new insight into the phylogeny of Phasianidae species (Aves: Galliformes). Gene 2012; 502:125-32. [PMID: 22565186 DOI: 10.1016/j.gene.2012.04.068] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/17/2011] [Revised: 04/20/2012] [Accepted: 04/22/2012] [Indexed: 01/21/2023]
Abstract
Chicken repeat 1 (CR1) elements, a class of retroposons belonging to non-long-terminal repeats, have been recognized as powerful tools for phylogenetic studies. Here we examine the phylogenetic relationships of 11 Phasianidae species based on CR1 retroposons. Together with 19 loci reported previously, a total of 99 CR1 loci were identified from chicken genome and turkey BAC clone sequences. 75 insertion events were used to address the branching order of 11 species in Phasianidae. The topology of our tree suggests that: 1) Gallus gallus possessed a basal phylogenetic position within Phasianidae and was related to Bambusicola thoracica (BSP=100%); 2) After the split of G. gallus and B. thoracica, Arborophila rufipectus diverged from Phasianidae (BSP=100%). Nine unambiguous insertion events supported a phylogenetic position of A. rufipectus different to previous mitochondrial data suggesting a hybrid origin or an ancient introgression of A. rufipectus; and 3) 22 CR1 insertion events strongly supported the eight phasianids under investigation sharing a common ancestor. Our study has revisited the phylogenetic position of G. gallus and A. rufipectus and provided a new insight into the phylogeny of Phasianidae birds. It showed that a CR1-based methodology has a great potential to be informative within Phasianidae in resolving relationships of closely related species whose radiation and speciation have occurred very recently.
Collapse
|
28
|
Kajikawa M, Sugano T, Sakurai R, Okada N. Low dependency of retrotransposition on the ORF1 protein of the zebrafish LINE, ZfL2-1. Gene 2012; 499:41-7. [PMID: 22405944 DOI: 10.1016/j.gene.2012.02.048] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/09/2011] [Revised: 02/20/2012] [Accepted: 02/22/2012] [Indexed: 10/28/2022]
Abstract
The zebrafish long interspersed element (LINE), ZfL2-1, which belongs to the L2 clade, contains two open reading frames, ORF1 and ORF2. ORF1 encodes a protein containing a coiled-coil motif and an esterase domain, whereas ORF2 encodes a protein containing an endonuclease and a reverse transcriptase domain. To elucidate the functional significance of ORF1 in retrotransposition, we constructed many variants of ZfL2-1 and examined their retrotransposition ability. We concluded: 1) the ORF1 protein is not essential for ZfL2-1 retrotransposition in cultured cells; 2) the translation of ORF1 is required for the translation of ORF2; and 3) ORF2 translation probably occurs via suppression of the ORF1 stop codon, the efficiency of which is influenced by the context of the sequence juxtaposed to the 3' side of the stop codon. These results offer a new perspective on the evolution of the L2 clade LINEs.
Collapse
Affiliation(s)
- Masaki Kajikawa
- Graduate School of Bioscience and Biotechnology, Tokyo Institute of Technology, 4259-B-15 Nagatsuta-cho, Midori-ku, Yokohama, Kanagawa 226–8501, Japan.
| | | | | | | |
Collapse
|
29
|
Kajikawa M, Yamaguchi K, Okada N. A new mechanism to ensure integration during LINE retrotransposition: a suggestion from analyses of the 5' extra nucleotides. Gene 2012; 505:345-51. [PMID: 22405943 DOI: 10.1016/j.gene.2012.02.047] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/18/2012] [Revised: 02/21/2012] [Accepted: 02/22/2012] [Indexed: 10/28/2022]
Abstract
Long interspersed elements (LINEs) are transposable elements that exist in the chromosomal DNA of most eukaryotes; as such, they have a large impact on the genome evolution of their hosts. LINEs mobilize by a mechanism called retrotransposition in which the LINE RNA is reverse-transcribed into DNA and then integrated into the host chromosome. The integration of the 3' end of the LINE element simultaneously occurs with the initiation of reverse transcription; this process is called target-primed reverse transcription and is one of the important characteristics of LINEs. However, the molecular mechanism of the integration of the 5' end is not well understood. Here, we show that, in cultured cells, the integrants of the zebrafish ZfL2-2 LINE produce extra nucleotides at their 5' ends, and the extra nucleotides originate from their flanking sequences. We also found that, in cultured cells, some integrants of the human L1 LINE and, in their native hosts, some endogenous elements of two other LINEs also contain 5' extra nucleotides of similar origin, suggesting that the mechanism for generation of the 5' extra nucleotides is universal among various LINEs. From these data, we propose a general mechanism for 5' integration in LINE retrotransposition.
Collapse
Affiliation(s)
- Masaki Kajikawa
- Graduate School of Bioscience and Biotechnology, Tokyo Institute of Technology, 4259-B-15 Nagatsuta-cho, Midori-ku, Yokohama, Kanagawa 226-8501, Japan.
| | | | | |
Collapse
|
30
|
Self-interaction, nucleic acid binding, and nucleic acid chaperone activities are unexpectedly retained in the unique ORF1p of zebrafish LINE. Mol Cell Biol 2011; 32:458-69. [PMID: 22106409 DOI: 10.1128/mcb.06162-11] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022] Open
Abstract
Long interspersed elements (LINEs) are mobile elements that comprise a large proportion of many eukaryotic genomes. Although some LINE-encoded open reading frame 1 proteins (ORF1ps) were suggested to be required for LINE mobilization through binding to their RNA, their general role is not known. The ZfL2-1 ORF1p, which belongs to the esterase-type ORF1p, is especially interesting because it has no known RNA-binding domain. Here we demonstrate that ZfL2-1 ORF1p has all the canonical activities associated with known ORF1ps, including self-interaction, nucleic acid binding, and nucleic acid chaperone activities. In particular, we showed that its chaperone activity is reversible, suggesting that the chaperone activities of many other ORF1ps are also reversible. From this discovery, we propose that LINE ORF1ps play a general role in LINE integration by forming a complex with LINE RNA and rearranging its conformation.
Collapse
|
31
|
Fabrick JA, Mathew LG, Tabashnik BE, Li X. Insertion of an intact CR1 retrotransposon in a cadherin gene linked with Bt resistance in the pink bollworm, Pectinophora gossypiella. INSECT MOLECULAR BIOLOGY 2011; 20:651-665. [PMID: 21815956 DOI: 10.1111/j.1365-2583.2011.01095.x] [Citation(s) in RCA: 19] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/29/2023]
Abstract
Three mutations in the Pectinophora gossypiella cadherin gene PgCad1 are linked with resistance to Bacillus thuringiensis (Bt) toxin Cry1Ac. Here we show that the r3 mutation entails recent insertion into PgCad1 of an active chicken repeat (CR1) retrotransposon, designated CR1-1_Pg. Unlike most other CR1 elements, CR1-1_Pg is intact, transcribed by a flanking promoter, contains target site duplications and has a relatively low number of copies. Examination of transcripts from the PgCad1 locus revealed that CR1-1_Pg disrupts both the cadherin protein and a long noncoding RNA of unknown function. Together with previously reported data, these findings show that transposable elements disrupt eight of 12 cadherin alleles linked with resistance to Cry1Ac in three lepidopteran species, indicating that the cadherin locus is a common target for disruption by transposable elements.
Collapse
Affiliation(s)
- Jeffrey A Fabrick
- USDA, ARS, US Arid Land Agricultural Research Center, Maricopa, AZ 85138, USA.
| | | | | | | |
Collapse
|
32
|
Khazina E, Truffault V, Büttner R, Schmidt S, Coles M, Weichenrieder O. Trimeric structure and flexibility of the L1ORF1 protein in human L1 retrotransposition. Nat Struct Mol Biol 2011; 18:1006-14. [PMID: 21822284 DOI: 10.1038/nsmb.2097] [Citation(s) in RCA: 107] [Impact Index Per Article: 8.2] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/03/2011] [Accepted: 06/02/2011] [Indexed: 02/07/2023]
Abstract
The LINE-1 (L1) retrotransposon emerges as a major source of human interindividual genetic variation, with important implications for evolution and disease. L1 retrotransposition is poorly understood at the molecular level, and the mechanistic details and evolutionary origin of the L1-encoded L1ORF1 protein (L1ORF1p) are particularly obscure. Here three crystal structures of trimeric L1ORF1p and NMR solution structures of individual domains reveal a sophisticated and highly structured, yet remarkably flexible, RNA-packaging protein. It trimerizes via an N-terminal, ion-containing coiled coil that serves as scaffold for the flexible attachment of the central RRM and the C-terminal CTD domains. The structures explain the specificity for single-stranded RNA substrates, and a mutational analysis indicates that the precise control of domain flexibility is critical for retrotransposition. Although the evolutionary origin of L1ORF1p remains unclear, our data reveal previously undetected structural and functional parallels to viral proteins.
Collapse
Affiliation(s)
- Elena Khazina
- Department of Biochemistry, Max Planck Institute for Developmental Biology, Tübingen, Germany
| | | | | | | | | | | |
Collapse
|
33
|
Kojima KK, Kapitonov VV, Jurka J. Recent expansion of a new Ingi-related clade of Vingi non-LTR retrotransposons in hedgehogs. Mol Biol Evol 2010; 28:17-20. [PMID: 20716533 DOI: 10.1093/molbev/msq220] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/02/2023] Open
Abstract
Autonomous non-long terminal repeat (non-LTR) retrotransposons and their repetitive remnants are ubiquitous components of mammalian genomes. Recently, we identified non-LTR retrotransposon families, Ingi-1_AAl and Ingi-1_EE, in two hedgehog genomes. Here we rename them to Vingi-1_AAl and Vingi-1_EE and report a new clade "Vingi," which is a sister clade of Ingi that lacks the ribonuclease H domain. In the European hedgehog genome, there are 11 non-autonomous families of elements derived from Vingi-1_EE by internal deletions. No retrotransposons related to Vingi elements were found in any of the remaining 33 mammalian genomes nearly completely sequenced to date, but we identified several new families of Vingi and Ingi retrotransposons outside mammals. Our data suggest the horizontal transfer of Vingi elements to hedgehog, although the vertical transfer cannot be ruled out. The compact structure and trans-mobilization of nonautonomous derivatives of Vingi can make them useful for in vivo retrotransposition assay system.
Collapse
|
34
|
Lowe CB, Bejerano G, Salama SR, Haussler D. Endangered species hold clues to human evolution. J Hered 2010; 101:437-47. [PMID: 20332163 PMCID: PMC2884192 DOI: 10.1093/jhered/esq016] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/06/2009] [Revised: 01/19/2010] [Accepted: 01/27/2010] [Indexed: 12/20/2022] Open
Abstract
We report that 18 conserved, and by extension functional, elements in the human genome are the result of retroposon insertions that are evolving under purifying selection in mammals. We show evidence that 1 of the 18 elements regulates the expression of ASXL3 during development by encoding an alternatively spliced exon that causes nonsense-mediated decay of the transcript. The retroposon that gave rise to these functional elements was quickly inactivated in the mammalian ancestor, and all traces of it have been lost due to neutral decay. However, the tuatara has maintained a near-ancestral version of this retroposon in its extant genome, which allows us to connect the 18 human elements to the evolutionary events that created them. We propose that conservation efforts over more than 100 years may not have only prevented the tuatara from going extinct but could have preserved our ability to understand the evolutionary history of functional elements in the human genome. Through simulations, we argue that species with historically low population sizes are more likely to harbor ancient mobile elements for long periods of time and in near-ancestral states, making these species indispensable in understanding the evolutionary origin of functional elements in the human genome.
Collapse
Affiliation(s)
- Craig B Lowe
- Center for Biomolecular Science and Engineering, University of California, Santa Cruz, CA 95064, USA
| | | | | | | |
Collapse
|
35
|
Plötner J, Köhler F, Uzzell T, Beerli P, Schreiber R, Guex GD, Hotz H. Evolution of serum albumin intron-1 is shaped by a 5' truncated non-long terminal repeat retrotransposon in western Palearctic water frogs (Neobatrachia). Mol Phylogenet Evol 2009; 53:784-91. [PMID: 19665056 DOI: 10.1016/j.ympev.2009.07.037] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/04/2009] [Revised: 07/30/2009] [Accepted: 07/31/2009] [Indexed: 10/20/2022]
Abstract
A 5' truncated non-LTR CR1-like retrotransposon, named RanaCR1, was identified in the serum albumin intron-1 (SAI-1) of at least seven species of western Palearctic water frogs (WPWF). Based on sequence similarity of the carboxy-terminal region (CTR) of ORF2 and/or the highly conserved 3' untranslated region (3' UTR), RanaCR1-like elements occur also in the genome of Xenopus tropicalis and Rana temporaria. Unlike other CR1 elements, RanaCR1 contains a CA microsatellite in its 3' UTR. The low nucleotide diversity of the 3' UTR compared to the CTR and to SAI-1 suggests that this region still plays a role in WPWF, either as a structure-stabilizing element, or within a species-specific transcriptional network. Length variation of water frog SAI-1 sequences is caused by deletions that extend in some cases beyond the 5' or 3' ends of RanaCR1, probably a result of selection for structural and functional stability of the primary transcript. The impact of RanaCR1 on SAI-1 evolution is also indicated by the significant negative correlation between the length of both SAI-1 and RanaCR1 and the percentage GC content of RanaCR1. Both SAI-1 and RanaCR1 sequences support the sister group relationship of R. perezi and R. saharica, which are placed in the phylogenetic tree at a basal position, the sister clade to other water frog taxa. It also supports the monophyly of the R. lessonae group; of Anatolian water frogs (R. cf. bedriagae), which are not conspecific with R. bedriagae, and of the European ridibunda group. Within the ridibunda clade, Greek frogs are clearly separated, supporting the hypothesis that Balkan water frogs represent a distinct species. Frogs from Atyrau (Kazakhstan), the type locality of R. ridibunda, were heterozygous for a ridibunda and a cf. bedriagae specific allele.
Collapse
Affiliation(s)
- Jörg Plötner
- Museum für Naturkunde, Leibniz-Institut für Evolutions - und Biodiversitätsforschung an der Humboldt-Universität zu Berlin, Invalidenstrasse 43, 10115 Berlin, Germany.
| | | | | | | | | | | | | |
Collapse
|
36
|
Kapitonov VV, Tempel S, Jurka J. Simple and fast classification of non-LTR retrotransposons based on phylogeny of their RT domain protein sequences. Gene 2009; 448:207-13. [PMID: 19651192 DOI: 10.1016/j.gene.2009.07.019] [Citation(s) in RCA: 76] [Impact Index Per Article: 5.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/18/2009] [Revised: 07/19/2009] [Accepted: 07/22/2009] [Indexed: 11/29/2022]
Abstract
Rapidly growing number of sequenced genomes requires fast and accurate computational tools for analysis of different transposable elements (TEs). In this paper we focus on a rapid and reliable procedure for classification of autonomous non-LTR retrotransposons based on alignment and clustering of their reverse transcriptase (RT) domains. Typically, the RT domain protein sequences encoded by different non-LTR retrotransposons are similar to each other in terms of significant BLASTP E-values. Therefore, they can be easily detected by the routine BLASTP searches of genomic DNA sequences coding for proteins similar to the RT domains of known non-LTR retrotransposons. However, detailed classification of non-LTR retrotransposons, i.e. their assignment to specific clades, is a slow and complex procedure that is not formalized or integrated as a standard set of computational methods and data. Here we describe a tool (RTclass1) designed for the fast and accurate automated assignment of novel non-LTR retrotransposons to known or novel clades using phylogenetic analysis of the RT domain protein sequences. RTclass1 classifies a particular non-LTR retrotransposon based on its RT domain in less than 10 min on a standard desktop computer and achieves 99.5% accuracy. RT1class1 works either as a stand-alone program installed locally or as a web-server that can be accessed distantly by uploading sequence data through the internet (http://www.girinst.org/RTphylogeny/RTclass1).
Collapse
Affiliation(s)
- Vladimir V Kapitonov
- Genetic Information Research Institute, 1925 Landings Dr, Mountain View, CA 94041, USA.
| | | | | |
Collapse
|
37
|
Non-LTR retrotransposons encode noncanonical RRM domains in their first open reading frame. Proc Natl Acad Sci U S A 2009; 106:731-6. [PMID: 19139409 DOI: 10.1073/pnas.0809964106] [Citation(s) in RCA: 107] [Impact Index Per Article: 7.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
Non-LTR retrotransposons (NLRs) are a unique class of mobile genetic elements that have significant impact on the evolution of eukaryotic genomes. However, the molecular details and functions of their encoded proteins, in particular of the accessory ORF1p proteins, are poorly understood. Here, we identify noncanonical RNA-recognition-motifs (RRMs) in several phylogenetically unrelated NLR ORF1p proteins. This provides an explanation for their RNA-binding properties and clearly shows that they are not related to the retroviral nucleocapsid protein Gag, despite the frequent presence of CCHC zinc knuckles. In particular, we characterize the ORF1p protein of the human long interspersed nuclear element 1 (LINE-1 or L1). We show that L1ORF1p is a multidomain protein, consisting of a coiled coil (cc), RRM, and C-terminal domain (CTD). Most importantly, we solved the crystal structure of the RRM domain, which is characterized by extended loops stabilized by unique salt bridges. Furthermore, we demonstrate that L1ORF1p trimerizes via its N-terminal cc domain, and we suggest that this property is functionally important for all homologues. The formation of distinct complexes with single-stranded nucleic acids requires the presence of the RRM and CTD domains on the same polypeptide chain as well as their close cooperation. Finally, the phylogenetic analysis of mammalian L1ORF1p shows an ancient origin of the RRM domain and supports a modular evolution of NLRs.
Collapse
|
38
|
Pidpala OV, Yatsishina AP, Lukash LL. Human mobile genetic elements: Structure, distribution and functional role. CYTOL GENET+ 2008. [DOI: 10.3103/s009545270806011x] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/29/2023]
|
39
|
Kapitonov VV, Jurka J. A universal classification of eukaryotic transposable elements implemented in Repbase. Nat Rev Genet 2008; 9:411-2; author reply 414. [PMID: 18421312 DOI: 10.1038/nrg2165-c1] [Citation(s) in RCA: 317] [Impact Index Per Article: 19.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/02/2023]
|
40
|
Ichiyanagi K, Okada N. Mobility pathways for vertebrate L1, L2, CR1, and RTE clade retrotransposons. Mol Biol Evol 2008; 25:1148-57. [PMID: 18343891 DOI: 10.1093/molbev/msn061] [Citation(s) in RCA: 21] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/02/2023] Open
Abstract
Autonomous non-long terminal repeat retrotransposons (NLRs) are ubiquitous mobile genetic elements that insert their DNA copies at new locations by retrotransposition. In vertebrates, there are 4 NLR clades, L1, L2, CR1, and RTE, which diverged in the Precambrian era. It has been demonstrated that retrotransposition of L1 and L2 members proceeds via coordinated reactions of targeted DNA cleavage and reverse transcription catalyzed by the NLR-encoded proteins, which are followed by the joining of the 5' (upstream) junction. However, the study on the mobility pathways for vertebrate NLRs is so far limited to L1 and L2. In this report, using target analysis of nested transposons for genomic copies, we studied retrotransposition pathways for a variety of vertebrate NLRs, including those of the L1, L2, CR1, and RTE clades in the human, cow, opossum, chicken, and zebrafish genomes. Thus, this study constitutes the first comprehensive analysis of NLR retrotransposition products in vertebrates. Our data revealed that these elements share similar mechanisms for the cleavages of the 2 target DNA strands and for the initiation of reverse transcription. Possible endonuclease-independent insertions were also identified. Overall, our results suggest the existence of multiple retrotransposition pathways that are conserved among the diverse NLR clades in various vertebrate hosts.
Collapse
Affiliation(s)
- Kenji Ichiyanagi
- Division of Human Genetics, Department of Integrated Genetics, National Institute of Genetics, Yata, Mishima, Shizuoka, Japan
| | | |
Collapse
|
41
|
Kriegs JO, Matzke A, Churakov G, Kuritzin A, Mayr G, Brosius J, Schmitz J. Waves of genomic hitchhikers shed light on the evolution of gamebirds (Aves: Galliformes). BMC Evol Biol 2007; 7:190. [PMID: 17925025 PMCID: PMC2169234 DOI: 10.1186/1471-2148-7-190] [Citation(s) in RCA: 76] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/18/2007] [Accepted: 10/09/2007] [Indexed: 02/04/2023] Open
Abstract
BACKGROUND The phylogenetic tree of Galliformes (gamebirds, including megapodes, currassows, guinea fowl, New and Old World quails, chicken, pheasants, grouse, and turkeys) has been considerably remodeled over the last decades as new data and analytical methods became available. Analyzing presence/absence patterns of retroposed elements avoids the problems of homoplastic characters inherent in other methodologies. In gamebirds, chicken repeats 1 (CR1) are the most prevalent retroposed elements, but little is known about the activity of their various subtypes over time. Ascertaining the fixation patterns of CR1 elements would help unravel the phylogeny of gamebirds and other poorly resolved avian clades. RESULTS We analyzed 1,978 nested CR1 elements and developed a multidimensional approach taking advantage of their transposition in transposition character (TinT) to characterize the fixation patterns of all 22 known chicken CR1 subtypes. The presence/absence patterns of those elements that were active at different periods of gamebird evolution provided evidence for a clade (Cracidae + (Numididae + (Odontophoridae + Phasianidae))) not including Megapodiidae; and for Rollulus as the sister taxon of the other analyzed Phasianidae. Genomic trace sequences of the turkey genome further demonstrated that the endangered African Congo Peafowl (Afropavo congensis) is the sister taxon of the Asian Peafowl (Pavo), rejecting other predominantly morphology-based groupings, and that phasianids are monophyletic, including the sister taxa Tetraoninae and Meleagridinae. CONCLUSION The TinT information concerning relative fixation times of CR1 subtypes enabled us to efficiently investigate gamebird phylogeny and to reconstruct an unambiguous tree topology. This method should provide a useful tool for investigations in other taxonomic groups as well.
Collapse
Affiliation(s)
- Jan Ole Kriegs
- Institute of Experimental Pathology (ZMBE) University of Münster, Von-Esmarch-Str. 56, D-48149 Münster, Germany
| | - Andreas Matzke
- Institute of Experimental Pathology (ZMBE) University of Münster, Von-Esmarch-Str. 56, D-48149 Münster, Germany
| | - Gennady Churakov
- Institute of Experimental Pathology (ZMBE) University of Münster, Von-Esmarch-Str. 56, D-48149 Münster, Germany
| | - Andrej Kuritzin
- Department of Physics and Mathematics, Saint Petersburg State Institute of Technology, 26 Moskovsky av., St.-Petersburg 198013, Russia
| | - Gerald Mayr
- Forschungsinstitut Senckenberg, Division of Ornithology, Senckenberganlage 25, D-60325 Frankfurt am Main, Germany
| | - Jürgen Brosius
- Institute of Experimental Pathology (ZMBE) University of Münster, Von-Esmarch-Str. 56, D-48149 Münster, Germany
| | - Jürgen Schmitz
- Institute of Experimental Pathology (ZMBE) University of Münster, Von-Esmarch-Str. 56, D-48149 Münster, Germany
| |
Collapse
|
42
|
Biedler JK, Tu Z. The Juan non-LTR retrotransposon in mosquitoes: genomic impact, vertical transmission and indications of recent and widespread activity. BMC Evol Biol 2007; 7:112. [PMID: 17620143 PMCID: PMC1947958 DOI: 10.1186/1471-2148-7-112] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/09/2007] [Accepted: 07/09/2007] [Indexed: 01/16/2023] Open
Abstract
BACKGROUND In contrast to DNA-mediated transposable elements (TEs), retrotransposons, particularly non-long terminal repeat retrotransposons (non-LTRs), are generally considered to have a much lower propensity towards horizontal transfer. Detailed studies on site-specific non-LTR families have demonstrated strict vertical transmission. More studies are needed with non-site-specific non-LTR families to determine whether strict vertical transmission is a phenomenon related to site specificity or a more general characteristic of all non-LTRs. Juan is a Jockey clade non-LTR retrotransposon first discovered in mosquitoes that is widely distributed in the mosquito family Culicidae. Being a non-site specific non-LTR, Juan offers an opportunity to further investigate the hypothesis that non-LTRs are genomic elements that are primarily vertically transmitted. RESULTS Systematic analysis of the ~1.3 Gbp Aedes aegypti (Ae. aegypti) genome sequence suggests that Juan-A is the only Juan-type non-LTR in Aedes aegypti. Juan-A is highly reiterated and comprises approximately 3% of the genome. Using minimum cutoffs of 90% length and 70% nucleotide (nt) identity, 663 copies were found by BLAST using the published Juan-A sequence as the query. All 663 copies are at least 95% identical to Juan-A, while 378 of these copies are 99% identical to Juan-A, indicating that the Juan-A family has been transposing recently in evolutionary history. Using the 0.34 Kb 5' UTR as the query, over 2000 copies were identified that may contain internal promoters, leading to questions on the genomic impact of Juan-A. Juan sequences were obtained by PCR, library screening, and database searches for 18 mosquito species of six genera including Aedes, Ochlerotatus, Psorophora, Culex, Deinocerites, and Wyeomyia. Comparison of host and Juan phylogenies shows overall congruence with few exceptions. CONCLUSION Juan-A is a major genomic component in Ae. aegypti and it has been retrotransposing recently in evolutionary history. There are also indications that Juan has been recently active in a wide range of mosquito species. Furthermore, our research demonstrates that a Jockey clade non-LTR without target site-specificity has been sustained by vertical transmission in the mosquito family. These results strengthen the argument that non-LTRs tend to be genomic elements capable of persistence by vertical descent over a long evolutionary time.
Collapse
Affiliation(s)
- James K Biedler
- Department of Biochemistry, Virginia Polytechnic Institute and State University, Blacksburg, VA 24061, USA
| | - Zhijian Tu
- Department of Biochemistry, Virginia Polytechnic Institute and State University, Blacksburg, VA 24061, USA
| |
Collapse
|
43
|
Souza RT, Santos MRM, Lima FM, El-Sayed NM, Myler PJ, Ruiz JC, da Silveira JF. New Trypanosoma cruzi repeated element that shows site specificity for insertion. EUKARYOTIC CELL 2007; 6:1228-38. [PMID: 17526721 PMCID: PMC1951114 DOI: 10.1128/ec.00036-07] [Citation(s) in RCA: 14] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/30/2022]
Abstract
A new family of site-specific repeated elements identified in Trypanosoma cruzi, which we named TcTREZO, is described here. TcTREZO appears to be a composite repeated element, since three subregions may be defined within it on the basis of sequence similarities with other T. cruzi sequences. Analysis of the distribution of TcTREZO in the genome clearly indicates that it displays site specificity for insertion. Most TcTREZO elements are flanked by conserved sequences. There is a highly conserved 68-bp sequence at the 5' end of the element and a sequence domain of approximately 500 bp without a well-defined borderline at the 3' end. Northern blot hybridization and reverse transcriptase PCR analyses showed that TcTREZO transcripts are expressed as oligo(A)-terminated transcripts whose length corresponds to the unit size of the element (1.6 kb). Transcripts of approximately 0.2 kb derived from a small part of TcTREZO are also detected in steady-state RNA. TcTREZO transcripts are unspliced and not translated. The copy number of TcTREZO sequences was estimated to be approximately 173 copies per haploid genome. TcTREZO appears to have been assembled by insertions of sequences into a progenitor element. Once associated with each other, these subunits were amplified as a new transposable element. TcTREZO shows site specificity for insertion, suggesting that a sequence-specific endonuclease could be responsible for its insertion at a unique site.
Collapse
Affiliation(s)
- Renata T Souza
- Department of Microbiology, Immunology and Parasitology, Escola Paulista de Medicina, UNIFESP, Rua Botucatu, São Paulo, Brazil
| | | | | | | | | | | | | |
Collapse
|
44
|
Honda H, Ichiyanagi K, Suzuki J, Ono T, Koyama H, Kajikawa M, Okada N. A new system for analyzing LINE retrotransposition in the chicken DT40 cell line widely used for reverse genetics. Gene 2007; 395:116-24. [PMID: 17434692 DOI: 10.1016/j.gene.2007.02.017] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/14/2006] [Revised: 02/14/2007] [Accepted: 02/19/2007] [Indexed: 11/15/2022]
Abstract
Long interspersed elements (LINEs) are autonomous transposable elements that proliferate via retrotransposition, which involves reverse transcription of LINE RNAs. It is anticipated that LINE retrotransposition requires both LINE-encoded proteins and host-encoded proteins. However, identification of the host factors, their roles, and the steps at which they act on retrotransposition are poorly understood because of the lack of an appropriate genetic system to study LINE retrotransposition in a series of mutant hosts. To construct such a genetic system, we applied the retrotransposition-indicative cassette method to DT40 cells, a chicken cell line for which a variety of isogenic mutants have been established by gene targeting. Because DT40 cells are non-adherent, we utilized a selective soft agarose medium to allow the formation of colonies of cells that had undergone LINE retrotransposition. Colony formation was completely dependent on the activities of the LINE-encoded proteins and on the presence of the essential 3' region of the LINE RNA. Moreover, the selected colonies indeed carried retrotransposed LINE copies in their chromosomes, with integration features similar to those of genomic (native) LINE copies. This method thus allows the authentic selection of LINE-retrotransposed cells and the approximate recapitulation of retrotransposition events that occur in nature. Therefore, the DT40 cell system established here provides a powerful tool for the elucidation of LINE retrotransposition pathways, the host factors involved, and their roles.
Collapse
Affiliation(s)
- Hiroshi Honda
- Graduate School of Bioscience and Biotechnology, Tokyo Institute of Technology, 4259-B-21 Nagatsuta-cho, Midori-ku, Yokohama, Kanagawa 226-8501, Japan
| | | | | | | | | | | | | |
Collapse
|
45
|
Ichiyanagi K, Nakajima R, Kajikawa M, Okada N. Novel retrotransposon analysis reveals multiple mobility pathways dictated by hosts. Genome Res 2006; 17:33-41. [PMID: 17151346 PMCID: PMC1716264 DOI: 10.1101/gr.5542607] [Citation(s) in RCA: 27] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/18/2022]
Abstract
Autonomous non-long-terminal-repeat retrotransposons (NLRs) proliferate by retrotransposition via coordinated reactions of target DNA cleavage and reverse transcription by a mechanism called target-primed reverse transcription (TPRT). Whereas this mechanism guarantees the covalent attachment of the NLR and its target site at the 3' junction, mechanisms for the joining at the 5' junction have been conjectural. To better understand the retrotransposition pathways, we analyzed target-NLR junctions of zebrafish NLRs with a new method of identifying genomic copies that reside within other transposons, termed "target analysis of nested transposons" (TANT). Application of the TANT method revealed various features of the zebrafish NLR integrants; for example, half of the integrants carry extra nucleotides at the 5' junction, which is in stark contrast to the major human NLR, LINE-1. Interestingly, in a cell culture assay, retrotransposition of the zebrafish NLR in heterologous human cells did not bear extra 5' nucleotides, indicating that the choice of the 5' joining pathway is affected by the host. Our results suggest that several pathways exist for NLR retrotransposition and argue in favor of host protein involvement. With genomic sequence information accumulating exponentially, our data demonstrate the general applicability of the TANT method for the analysis of a wide variety of retrotransposons.
Collapse
Affiliation(s)
- Kenji Ichiyanagi
- Department of Biological Sciences, Graduate School of Bioscience and Biotechnology, Tokyo Institute of Technology, Midori-ku, Yokohama 226-8501, Japan
| | - Ryo Nakajima
- Department of Biological Sciences, Graduate School of Bioscience and Biotechnology, Tokyo Institute of Technology, Midori-ku, Yokohama 226-8501, Japan
| | - Masaki Kajikawa
- Department of Biological Sciences, Graduate School of Bioscience and Biotechnology, Tokyo Institute of Technology, Midori-ku, Yokohama 226-8501, Japan
| | - Norihiro Okada
- Department of Biological Sciences, Graduate School of Bioscience and Biotechnology, Tokyo Institute of Technology, Midori-ku, Yokohama 226-8501, Japan
- Corresponding author.E-mail fax: 81-45-924-5835
| |
Collapse
|
46
|
Schön I, Arkhipova IR. Two families of non-LTR retrotransposons, Syrinx and Daphne, from the Darwinulid ostracod, Darwinula stevensoni. Gene 2006; 371:296-307. [PMID: 16469453 DOI: 10.1016/j.gene.2005.12.007] [Citation(s) in RCA: 11] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/20/2005] [Revised: 12/14/2005] [Accepted: 12/15/2005] [Indexed: 10/25/2022]
Abstract
Two novel families of non-LTR retrotransposons, named Syrinx and Daphne, were cloned and characterized in a putative ancient asexual ostracod Darwinula stevensoni. Phylogenetic analysis reveals that Daphne is the founding member of a novel clade of non-LTR retroelements, which also contains retrotransposon families from the sea urchin and the silkworm and forms a sister clade to L2-like elements. The Syrinx family of non-LTR retrotransposons exhibits evidence of relatively recent activity, manifested in high levels of sequence similarity between individual copies and a three- to ten-fold excess of synonymous substitutions, which is indicative of purifying selection. The Daphne family may have very few copies with intact open reading frames, and exhibits neutral within-family ratio of non-synonymous to synonymous substitutions. It can additionally be characterized by formation of inverted truncated head-to-head structures. All of these features make recent activity less likely than in the Syrinx family. Our results are discussed in light of the evolutionary consequences of long-term asexuality in general and in D. stevensoni in particular.
Collapse
Affiliation(s)
- Isabelle Schön
- Freshwater Biology Section, Royal Belgian Institute of Natural Sciences, Vautierstraat 29, B-1000 Brussels, Belgium
| | | |
Collapse
|
47
|
Pérez-Alegre M, Dubus A, Fernández E. REM1, a new type of long terminal repeat retrotransposon in Chlamydomonas reinhardtii. Mol Cell Biol 2005; 25:10628-38. [PMID: 16287873 PMCID: PMC1291216 DOI: 10.1128/mcb.25.23.10628-10638.2005] [Citation(s) in RCA: 24] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022] Open
Abstract
A new long terminal repeat (LTR) retrotransposon, named REM1, has been identified in the green alga Chlamydomonas reinhardtii. It was found in low copy number, highly methylated, and with an inducible transpositional activity. This retrotransposon is phylogenetically related to Ty3-gypsy LTR retrotransposons and possesses new and unusual structural features. A regulatory module, ORF3p, is present in an inverse transcriptional orientation to that of the polyprotein and contains PHD-finger and chromodomains, which might confer specificity of the target site and are highly conserved in proteins involved in transcriptional regulation by chromatin remodeling. By using different wild-type and mutant strains, we show that CrREM1 was active with a strong transcriptional activity and amplified its copy number in strains that underwent foreign DNA integration and/or genetic crosses. However, integration of CrREM1 was restricted to these events even though the expression of its full-length transcripts remained highly activated. A regulatory mechanism of CrREM1 retrotransposition which would help to minimize its deleterious effects in the host genome is proposed.
Collapse
Affiliation(s)
- Mónica Pérez-Alegre
- Departamento de Bioquímica y Biología Molecular, Edificio Severo Ochoa Planta baja, Facultad de Ciencias, Campus de Rabanales, Universidad de Córdoba, 14071 Córdoba, Spain
| | | | | |
Collapse
|
48
|
Sugano T, Kajikawa M, Okada N. Isolation and characterization of retrotransposition-competent LINEs from zebrafish. Gene 2005; 365:74-82. [PMID: 16356661 DOI: 10.1016/j.gene.2005.09.037] [Citation(s) in RCA: 41] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/15/2005] [Revised: 09/01/2005] [Accepted: 09/27/2005] [Indexed: 11/30/2022]
Abstract
Long interspersed elements (LINEs) are a type of retroposon and are widely distributed in most eukaryotic genomes. LINEs are classified into two groups, the stringent type and relaxed type, based on the recognition of the 3' tail of their own RNA by reverse transcriptase (RT) during retrotransposition. Although most LINEs are thought to belong to the stringent type, retrotransposition studies of the stringent type LINEs are relatively limited compared with those of the relaxed type. We have now isolated two retrotransposition-competent LINEs (ZfL2-1 and ZfL2-2) from the zebrafish genome. Both ZfL2-1 and ZfL2-2 are members of the L2 clade; ZfL2-1 encodes two open reading frames (ORFs) and ZfL2-2 encodes one ORF, and each of the ORFs is required for retrotransposition. Using a retrotransposition assay in HeLa cells, we established that both ZfL2-1 and Zfl2-2 belong to the stringent type. We also demonstrated that an esterase (ES) domain encoded by ZfL2-1 ORF1 strongly enhances its own retrotransposition. The ES domain is encoded only in ORF1 of LINEs classified in the CR1 and L2 clades, although its function or significance in retrotransposition has not been elucidated. Thus, this is the first experimental evidence that the ES domain has an enhancing function during retrotransposition. These zebrafish LINEs will be useful for determining the function of ORF1 and the retrotransposition mechanism of stringent-type LINEs.
Collapse
Affiliation(s)
- Tomohiro Sugano
- Graduate School of Bioscience and Biotechnology, Tokyo Institute of Technology, Yokohama, Kanagawa 226-8501, Japan
| | | | | |
Collapse
|
49
|
Jurka J, Kapitonov VV, Pavlicek A, Klonowski P, Kohany O, Walichiewicz J. Repbase Update, a database of eukaryotic repetitive elements. Cytogenet Genome Res 2005; 110:462-7. [PMID: 16093699 DOI: 10.1159/000084979] [Citation(s) in RCA: 2338] [Impact Index Per Article: 123.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/16/2003] [Accepted: 04/06/2004] [Indexed: 12/13/2022] Open
Abstract
Repbase Update is a comprehensive database of repetitive elements from diverse eukaryotic organisms. Currently, it contains over 3600 annotated sequences representing different families and subfamilies of repeats, many of which are unreported anywhere else. Each sequence is accompanied by a short description and references to the original contributors. Repbase Update includes Repbase Reports, an electronic journal publishing newly discovered transposable elements, and the Transposon Pub, a web-based browser of selected chromosomal maps of transposable elements. Sequences from Repbase Update are used to screen and annotate repetitive elements using programs such as Censor and RepeatMasker. Repbase Update is available on the worldwide web at http://www.girinst.org/Repbase_Update.html.
Collapse
Affiliation(s)
- J Jurka
- Genetic Information Research Institute, Mountain View, CA 94043, USA.
| | | | | | | | | | | |
Collapse
|
50
|
Ohshima K, Okada N. SINEs and LINEs: symbionts of eukaryotic genomes with a common tail. Cytogenet Genome Res 2005; 110:475-90. [PMID: 16093701 DOI: 10.1159/000084981] [Citation(s) in RCA: 120] [Impact Index Per Article: 6.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/18/2004] [Accepted: 04/27/2004] [Indexed: 01/26/2023] Open
Abstract
Many SINEs and LINEs have been characterized to date, and examples of the SINE and LINE pair that have the same 3' end sequence have also increased. We report the phylogenetic relationships of nearly all known LINEs from which SINEs are derived, including a new example of a SINE/LINE pair identified in the salmon genome. We also use several biological examples to discuss the impact and significance of SINEs and LINEs in the evolution of vertebrate genomes.
Collapse
Affiliation(s)
- K Ohshima
- School and Graduate School of Bioscience and Biotechnology, Tokyo Institute of Technology, Yokohama, Japan.
| | | |
Collapse
|