1
|
Kaur D, Agrahari M, Bhattacharya A, Bhattacharya S. The non-LTR retrotransposons of Entamoeba histolytica: genomic organization and biology. Mol Genet Genomics 2022; 297:1-18. [PMID: 34999963 DOI: 10.1007/s00438-021-01843-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/21/2021] [Accepted: 11/26/2021] [Indexed: 11/24/2022]
Abstract
Genome sequence analysis of Entamoeba species revealed various classes of transposable elements. While E. histolytica and E. dispar are rich in non-long terminal repeat (LTR) retrotransposons, E. invadens contains predominantly DNA transposons. Non-LTR retrotransposons of E. histolytica constitute three families of long interspersed nuclear elements (LINEs), and their short, nonautonomous partners, SINEs. They occupy ~ 11% of the genome. The EhLINE1/EhSINE1 family is the most abundant and best studied. EhLINE1 is 4.8 kb, with two ORFs that encode functions needed for retrotransposition. ORF1 codes for the nucleic acid-binding protein, and ORF2 has domains for reverse transcriptase (RT) and endonuclease (EN). Most copies of EhLINEs lack complete ORFs. ORF1p is expressed constitutively, but ORF2p is not detected. Retrotransposition could be demonstrated upon ectopic over expression of ORF2p, showing that retrotransposition machinery is functional. The newly retrotransposed sequences showed a high degree of recombination. In transcriptomic analysis, RNA-Seq reads were mapped to individual EhLINE1 copies. Although full-length copies were transcribed, no full-length 4.8 kb transcripts were seen. Rather, sense transcripts mapped to ORF1, RT and EN domains. Intriguingly, there was strong antisense transcription almost exclusively from the RT domain. These unique features of EhLINE1 could serve to attenuate retrotransposition in E. histolytica.
Collapse
|
2
|
Talavera-López C, Messenger LA, Lewis MD, Yeo M, Reis-Cunha JL, Matos GM, Bartholomeu DC, Calzada JE, Saldaña A, Ramírez JD, Guhl F, Ocaña-Mayorga S, Costales JA, Gorchakov R, Jones K, Nolan MS, Teixeira SMR, Carrasco HJ, Bottazzi ME, Hotez PJ, Murray KO, Grijalva MJ, Burleigh B, Grisard EC, Miles MA, Andersson B. Repeat-Driven Generation of Antigenic Diversity in a Major Human Pathogen, Trypanosoma cruzi. Front Cell Infect Microbiol 2021; 11:614665. [PMID: 33747978 PMCID: PMC7966520 DOI: 10.3389/fcimb.2021.614665] [Citation(s) in RCA: 20] [Impact Index Per Article: 6.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/06/2020] [Accepted: 01/22/2021] [Indexed: 12/18/2022] Open
Abstract
Trypanosoma cruzi, a zoonotic kinetoplastid protozoan parasite, is the causative agent of American trypanosomiasis (Chagas disease). Having a very plastic, repetitive and complex genome, the parasite displays a highly diverse repertoire of surface molecules, with pivotal roles in cell invasion, immune evasion and pathogenesis. Before 2016, the complexity of the genomic regions containing these genes impaired the assembly of a genome at chromosomal level, making it impossible to study the structure and function of the several thousand repetitive genes encoding the surface molecules of the parasite. We here describe the genome assembly of the Sylvio X10/1 genome sequence, which since 2016 has been used as a reference genome sequence for T. cruzi clade I (TcI), produced using high coverage PacBio single-molecule sequencing. It was used to analyze deep Illumina sequence data from 34 T. cruzi TcI isolates and clones from different geographic locations, sample sources and clinical outcomes. Resolution of the surface molecule gene distribution showed the unusual duality in the organization of the parasite genome, a synteny of the core genomic region with related protozoa flanked by unique and highly plastic multigene family clusters encoding surface antigens. The presence of abundant interspersed retrotransposons in these multigene family clusters suggests that these elements are involved in a recombination mechanism for the generation of antigenic variation and evasion of the host immune response on these TcI strains. The comparative genomic analysis of the cohort of TcI strains revealed multiple cases of such recombination events involving surface molecule genes and has provided new insights into T. cruzi population structure.
Collapse
Affiliation(s)
- Carlos Talavera-López
- Department of Cell and Molecular Biology, Karolinska Institutet, Stockholm, Sweden
- European Bioinformatics Institute, Wellcome Sanger Institute, Hinxton, United Kingdom
| | - Louisa A. Messenger
- Faculty of Infectious and Tropical Diseases, London School of Hygiene and Tropical Medicine, London, United Kingdom
| | - Michael D. Lewis
- Faculty of Infectious and Tropical Diseases, London School of Hygiene and Tropical Medicine, London, United Kingdom
| | - Matthew Yeo
- Faculty of Infectious and Tropical Diseases, London School of Hygiene and Tropical Medicine, London, United Kingdom
| | - João Luís Reis-Cunha
- Departamento de Parasitologia, Universidade Federal de Minas Gerais, Belo Horizonte, Brazil
| | - Gabriel Machado Matos
- Departamento de Biologia Celular, Embriologia e Genética, Universidade Federal Santa Catarina, Florianópolis, Brazil
| | | | - José E. Calzada
- Departamento de Parasitología, Instituto Conmemorativo Gorgas de Estudios de la Salud, Ciudad de Panamá, Panama
| | - Azael Saldaña
- Departamento de Parasitología, Instituto Conmemorativo Gorgas de Estudios de la Salud, Ciudad de Panamá, Panama
| | - Juan David Ramírez
- Grupo de Investigaciones Microbiológicas-UR (GIMUR), Departamento de Biología, Facultad de Ciencias Naturales, Universidad del Rosario, Bogotá, Colombia
| | - Felipe Guhl
- Grupo de Investigaciones en Microbiología y Parasitología Tropical (CIMPAT), Tropical Parasitology Research Center, Universidad de Los Andes, Bogotá, Colombia
| | - Sofía Ocaña-Mayorga
- Centro de Investigación para la Salud en América Latina (CISeAL), Escuela de Ciencias Biológicas, Pontificia Universidad Católica del Ecuador, Quito, Ecuador
| | - Jaime A. Costales
- Centro de Investigación para la Salud en América Latina (CISeAL), Escuela de Ciencias Biológicas, Pontificia Universidad Católica del Ecuador, Quito, Ecuador
| | - Rodion Gorchakov
- Sabin Vaccine Institute and Texas Children’s Hospital Center for Vaccine Development, National School of Tropical Medicine, Department of Pediatrics - Tropical Medicine, Baylor College of Medicine, Houston, TX, United States
| | - Kathryn Jones
- Sabin Vaccine Institute and Texas Children’s Hospital Center for Vaccine Development, National School of Tropical Medicine, Department of Pediatrics - Tropical Medicine, Baylor College of Medicine, Houston, TX, United States
| | - Melissa S. Nolan
- Sabin Vaccine Institute and Texas Children’s Hospital Center for Vaccine Development, National School of Tropical Medicine, Department of Pediatrics - Tropical Medicine, Baylor College of Medicine, Houston, TX, United States
| | - Santuza M. R. Teixeira
- Departamento de Bioquímica e Imunologia, Universidade Federal de Minas Gerais, Belo Horizonte, Brazil
| | - Hernán José Carrasco
- Laboratorio de Biología Molecular de Protozoarios, Instituto de Medicina Tropical, Facultad de Medicina, Universidad Central de Venezuela, Caracas, Venezuela
| | - Maria Elena Bottazzi
- Sabin Vaccine Institute and Texas Children’s Hospital Center for Vaccine Development, National School of Tropical Medicine, Department of Pediatrics - Tropical Medicine, Baylor College of Medicine, Houston, TX, United States
| | - Peter J. Hotez
- Sabin Vaccine Institute and Texas Children’s Hospital Center for Vaccine Development, National School of Tropical Medicine, Department of Pediatrics - Tropical Medicine, Baylor College of Medicine, Houston, TX, United States
| | - Kristy O. Murray
- Sabin Vaccine Institute and Texas Children’s Hospital Center for Vaccine Development, National School of Tropical Medicine, Department of Pediatrics - Tropical Medicine, Baylor College of Medicine, Houston, TX, United States
| | - Mario J. Grijalva
- Centro de Investigación para la Salud en América Latina (CISeAL), Escuela de Ciencias Biológicas, Pontificia Universidad Católica del Ecuador, Quito, Ecuador
- Department of Biomedical Sciences, Heritage College of Osteopathic Medicine, Infectious and Tropical Disease Institute, Ohio University, Athens, OH, United States
| | - Barbara Burleigh
- Department of Immunology and Infectious Diseases, T.H. Chan School of Public Health, Harvard University, Boston, MA, United States
| | - Edmundo C. Grisard
- Departamento de Microbiologia, Imunologia e Parasitologia, Universidade Federal Santa Catarina, Florianópolis, Brazil
| | - Michael A. Miles
- Faculty of Infectious and Tropical Diseases, London School of Hygiene and Tropical Medicine, London, United Kingdom
| | - Björn Andersson
- Department of Cell and Molecular Biology, Karolinska Institutet, Stockholm, Sweden
| |
Collapse
|
3
|
Macías F, Afonso-Lehmann R, López MC, Gómez I, Thomas MC. Biology of Trypanosoma cruzi Retrotransposons: From an Enzymatic to a Structural Point of View. Curr Genomics 2018; 19:110-118. [PMID: 29491739 PMCID: PMC5814959 DOI: 10.2174/1389202918666170815150738] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/15/2017] [Revised: 02/13/2017] [Accepted: 04/16/2017] [Indexed: 12/15/2022] Open
Abstract
INTRODUCTION An important portion of the Trypanosoma cruzi genome is composed of mobile genetic elements, which are interspersed with genes on all chromosomes. The L1Tc non-LTR retrotransposon and its truncated version NARTc are the most highly represented and best studied of these elements. L1Tc is actively transcribed in all three forms of the Trypanosoma parasite and encodes the proteins that enable it to autonomously mobilize. This mini review discusses the enzymatic properties of L1Tc that enable its mobilization and possibly the mobilization of other non-autonomous retrotransposons in Trypanosoma. We also briefly review the Hepatitis Delta Virus-like autocatalytic and 2A self-cleaving viral-like sequences contained in L1Tc that regulate post-transcriptional properties such as relative protein abundance and mRNA stability. Special emphasis is placed on the Pr77 dual system, which is based on the RNA pol II-dependent internal promoter of L1Tc and NARTc and the HDV-like ribozyme activity encoded by the first 77 nucleotides of the element's DNA and RNA. The high degree of conservation of the Pr77 sequence, referred to as the "Pr77-hallmark", among different trypanosomatid retroelements suggests that these mobile elements are responsible for the distribution of regulatory sequences within the genome they inhabit. CONCLUSION We also discuss how the involvement of L1Tc and NARTc in the gene regulatory processes of these parasites could justify their domestication and long-term coexistence in these ancient organisms.
Collapse
Affiliation(s)
- Francisco Macías
- Departamento de Biología Molecular, Instituto de Parasitología y Biomedicina López Neyra-CSIC, Consejo Superior de Investigaciones Científicas (IPBLN-CSIC). PTS-Granada. Avda. del Conocimiento S/N, 18016-Granada, Spain
| | - Raquel Afonso-Lehmann
- Departamento de Biología Molecular, Instituto de Parasitología y Biomedicina López Neyra-CSIC, Consejo Superior de Investigaciones Científicas (IPBLN-CSIC). PTS-Granada. Avda. del Conocimiento S/N, 18016-Granada, Spain
| | - Manuel C. López
- Departamento de Biología Molecular, Instituto de Parasitología y Biomedicina López Neyra-CSIC, Consejo Superior de Investigaciones Científicas (IPBLN-CSIC). PTS-Granada. Avda. del Conocimiento S/N, 18016-Granada, Spain
| | - Inmaculada Gómez
- Departamento de Biología Molecular, Instituto de Parasitología y Biomedicina López Neyra-CSIC, Consejo Superior de Investigaciones Científicas (IPBLN-CSIC). PTS-Granada. Avda. del Conocimiento S/N, 18016-Granada, Spain
| | - M. Carmen Thomas
- Departamento de Biología Molecular, Instituto de Parasitología y Biomedicina López Neyra-CSIC, Consejo Superior de Investigaciones Científicas (IPBLN-CSIC). PTS-Granada. Avda. del Conocimiento S/N, 18016-Granada, Spain
| |
Collapse
|
4
|
Govindaraju A, Cortez JD, Reveal B, Christensen SM. Endonuclease domain of non-LTR retrotransposons: loss-of-function mutants and modeling of the R2Bm endonuclease. Nucleic Acids Res 2016; 44:3276-87. [PMID: 26961309 PMCID: PMC4838377 DOI: 10.1093/nar/gkw134] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/25/2015] [Revised: 02/22/2016] [Accepted: 02/23/2016] [Indexed: 01/07/2023] Open
Abstract
Non-LTR retrotransposons are an important class of mobile elements that insert into host DNA by target-primed reverse transcription (TPRT). Non-LTR retrotransposons must bind to their mRNA, recognize and cleave their target DNA, and perform TPRT at the site of DNA cleavage. As DNA binding and cleavage are such central parts of the integration reaction, a better understanding of the endonuclease encoded by non-LTR retrotransposons is needed. This paper explores the R2 endonuclease domain from Bombyx mori using in vitro studies and in silico modeling. Mutations in conserved sequences located across the putative PD-(D/E)XK endonuclease domain reduced DNA cleavage, DNA binding and TPRT. A mutation at the beginning of the first α-helix of the modeled endonuclease obliterated DNA cleavage and greatly reduced DNA binding. It also reduced TPRT when tested on pre-cleaved DNA substrates. The catalytic K was located to a non-canonical position within the second α-helix. A mutation located after the fourth β-strand reduced DNA binding and cleavage. The motifs that showed impaired activity form an extensive basic region. The R2 biochemical and structural data are compared and contrasted with that of two other well characterized PD-(D/E)XK endonucleases, restriction endonucleases and archaeal Holliday junction resolvases.
Collapse
Affiliation(s)
- Aruna Govindaraju
- Department of Biology, University of Texas at Arlington, Arlington, TX 76019-0498, USA
| | - Jeremy D. Cortez
- Department of Biology, University of Texas at Arlington, Arlington, TX 76019-0498, USA
| | - Brad Reveal
- Department of Biology, University of Texas at Arlington, Arlington, TX 76019-0498, USA
| | - Shawn M. Christensen
- Department of Biology, University of Texas at Arlington, Arlington, TX 76019-0498, USA
| |
Collapse
|
5
|
Comparative in-silico genome analysis of Leishmania (Leishmania) donovani: A step towards its species specificity. Meta Gene 2014; 2:782-98. [PMID: 25606461 PMCID: PMC4287845 DOI: 10.1016/j.mgene.2014.10.003] [Citation(s) in RCA: 19] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/27/2014] [Revised: 09/12/2014] [Accepted: 10/04/2014] [Indexed: 12/24/2022] Open
Abstract
Comparative genome analysis of recently sequenced Leishmania (L.) donovani was unexplored so far. The present study deals with the complete scanning of L. (L.) donovani genome revealing its interspecies variations. 60 distinctly present genes in L. (L.) donovani were identified when the whole genome was compared with Leishmania (L.) infantum. Similarly 72, 159, and 265 species specific genes were identified in L. (L.) donovani when compared to Leishmania (L.) major, Leishmania (L.) mexicana and Leishmania (Viannia) braziliensis respectively. The cross comparison of L. (L.) donovani in parallel with the other sequenced species of leishmanial led to the identification of 55 genes which are highly specific and expressed exclusively in L. (L.) donovani. We found mainly the discrepancies of surface proteins such as amastins, proteases, and peptidases. Also 415 repeat containing proteins in L. (L.) donovani and their differential distribution in other leishmanial species were identified which might have a potential role during pathogenesis. The genes identified can be evaluated as drug targets for anti-leishmanial treatment, exploring the scope for extensive future investigations. Comparative genome analysis identifies 55 species specific L. (L.) donovani genes. Discrepancies of surface proteins such as amastins, proteases, and peptidases are identified in L. (L.) donovani. Apical Membrane Antigen (AMA1) might be a novel factor which helps L. (L.) donovani invasion. Novel A2 and amastin genes in L. (L.) donovani genome are identified. Our study identifies differential gene distribution in L. (L.) donovani with respect to other leishmanial species.
Collapse
|
6
|
Mukha DV, Pasyukova EG, Kapelinskaya TV, Kagramanova AS. Endonuclease domain of the Drosophila melanogaster R2 non-LTR retrotransposon and related retroelements: a new model for transposition. Front Genet 2013; 4:63. [PMID: 23637706 PMCID: PMC3636483 DOI: 10.3389/fgene.2013.00063] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/11/2012] [Accepted: 04/05/2013] [Indexed: 01/25/2023] Open
Abstract
The molecular mechanisms of the transposition of non-long terminal repeat (non-LTR) retrotransposons are not well understood; the key questions of how the 3′-ends of cDNA copies integrate and how site-specific integration occurs remain unresolved. Integration depends on properties of the endonuclease (EN) domain of retrotransposons. Using the EN domain of the Drosophila R2 retrotransposon as a model for other, closely related non-LTR retrotransposons, we investigated the EN domain and found that it resembles archaeal Holliday-junction resolvases. We suggest that these non-LTR retrotransposons are co-transcribed with the host transcript. Combined with the proposed resolvase activity of the EN domain, this model yields a novel mechanism for site-specific retrotransposition within this class of retrotransposons, with resolution proceeding via a Holliday junction intermediate.
Collapse
Affiliation(s)
- Dmitry V Mukha
- Vavilov Institute of General Genetics, Russian Academy of Sciences Moscow, Russia
| | | | | | | |
Collapse
|
7
|
Teixeira SM, de Paiva RMC, Kangussu-Marcolino MM, Darocha WD. Trypanosomatid comparative genomics: Contributions to the study of parasite biology and different parasitic diseases. Genet Mol Biol 2012; 35:1-17. [PMID: 22481868 PMCID: PMC3313497 DOI: 10.1590/s1415-47572012005000008] [Citation(s) in RCA: 28] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/08/2011] [Accepted: 10/18/2011] [Indexed: 01/23/2023] Open
Abstract
In 2005, draft sequences of the genomes of Trypanosoma brucei, Trypanosoma cruzi and Leishmania major, also known as the Tri-Tryp genomes, were published. These protozoan parasites are the causative agents of three distinct insect-borne diseases, namely sleeping sickness, Chagas disease and leishmaniasis, all with a worldwide distribution. Despite the large estimated evolutionary distance among them, a conserved core of ~6,200 trypanosomatid genes was found among the Tri-Tryp genomes. Extensive analysis of these genomic sequences has greatly increased our understanding of the biology of these parasites and their host-parasite interactions. In this article, we review the recent advances in the comparative genomics of these three species. This analysis also includes data on additional sequences derived from other trypanosmatid species, as well as recent data on gene expression and functional genomics. In addition to facilitating the identification of key parasite molecules that may provide a better understanding of these complex diseases, genome studies offer a rich source of new information that can be used to define potential new drug targets and vaccine candidates for controlling these parasitic infections.
Collapse
Affiliation(s)
- Santuza M Teixeira
- Departamento de Bioquímica e Imunologia, Universidade Federal de Minas Gerais, Belo Horizonte, MG, Brazil
| | | | | | | |
Collapse
|
8
|
R2 retrotransposons encode a self-cleaving ribozyme for processing from an rRNA cotranscript. Mol Cell Biol 2010; 30:3142-50. [PMID: 20421411 DOI: 10.1128/mcb.00300-10] [Citation(s) in RCA: 75] [Impact Index Per Article: 5.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/01/2023] Open
Abstract
The non-long terminal repeat (non-LTR) retrotransposon R2 is inserted into the 28S rRNA genes of many animals. Expression of the element appears to be by cotranscription with the rRNA gene unit. We show here that processing of the rRNA cotranscript at the 5' end of the R2 element in Drosophila simulans is rapid and utilizes an unexpected mechanism. Using RNA synthesized in vitro, the 5' untranslated region of R2 was shown capable of rapid and efficient self-cleavage of the 28S-R2 cotranscript. The 5' end generated in vitro by the R2 ribozyme was at the position identical to that found for in vivo R2 transcripts. The RNA segment corresponding to the R2 ribozyme could be folded into a double pseudoknot structure similar to that of the hepatitis delta virus (HDV) ribozyme. Remarkably, 21 of the nucleotide positions in and around the active site of the HDV ribozyme were identical in R2. R2 elements from other Drosophila species were also shown to encode HDV-like ribozymes capable of self-cleavage. Tracing their sequence evolution in the Drosophila lineage suggests that the extensive similarity of the R2 ribozyme from D. simulans to that of HDV was a result of convergent evolution, not common descent.
Collapse
|
9
|
Respuela P, Ferella M, Rada-Iglesias A, Åslund L. Histone acetylation and methylation at sites initiating divergent polycistronic transcription in Trypanosoma cruzi. J Biol Chem 2008; 283:15884-92. [PMID: 18400752 PMCID: PMC3259629 DOI: 10.1074/jbc.m802081200] [Citation(s) in RCA: 73] [Impact Index Per Article: 4.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/17/2008] [Indexed: 01/23/2023] Open
Abstract
Trypanosomes are ancient eukaryotic parasites in which the protein-coding genes, organized in large polycistronic clusters on both strands, are transcribed from as yet unidentified promoters. In an effort to reveal transcriptional initiation sites, we examined the Trypanosoma cruzi genome for histone modification patterns shown to be linked to active genes in various organisms. Here, we show that acetylated and methylated histones were found to be enriched at strand switch regions of divergent gene arrays, not at convergent clusters or intra- and intergenic regions within clusters. The modified region showed a bimodular profile with two peaks centered over the 5'-regions of the gene pair flanking the strand switch region. This pattern, which demarcates polycistronic transcription units originating from bidirectional initiation sites, is likely to be common in kinetoplastid parasites as well as in other organisms with polycistronic transcription. In contrast, no acetylation was found at promoters of the highly expressed rRNA and spliced leader genes or satellite DNA or at tested retrotransposonal elements. These results reveal, for the first time, the presence of specific epigenetic marks in T. cruzi with potential implications for transcriptional regulation; they indicate that both histone modifications and bidirectional transcription are evolutionarily conserved.
Collapse
Affiliation(s)
- Patricia Respuela
- Department of Genetics and Pathology,
Rudbeck Laboratory, Uppsala University, SE-751 85 Uppsala, Sweden and the
Program for Genomics and Bioinformatics,
Department of Cell and Molecular Biology, Karolinska Institute, SE-171 77
Stockholm, Sweden
| | - Marcela Ferella
- Department of Genetics and Pathology,
Rudbeck Laboratory, Uppsala University, SE-751 85 Uppsala, Sweden and the
Program for Genomics and Bioinformatics,
Department of Cell and Molecular Biology, Karolinska Institute, SE-171 77
Stockholm, Sweden
| | - Alvaro Rada-Iglesias
- Department of Genetics and Pathology,
Rudbeck Laboratory, Uppsala University, SE-751 85 Uppsala, Sweden and the
Program for Genomics and Bioinformatics,
Department of Cell and Molecular Biology, Karolinska Institute, SE-171 77
Stockholm, Sweden
| | - Lena Åslund
- Department of Genetics and Pathology,
Rudbeck Laboratory, Uppsala University, SE-751 85 Uppsala, Sweden and the
Program for Genomics and Bioinformatics,
Department of Cell and Molecular Biology, Karolinska Institute, SE-171 77
Stockholm, Sweden
| |
Collapse
|
10
|
Patrick KL, Luz PM, Ruan JP, Shi H, Ullu E, Tschudi C. Genomic rearrangements and transcriptional analysis of the spliced leader-associated retrotransposon in RNA interference-deficient Trypanosoma brucei. Mol Microbiol 2007; 67:435-47. [PMID: 18067542 DOI: 10.1111/j.1365-2958.2007.06057.x] [Citation(s) in RCA: 20] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/31/2022]
Abstract
The Trypanosoma brucei genome is colonized by the site-specific non-LTR retrotransposon SLACS, or spliced leader-associated conserved sequence, which integrates exclusively into the spliced leader (SL) RNA genes. Although there is evidence that the RNA interference (RNAi) machinery regulates SLACS transcript levels, we do not know whether RNAi deficiency affects the genomic stability of SLACS, nor do we understand the mechanism of SLACS transcription. Here, we report that prolonged culturing of RNAi-deficient T. brucei cells, but not wild-type cells, results in genomic rearrangements of SLACS. Furthermore, two populations of SLACS transcripts persist in RNAi-deficient cells: a full-length transcript of approximately 7 kb and a heterogeneous population of small SLACS transcripts ranging in size from 450 to 550 nt. We provide evidence that SLACS transcription initiates at the +1 of the interrupted SL RNA gene and proceeds into the 5' UTR and open reading frame 1 (ORF1). This transcription is carried out by an RNA polymerase with alpha-amanitin sensitivity reminiscent of SL RNA synthesis and is dependent on the SL RNA promoter. Additionally, we show that both sense and antisense small SLACS transcripts originate from ORF1 and that they are associated with proteins in vivo. We speculate that the small SLACS transcripts serve as substrates for the production of siRNAs to regulate SLACS expression.
Collapse
Affiliation(s)
- Kristin L Patrick
- Department of Epidemiology and Public Health, Yale University Medical School, 295 Congress Avenue, New Haven, CT 06536-0812, USA
| | | | | | | | | | | |
Collapse
|
11
|
Ekanayake DK, Cipriano MJ, Sabatini R. Telomeric co-localization of the modified base J and contingency genes in the protozoan parasite Trypanosoma cruzi. Nucleic Acids Res 2007; 35:6367-77. [PMID: 17881368 PMCID: PMC2095807 DOI: 10.1093/nar/gkm693] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/18/2007] [Revised: 08/14/2007] [Accepted: 08/22/2007] [Indexed: 12/20/2022] Open
Abstract
Base J or beta-d-glucosylhydroxymethyluracil is a modification of thymine residues within the genome of kinetoplastid parasites. In organisms known to contain the modified base, J is located mainly within the telomeric repeats. However, in Trypanosoma brucei, a small fraction of J is also located within the silent subtelomeric variant surface glycoprotein (VSG) gene expression sites, but not in the active expression site, suggesting a role for J in regulating telomeric genes involved in pathogenesis. With the identification of surface glycoprotein genes adjacent to telomeres in the South American Trypanosome, Trypanosoma cruzi, we became interested in the telomeric distribution of base J. Analysis of J and telomeric repeat sequences by J immunoblots and Southern blots following DNA digestion, reveals approximately 25% of J outside the telomeric repeat sequences. Moreover, the analysis of DNA sequences immunoprecipitated with J antiserum, localized J within subtelomeric regions rich in life-stage-specific surface glycoprotein genes involved in pathogenesis. Interestingly, the pattern of J within these regions is developmentally regulated. These studies provide a framework to characterize the role of base J in the regulation of telomeric gene expression/diversity in T. cruzi.
Collapse
Affiliation(s)
| | | | - Robert Sabatini
- Department of Biochemistry and Molecular Biology, University of Georgia, Athens, GA, USA
| |
Collapse
|
12
|
Souza RT, Santos MRM, Lima FM, El-Sayed NM, Myler PJ, Ruiz JC, da Silveira JF. New Trypanosoma cruzi repeated element that shows site specificity for insertion. EUKARYOTIC CELL 2007; 6:1228-38. [PMID: 17526721 PMCID: PMC1951114 DOI: 10.1128/ec.00036-07] [Citation(s) in RCA: 14] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/30/2022]
Abstract
A new family of site-specific repeated elements identified in Trypanosoma cruzi, which we named TcTREZO, is described here. TcTREZO appears to be a composite repeated element, since three subregions may be defined within it on the basis of sequence similarities with other T. cruzi sequences. Analysis of the distribution of TcTREZO in the genome clearly indicates that it displays site specificity for insertion. Most TcTREZO elements are flanked by conserved sequences. There is a highly conserved 68-bp sequence at the 5' end of the element and a sequence domain of approximately 500 bp without a well-defined borderline at the 3' end. Northern blot hybridization and reverse transcriptase PCR analyses showed that TcTREZO transcripts are expressed as oligo(A)-terminated transcripts whose length corresponds to the unit size of the element (1.6 kb). Transcripts of approximately 0.2 kb derived from a small part of TcTREZO are also detected in steady-state RNA. TcTREZO transcripts are unspliced and not translated. The copy number of TcTREZO sequences was estimated to be approximately 173 copies per haploid genome. TcTREZO appears to have been assembled by insertions of sequences into a progenitor element. Once associated with each other, these subunits were amplified as a new transposable element. TcTREZO shows site specificity for insertion, suggesting that a sequence-specific endonuclease could be responsible for its insertion at a unique site.
Collapse
Affiliation(s)
- Renata T Souza
- Department of Microbiology, Immunology and Parasitology, Escola Paulista de Medicina, UNIFESP, Rua Botucatu, São Paulo, Brazil
| | | | | | | | | | | | | |
Collapse
|
13
|
Smith DF, Peacock CS, Cruz AK. Comparative genomics: from genotype to disease phenotype in the leishmaniases. Int J Parasitol 2007; 37:1173-86. [PMID: 17645880 PMCID: PMC2696322 DOI: 10.1016/j.ijpara.2007.05.015] [Citation(s) in RCA: 90] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/22/2007] [Revised: 05/18/2007] [Accepted: 05/21/2007] [Indexed: 11/29/2022]
Abstract
Recent progress in sequencing the genomes of several Leishmania species, causative agents of cutaneous, mucocutaneous and visceral leishmaniasis, is revealing unusual features of potential relevance to parasite virulence and pathogenesis in the host. While the genomes of Leishmania major, Leishmania braziliensis and Leishmania infantum are highly similar in content and organisation, species-specific genes and mechanisms distinguish one from another. In particular, the presence of retrotransposons and the components of a putative RNA interference machinery in L. braziliensis suggest the potential for both greater diversity and more tractable experimentation in this Leishmania Viannia species.
Collapse
Affiliation(s)
- Deborah F Smith
- Immunology and Infection Unit, Department of Biology/Hull York Medical School, University of York, Heslington, York YO10 5YW, UK.
| | | | | |
Collapse
|
14
|
Peacock CS, Seeger K, Harris D, Murphy L, Ruiz JC, Quail MA, Peters N, Adlem E, Tivey A, Aslett M, Kerhornou A, Ivens A, Fraser A, Rajandream MA, Carver T, Norbertczak H, Chillingworth T, Hance Z, Jagels K, Moule S, Ormond D, Rutter S, Squares R, Whitehead S, Rabbinowitsch E, Arrowsmith C, White B, Thurston S, Bringaud F, Baldauf SL, Faulconbridge A, Jeffares D, Depledge DP, Oyola SO, Hilley JD, Brito LO, Tosi LRO, Barrell B, Cruz AK, Mottram JC, Smith DF, Berriman M. Comparative genomic analysis of three Leishmania species that cause diverse human disease. Nat Genet 2007; 39:839-47. [PMID: 17572675 PMCID: PMC2592530 DOI: 10.1038/ng2053] [Citation(s) in RCA: 567] [Impact Index Per Article: 33.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/19/2006] [Accepted: 05/04/2007] [Indexed: 12/23/2022]
Abstract
Leishmania parasites cause a broad spectrum of clinical disease. Here we report the sequencing of the genomes of two species of Leishmania: Leishmania infantum and Leishmania braziliensis. The comparison of these sequences with the published genome of Leishmania major reveals marked conservation of synteny and identifies only approximately 200 genes with a differential distribution between the three species. L. braziliensis, contrary to Leishmania species examined so far, possesses components of a putative RNA-mediated interference pathway, telomere-associated transposable elements and spliced leader-associated SLACS retrotransposons. We show that pseudogene formation and gene loss are the principal forces shaping the different genomes. Genes that are differentially distributed between the species encode proteins implicated in host-pathogen interactions and parasite survival in the macrophage.
Collapse
Affiliation(s)
- Christopher S Peacock
- Wellcome Trust Sanger Institute, Wellcome Trust Genome Campus, Hinxton, Cambridgeshire CB10 1SA, UK.
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
15
|
Wickstead B, Ersfeld K, Gull K. Repetitive elements in genomes of parasitic protozoa. Microbiol Mol Biol Rev 2003; 67:360-75, table of contents. [PMID: 12966140 PMCID: PMC193867 DOI: 10.1128/mmbr.67.3.360-375.2003] [Citation(s) in RCA: 58] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022] Open
Abstract
Repetitive DNA elements have been a part of the genomic fauna of eukaryotes perhaps since their very beginnings. Millions of years of coevolution have given repeats central roles in chromosome maintenance and genetic modulation. Here we review the genomes of parasitic protozoa in the context of the current understanding of repetitive elements. Particular reference is made to repeats in five medically important species with ongoing or completed genome sequencing projects: Plasmodium falciparum, Leishmania major, Trypanosoma brucei, Trypanosoma cruzi, and Giardia lamblia. These organisms are used to illustrate five thematic classes of repeats with different structures and genomic locations. We discuss how these repeat classes may interact with parasitic life-style and also how they can be used as experimental tools. The story which emerges is one of opportunism and upheaval which have been employed to add genetic diversity and genomic flexibility.
Collapse
Affiliation(s)
- Bill Wickstead
- Sir William Dunn School of Pathology, University of Oxford, Oxford OX1 3RE, United Kingdom
| | | | | |
Collapse
|
16
|
Bringaud F, Biteau N, Melville SE, Hez S, El-Sayed NM, Leech V, Berriman M, Hall N, Donelson JE, Baltz T. A new, expressed multigene family containing a hot spot for insertion of retroelements is associated with polymorphic subtelomeric regions of Trypanosoma brucei. EUKARYOTIC CELL 2002; 1:137-51. [PMID: 12455980 PMCID: PMC118050 DOI: 10.1128/ec.1.1.137-151.2002] [Citation(s) in RCA: 75] [Impact Index Per Article: 3.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/20/2022]
Abstract
We describe a novel gene family that forms clusters in subtelomeric regions of Trypanosoma brucei chromosomes and partially accounts for the observed clustering of retrotransposons. The ingi and ribosomal inserted mobile element (RIME) non-LTR retrotransposons share 250 bp at both extremities and are the most abundant putatively mobile elements, with about 500 copies per haploid genome. From cDNA clones and subsequently in the T. brucei genomic DNA databases, we identified 52 homologous gene and pseudogene sequences, 16 of which contain a RIME and/or ingi retrotransposon inserted at exactly the same relative position. Here these genes are called the RHS family, for retrotransposon hot spot. Comparison of the protein sequences encoded by RHS genes (21 copies) and pseudogenes (24 copies) revealed a conserved central region containing an ATP/GTP-binding motif and the RIME/ingi insertion site. The RHS proteins share between 13 and 96% identity, and six subfamilies, RHS1 to RHS6, can be defined on the basis of their divergent C-terminal domains. Immunofluorescence and Western blot analyses using RHS subfamily-specific immune sera show that RHS proteins are constitutively expressed and occur mainly in the nucleus. Analysis of Genome Survey Sequence databases indicated that the Trypanosoma brucei diploid genome contains about 280 RHS (pseudo)genes. Among the 52 identified RHS (pseudo)genes, 48 copies are in three RHS clusters located in subtelomeric regions of chromosomes Ia and II and adjacent to the active bloodstream form expression site in T. brucei strain TREU927/4 GUTat10.1. RHS genes comprise the remaining sequence of the size-polymorphic "repetitive region" described for T. brucei chromosome I, and a homologous gene family is present in the Trypanosoma cruzi genome.
Collapse
Affiliation(s)
- Frederic Bringaud
- Laboratoire de Parasitologie Moléculaire, Université Victor Segalen Bordeaux II, UMR-5016 CNRS, 33076 Bordeaux, France.
| | | | | | | | | | | | | | | | | | | |
Collapse
|
17
|
Arkhipova IR, Morrison HG. Three retrotransposon families in the genome of Giardia lamblia: two telomeric, one dead. Proc Natl Acad Sci U S A 2001; 98:14497-502. [PMID: 11734649 PMCID: PMC64710 DOI: 10.1073/pnas.231494798] [Citation(s) in RCA: 78] [Impact Index Per Article: 3.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/20/2023] Open
Abstract
Transposable elements inhabiting eukaryotic genomes are generally regarded either as selfish DNA, which is selectively neutral to the host organism, or as parasitic DNA, deleterious to the host. Thus far, the only agreed-upon example of beneficial eukaryotic transposons is provided by Drosophila telomere-associated retrotransposons, which transpose directly to the chromosome ends and thereby protect them from degradation. This article reports the transposon content of the genome of the protozoan Giardia lamblia, one of the earliest-branching eukaryotes. A total of three non-long terminal repeat retrotransposon families have been identified, two of which are located at the ends of chromosomes, and the third one contains exclusively dead copies with multiple internal deletions, nucleotide substitutions, and frame shifts. No other reverse transcriptase- or transposase-related sequences were found. Thus, the entire genome of this protozoan, which is not known to reproduce sexually, contains only retrotransposons that are either confined to telomeric regions and possibly beneficial, or inactivated and completely nonfunctional.
Collapse
Affiliation(s)
- I R Arkhipova
- Department of Molecular and Cellular Biology, Harvard University, 7 Divinity Avenue, Cambridge, MA 02138, USA.
| | | |
Collapse
|
18
|
Anzai T, Takahashi H, Fujiwara H. Sequence-specific recognition and cleavage of telomeric repeat (TTAGG)(n) by endonuclease of non-long terminal repeat retrotransposon TRAS1. Mol Cell Biol 2001; 21:100-8. [PMID: 11113185 PMCID: PMC88784 DOI: 10.1128/mcb.21.1.100-108.2001] [Citation(s) in RCA: 44] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022] Open
Abstract
The telomere of the silkworm Bombyx mori consists of (TTAGG/CCTAA)(n) repeats and harbors a large number of telomeric repeat-specific non-long terminal repeat retrotransposons, such as TRAS1 and SART1. To understand how these retrotransposons recognize and integrate into the telomeric repeat in a sequence-specific manner, we expressed the apurinic-apryrimidinic endonuclease-like endonuclease domain of TRAS1 (TRAS1 EN), which is supposed to digest the target DNA, and characterized its enzymatic properties. Purified TRAS1 EN could generate specific nicks on both strands of the telomeric repeat sequence between T and A of the (TTAGG)(n) strand (bottom strand) and between C and T of the (CCTAA)(n) strand (top strand). These sites are consistent with insertion sites expected from the genomic structure of boundary regions of TRAS1. Time course studies of nicking activities on both strands revealed that the cleavages on the bottom strand preceded those on the top strand, supporting the target-primed reverse transcription model. TRAS1 EN could cleave the telomeric repeats specifically even if it was flanked by longer tracts of nontelomeric sequence, indicating that the target site specificity of the TRAS1 element was mainly determined by its EN domain. Based on mutation analyses, TRAS1 EN recognizes less than 10 bp around the initial cleavage site (upstream 7 bp and downstream 3 bp), and the GTTAG sequence especially is essential for the cleavage reaction on the bottom strand (5'. TTAGGTT downward arrow AGG. 3'). TRAS1 EN, the first identified endonuclease digesting telomeric repeats, may be used as a genetic tool to shorten the telomere in insects and some other organisms.
Collapse
Affiliation(s)
- T Anzai
- Department of Integrated Biosciences, Graduate School of Frontier Sciences, University of Tokyo, Bunkyo-ku, Tokyo 113-0033, Japan
| | | | | |
Collapse
|
19
|
Vazquez M, Ben-Dov C, Lorenzi H, Moore T, Schijman A, Levin MJ. The short interspersed repetitive element of Trypanosoma cruzi, SIRE, is part of VIPER, an unusual retroelement related to long terminal repeat retrotransposons. Proc Natl Acad Sci U S A 2000; 97:2128-33. [PMID: 10688909 PMCID: PMC15765 DOI: 10.1073/pnas.050578397] [Citation(s) in RCA: 29] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
The short interspersed repetitive element (SIRE) of Trypanosoma cruzi was first detected when comparing the sequences of loci that encode the TcP2beta genes. It is present in about 1,500-3,000 copies per genome, depending on the strain, and it is distributed in all chromosomes. An initial analysis of SIRE sequences from 21 genomic fragments allowed us to derive a consensus nucleotide sequence and structure for the element, consisting of three regions (I, II, and III) each harboring distinctive features. Analysis of 158 transcribed SIREs demonstrates that the consensus is highly conserved. The sequences of 51 cDNAs show that SIRE is included in the 3' end of several mRNAs, always transcribed from the sense strand, contributing the polyadenylation site in 63% of the cases. This study led to the characterization of VIPER (vestigial interposed retroelement), a 2,326-bp-long unusual retroelement. VIPER's 5' end is formed by the first 182 bp of SIRE, whereas its 3' end is formed by the last 220 bp of the element. Both SIRE moieties are connected by a 1,924-bp-long fragment that carries a unique ORF encoding a complete reverse transcriptase-RNase H gene whose 15 C-terminal amino acids derive from codons specified by SIRE's region II. The amino acid sequence of VIPER's reverse transcriptase-RNase H shares significant homology to that of long terminal repeat retrotransposons. The fact that SIRE and VIPER sequences are found only in the T. cruzi genome may be of relevance for studies concerning the evolution and the genome flexibility of this protozoan parasite.
Collapse
Affiliation(s)
- M Vazquez
- Laboratorio de Biología Molecular de la Enfermedad de Chagas, Instituto de Investigaciones en Ingenieria Genética y Biologia Molecular, Vuelta de Obligado 2490, 1428 Buenos Aires, Argentina
| | | | | | | | | | | |
Collapse
|
20
|
Yang J, Malik HS, Eickbush TH. Identification of the endonuclease domain encoded by R2 and other site-specific, non-long terminal repeat retrotransposable elements. Proc Natl Acad Sci U S A 1999; 96:7847-52. [PMID: 10393910 PMCID: PMC22150 DOI: 10.1073/pnas.96.14.7847] [Citation(s) in RCA: 110] [Impact Index Per Article: 4.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
The non-long terminal repeat (LTR) retrotransposon, R2, encodes a sequence-specific endonuclease responsible for its insertion at a unique site in the 28S rRNA genes of arthropods. Although most non-LTR retrotransposons encode an apurinic-like endonuclease upstream of a common reverse transcriptase domain, R2 and many other site-specific non-LTR elements do not (CRE1 and 2, SLACS, CZAR, Dong, R4). Sequence comparison of these site-specific elements has revealed that the region downstream of their reverse transcriptase domain is conserved and shares sequence features with various prokaryotic restriction endonucleases. In particular, these non-LTR elements have a Lys/Arg-Pro-Asp-X12-14aa-Asp/Glu motif known to lie near the scissile phosphodiester bonds in the protein-DNA complexes of restriction enzymes. Site-directed mutagenesis of the R2 protein was used to provide evidence that this motif is also part of the active site of the endonuclease encoded by this element. Mutations of this motif eliminate both DNA-cleavage activities of the R2 protein: first-strand cleavage in which the exposed 3' end is used to prime reverse transcription of the RNA template and second-strand cleavage, which occurs after reverse transcription. The general organization of the R2 protein appears similar to the type IIS restriction enzyme, FokI, in which specific DNA binding is controlled by a separate domain located amino terminal to the cleavage domain. Previous phylogenetic analysis of their reverse transcriptase domains has indicated that the non-LTR elements identified here as containing restriction-like endonucleases are the oldest lineages of non-LTR elements, suggesting a scenario for the evolution of non-LTR elements.
Collapse
Affiliation(s)
- J Yang
- Department of Biology, University of Rochester, Rochester, NY 14627-0211, USA
| | | | | |
Collapse
|
21
|
Takahashi H, Okazaki S, Fujiwara H. A new family of site-specific retrotransposons, SART1, is inserted into telomeric repeats of the silkworm, Bombyx mori. Nucleic Acids Res 1997; 25:1578-84. [PMID: 9092665 PMCID: PMC146635 DOI: 10.1093/nar/25.8.1578] [Citation(s) in RCA: 77] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/04/2023] Open
Abstract
The telomeres of the silkworm, Bombyx mori, consist of pentanucleotide repeats (TTAGG)n . We previously characterized the non-LTR element TRAS1, which terminates with oligo (A) in a head to tail orientation at the exact position (between A and C) of the (CCTAA) n repeats. Here we characterized another family of telomere-specific non-LTR retrotransposon named SART1. The SART1 family was inserted at another site of the (TTAGG) n in a reverse orientation from that of TRAS1. The complete unit of SART1, 6.7 kb in length with a poly (A) stretch, contains two open reading frames encoding putative gag and pol products, overlapping by 54 bp in the -1 reading frame. Most of the 600 SART1 copies in the silkworm haploid genome are completely conserved in structure without 5'truncation. All SART1 sequences analyzed were inserted at the same position (between T and A) within the (TTAGG) n repeats. Fluorescence in situ hybridization showed that many of the SART1 copies were localized in the chromosomal ends. A phylogenetic tree showed that the SART1, TRAS1 and two other site-specific elements, R1 and RT, which insert into 28S ribosomal RNA genes in insects, belong to the same group. Based on the orientation for the chromosomal insertion and structural similarities, these elements could be further classified into two subgroups, R1/TRAS1 and RT/SART1, suggesting that the target specificity of the two telomere-associated elements was changed independently.
Collapse
Affiliation(s)
- H Takahashi
- Department of Biological Sciences, Graduate School of Science, University of Tokyo, Hongo, Bunkyo-ku, Tokyo 113, Japan
| | | | | |
Collapse
|
22
|
Luan DD, Eickbush TH. Downstream 28S gene sequences on the RNA template affect the choice of primer and the accuracy of initiation by the R2 reverse transcriptase. Mol Cell Biol 1996; 16:4726-34. [PMID: 8756630 PMCID: PMC231473 DOI: 10.1128/mcb.16.9.4726] [Citation(s) in RCA: 43] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/02/2023] Open
Abstract
R2 non-long terminal repeat retrotransposable elements insert at a unique site in the 28S rRNA genes of insects. The protein encoded by the single open reading frame of R2 is capable of conducting the initial steps of its integration in vitro. The protein nicks the noncoding strand of the 28S target DNA (the strand which serves as a template for RNA synthesis) and uses the 3' hydroxyl group exposed by this nick to prime reverse transcription of the R2 RNA template. This target-primed reverse transcription (TPRT) reaction requires that the RNA template contains the 250-nucleotide 3' untranslated region of the R2 element. If this RNA template ends at the precise 3' end of the R2 element, then extra nucleotides, which we refer to as nontemplated nucleotides, are added to the target before cDNA synthesis. The presence of downstream 28S gene sequences on the RNA template reduces the total efficiency but eliminates these nontemplated additions, resulting in nearly 90% of all TPRT products reproducing the 3' junctions seen in vivo. Templates with 5 to 10 nucleotides of the 28S sequence are used most efficiently in this in vitro TPRT reaction. The requirement for downstream 28S rRNA sequences probably explains why the R2 elements of most insects differ from the majority of non-long terminal repeat retrotransposons in that they do not contain an A-rich repeat at their 3' junction with the target DNA. The presence of downstream sequences on these in vitro R2 templates also revealed that the R2 reverse transcriptase can prime cDNA synthesis by using the 3' end of another RNA molecule. This RNA-primed cDNA synthesis is not based on sequence complementarity between the RNA primer and the R2 template. The ability to use the 3' end of a noncomplementary RNA molecule has also been seen with the reverse transcriptase of the mitochondrial Mauriceville plasmid of Neurospora crassa.
Collapse
Affiliation(s)
- D D Luan
- Department of Biology, University of Rochester, New York 14627, USA
| | | |
Collapse
|
23
|
Teng SC, Wang SX, Gabriel A. A new non-LTR retrotransposon provides evidence for multiple distinct site-specific elements in Crithidia fasciculata miniexon arrays. Nucleic Acids Res 1995; 23:2929-36. [PMID: 7659515 PMCID: PMC307132 DOI: 10.1093/nar/23.15.2929] [Citation(s) in RCA: 19] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/26/2023] Open
Abstract
We have identified a new member of the family of trypanosome site-specific retrotransposons, using a degenerate oligonucleotide PCR strategy. The 9595 bp element, termed Crithidia retrotransposable element 2 (CRE2), was cloned and found to be inserted in the tandemly arrayed miniexon genes of Crithidia fasciculata. The element is flanked by 29 bp target site duplications but lacks the 3' poly dA tract characteristic of most other non-long terminal repeat retrotransposons. The amino terminal region of the single 2518-codon open reading frame contains a putative metal-binding motif and a proline-rich region similar to gag-like domains of other retrotransposons. The carboxy terminal region of this open reading frame shares sequence homology with the reverse transcriptase and putative endonuclease regions of three previously described trypanosomatid site-specific retrotransposons. All four of these retrotransposons are specifically inserted between nucleotides 11 and 12 of the highly conserved 39mer sequence of the miniexon gene. Most copies of CRE2 and the previously characterized CRE1 are located on different sized chromosomes. Additional CRE-related sequences were identified by screening Crithidia libraries. These results suggest that a particular sequence in the C. fasciculata miniexon repeat is the target for multiple distinct site-specific retrotransposon insertions.
Collapse
Affiliation(s)
- S C Teng
- Department of Molecular Biology and Biochemistry, Rutgers University, Piscataway, NJ 08855, USA
| | | | | |
Collapse
|
24
|
Okazaki S, Ishikawa H, Fujiwara H. Structural analysis of TRAS1, a novel family of telomeric repeat-associated retrotransposons in the silkworm, Bombyx mori. Mol Cell Biol 1995; 15:4545-52. [PMID: 7623845 PMCID: PMC230694 DOI: 10.1128/mcb.15.8.4545] [Citation(s) in RCA: 84] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/26/2023] Open
Abstract
We characterized TRAS1, a retrotransposable element which was inserted into the telomeric repetitive sequence (CCTAA)n of the silkworm, Bombyx mori. The complete sequence of TRAS1, a stretch of 7.8 kb with a poly(A) tract at the 3' end, was determined. No long terminal repeat (LTR) was found at the termini of the element. TRAS1 contains gag- and pol-like open reading frames (ORFs) which are similar to those of non-LTR retrotransposons. The two ORFs overlap but are one nucleotide out of frame (+1 frameshift). Most of the approximately 250 copies of TRAS1 elements in the genome were highly conserved in the structure. Chromosomal in situ hybridization showed that TRAS1 elements are clustered at the telomeres of Bombyx chromosomes. A phylogenetic analysis using the amino acid sequence of the reverse transcriptase domain within the pol-like ORF revealed that TRAS1 falls into one lineage with R1, which is a family of non-LTR retrotransposons inserted into the same site within the 28S ribosomal DNA unit in most insects. TRAS1 may have been derived from R1 and changed the target specificity so that TRAS1 inserts into the telomeric repetitive sequence (CCTAA)n. Southern hybridization and Bal 31 exonuclease analyses showed that TRAS1 elements are clustered proximal to the terminal long tract of (CCTAA)n. TRAS1 is a novel family of non-LTR retrotransposons which are inserted into the telomeric repetitive sequences as target sites.
Collapse
Affiliation(s)
- S Okazaki
- Zoological Institute, Graduate School of Science, University of Tokyo, Japan
| | | | | |
Collapse
|
25
|
Expression of a retroposon-like sequence upstream of the putative Trypanosoma brucei variant surface glycoprotein gene expression site promoter. Mol Cell Biol 1993. [PMID: 8413293 DOI: 10.1128/mcb.13.11.7036] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022] Open
Abstract
We have cloned the region spanning the putative promoter from two variant surface glycoprotein gene expression sites that are at each end of chromosome M4 of Trypanosoma brucei IsTat 7. Both expression sites contain a retroposon-like sequence (ESR) pseudogene whose 3' end is approximately 30 bp upstream of the putative expression site promoter. The ESRs from both expression sites share considerable sequence homology and are related to LINE-like elements, especially the T. brucei ingi retroposon. Other ESRs are located on large, but not intermediate or mini-, chromosomes in the IsTaR 1 serodeme, and the total copy number is 10 to 20, similar to that estimated for variant surface glycoprotein expression sites. No DNA rearrangements in the vicinity of the ESR and putative expression site promoter were detected following antigenic switches in the IsTaR 1 serodeme. ESR transcripts are present in bloodstream, but not procyclic, forms. Variation in transcript size and sequence between bloodstream variant antigenic types implies that only the ESR from the active expression site is transcribed. This pattern of expression reflects that of sequences downstream of the putative expression site promoter, suggesting that the region of coordinately controlled expression extends upstream of this promoter.
Collapse
|
26
|
Lodes MJ, Smiley BL, Stadnyk AW, Bennett JL, Myler PJ, Stuart K. Expression of a retroposon-like sequence upstream of the putative Trypanosoma brucei variant surface glycoprotein gene expression site promoter. Mol Cell Biol 1993; 13:7036-44. [PMID: 8413293 PMCID: PMC364765 DOI: 10.1128/mcb.13.11.7036-7044.1993] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/30/2023] Open
Abstract
We have cloned the region spanning the putative promoter from two variant surface glycoprotein gene expression sites that are at each end of chromosome M4 of Trypanosoma brucei IsTat 7. Both expression sites contain a retroposon-like sequence (ESR) pseudogene whose 3' end is approximately 30 bp upstream of the putative expression site promoter. The ESRs from both expression sites share considerable sequence homology and are related to LINE-like elements, especially the T. brucei ingi retroposon. Other ESRs are located on large, but not intermediate or mini-, chromosomes in the IsTaR 1 serodeme, and the total copy number is 10 to 20, similar to that estimated for variant surface glycoprotein expression sites. No DNA rearrangements in the vicinity of the ESR and putative expression site promoter were detected following antigenic switches in the IsTaR 1 serodeme. ESR transcripts are present in bloodstream, but not procyclic, forms. Variation in transcript size and sequence between bloodstream variant antigenic types implies that only the ESR from the active expression site is transcribed. This pattern of expression reflects that of sequences downstream of the putative expression site promoter, suggesting that the region of coordinately controlled expression extends upstream of this promoter.
Collapse
Affiliation(s)
- M J Lodes
- Seattle Biomedical Research Institute, Washington 98109-1651
| | | | | | | | | | | |
Collapse
|
27
|
Distinct families of site-specific retrotransposons occupy identical positions in the rRNA genes of Anopheles gambiae. Mol Cell Biol 1992. [PMID: 1328871 DOI: 10.1128/mcb.12.11.5102] [Citation(s) in RCA: 30] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022] Open
Abstract
Two distinct site-specific retrotransposon families, named RT1 and RT2, from the sibling mosquito species Anopheles gambiae and A. arabiensis, respectively, were previously identified. Both were shown to occupy identical nucleotide positions in the 28S rRNA gene and to be flanked by identical 17-bp target site duplications. Full-length representatives of each have been isolated from a single species, A. gambiae, and the nucleotide sequences have been analyzed. Beyond insertion specificity, RT1 and RT2 share several structural and sequence features which show them to be members of the LINE-like, or non-long-terminal-repeat retrotransposon, class of reverse transcriptase-encoding mobile elements. These features include two long overlapping open reading frames (ORFs), poly(A) tails, the absence of long terminal repeats, and heterogeneous 5' truncation of most copies. The first ORF of both elements, particularly ORF1 of RT1, is glutamine rich and contains long tracts of polyglutamine reminiscent of the opa repeat. Near the carboxy ends, three cysteine-histidine motifs occur in ORF1 and one occurs in ORF2. In addition, each ORF2 contains a region of sequence similarity to reverse transcriptases and integrases. Alignments of the protein sequences from RT1 and RT2 reveal 36% identity over the length of ORF1 and 60% identity over the length of ORF2, but the elements cannot be aligned in the 5' and 3' noncoding regions. Unlike that of RT2, the 5' noncoding region of RT1 contains 3.5 copies of a 500-bp subrepeat, followed by a poly(T) tract and two imperfect 55-bp subrepeats, the second spanning the beginning of ORF1. The pattern of distribution of these elements among five siblings species in the A. gambiae complex is nonuniform. RT1 is present in laboratory and wild A. gambiae, A. arabiensis, and A. melas but has not been detected in A. quadriannulatus or A. merus. RT2 has been detected in all available members of the A. gambiae complex except A. merus. Copy number fluctuates, even among the offspring of individual wild female A. gambiae mosquitoes. These findings reflect a complex evolutionary history balancing gain and loss of copies against the coexistence of two elements competing for a conserved target site in the same species for perhaps millions of years.
Collapse
|
28
|
Besansky NJ, Paskewitz SM, Hamm DM, Collins FH. Distinct families of site-specific retrotransposons occupy identical positions in the rRNA genes of Anopheles gambiae. Mol Cell Biol 1992; 12:5102-10. [PMID: 1328871 PMCID: PMC360444 DOI: 10.1128/mcb.12.11.5102-5110.1992] [Citation(s) in RCA: 11] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/26/2022] Open
Abstract
Two distinct site-specific retrotransposon families, named RT1 and RT2, from the sibling mosquito species Anopheles gambiae and A. arabiensis, respectively, were previously identified. Both were shown to occupy identical nucleotide positions in the 28S rRNA gene and to be flanked by identical 17-bp target site duplications. Full-length representatives of each have been isolated from a single species, A. gambiae, and the nucleotide sequences have been analyzed. Beyond insertion specificity, RT1 and RT2 share several structural and sequence features which show them to be members of the LINE-like, or non-long-terminal-repeat retrotransposon, class of reverse transcriptase-encoding mobile elements. These features include two long overlapping open reading frames (ORFs), poly(A) tails, the absence of long terminal repeats, and heterogeneous 5' truncation of most copies. The first ORF of both elements, particularly ORF1 of RT1, is glutamine rich and contains long tracts of polyglutamine reminiscent of the opa repeat. Near the carboxy ends, three cysteine-histidine motifs occur in ORF1 and one occurs in ORF2. In addition, each ORF2 contains a region of sequence similarity to reverse transcriptases and integrases. Alignments of the protein sequences from RT1 and RT2 reveal 36% identity over the length of ORF1 and 60% identity over the length of ORF2, but the elements cannot be aligned in the 5' and 3' noncoding regions. Unlike that of RT2, the 5' noncoding region of RT1 contains 3.5 copies of a 500-bp subrepeat, followed by a poly(T) tract and two imperfect 55-bp subrepeats, the second spanning the beginning of ORF1. The pattern of distribution of these elements among five siblings species in the A. gambiae complex is nonuniform. RT1 is present in laboratory and wild A. gambiae, A. arabiensis, and A. melas but has not been detected in A. quadriannulatus or A. merus. RT2 has been detected in all available members of the A. gambiae complex except A. merus. Copy number fluctuates, even among the offspring of individual wild female A. gambiae mosquitoes. These findings reflect a complex evolutionary history balancing gain and loss of copies against the coexistence of two elements competing for a conserved target site in the same species for perhaps millions of years.
Collapse
Affiliation(s)
- N J Besansky
- Division of Parasitic Diseases, Centers for Disease Control, Atlanta, Georgia 30333
| | | | | | | |
Collapse
|
29
|
Keller M, Tessier LH, Chan RL, Weil JH, Imbault P. In Euglena, spliced-leader RNA (SL-RNA) and 5S rRNA genes are tandemly repeated. Nucleic Acids Res 1992; 20:1711-5. [PMID: 1579464 PMCID: PMC312261 DOI: 10.1093/nar/20.7.1711] [Citation(s) in RCA: 25] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/27/2022] Open
Abstract
In Euglena gracilis, a 26 nucleotide leader sequence (spliced leader sequence = SL) is transferred by trans-splicing to the 5' end of a vast majority of cytoplasmic mRNAs (8). The SL originates from the 5' extremity of a family of closely related snRNAs (SL-RNAs) which are about 100 nucleotide long. In this paper we present the nucleotide sequences of two SL-RNA genes, confirming the sequences previously established by sequencing purified SL-RNAs. Although some SL-RNA genes are dispersed throughout the genome, we show that the majority of SL-RNA genes are located on 0.6 kb repeated units which also encode the cytoplasmic 5S rRNA. We estimate that the copy number of these repeated units is about 300 per haploid genome. The association of SL-RNA and 5S rRNA genes in tandemly repeated units is also found in nematodes but paradoxically does not exist in trypanosomes which are phylogenically much closer to Euglena. We also show that a high number of sequences analogous to the 26 nucleotide SL are dispersed throughout the genome and are not associated with SL-RNAs.
Collapse
Affiliation(s)
- M Keller
- Institut de Biologie Moléculaire des Plantes du CNRS, Université Louis Pasteur, Strasbourg, France
| | | | | | | | | |
Collapse
|