1
|
Kaur D, Agrahari M, Bhattacharya A, Bhattacharya S. The non-LTR retrotransposons of Entamoeba histolytica: genomic organization and biology. Mol Genet Genomics 2022; 297:1-18. [PMID: 34999963 DOI: 10.1007/s00438-021-01843-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/21/2021] [Accepted: 11/26/2021] [Indexed: 11/24/2022]
Abstract
Genome sequence analysis of Entamoeba species revealed various classes of transposable elements. While E. histolytica and E. dispar are rich in non-long terminal repeat (LTR) retrotransposons, E. invadens contains predominantly DNA transposons. Non-LTR retrotransposons of E. histolytica constitute three families of long interspersed nuclear elements (LINEs), and their short, nonautonomous partners, SINEs. They occupy ~ 11% of the genome. The EhLINE1/EhSINE1 family is the most abundant and best studied. EhLINE1 is 4.8 kb, with two ORFs that encode functions needed for retrotransposition. ORF1 codes for the nucleic acid-binding protein, and ORF2 has domains for reverse transcriptase (RT) and endonuclease (EN). Most copies of EhLINEs lack complete ORFs. ORF1p is expressed constitutively, but ORF2p is not detected. Retrotransposition could be demonstrated upon ectopic over expression of ORF2p, showing that retrotransposition machinery is functional. The newly retrotransposed sequences showed a high degree of recombination. In transcriptomic analysis, RNA-Seq reads were mapped to individual EhLINE1 copies. Although full-length copies were transcribed, no full-length 4.8 kb transcripts were seen. Rather, sense transcripts mapped to ORF1, RT and EN domains. Intriguingly, there was strong antisense transcription almost exclusively from the RT domain. These unique features of EhLINE1 could serve to attenuate retrotransposition in E. histolytica.
Collapse
|
2
|
Whole genome sequencing of Entamoeba nuttalli reveals mammalian host-related molecular signatures and a novel octapeptide-repeat surface protein. PLoS Negl Trop Dis 2019; 13:e0007923. [PMID: 31805050 PMCID: PMC6917348 DOI: 10.1371/journal.pntd.0007923] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/28/2019] [Revised: 12/17/2019] [Accepted: 11/12/2019] [Indexed: 11/19/2022] Open
Abstract
The enteric protozoa Entamoeba histolytica is the causative agent of amebiasis, which is one of the most common parasitic diseases in developed and developing countries. Entamoeba nuttalli is the genetically closest species to E. histolytica in current phylogenetic analyses of Entamoeba species, and is prevalent in wild macaques. Therefore, E. nuttalli may be a key organism in which to investigate molecules required for infection of human or non-human primates. To explore the molecular signatures of host-parasite interactions, we conducted de novo assembly of the E. nuttalli genome, utilizing self-correction of PacBio long reads and polishing corrected reads using Illumina short reads, followed by comparative genomic analysis with two other mammalian and a reptilian Entamoeba species. The final draft assembly of E. nuttalli included 395 contigs with a total length of approximately 23 Mb, and 9,647 predicted genes, of which 6,940 were conserved with E. histolytica. In addition, we found an E. histolytica-specific repeat known as ERE2 in the E. nuttalli genome. GO-term enrichment analysis of mammalian host-related molecules indicated diversification of transmembrane proteins, including AIG1 family and BspA-like proteins that may be involved in the host-parasite interaction. Furthermore, we identified an E. nuttalli-specific protein that contained 42 repeats of an octapeptide ([G,E]KPTDTPS). This protein was shown to be localized on the cell surface using immunofluorescence. Since many repeat-containing proteins in parasites play important roles in interactions with host cells, this unique octapeptide repeat-containing protein may be involved in colonization of E. nuttalli in the intestine of macaques. Overall, our draft assembly provides a valuable resource for studying Entamoeba evolution and host-parasite selection.
Collapse
|
3
|
RNA-Mediated Gene Duplication and Retroposons: Retrogenes, LINEs, SINEs, and Sequence Specificity. INTERNATIONAL JOURNAL OF EVOLUTIONARY BIOLOGY 2013; 2013:424726. [PMID: 23984183 PMCID: PMC3747384 DOI: 10.1155/2013/424726] [Citation(s) in RCA: 16] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 05/14/2013] [Accepted: 07/01/2013] [Indexed: 11/18/2022]
Abstract
A substantial number of “retrogenes” that are derived from the mRNA of various intron-containing genes have been reported. A class of mammalian retroposons, long interspersed element-1 (LINE1, L1), has been shown to be involved in the reverse transcription of retrogenes (or processed pseudogenes) and non-autonomous short interspersed elements (SINEs). The 3′-end sequences of various SINEs originated from a corresponding LINE. As the 3′-untranslated regions of several LINEs are essential for retroposition, these LINEs presumably require “stringent” recognition of the 3′-end sequence of the RNA template. However, the 3′-ends of mammalian L1s do not exhibit any similarity to SINEs, except for the presence of 3′-poly(A) repeats. Since the 3′-poly(A) repeats of L1 and Alu SINE are critical for their retroposition, L1 probably recognizes the poly(A) repeats, thereby mobilizing not only Alu SINE but also cytosolic mRNA. Many flowering plants only harbor L1-clade LINEs and a significant number of SINEs with poly(A) repeats, but no homology to the LINEs. Moreover, processed pseudogenes have also been found in flowering plants. I propose that the ancestral L1-clade LINE in the common ancestor of green plants may have recognized a specific RNA template, with stringent recognition then becoming relaxed during the course of plant evolution.
Collapse
|
4
|
Kumari V, Iyer LR, Roy R, Bhargava V, Panda S, Paul J, Verweij JJ, Clark CG, Bhattacharya A, Bhattacharya S. Genomic distribution of SINEs in Entamoeba histolytica strains: implication for genotyping. BMC Genomics 2013; 14:432. [PMID: 23815468 PMCID: PMC3716655 DOI: 10.1186/1471-2164-14-432] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/26/2012] [Accepted: 06/20/2013] [Indexed: 11/01/2022] Open
Abstract
BACKGROUND The major clinical manifestations of Entamoeba histolytica infection include amebic colitis and liver abscess. However the majority of infections remain asymptomatic. Earlier reports have shown that some E. histolytica isolates are more virulent than others, suggesting that virulence may be linked to genotype. Here we have looked at the genomic distribution of the retrotransposable short interspersed nuclear elements EhSINE1 and EhSINE2. Due to their mobile nature, some EhSINE copies may occupy different genomic locations among isolates of E. histolytica possibly affecting adjacent gene expression; this variability in location can be exploited to differentiate strains. RESULTS We have looked for EhSINE1- and EhSINE2-occupied loci in the genome sequence of Entamoeba histolytica HM-1:IMSS and searched for homologous loci in other strains to determine the insertion status of these elements. A total of 393 EhSINE1 and 119 EhSINE2 loci were analyzed in the available sequenced strains (Rahman, DS4-868, HM1:CA, KU48, KU50, KU27 and MS96-3382. Seventeen loci (13 EhSINE1 and 4 EhSINE2) were identified where a EhSINE1/EhSINE2 sequence was missing from the corresponding locus of other strains. Most of these loci were unoccupied in more than one strain. Some of the loci were analyzed experimentally for SINE occupancy using DNA from strain Rahman. These data helped to correctly assemble the nucleotide sequence at three loci in Rahman. SINE occupancy was also checked at these three loci in 7 other axenically cultivated E. histolytica strains and 16 clinical isolates. Each locus gave a single, specific amplicon with the primer sets used, making this a suitable method for strain typing. Based on presence/absence of SINE and amplification with locus-specific primers, the 23 strains could be divided into eleven genotypes. The results obtained by our method correlated with the data from other typing methods. We also report a bioinformatic analysis of EhSINE2 copies. CONCLUSIONS Our results reveal several loci with extensive polymorphism of SINE occupancy among different strains of E. histolytica and prove the principle that the genomic distribution of SINEs is a valid method for typing of E. histolytica strains.
Collapse
Affiliation(s)
- Vandana Kumari
- School of Environmental Sciences, Jawaharlal Nehru University, New Delhi 110067, India
| | - Lakshmi Rani Iyer
- School of Life Sciences, Jawaharlal Nehru University, New Delhi 110067, India
| | - Riti Roy
- School of Computational and Integrative Sciences, Jawaharlal Nehru University, New Delhi, India
| | - Varsha Bhargava
- School of Environmental Sciences, Jawaharlal Nehru University, New Delhi 110067, India
| | - Suchita Panda
- School of Environmental Sciences, Jawaharlal Nehru University, New Delhi 110067, India
| | - Jaishree Paul
- School of Life Sciences, Jawaharlal Nehru University, New Delhi 110067, India
| | - Jaco J Verweij
- Laboratory for Medical Microbiology and Immunology, Laboratory for Clinical Pathology, St. Elisabeth Hospital, Tilburg, The Netherlands
| | - C Graham Clark
- Department of Pathogen Molecular Biology, London School of Hygiene and Tropical Medicine, Keppel Street, London, WC1E 7HT, UK
| | - Alok Bhattacharya
- School of Life Sciences, Jawaharlal Nehru University, New Delhi 110067, India
- School of Computational and Integrative Sciences, Jawaharlal Nehru University, New Delhi, India
| | - Sudha Bhattacharya
- School of Environmental Sciences, Jawaharlal Nehru University, New Delhi 110067, India
| |
Collapse
|
5
|
Kumari V, Sharma R, Yadav VP, Gupta AK, Bhattacharya A, Bhattacharya S. Differential distribution of a SINE element in the Entamoeba histolytica and Entamoeba dispar genomes: role of the LINE-encoded endonuclease. BMC Genomics 2011; 12:267. [PMID: 21612594 PMCID: PMC3118788 DOI: 10.1186/1471-2164-12-267] [Citation(s) in RCA: 11] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/27/2011] [Accepted: 05/25/2011] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND Entamoeba histolytica and Entamoeba dispar are closely related protistan parasites but while E. histolytica can be invasive, E. dispar is completely non pathogenic. Transposable elements constitute a significant portion of the genome in these species; there being three families of LINEs and SINEs. These elements can profoundly influence the expression of neighboring genes. Thus their genomic location can have important phenotypic consequences. A genome-wide comparison of the location of these elements in the E. histolytica and E. dispar genomes has not been carried out. It is also not known whether the retrotransposition machinery works similarly in both species. The present study was undertaken to address these issues. RESULTS Here we extracted all genomic occurrences of full-length copies of EhSINE1 in the E. histolytica genome and matched them with the homologous regions in E. dispar, and vice versa, wherever it was possible to establish synteny. We found that only about 20% of syntenic sites were occupied by SINE1 in both species. We checked whether the different genomic location in the two species was due to differences in the activity of the LINE-encoded endonuclease which is required for nicking the target site. We found that the endonucleases of both species were essentially very similar, both in their kinetic properties and in their substrate sequence specificity. Hence the differential distribution of SINEs in these species is not likely to be influenced by the endonuclease. Further we found that the physical properties of the DNA sequences adjoining the insertion sites were similar in both species. CONCLUSIONS Our data shows that the basic retrotransposition machinery is conserved in these sibling species. SINEs may indeed have occupied all of the insertion sites in the genome of the common ancestor of E. histolytica and E. dispar but these may have been subsequently lost from some locations. Alternatively, SINE expansion took place after the divergence of the two species. The absence of SINE1 in 80% of syntenic loci could affect the phenotype of the two species, including their pathogenic properties, which needs to be explored.
Collapse
Affiliation(s)
- Vandana Kumari
- School of Environmental Sciences, Jawaharlal Nehru University, New Delhi 110067, India
| | | | | | | | | | | |
Collapse
|
6
|
Gadzalski M, Sakowicz T. Novel SINEs families in Medicago truncatula and Lotus japonicus: bioinformatic analysis. Gene 2011; 480:21-7. [PMID: 21352903 DOI: 10.1016/j.gene.2011.01.020] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/01/2010] [Revised: 12/01/2010] [Accepted: 01/31/2011] [Indexed: 02/02/2023]
Abstract
Although short interspersed elements (SINEs) were discovered nearly 30 years ago, the studies of these genomic repeats were mostly limited to animal genomes. Very little is known about SINEs in legumes--one of the most important plant families. Here we report identification, genomic distribution and molecular features of six novel SINE elements in Lotus japonicus (named LJ_SINE-1, -2, -3) and Medicago truncatula (MT_SINE-1, -2, -3), model species of legume. They possess all the structural features commonly found in short interspersed elements including RNA polymerase III promoter, polyA tail and flanking repeats. SINEs described here are present in low to moderate copy numbers from 150 to 3000. Bioinformatic analyses were used to searched public databases, we have shown that three of new SINE elements from M. truncatula seem to be characteristic of Medicago and Trifolium genera. Two SINE families have been found in L. japonicus and one is present in both M. truncatula and L. japonicus. In addition, we are discussing potential activities of the described elements.
Collapse
Affiliation(s)
- Marek Gadzalski
- Department of General Genetics, Plant Molecular Biology and Biotechnology, University of Lodz, Banacha 12/16, Lodz, Poland.
| | | |
Collapse
|
7
|
Weedall GD, Hall N. Evolutionary genomics of Entamoeba. Res Microbiol 2011; 162:637-45. [PMID: 21288488 PMCID: PMC3268252 DOI: 10.1016/j.resmic.2011.01.007] [Citation(s) in RCA: 27] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/16/2010] [Accepted: 12/17/2010] [Indexed: 11/06/2022]
Abstract
Entamoeba histolytica is a human pathogen that causes amoebic dysentery and leads to significant morbidity and mortality worldwide. Understanding the genome and evolution of the parasite will help explain how, when and why it causes disease. Here we review current knowledge about the evolutionary genomics of Entamoeba: how differences between the genomes of different species may help explain different phenotypes, and how variation among E. histolytica parasites reveals patterns of population structure. The imminent expansion of the amount genome data will greatly improve our knowledge of the genus and of pathogenic species within it.
Collapse
Affiliation(s)
- Gareth D Weedall
- Institute of Integrative Biology, University of Liverpool, Crown Street, Liverpool L69 7ZB, UK.
| | | |
Collapse
|
8
|
Lorenzi H, Thiagarajan M, Haas B, Wortman J, Hall N, Caler E. Genome wide survey, discovery and evolution of repetitive elements in three Entamoeba species. BMC Genomics 2008; 9:595. [PMID: 19077187 PMCID: PMC2657916 DOI: 10.1186/1471-2164-9-595] [Citation(s) in RCA: 41] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/22/2008] [Accepted: 12/10/2008] [Indexed: 11/14/2022] Open
Abstract
Background Identification and mapping of repetitive elements is a key step for accurate gene prediction and overall structural annotation of genomes. During the assembly and annotation of three highly repetitive amoeba genomes, Entamoeba histolytica, Entamoeba dispar, and Entamoeba invadens, we performed comparative sequence analysis to identify and map all class I and class II transposable elements in their sequences. Results Here, we report the identification of two novel Entamoeba-specific repeats: ERE1 and ERE2; ERE1 is spread across the three genomes and associated with different repeats in a species-specific manner, while ERE2 is unique to E. histolytica. We also report the identification of two novel subfamilies of LINE and SINE retrotransposons in E. dispar and provide evidence for how the different LINE and SINE subfamilies evolved in these species. Additionally, we found a putative transposase-coding gene in E. histolytica and E. dispar related to the mariner transposon Hydargos from E. invadens. The distribution of transposable elements in these genomes is markedly skewed with a tendency of forming clusters. More than 70% of the three genomes have a repeat density below their corresponding average value indicating that transposable elements are not evenly distributed. We show that repeats and repeat-clusters are found at syntenic break points between E. histolytica and E. dispar and hence, could work as recombination hot spots promoting genome rearrangements. Conclusion The mapping of all transposable elements found in these parasites shows that repeat coverage is up to three times higher than previously reported. LINE, ERE1 and mariner elements were present in the common ancestor to the three Entamoeba species while ERE2 was likely acquired by E. histolytica after its separation from E. dispar. We demonstrate that E. histolytica and E. dispar share their entire repertoire of LINE and SINE retrotransposons and that Eh_SINE3/Ed_SINE1 originated as a chimeric SINE from Eh/Ed_SINE2 and Eh_SINE1/Ed_SINE3. Our work shows that transposable elements are organized in clusters, frequently found at syntenic break points providing insights into their contribution to chromosome instability and therefore, to genomic variation and speciation in these parasites.
Collapse
Affiliation(s)
- Hernan Lorenzi
- J, Craig Venter Institute, 9704 Medical Center Drive, Rockville, MD 20850, USA.
| | | | | | | | | | | |
Collapse
|
9
|
Clark CG, Alsmark UCM, Tazreiter M, Saito-Nakano Y, Ali V, Marion S, Weber C, Mukherjee C, Bruchhaus I, Tannich E, Leippe M, Sicheritz-Ponten T, Foster PG, Samuelson J, Noël CJ, Hirt RP, Embley TM, Gilchrist CA, Mann BJ, Singh U, Ackers JP, Bhattacharya S, Bhattacharya A, Lohia A, Guillén N, Duchêne M, Nozaki T, Hall N. Structure and content of the Entamoeba histolytica genome. ADVANCES IN PARASITOLOGY 2008; 65:51-190. [PMID: 18063096 DOI: 10.1016/s0065-308x(07)65002-7] [Citation(s) in RCA: 133] [Impact Index Per Article: 8.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/18/2023]
Abstract
The intestinal parasite Entamoeba histolytica is one of the first protists for which a draft genome sequence has been published. Although the genome is still incomplete, it is unlikely that many genes are missing from the list of those already identified. In this chapter we summarise the features of the genome as they are currently understood and provide previously unpublished analyses of many of the genes.
Collapse
Affiliation(s)
- C G Clark
- Department of Infectious and Tropical Diseases, London School of Hygiene and Tropical Medicine, London WC1E 7HT, UK
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
10
|
Trichomonas vaginalis surface proteins: a view from the genome. Trends Parasitol 2007; 23:540-7. [DOI: 10.1016/j.pt.2007.08.020] [Citation(s) in RCA: 52] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/20/2007] [Revised: 08/20/2007] [Accepted: 08/20/2007] [Indexed: 01/22/2023]
|