1
|
Teixeira SM, de Paiva RMC, Kangussu-Marcolino MM, Darocha WD. Trypanosomatid comparative genomics: Contributions to the study of parasite biology and different parasitic diseases. Genet Mol Biol 2012; 35:1-17. [PMID: 22481868 PMCID: PMC3313497 DOI: 10.1590/s1415-47572012005000008] [Citation(s) in RCA: 28] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/08/2011] [Accepted: 10/18/2011] [Indexed: 01/23/2023] Open
Abstract
In 2005, draft sequences of the genomes of Trypanosoma brucei, Trypanosoma cruzi and Leishmania major, also known as the Tri-Tryp genomes, were published. These protozoan parasites are the causative agents of three distinct insect-borne diseases, namely sleeping sickness, Chagas disease and leishmaniasis, all with a worldwide distribution. Despite the large estimated evolutionary distance among them, a conserved core of ~6,200 trypanosomatid genes was found among the Tri-Tryp genomes. Extensive analysis of these genomic sequences has greatly increased our understanding of the biology of these parasites and their host-parasite interactions. In this article, we review the recent advances in the comparative genomics of these three species. This analysis also includes data on additional sequences derived from other trypanosmatid species, as well as recent data on gene expression and functional genomics. In addition to facilitating the identification of key parasite molecules that may provide a better understanding of these complex diseases, genome studies offer a rich source of new information that can be used to define potential new drug targets and vaccine candidates for controlling these parasitic infections.
Collapse
Affiliation(s)
- Santuza M Teixeira
- Departamento de Bioquímica e Imunologia, Universidade Federal de Minas Gerais, Belo Horizonte, MG, Brazil
| | | | | | | |
Collapse
|
2
|
Quijada L, Soto M, Requena JM. Genomic DNA macroarrays as a tool for analysis of gene expression in Leishmania. Exp Parasitol 2005; 111:64-70. [PMID: 15936759 DOI: 10.1016/j.exppara.2005.04.006] [Citation(s) in RCA: 14] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/04/2005] [Revised: 04/19/2005] [Accepted: 04/27/2005] [Indexed: 11/15/2022]
Abstract
Gene-array technologies have been applied in a wide number of organisms to study gene expression profiling under several physiological and experimental conditions. Gene-array implementations combined with the information arising from emerging genome sequencing projects are expected to be in the near future a major tool to characterize genes involved in different processes. So far, gene expression profile studies in trypanosomatids have been performed in microarrays that use a glass support to immobilize fragments of genomic DNA followed by fluorescent detection. Here, we wanted to test the potential of genomic DNA macroarrays of Leishmania infantum using nylon membranes and radioactive detection. Nylon macroarrays present a number of advantages since the processing of the membranes is based on standard Southern blotting protocols familiar to molecular biologists, and the data acquisition equipment is available to most research institutions. Nylon macroarrays were employed to search for genes showing increased mRNA abundance during an axenic differentiation of L. infantum promastigotes to amastigotes. Several clones were rescued and, after validation by Northern blot assays, these L. infantum sequences were used to screen the Leishmania major gene database. The L. major contigs with high homology to the L. infantum sequences allowed a consistent identification of the regulated genes.
Collapse
Affiliation(s)
- Luis Quijada
- Centro de Biología Molecular Severo Ochoa, Lab CV-401, Universidad Autónoma de Madrid, Cantoblanco, E-28049 Madrid, Spain.
| | | | | |
Collapse
|
3
|
Guerreiro LTA, Souza SS, Wagner G, De Souza EA, Mendes PN, Campos LM, Barros L, Pires PF, Campos MLM, Grisard EC, Dávila AMR. Exploring the Genome of Trypanosoma vivax through GSS and In Silico Comparative Analysis. OMICS-A JOURNAL OF INTEGRATIVE BIOLOGY 2005; 9:116-28. [PMID: 15805782 DOI: 10.1089/omi.2005.9.116] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/13/2022]
Abstract
A survey of the Trypanosoma vivax genome was carried out by the genome sequence survey (GSS) approach resulting in 1,086 genomic sequences. A total of 455 high-quality GSS sequences were generated, consisting of 331 non-redundant sequences distributed in 264 singlets and 67 clusters in a total of 135.5 Kb of the T. vivax genome. The estimation of the overall G+C content, and the prediction of the presence of ORFs and putative genes were carried out using the Glimmer and Jemboss packages. Analysis of the obtained sequences was carried out by BLAST programs against 12 different databases and also using the Conserved Domain Database, InterProScan, and tRNAscan-SE. Along with the existing 23 T. vivax entries in the GenBank, the 32 putative genes predicted and the 331 non-redundant GSS sequences reported herein represent new potential markers for the development of PCRbased assays for specific diagnosis and typing of Trypanosoma vivax.
Collapse
Affiliation(s)
- Luana Tatiana A Guerreiro
- Departamento de Bioquímica e Biologia Molecular, Instituto Oswaldo Cruz/Fundação Oswaldo Cruz, Rio de Janeiro, Brazil
| | | | | | | | | | | | | | | | | | | | | |
Collapse
|
4
|
Laurentino EC, Ruiz JC, Fazelinia G, Myler PJ, Degrave W, Alves-Ferreira M, Ribeiro JMC, Cruz AK. A survey of Leishmania braziliensis genome by shotgun sequencing. Mol Biochem Parasitol 2005; 137:81-6. [PMID: 15279954 DOI: 10.1016/j.molbiopara.2004.05.001] [Citation(s) in RCA: 19] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/02/2004] [Revised: 05/12/2004] [Accepted: 05/14/2004] [Indexed: 11/23/2022]
Abstract
We have carried out a survey of the genome of Leishmania (Viannia) braziliensis by shotgun sequencing. Approximately 15% of the haploid genome of the parasite (5.15 Mb of genomic sequence) was obtained. A large number of known and putative genes, predicted to be involved in several cellular processes, were identified. Some genomic features were investigated, such as the general G + C content, which was found to be lower than L. major (57% versus 63%). BlastN searches revealed that 60.2% of the clusterized GSS sequences displayed similarity to L. major genomic sequences, while a BlastX search showed that 45.3% of the thus obtained predicted protein sequences showed similarity to annotated proteins of L. major. Further comparison of the degree of conservation between L. major and L. braziliensis revealed that coding regions are much more conserved than non-coding ones. The shotgun sequence analysis of Leishmania braziliensis appears to be an efficient and suitable strategy contributing to the search for vaccines and novel drug targets. The sequence data described in this paper have been submitted to the dbGSS database under the following accession numbers (BX530413 to BX530454; BX530456 to BX530718; BX538354 to BX539305; BX539350 to BX540325; BX541002 to BX544869; BX544893 to BX545685; BX897701 to BX897710; BX905184 to BX907797; BX907798 to BX908381; BX908403 to BX908718). All data including sequences are also available at (www.ebi.ac.uk/embl/).
Collapse
Affiliation(s)
- Eliane C Laurentino
- Departamento de Biologia Celular e Molecular e Bioagentes Patogênicos, Faculdade de Medicina de Ribeirão Preto, Universidade de São Paulo, Av. Bandeirantes, 3900, 14049-900, Brasil
| | | | | | | | | | | | | | | |
Collapse
|
5
|
Whitton C, Daub J, Quail M, Hall N, Foster J, Ware J, Ganatra M, Slatko B, Barrell B, Blaxter M. A genome sequence survey of the filarial nematode Brugia malayi: repeats, gene discovery, and comparative genomics. Mol Biochem Parasitol 2004; 137:215-27. [PMID: 15383292 DOI: 10.1016/j.molbiopara.2004.05.013] [Citation(s) in RCA: 23] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/18/2004] [Revised: 04/01/2004] [Accepted: 05/07/2004] [Indexed: 11/18/2022]
Abstract
Comparative nematode genomics has thus far been largely constrained to the genus Caenorhabditis, but a huge diversity of other nematode species, and genomes, exist. The Brugia malayi genome is approximately 100 Mb in size, and distributed across five chromosome pairs. Previous genomic investigations have included definition of major repeat classes and sequencing of selected genes. We have generated over 18,000 sequences from the ends of large-insert clones from bacterial artificial chromosome libraries. These end sequences, totalling over 10 Mb of sequence, contain just under 8 Mb of unique sequence. We identified the known Mbo I and Hha I repeat families in the sequence data, and also identified several new repeats based on their abundance. Genomic copies of 17% of B. malayi genes defined by expressed sequence tags have been identified. Nearly one quarter of end sequences can encode peptides with significant similarity to protein sequences in the public databases, and we estimate that we have identified more than 2700 new B. malayi genes. Importantly, 459 end sequences had homologues in other organisms, but lacked a match in the completely sequenced genomes of Caenorhabditis briggsae and Caenorhabditis elegans, emphasising the role of gene loss in genome evolution. B. malayi is estimated to have over 18,500 protein-coding genes.
Collapse
Affiliation(s)
- Claire Whitton
- Ashworth Laboratories, Institution of Cell, Animal and Polulation Biology, School of Biological Sciences, University of Edinburgh, King's Buildings, Edinburgh EH9 3JT, UK
| | | | | | | | | | | | | | | | | | | |
Collapse
|
6
|
Dávila AMR, Lukeš J. Towards a framework for the evolutionary genomics of Kinetoplastids: what kind of data and how much? KINETOPLASTID BIOLOGY AND DISEASE 2003; 2:16. [PMID: 14613494 PMCID: PMC280664 DOI: 10.1186/1475-9292-2-16] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Subscribe] [Scholar Register] [Received: 07/30/2003] [Accepted: 10/28/2003] [Indexed: 11/10/2022]
Abstract
The current status of kinetoplastids phylogeny and evolution is discussed in view of the recent progresses on genomics. Some ideas on a potential framework for the evolutionary genomics of kinetoplastids are presented.
Collapse
Affiliation(s)
- Alberto MR Dávila
- Lab. de Biologia Molecular de Tripanosomatídeos, Departamento de Bioquímica e Biologia Molecular, Instituto Oswaldo Cruz, Fiocruz. Av. Brasil 4365, Rio de Janeiro, RJ, Brasil. 21045-900
| | - Julius Lukeš
- Institute of Parasitology, Czech Academy of Sciences and Faculty of Biology, University of South Bohemia, Branišovská 31, 37005 Ceske Budejovice, Czech Republic
| |
Collapse
|
7
|
LaCount DJ, Gruszynski AE, Grandgenett PM, Bangs JD, Donelson JE. Expression and function of the Trypanosoma brucei major surface protease (GP63) genes. J Biol Chem 2003; 278:24658-64. [PMID: 12707278 DOI: 10.1074/jbc.m301451200] [Citation(s) in RCA: 60] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022] Open
Abstract
The genome of the African trypanosome Trypanosoma brucei (Tb) contains at least three gene families (TbMSP-A, -B, and -C) encoding homologues of the abundant major surface protease (MSP, previously called GP63), which is found in all Leishmania species. TbMSP-B mRNA occurs in both procyclic and bloodstream trypanosomes, whereas TbMSP-A and -C mRNAs are detected only in bloodstream organisms. RNA interference (RNAi)-mediated gene silencing was used to investigate the function of TbMSP-B protein. RNAi directed against TbMSP-B but not TbMSP-A ablated the steady state TbMSP-B mRNA levels in both procyclic and bloodstream cells but had no effect on the kinetics of cultured trypanosome growth in either stage. Procyclic trypanosomes have been shown previously to have an uncharacterized cell surface metalloprotease activity that can release ectopically expressed surface proteins. To determine whether TbMSP-B is responsible for this release, transgenic variant surface glycoprotein 117 (VSG117) was expressed constitutively in T. brucei procyclic TbMSP-RNAi cell lines, and the amount of surface VSG117 was determined using a surface biotinylation assay. Ablation of TbMSP-B but not TbMSP-A mRNA resulted in a marked decrease in VSG release with a concomitant increase in steady state cell-associated VSG117, indicating that TbMSP-B mediates the surface protease activity of procyclic trypanosomes. This finding is consistent with previous pharmacological studies showing that peptidomimetic collagenase inhibitors block release of transgenic VSG from procyclic trypanosomes and are toxic for bloodstream but not procyclic organisms.
Collapse
Affiliation(s)
- Douglas J LaCount
- Department of Biochemistry, University of Iowa, Iowa City 52242, USA
| | | | | | | | | |
Collapse
|
8
|
Agbo EC, Majiwa PAO, Büscher P, Claassen E, te Pas MFW. Trypanosoma brucei genomics and the challenge of identifying drug and vaccine targets. Trends Microbiol 2003; 11:322-9. [PMID: 12875816 DOI: 10.1016/s0966-842x(03)00151-3] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/21/2022]
Affiliation(s)
- Eddy C Agbo
- Division of Animal Sciences, Institute for Animal Science and Health (ID-Lelystad), Edelhertweg 15, 8200 AB Lelystad, The Netherlands.
| | | | | | | | | |
Collapse
|
9
|
Ramos MA, Sanchez-Lopez R, Olvera F, Alagón A. Entamoeba histolytica genomic organization: identification, structure, and phylogenetic relationship of two serine-threonine protein kinases. Exp Parasitol 2002; 100:135-9. [PMID: 12054704 DOI: 10.1016/s0014-4894(02)00011-5] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.0] [Reference Citation Analysis] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/30/2022]
Affiliation(s)
- Marco A Ramos
- Instituto de Biotecnología, Universidad Nacional Autónoma de México. Ave., Universidad 2001, Cuernavaca, Morelos 62210, Mexico
| | | | | | | |
Collapse
|
10
|
Zeng C, Kouprina N, Zhu B, Cairo A, Hoek M, Cross G, Osoegawa K, Larionov V, de Jong P. Large-insert BAC/YAC libraries for selective re-isolation of genomic regions by homologous recombination in yeast. Genomics 2001; 77:27-34. [PMID: 11543629 DOI: 10.1006/geno.2001.6616] [Citation(s) in RCA: 34] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022]
Abstract
We constructed representative large-insert bacterial artificial chromosome (BAC) libraries of two human pathogens (Trypanosoma brucei and Giardia lamblia) using a new hybrid vector, pTARBAC1, containing a yeast artificial chromosome (YAC) cassette (a yeast selectable marker and a centromere). The cassette allows transferring of BACs into yeast for their further modification. Furthermore, the new hybrid vector provides the opportunity to re-isolate each DNA insert without construction of a new library of random clones. Digestion of a BAC DNA by an endonuclease that has no recognition site in the vector, but which deletes most of the internal insert sequence and leaves the unique flanking sequences, converts a BAC into a TAR vector, thus allowing direct gene isolation. Cotransformation of a TAR vector and genomic DNA into yeast spheroplasts, and subsequent recombination between the TAR vector's flanking ends and a specific genomic fragment, allows rescue of the fragment as a circular YAC/BAC molecule. Here we prove a new cloning strategy by re-isolation of randomly chosen genomic fragments of different size from T. brucei cloned in BACs. We conclude that genomic regions of unicellular eukaryotes can be easily re-isolated using this technique, which provides an opportunity to study evolution of these genomes and the role of genome instability in pathogenicity.
Collapse
MESH Headings
- Animals
- Chromosomes, Artificial, Bacterial/genetics
- Chromosomes, Artificial, Yeast/genetics
- Cloning, Molecular/methods
- DNA Fingerprinting
- DNA, Protozoan/chemistry
- DNA, Protozoan/genetics
- Escherichia coli/genetics
- Gene Library
- Genetic Vectors/genetics
- Genome, Protozoan
- Molecular Sequence Data
- Recombination, Genetic
- Saccharomyces cerevisiae/genetics
- Sequence Analysis, DNA
- Trypanosoma brucei brucei/genetics
Collapse
Affiliation(s)
- C Zeng
- Department of Cancer Genetics, Roswell Park Cancer Institute, Buffalo, New York 14263, USA
| | | | | | | | | | | | | | | | | |
Collapse
|
11
|
Akopyants NS, Clifton SW, Martin J, Pape D, Wylie T, Li L, Kissinger JC, Roos DS, Beverley SM. A survey of the Leishmania major Friedlin strain V1 genome by shotgun sequencing: a resource for DNA microarrays and expression profiling. Mol Biochem Parasitol 2001; 113:337-40. [PMID: 11295190 DOI: 10.1016/s0166-6851(01)00227-4] [Citation(s) in RCA: 37] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/18/2022]
Affiliation(s)
- N S Akopyants
- Department of Molecular Microbiology, Washington University School of Medicine, St. Louis MO, USA
| | | | | | | | | | | | | | | | | |
Collapse
|
12
|
Janssen CS, Barrett MP, Lawson D, Quail MA, Harris D, Bowman S, Phillips RS, Turner CM. Gene discovery in Plasmodium chabaudi by genome survey sequencing. Mol Biochem Parasitol 2001; 113:251-60. [PMID: 11295179 DOI: 10.1016/s0166-6851(01)00224-9] [Citation(s) in RCA: 21] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/23/2022]
Abstract
The first genome survey sequencing of the rodent malaria parasite Plasmodium chabaudi is presented here. In 766 sequences, 131 putative gene sequences have been identified by sequence similarity database searches. Further, 7 potential gene families, four of which have not previously been described, were discovered. These genes may be important in understanding the biology of malaria, as well as offering potential new drug targets. We have also identified a number of candidate minisatellite sequences that could be helpful in genetic studies. Genome survey sequencing in P. chabaudi is a productive strategy in further developing this in vivo model of malaria, in the context of the malaria genome projects.
Collapse
Affiliation(s)
- C S Janssen
- Division of Infection & Immunity, IBLS, University of Glasgow, G12 8QQ, Glasgow, UK.
| | | | | | | | | | | | | | | |
Collapse
|
13
|
Agüero F, Verdún RE, Frasch AC, Sánchez DO. A random sequencing approach for the analysis of the Trypanosoma cruzi genome: general structure, large gene and repetitive DNA families, and gene discovery. Genome Res 2000; 10:1996-2005. [PMID: 11116094 PMCID: PMC313047 DOI: 10.1101/gr.gr-1463r] [Citation(s) in RCA: 46] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/02/2000] [Accepted: 09/20/2000] [Indexed: 11/24/2022]
Abstract
A random sequence survey of the genome of Trypanosoma cruzi, the agent of Chagas disease, was performed and 11,459 genomic sequences were obtained, resulting in approximately 4.3 Mb of readable sequences or approximately 10% of the parasite haploid genome. The estimated total GC content was 50.9%, with a high representation of A and T di- and trinucleotide repeats. Out of the estimated 5000 parasite genes, 947 putative new genes were identified. Another 1723 sequences corresponded to genes detected previously in T. cruzi through expression sequence tag analysis. 7735 sequences had no matches in the database, but the presence of open reading frames that passed Fickett's test suggests that some might contain coding DNA. The survey was highly redundant, with approximately 35% of the sequences included in a few large sequence families. Some of them code for protein families present in dozens of copies, including proteins essential for parasite survival and retrotransposons. Other sequence families include repetitive DNA present in thousands of copies per haploid genome. Some families in the latter group are new, parasite-specific, repetitive DNAs. These results suggest that T. cruzi could constitute an interesting model to analyze gene and genome evolution due to its plasticity in terms of sequence amplification and divergence. Additional information can be found at http://www.iib.unsam.edu.ar/tcruzi.gss. html.
Collapse
Affiliation(s)
- F Agüero
- Instituto de Investigaciones Biotecnológicas, Instituto Tecnológico de Chascomús, Universidad Nacional de General San Martín, Consejo Nacional de Investigaciones Científicas y Técnicas, San Martín, Provincia de Buenos Aires, Argentina
| | | | | | | |
Collapse
|
14
|
Ingram AK, Cross GA, Horn D. Genetic manipulation indicates that ARD1 is an essential N(alpha)-acetyltransferase in Trypanosoma brucei. Mol Biochem Parasitol 2000; 111:309-17. [PMID: 11163439 DOI: 10.1016/s0166-6851(00)00322-4] [Citation(s) in RCA: 50] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/17/2022]
Abstract
N(alpha)-acetylation, the most common protein modification, involves the transfer of an acetyl group from acetyl-coenzyme A to the N-terminus of a protein or peptide. The major N(alpha)-acetyltransferase in Saccharomyces cerevisiae is the ARDI-NATI complex. To investigate N(alpha) -acetylation in Trypanosoma brucei we have cloned and characterised genes encoding putative homologues of ARD1 and NAT1. Both genes are single copy and ARD1, the putative catalytic component, is expressed in both bloodstream-form and insect-stage cells. In either of these life-cycle stages, disruption of both ARD1 alleles was only possible when another copy was generated via gene duplication or when ARD1 was expressed from elsewhere in the genome. These genetic manipulations demonstrate that, unlike the situation in S. cerevisiae, ARD1 is an essential gene in T. brucei. We propose that protein modification by ARD1 is essential for viability in mammalian and insect-stage T. brucei cells.
Collapse
Affiliation(s)
- A K Ingram
- London School of Hygiene and Tropical Medicine, UK
| | | | | |
Collapse
|
15
|
Agüero F, Verdún RE, Frasch ACC, Sánchez DO. A Random Sequencing Approach for the Analysis of the Trypanosoma cruzi Genome: General Structure, Large Gene and Repetitive DNA Families, and Gene Discovery. Genome Res 2000. [DOI: 10.1101/gr.146300] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/25/2022]
Abstract
A random sequence survey of the genome of Trypanosoma cruzi, the agent of Chagas disease, was performed and 11,459 genomic sequences were obtained, resulting in ∼4.3 Mb of readable sequences or ∼10% of the parasite haploid genome. The estimated total GC content was 50.9%, with a high representation of A and T di- and trinucleotide repeats. Out of the estimated 5000 parasite genes, 947 putative new genes were identified. Another 1723 sequences corresponded to genes detected previously in T. cruzi through expression sequence tag analysis. 7735 sequences had no matches in the database, but the presence of open reading frames that passed Fickett's test suggests that some might contain coding DNA. The survey was highly redundant, with ∼35% of the sequences included in a few large sequence families. Some of them code for protein families present in dozens of copies, including proteins essential for parasite survival and retrotransposons. Other sequence families include repetitive DNA present in thousands of copies per haploid genome. Some families in the latter group are new, parasite-specific, repetitive DNAs. These results suggest that T. cruzi could constitute an interesting model to analyze gene and genome evolution due to its plasticity in terms of sequence amplification and divergence. Additional information can be found at http://www.iib.unsam.edu.ar/tcruzi.gss.html.[The sequence data described in this paper have been submitted to the dbGSS database under the following GenBank accession nos.:AQ443439–AQ443513, AQ443743–AQ445667, AQ902981–AQ911366,AZ049857–AZ051184, and AZ302116–AZ302563.]
Collapse
|
16
|
Abstract
The haploid nuclear genome of the African trypanosome, Trypanosoma brucei, is about 35 Mb and varies in size among different trypanosome isolates by as much as 25%. The nuclear DNA of this diploid organism is distributed among three size classes of chromosomes: the megabase chromosomes of which there are at least 11 pairs ranging from 1 Mb to more than 6 Mb (numbered I-XI from smallest to largest); several intermediate chromosomes of 200-900 kb and uncertain ploidy; and about 100 linear minichromosomes of 50-150 kb. Size differences of as much as four-fold can occur, both between the two homologues of a megabase chromosome pair in a specific trypanosome isolate and among chromosome pairs in different isolates. The genomic DNA sequences determined to date indicated that about 50% of the genome is coding sequence. The chromosomal telomeres possess TTAGGG repeats and many, if not all, of the telomeres of the megabase and intermediate chromosomes are linked to expression sites for genes encoding variant surface glycoproteins (VSGs). The minichromosomes serve as repositories for VSG genes since some but not all of their telomeres are linked to unexpressed VSG genes. A gene discovery program, based on sequencing the ends of cloned genomic DNA fragments, has generated more than 20 Mb of discontinuous single-pass genomic sequence data during the past year, and the complete sequences of chromosomes I and II (about 1 Mb each) in T. brucei GUTat 10.1 are currently being determined. It is anticipated that the entire genomic sequence of this organism will be known in a few years. Analysis of a test microarray of 400 cDNAs and small random genomic DNA fragments probed with RNAs from two developmental stages of T. brucei demonstrates that the microarray technology can be used to identify batteries of genes differentially expressed during the various life cycle stages of this parasite.
Collapse
Affiliation(s)
- N M El-Sayed
- The Institute for Genomic Research (TIGR), 9712 Medical Center Drive, Rockville, MD 20850, USA.
| | | | | | | | | |
Collapse
|
17
|
Králová I, Rigden DJ, Opperdoes FR, Michels PA. Glycerol kinase of Trypanosoma brucei. Cloning, molecular characterization and mutagenesis. EUROPEAN JOURNAL OF BIOCHEMISTRY 2000; 267:2323-33. [PMID: 10759857 DOI: 10.1046/j.1432-1327.2000.01238.x] [Citation(s) in RCA: 35] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/20/2022]
Abstract
Trypanosoma brucei contains two tandemly arranged genes for glycerol kinase. The downstream gene was analysed in detail. It contains an ORF for a polypeptide of 512 amino acids. The polypeptide has a calculated molecular mass of 56 363 Da and a pI of 8.6. Comparison of the T. brucei glycerol kinase amino-acid sequence with the glycerol kinase sequences available in databases revealed positional identities of 39.0-50.4%. The T. brucei glycerol kinase gene was overexpressed in Escherichia coli cells and the recombinant protein obtained was purified and characterized biochemically. Its kinetic properties with regard to both the forward and reverse reaction were measured. The values corresponded to those determined previously for the natural glycerol kinase purified from the parasite, and confirmed that the apparent Km values of the trypanosome enzyme for its substrates are relatively high compared with those of other glycerol kinases. Alignment of the amino-acid sequences of T. brucei glycerol kinase and other eukaryotic and prokaryotic glycerol kinases, as well as inspection of the available three-dimensional structure of E. coli glycerol kinase showed that most residues of the magnesium-, glycerol- and ADP-binding sites are well conserved in T. brucei glycerol kinase. However, a number of remarkable substitutions was identified, which could be responsible for the low affinity for the substrates. Most striking is amino-acid Ala137 in T. brucei glycerol kinase; in all other organisms a serine is present at the corresponding position. We mutated Ala137 of T. brucei glycerol kinase into a serine and this mutant glycerol kinase was over-expressed and purified. The affinity of the mutant enzyme for its substrates glycerol and glycerol 3-phosphate appeared to be 3. 1-fold to 3.6-fold higher than in the wild-type enzyme. Part of the glycerol kinase gene comprising this residue 137 was amplified in eight different kinetoplastid species and sequenced. Interestingly, an alanine occurs not only in T. brucei, but also in other trypanosomatids which can convert glucose into equimolar amounts of glycerol and pyruvate: T. gambiense, T. equiperdum and T. evansi. In trypanosomatids with no or only a limited capacity to produce glycerol, a hydroxy group-containing residue is found as in all other organisms: T. vivax and T. congolense possess a serine while Phytomonas sp., Leishmania brasiliensis and L. mexicana have a threonine.
Collapse
Affiliation(s)
- I Králová
- Research Unit for Tropical Diseases, Christian de Duve Institute of Cellular Pathology and Laboratory of Biochemistry, Université Catholique de Louvain, Brussels, Belgium
| | | | | | | |
Collapse
|
18
|
Abstract
Cryptosporidium parvum is an obligate intracellular pathogen responsible for widespread infections in humans and animals. The inability to obtain purified samples of this organism's various developmental stages has limited the understanding of the biochemical mechanisms important for C. parvum development or host-parasite interaction. To identify C. parvum genes independent of their developmental expression, a random sequence analysis of the 10.4-megabase genome of C. parvum was undertaken. Total genomic DNA was sheared by nebulization, and fragments between 800 and 1,500 bp were gel purified and cloned into a plasmid vector. A total of 442 clones were randomly selected and subjected to automated sequencing by using one or two primers flanking the cloning site. In this way, 654 genomic survey sequences (GSSs) were generated, corresponding to >320 kb of genomic sequence. These sequences were assembled into 408 contigs containing >250 kb of unique sequence, representing approximately 2.5% of the C. parvum genome. Comparison of the GSSs with sequences in the public DNA and protein databases revealed that 107 contigs (26%) displayed similarity to previously identified proteins and rRNA and tRNA genes. These included putative genes involved in the glycolytic pathway, DNA, RNA, and protein metabolism, and signal transduction pathways. The repetitive sequence elements identified included a telomere-like sequence containing hexamer repeats, 57 microsatellite-like elements composed of dinucleotide or trinucleotide repeats, and a direct repeat sequence. This study demonstrates that large-scale genomic sequencing is an efficient approach to analyze the organizational characteristics and information content of the C. parvum genome.
Collapse
Affiliation(s)
- C Liu
- Department of Veterinary PathoBiology, University of Minnesota, St. Paul, Minnesota, USA
| | | | | | | |
Collapse
|
19
|
Abstract
Histidyl-tRNA synthetase (HisRS) is responsible for the synthesis of histidyl-transfer RNA, which is essential for the incorporation of histidine into proteins. This amino acid has uniquely moderate basic properties and is an important group in many catalytic functions of enzymes. A compilation of currently known primary structures of HisRS shows that the subunits of these homo-dimeric enzymes consist of 420-550 amino acid residues. This represents a relatively short chain length among aminoacyl-tRNA synthetases (aaRS), whose peptide chain sizes range from about 300 to 1100 amino acid residues. The crystal structures of HisRS from two organisms and their complexes with histidine, histidyl-adenylate and histidinol with ATP have been solved. HisRS from Escherichia coli and Thermus thermophilus are very similar dimeric enzymes consisting of three domains: the N-terminal catalytic domain containing the six-stranded antiparallel beta-sheet and the three motifs characteristic of class II aaRS, a HisRS-specific helical domain inserted between motifs 2 and 3 that may contact the acceptor stem of the tRNA, and a C-terminal alpha/beta domain that may be involved in the recognition of the anticodon stem and loop of tRNA(His). The aminoacylation reaction follows the standard two-step mechanism. HisRS also belongs to the group of aaRS that can rapidly synthesize diadenosine tetraphosphate, a compound that is suspected to be involved in several regulatory mechanisms of cell metabolism. Many analogs of histidine have been tested for their properties as substrates or inhibitors of HisRS, leading to the elucidation of structure-activity relationships concerning configuration, importance of the carboxy and amino group, and the nature of the side chain. HisRS has been found to act as a particularly important antigen in autoimmune diseases such as rheumatic arthritis or myositis. Successful attempts have been made to identify epitopes responsible for the complexation with such auto-antibodies.
Collapse
Affiliation(s)
- W Freist
- Max-Planck-Institut für experimentelle Medizin, Abteilung Molekulare Biologie Neuronaler Signale, Göttingen, Germany
| | | | | | | | | |
Collapse
|
20
|
Abstract
The Leishmania Genome Network (LGN) was born in Rio de Janeiro, Brazil in 1994. In the short period that has elapsed since then, the LGN has focused solely on the acquisition of the resources, and hence data, that have enabled a rational approach to genomic sequencing of the reference strain, Leishmania major Friedlin. This has now been achieved. In this review, Alasdair Ivens and Jennie Blackwell, secretary and chairman of the LGN, respectively, re-examine the approaches that were adopted, comment on some of the interesting data that have been obtained and introduce some genome-wide approaches that will facilitate functional studies of the parasite.
Collapse
Affiliation(s)
- A C Ivens
- The Sanger Centre, Wellcome Trust Genome Campus, Hinxton, Cambridge, UK.
| | | |
Collapse
|
21
|
Abstract
Despite the advances of modern medicine, the threat of chronic illness, disfigurement, or death that can result from parasitic infection still affects the majority of the world population, retarding economic development. For most parasitic diseases, current therapeutics often leave much to be desired in terms of administration regime, toxicity, or effectiveness and potential vaccines are a long way from market. Our best prospects for identifying new targets for drug, vaccine, and diagnostics development and for dissecting the biological basis of drug resistance, antigenic diversity, infectivity and pathology lie in parasite genome analysis, and international mapping and gene discovery initiatives are under way for a variety of protozoan and helminth parasites. These are far from ideal experimental organisms, and the influence of biological and genomic characteristics on experimental approaches is discussed, progress is reviewed and future prospects are examined.
Collapse
Affiliation(s)
- D A Johnston
- Department of Zoology, Natural History Museum, London, United Kingdom
| | | | | | | | | | | |
Collapse
|
22
|
Abstract
African trypanosomes combine antigenic variation of their surface coat with the ability to take up nutrients from their mammalian hosts. Uptake of small molecules such as glucose or nucleosides is mediated by translocators hidden from host antibodies by the surface coat. The multiple glucose transporters and transporters for nucleobases and nucleosides have been characterized. Receptors for host macromolecules such as transferrin and lipoproteins are visible to antibodies but hidden from the cellular arm of the host immune system in an invagination of the trypanosome surface, the flagellar pocket. The trypanosomal transferrin receptor is a heterodimer that resembles the major component of the surface coat of Trypanosoma brucei. The ability to make several versions of this receptor allows T. brucei to bind transferrins from a range of mammals with high affinity. The proteins required for uptake of nutrients by trypanosomes provide a target for chemotherapy that remains to be fully exploited.
Collapse
Affiliation(s)
- P Borst
- The Netherlands Cancer Institute, Division of Molecular Biology, Amsterdam, The Netherlands
| | | |
Collapse
|
23
|
Verdun RE, Di Paolo N, Urmenyi TP, Rondinelli E, Frasch AC, Sanchez DO. Gene discovery through expressed sequence Tag sequencing in Trypanosoma cruzi. Infect Immun 1998; 66:5393-8. [PMID: 9784549 PMCID: PMC108675 DOI: 10.1128/iai.66.11.5393-5398.1998] [Citation(s) in RCA: 51] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/15/1998] [Accepted: 08/10/1998] [Indexed: 11/20/2022] Open
Abstract
Analysis of expressed sequence tags (ESTs) constitutes a useful approach for gene identification that, in the case of human pathogens, might result in the identification of new targets for chemotherapy and vaccine development. As part of the Trypanosoma cruzi genome project, we have partially sequenced the 5' ends of 1, 949 clones to generate ESTs. The clones were randomly selected from a normalized CL Brener epimastigote cDNA library. A total of 14.6% of the clones were homologous to previously identified T. cruzi genes, while 18.4% had significant matches to genes from other organisms in the database. A total of 67% of the ESTs had no matches in the database, and thus, some of them might be T. cruzi-specific genes. Functional groups of those sequences with matches in the database were constructed according to their putative biological functions. The two largest categories were protein synthesis (23.3%) and cell surface molecules (10.8%). The information reported in this paper should be useful for researchers in the field to analyze genes and proteins of their own interest.
Collapse
Affiliation(s)
- R E Verdun
- Instituto de Investigaciones Biotecnológicas, Universidad Nacional de General San Martín, Buenos Aires, Argentina
| | | | | | | | | | | |
Collapse
|
24
|
Djikeng A, Agufa C, Donelson JE, Majiwa PA. Generation of expressed sequence tags as physical landmarks in the genome of Trypanosoma brucei. Gene 1998; 221:93-106. [PMID: 9852954 DOI: 10.1016/s0378-1119(98)00427-2] [Citation(s) in RCA: 20] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
Abstract
Previous molecular genetic studies on the African trypanosome have focused on only a few genes and gene products, the majority of which are concerned with surface antigenic variation; consequently, an insignificant number of the genes of this organism have been characterized to date. In order to: (1) identify new genes and analyze their expression profile, (2) generate expressed sequence tags (ESTs) for derivation of a physical map of the trypanosome genome, and (3) make available the partial sequence information and the corresponding clones for general biomedical research on the parasite, we have performed single-pass sequencing of random, directionally cloned cDNAs from a bloodstream form Trypanosoma brucei rhodesiense library. Analysis of 2128 such ESTs sequenced so far in this study showed significant similarities [BLASTX P(n)-value < 10(-4), and a match > 10 amino acid residues] with proteins whose genes have been described in diverse organisms including man, rodents, kinetoplastids, yeasts and plants. A number of the ESTs encode homologues of proteins involved in various functions including signal reception and transduction, cell division, gene regulation, DNA repair and replication, general metabolism, and structural integrity. Although some of these genes may have been expected to be present in the African trypanosomes, the majority of them had not previously been described in these organisms. A large proportion, 768 individual ESTs (36%, representing 385 different transcripts), had a significant homology with genes described in organisms other than the African trypanosomes; however, 15% of the ESTs were from genes already described in trypanosomes. Among the ESTs analysed were 462 distinct known genes, only 77 of which have been described in T. brucei. Approximately 52% of the ESTs did not show any significant homology with the sequences in any of the public domain databases.
Collapse
Affiliation(s)
- A Djikeng
- International Livestock Research Institute (ILRI), Nairobi, Kenya
| | | | | | | |
Collapse
|
25
|
|
26
|
Mathieu-Daudé F, Welsh J, Davis C, McClelland M. Differentially expressed genes in the Trypanosoma brucei life cycle identified by RNA fingerprinting. Mol Biochem Parasitol 1998; 92:15-28. [PMID: 9574906 DOI: 10.1016/s0166-6851(97)00221-1] [Citation(s) in RCA: 13] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/07/2023]
Abstract
RNA fingerprinting by arbitrarily primed polymerase chain reaction (RAP-PCR) was used to identify genes that were differentially expressed during the life cycle of Trypanosoma brucei, as well as in response to heat shock. The standard RAP-PCR protocol was varied in two ways. First, the PCR reactions sometimes included a primer derived from the 5' mini-exon sequence, to ensure that most of the products contained the 5' end of mRNAs. Second, differentially amplified products were reamplified, isolated on single strand conformation polymorphism (SSCP) gels, cloned, and sequenced. Clones representing 32 different expressed sequence tags (ESTs) were obtained. Twenty-four ESTs were confirmed as differentially expressed by RT-PCR between different stages of the parasite cycle, or in response to temperature elevation. Nine clones had significant similarities to sequences already in the database. These transcripts included genes encoding cell surface proteins, metabolic enzymes, and heat shock proteins, either from trypanosomes or other organisms. Of particular interest, ESAG1 was shown to be heat-inducible in the procyclic stage. Most of the transcripts were unrelated to any other sequences in the database, and were deposited as new ESTs. The identification of stage-specific and heat shock-regulated transcripts will complement the growing T. brucei database. In addition, this experimental approach allows previous entries in the sequence database to be annotated with regulatory information.
Collapse
|
27
|
El-Sayed NM, Donelson JE. African trypanosomes have differentially expressed genes encoding homologues of the Leishmania GP63 surface protease. J Biol Chem 1997; 272:26742-8. [PMID: 9334260 DOI: 10.1074/jbc.272.42.26742] [Citation(s) in RCA: 45] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/05/2023] Open
Abstract
The genomes of various Leishmania parasites contain tandemly arrayed genes encoding an abundant 63-kDa surface glycoprotein called GP63. Leishmania GP63s are metalloproteases that play an important role in the invasion and survival of the parasites within the macrophage, and their presence on the Leishmania surface has been correlated with resistance to complement-mediated lysis. Here we report the identification of GP63-like genes in African trypanosomes. The predicted trypanosome and Leishmania GP63s share a metalloprotease catalytic site motif of HEXXH as well as 19 cysteines and 10 prolines, implying a conservation of enzymatic activity and secondary/tertiary structure. The trypanosome GP63 genes are transcribed equally in procyclic and bloodstream trypanosomes, but their mRNAs accumulate to a 50-fold higher steady state level in bloodstream trypanosomes, where the ratio of mRNAs for GP63 and variant surface glycoprotein is about 1:150. Transcription of the GP63 genes is sensitive to alpha-amanitin, indicating that they are transcribed by a different polymerase than the variant surface glycoprotein genes. These results lead to a reconsideration of the potential functions of GP63, inasmuch as African trypanosomes are not known to interact with macrophages and do not have an intracellular stage during their life cycle.
Collapse
Affiliation(s)
- N M El-Sayed
- Department of Biochemistry, University of Iowa, Iowa City, Iowa 52242, USA
| | | |
Collapse
|