1
|
Chakrabarty B, Parekh N. DbStRiPs: Database of structural repeats in proteins. Protein Sci 2022; 31:23-36. [PMID: 33641184 PMCID: PMC8740836 DOI: 10.1002/pro.4052] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/25/2020] [Revised: 02/11/2021] [Accepted: 02/15/2021] [Indexed: 01/03/2023]
Abstract
Recent interest in repeat proteins has arisen due to stable structural folds, high evolutionary conservation and repertoire of functions provided by these proteins. However, repeat proteins are poorly characterized because of high sequence variation between repeating units and structure-based identification and classification of repeats is desirable. Using a robust network-based pipeline, manual curation and Kajava's structure-based classification schema, we have developed a database of tandem structural repeats, Database of Structural Repeats in Proteins (DbStRiPs). A unique feature of this database is that available knowledge on sequence repeat families is incorporated by mapping Pfam classification scheme onto structural classification. Integration of sequence and structure-based classifications help in identifying different functional groups within the same structural subclass, leading to refinement in the annotation of repeat proteins. Analysis of complete Protein Data Bank revealed 16,472 repeat annotations in 15,141 protein chains, one previously uncharacterized novel protein repeat family (PRF), named left-handed beta helix, and 33 protein repeat clusters (PRCs). Based on their unique structural motif, ~79% of these repeat proteins are classified in one of the 14 PRFs or 33 PRCs, and the remaining are grouped as unclassified repeat proteins. Each repeat protein is provided with a detailed annotation in DbStRiPs that includes start and end boundaries of repeating units, copy number, secondary and tertiary structure view, repeat class/subclass, disease association, MSA of repeating units and cross-references to various protein pattern databases, human protein atlas and interaction resources. DbStRiPs provides easy search and download options to high-quality annotations of structural repeat proteins (URL: http://bioinf.iiit.ac.in/dbstrips/).
Collapse
Affiliation(s)
- Broto Chakrabarty
- Centre for Computational Natural Sciences and Bioinformatics, International Institute of Information TechnologyHyderabadIndia
| | - Nita Parekh
- Centre for Computational Natural Sciences and Bioinformatics, International Institute of Information TechnologyHyderabadIndia
| |
Collapse
|
2
|
Siqueira WF, Viana AG, Reis Cunha JL, Rosa LM, Bueno LL, Bartholomeu DC, Cardoso MS, Fujiwara RT. The increased presence of repetitive motifs in the KDDR-plus recombinant protein, a kinesin-derived antigen from Leishmania infantum, improves the diagnostic performance of serological tests for human and canine visceral leishmaniasis. PLoS Negl Trop Dis 2021; 15:e0009759. [PMID: 34534217 PMCID: PMC8480608 DOI: 10.1371/journal.pntd.0009759] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/04/2021] [Revised: 09/29/2021] [Accepted: 08/24/2021] [Indexed: 02/07/2023] Open
Abstract
Visceral leishmaniasis (VL) is caused by protozoa belonging to the Leishmania donovani complex and is considered the most serious and fatal form among the different types of leishmaniasis, if not early diagnosed and treated. Among the measures of disease control stand out the management of infected dogs and the early diagnosis and appropriate treatment of human cases. Several antigens have been characterized for use in the VL diagnosis, among them are the recombinant kinesin-derived antigens from L. infantum, as rK39 and rKDDR. The main difference between these antigens is the size of the non-repetitive kinesin region and the number of repetitions of the 39 amino acid degenerate motif (6.5 and 8.5 repeats in rK39 and rKDDR, respectively). This repetitive region has a high antigenicity score. To evaluate the effect of increasing the number of repeats on diagnostic performance, we designed the rKDDR-plus antigen, containing 15.3 repeats of the 39 amino acid degenerate motif, besides the absence of the non-repetitive portion from L. infantum kinesin. Its performance was evaluated by enzyme-linked immunosorbent assay (ELISA) and rapid immunochromatographic test (ICT), and compared with the kinesin-derived antigens (rKDDR and rK39). In ELISA with human sera, all recombinant antigens had a sensitivity of 98%, whereas the specificity for rKDDR-plus, rKDDR and rK39 was 100%, 96% and 71%, respectively. When evaluated canine sera, the ELISA sensitivity was 97% for all antigens, and the specificity for rKDDR-plus, rKDDR and rK39 was 98%, 91% and 83%, respectively. Evaluation of the ICT/rKDDR-plus, using human sera, showed greater diagnostic sensitivity (90%) and specificity (100%), when compared to the IT LEISH (79% and 98%, respectively), which is based on the rK39 antigen. These results suggest that the increased presence of repetitive motifs in the rKDDR-plus protein improves the diagnostic performance of serological tests by increasing the specificity and accuracy of the diagnosis.
Collapse
Affiliation(s)
- Williane Fernanda Siqueira
- Programa de Pós-graduação em Ciências da Saúde: Infectologia e Medicina Tropical, Faculdade de Medicina, Universidade Federal de Minas Gerais, Belo Horizonte, Minas Gerais, Brazil
| | - Agostinho Gonçalves Viana
- Departamento de Parasitologia, Instituto de Ciências Biológicas, Universidade Federal de Minas Gerais, Belo Horizonte, Minas Gerais, Brazil
| | - João Luís Reis Cunha
- Departamento de Parasitologia, Instituto de Ciências Biológicas, Universidade Federal de Minas Gerais, Belo Horizonte, Minas Gerais, Brazil
| | - Leticia Mansur Rosa
- Departamento de Parasitologia, Instituto de Ciências Biológicas, Universidade Federal de Minas Gerais, Belo Horizonte, Minas Gerais, Brazil
| | - Lilian Lacerda Bueno
- Programa de Pós-graduação em Ciências da Saúde: Infectologia e Medicina Tropical, Faculdade de Medicina, Universidade Federal de Minas Gerais, Belo Horizonte, Minas Gerais, Brazil.,Departamento de Parasitologia, Instituto de Ciências Biológicas, Universidade Federal de Minas Gerais, Belo Horizonte, Minas Gerais, Brazil
| | - Daniella Castanheira Bartholomeu
- Departamento de Parasitologia, Instituto de Ciências Biológicas, Universidade Federal de Minas Gerais, Belo Horizonte, Minas Gerais, Brazil
| | - Mariana Santos Cardoso
- Departamento de Parasitologia, Instituto de Ciências Biológicas, Universidade Federal de Minas Gerais, Belo Horizonte, Minas Gerais, Brazil
| | - Ricardo Toshio Fujiwara
- Programa de Pós-graduação em Ciências da Saúde: Infectologia e Medicina Tropical, Faculdade de Medicina, Universidade Federal de Minas Gerais, Belo Horizonte, Minas Gerais, Brazil.,Departamento de Parasitologia, Instituto de Ciências Biológicas, Universidade Federal de Minas Gerais, Belo Horizonte, Minas Gerais, Brazil
| |
Collapse
|
3
|
Mary Rajathei D, Parthasarathy S, Selvaraj S. HPREP: a comprehensive database for human proteome repeats. J Integr Bioinform 2020; 0:/j/jib.ahead-of-print/jib-2020-0024/jib-2020-0024.xml. [PMID: 33136065 DOI: 10.1515/jib-2020-0024] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/08/2020] [Accepted: 09/17/2020] [Indexed: 11/15/2022] Open
Abstract
Amino acid repeats are found to play important roles in both structures and functions of the proteins. These are commonly found in all kingdoms of life, especially in eukaryotes and a larger fraction of human proteins composed of repeats. Further, the abnormal expansions of shorter repeats cause various diseases to humans. Therefore, the analysis of repeats of the entire human proteome along with functional, mutational and disease information would help to better understand their roles in proteins. To fulfill this need, we developed a web database HPREP (http://bioinfo.bdu.ac.in/hprep) for human proteome repeats using Perl and HTML programming. We identified different categories of well-characterized repeats and domain repeats that are present in the human proteome of UniProtKB/Swiss-Prot by using in-house Perl programming and novel repeats by using the repeat detection T-REKS tool as well as XSTREAM web server. Further, these proteins are annotated with functional, mutational and disease information and grouped according to specific repeat types. The developed database enables the users to search by specific repeat type in order to understand their involvement in proteins. Thus, the HPREP database is expected to be a useful resource to gain better insight regarding the different repeats in human proteome and their biological roles.
Collapse
Affiliation(s)
- David Mary Rajathei
- Department of Bioinformatics, School of Life Sciences, Bharathidasan University, Tiruchirappalli 620 024, India
| | - Subbiah Parthasarathy
- Department of Bioinformatics, School of Life Sciences, Bharathidasan University, Tiruchirappalli 620 024, India
| | - Samuel Selvaraj
- Department of Bioinformatics, School of Life Sciences, Bharathidasan University, Tiruchirappalli 620 024, India
| |
Collapse
|
4
|
Stage-Specific Transcriptome and Proteome Analyses of the Filarial Parasite Onchocerca volvulus and Its Wolbachia Endosymbiont. mBio 2016; 7:mBio.02028-16. [PMID: 27881553 PMCID: PMC5137501 DOI: 10.1128/mbio.02028-16] [Citation(s) in RCA: 42] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/02/2022] Open
Abstract
Onchocerciasis (river blindness) is a neglected tropical disease that has been successfully targeted by mass drug treatment programs in the Americas and small parts of Africa. Achieving the long-term goal of elimination of onchocerciasis, however, requires additional tools, including drugs, vaccines, and biomarkers of infection. Here, we describe the transcriptome and proteome profiles of the major vector and the human host stages (L1, L2, L3, molting L3, L4, adult male, and adult female) of Onchocerca volvulus along with the proteome of each parasitic stage and of its Wolbachia endosymbiont (wOv). In so doing, we have identified stage-specific pathways important to the parasite’s adaptation to its human host during its early development. Further, we generated a protein array that, when screened with well-characterized human samples, identified novel diagnostic biomarkers of O. volvulus infection and new potential vaccine candidates. This immunomic approach not only demonstrates the power of this postgenomic discovery platform but also provides additional tools for onchocerciasis control programs. The global onchocerciasis (river blindness) elimination program will have to rely on the development of new tools (drugs, vaccines, biomarkers) to achieve its goals by 2025. As an adjunct to the completed genomic sequencing of O. volvulus, we used a comprehensive proteomic and transcriptomic profiling strategy to gain a comprehensive understanding of both the vector-derived and human host-derived parasite stages. In so doing, we have identified proteins and pathways that enable novel drug targeting studies and the discovery of novel vaccine candidates, as well as useful biomarkers of active infection.
Collapse
|
5
|
Pellegrini M. Tandem Repeats in Proteins: Prediction Algorithms and Biological Role. Front Bioeng Biotechnol 2015; 3:143. [PMID: 26442257 PMCID: PMC4585158 DOI: 10.3389/fbioe.2015.00143] [Citation(s) in RCA: 19] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/12/2015] [Accepted: 09/07/2015] [Indexed: 12/30/2022] Open
Abstract
Tandem repetitions in protein sequence and structure is a fascinating subject of research which has been a focus of study since the late 1990s. In this survey, we give an overview on the multi-faceted aspects of research on protein tandem repeats (PTR for short), including prediction algorithms, databases, early classification efforts, mechanisms of PTR formation and evolution, and synthetic PTR design. We also touch on the rather open issue of the relationship between PTR and flexibility (or disorder) in proteins. Detection of PTR either from protein sequence or structure data is challenging due to inherent high (biological) signal-to-noise ratio that is a key feature of this problem. As early in silico analytic tools have been key enablers for starting this field of study, we expect that current and future algorithmic and statistical breakthroughs will have a high impact on the investigations of the biological role of PTR.
Collapse
Affiliation(s)
- Marco Pellegrini
- Laboratory for Integrative Systems Medicine (LISM), Istituto di Informatica e Telematica, and Istituto di Fisiologia Clinica, Consiglio Nazionale delle Ricerche , Pisa , Italy
| |
Collapse
|
6
|
Comparative in-silico genome analysis of Leishmania (Leishmania) donovani: A step towards its species specificity. Meta Gene 2014; 2:782-98. [PMID: 25606461 PMCID: PMC4287845 DOI: 10.1016/j.mgene.2014.10.003] [Citation(s) in RCA: 19] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/27/2014] [Revised: 09/12/2014] [Accepted: 10/04/2014] [Indexed: 12/24/2022] Open
Abstract
Comparative genome analysis of recently sequenced Leishmania (L.) donovani was unexplored so far. The present study deals with the complete scanning of L. (L.) donovani genome revealing its interspecies variations. 60 distinctly present genes in L. (L.) donovani were identified when the whole genome was compared with Leishmania (L.) infantum. Similarly 72, 159, and 265 species specific genes were identified in L. (L.) donovani when compared to Leishmania (L.) major, Leishmania (L.) mexicana and Leishmania (Viannia) braziliensis respectively. The cross comparison of L. (L.) donovani in parallel with the other sequenced species of leishmanial led to the identification of 55 genes which are highly specific and expressed exclusively in L. (L.) donovani. We found mainly the discrepancies of surface proteins such as amastins, proteases, and peptidases. Also 415 repeat containing proteins in L. (L.) donovani and their differential distribution in other leishmanial species were identified which might have a potential role during pathogenesis. The genes identified can be evaluated as drug targets for anti-leishmanial treatment, exploring the scope for extensive future investigations. Comparative genome analysis identifies 55 species specific L. (L.) donovani genes. Discrepancies of surface proteins such as amastins, proteases, and peptidases are identified in L. (L.) donovani. Apical Membrane Antigen (AMA1) might be a novel factor which helps L. (L.) donovani invasion. Novel A2 and amastin genes in L. (L.) donovani genome are identified. Our study identifies differential gene distribution in L. (L.) donovani with respect to other leishmanial species.
Collapse
|
7
|
Russell K, Cheng CH, Bizzaro JW, Ponts N, Emes RD, Le Roch K, Marx KA, Horrocks P. Homopolymer tract organization in the human malarial parasite Plasmodium falciparum and related Apicomplexan parasites. BMC Genomics 2014; 15:848. [PMID: 25281558 PMCID: PMC4194402 DOI: 10.1186/1471-2164-15-848] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/21/2014] [Accepted: 09/24/2014] [Indexed: 01/22/2023] Open
Abstract
BACKGROUND Homopolymeric tracts, particularly poly dA.dT, are enriched within the intergenic sequences of eukaryotic genomes where they appear to act as intrinsic regulators of nucleosome positioning. A previous study of the incomplete genome of the human malarial parasite Plasmodium falciparum reports a higher than expected enrichment of poly dA.dT tracts, far above that anticipated even in this highly AT rich genome. Here we report an analysis of the relative frequency, length and spatial arrangement of homopolymer tracts for the complete P. falciparum genome, extending this analysis to twelve additional genomes of Apicomplexan parasites important to human and animal health. In addition, using nucleosome-positioning data available for P. falciparum, we explore the correlation of poly dA.dT tracts with nucleosome-positioning data over key expression landmarks within intergenic regions. RESULTS We describe three apparent lineage-specific patterns of homopolymeric tract organization within the intergenic regions of these Apicomplexan parasites. Moreover, a striking pattern of enrichment of overly long poly dA.dT tracts in the intergenic regions of Plasmodium spp. uniquely extends into protein coding sequences. There is a conserved spatial arrangement of poly dA.dT immediately flanking open reading frames and over predicted core promoter sites. These key landmarks are all relatively depleted in nucleosomes in P. falciparum, as would be expected for poly dA.dT acting as nucleosome exclusion sequences. CONCLUSIONS Previous comparative studies of homopolymer tract organization emphasize evolutionary diversity; this is the first report of such an analysis within a single phylum. Our data provide insights into the evolution of homopolymeric tracts and the selective pressures at play in their maintenance and expansion.
Collapse
Affiliation(s)
- Karen Russell
- />Institute for Science and Technology in Medicine, Keele University, Stoke-on-Trent, ST5 5BG Staffordshire, UK
| | - Chia-Ho Cheng
- />Center for Intelligent Biomaterials, University of Massachusetts Lowell, Lowell, MA 01854 USA
- />Hebrew SeniorLife, Institute for Aging Research, Boston, MA 02131 USA
| | | | - Nadia Ponts
- />National Institute for Agricultural Research (INRA), UR1264-Mycology and Food Safety (MycSA), CS20032, 33882 Villenave d’Ornon Cedex, France
| | - Richard D Emes
- />School of Veterinary Medicine and Science, University of Nottingham, LE12 5RD Nottingham, Leicestershire, UK
- />Advanced Data Analysis Centre, University of Nottingham, Nottingham, UK
| | - Karine Le Roch
- />Department Cell Biology and Neuroscience, University of California, Riverside, CA 92521 USA
| | - Kenneth A Marx
- />Center for Intelligent Biomaterials, University of Massachusetts Lowell, Lowell, MA 01854 USA
| | - Paul Horrocks
- />Institute for Science and Technology in Medicine, Keele University, Stoke-on-Trent, ST5 5BG Staffordshire, UK
| |
Collapse
|
8
|
Maier J, Adzhubei AA, Egge-Jacobsen W. SAPA tool: finding protein regions by combination of amino acid composition, scaled profiles, patterns and rules. Bioinformatics 2013; 29:2496-7. [DOI: 10.1093/bioinformatics/btt411] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
|
9
|
María Velasco A, Becerra A, Hernández-Morales R, Delaye L, Jiménez-Corona ME, Ponce-de-Leon S, Lazcano A. Low complexity regions (LCRs) contribute to the hypervariability of the HIV-1 gp120 protein. J Theor Biol 2013; 338:80-6. [PMID: 24021867 DOI: 10.1016/j.jtbi.2013.08.039] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/10/2013] [Revised: 08/01/2013] [Accepted: 08/31/2013] [Indexed: 01/27/2023]
Abstract
Low complexity regions (LCRs) are sequences of nucleic acids or proteins defined by a compositional bias. Their occurrence has been confirmed in sequences of the three cellular lineages (Bacteria, Archaea and Eucarya), and has also been reported in viral genomes. We present here the results of a detailed computer analysis of the LCRs present in the HIV-1 glycoprotein 120 (gp120) encoded by the viral gene env. The analysis was performed using a sample of 3637 Env polyprotein sequences derived from 4117 completely sequenced and translated HIV-1 genomes available in public databases as of December 2012. We have identified 1229 LCRs located in four different regions of the gp120 protein that correspond to four of the five regions that have been identified as hypervariable (V1, V2, V4 and V5). The remaining 29 LCRs are found in the signal peptide and in the conserved regions C2, C3, C4 and C5. No LCR has been identified in the hypervariable region V3. The LCRs detected in the V1, V2, V4, and V5 hypervariable regions exhibit a high Asn content in their amino acid composition, which very likely correspond to glycosylation sites, which may contribute to the retroviral ability to avoid the immune system. In sharp contrast with what is observed in gp120 proteins lacking LCRs, the glycosylation sites present in LCRs tend to be clustered towards the center of the region forming well-defined islands. The results presented here suggest that LCRs represent a hitherto undescribed source of genomic variability in lentivirus, and that these repeats may represent an important source of antigenic variation in HIV-1 populations. The results reported here may exemplify the evolutionary processes that may have increased the size of primitive cellular RNA genomes and the role of LCRs as a source of raw material during the processes of evolutionary acquisition of new functions.
Collapse
Affiliation(s)
- Ana María Velasco
- Facultad de Ciencias, UNAM, Ciudad Universitaria, Apdo. Postal 70-407, México D. F. 04510, Mexico; Laboratorios de Biológicos y Reactivos de México, Amores 1240, Colonia Del Valle, México D. F. 03100, Mexico
| | | | | | | | | | | | | |
Collapse
|
10
|
Kubrycht J, Sigler K, Souček P, Hudeček J. Structures composing protein domains. Biochimie 2013; 95:1511-24. [DOI: 10.1016/j.biochi.2013.04.001] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/22/2013] [Accepted: 04/02/2013] [Indexed: 12/21/2022]
|
11
|
Homepeptide repeats: implications for protein structure, function and evolution. GENOMICS PROTEOMICS & BIOINFORMATICS 2012; 10:217-25. [PMID: 23084777 PMCID: PMC5054710 DOI: 10.1016/j.gpb.2012.04.001] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 11/18/2011] [Revised: 04/03/2012] [Accepted: 04/19/2012] [Indexed: 11/20/2022]
Abstract
Analysis of protein sequences from Mycobacterium tuberculosis H37Rv (Mtb H37Rv) was performed to identify homopeptide repeat-containing proteins (HRCPs). Functional annotation of the HRCPs showed that they are preferentially involved in cellular metabolism. Furthermore, these homopeptide repeats might play some specific roles in protein–protein interaction. Repeat length differences among Bacteria, Archaea and Eukaryotes were calculated in order to identify the conservation of the repeats in these divergent kingdoms. From the results, it was evident that these repeats have a higher degree of conservation in Bacteria and Archaea than in Eukaryotes. In addition, there seems to be a direct correlation between the repeat length difference and the degree of divergence between the species. Our study supports the hypothesis that the presence of homopeptide repeats influences the rate of evolution of the protein sequences in which they are embedded. Thus, homopeptide repeat may have structural, functional and evolutionary implications on proteins.
Collapse
|
12
|
Maroof A, Brown N, Smith B, Hodgkinson MR, Maxwell A, Losch FO, Fritz U, Walden P, Lacey CNJ, Smith DF, Aebischer T, Kaye PM. Therapeutic vaccination with recombinant adenovirus reduces splenic parasite burden in experimental visceral leishmaniasis. J Infect Dis 2012; 205:853-63. [PMID: 22301630 PMCID: PMC3274377 DOI: 10.1093/infdis/jir842] [Citation(s) in RCA: 46] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/13/2022] Open
Abstract
Therapeutic vaccines, when used alone or in combination therapy with antileishmanial drugs, may have an important place in the control of a variety of forms of human leishmaniasis. Here, we describe the development of an adenovirus-based vaccine (Ad5-KH) comprising a synthetic haspb gene linked to a kmp11 gene via a viral 2A sequence. In nonvaccinated Leishmania donovani–infected BALB/c mice, HASPB- and KMP11-specific CD8+ T cell responses were undetectable, although IgG1 and IgG2a antibodies were evident. After therapeutic vaccination, antibody responses were boosted, and IFNγ+CD8+ T cell responses, particularly to HASPB, became apparent. A single vaccination with Ad5-KH inhibited splenic parasite growth by ∼66%, a level of efficacy comparable to that observed in early stage testing of clinically approved antileishmanial drugs in this model. These studies indicate the usefulness of adenoviral vectors to deliver leishmanial antigens in a potent and host protective manner to animals with existing L. donovani infection.
Collapse
Affiliation(s)
- Asher Maroof
- Centre for Immunology and Infection, Hull York Medical School and Department of Biology, University of York, Heslington, UK
| | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
13
|
Luo H, Lin K, David A, Nijveen H, Leunissen JAM. ProRepeat: an integrated repository for studying amino acid tandem repeats in proteins. Nucleic Acids Res 2011; 40:D394-9. [PMID: 22102581 PMCID: PMC3245022 DOI: 10.1093/nar/gkr1019] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/13/2023] Open
Abstract
ProRepeat (http://prorepeat.bioinformatics.nl/) is an integrated curated repository and analysis platform for in-depth research on the biological characteristics of amino acid tandem repeats. ProRepeat collects repeats from all proteins included in the UniProt knowledgebase, together with 85 completely sequenced eukaryotic proteomes contained within the RefSeq collection. It contains non-redundant perfect tandem repeats, approximate tandem repeats and simple, low-complexity sequences, covering the majority of the amino acid tandem repeat patterns found in proteins. The ProRepeat web interface allows querying the repeat database using repeat characteristics like repeat unit and length, number of repetitions of the repeat unit and position of the repeat in the protein. Users can also search for repeats by the characteristics of repeat containing proteins, such as entry ID, protein description, sequence length, gene name and taxon. ProRepeat offers powerful analysis tools for finding biological interesting properties of repeats, such as the strong position bias of leucine repeats in the N-terminus of eukaryotic protein sequences, the differences of repeat abundance among proteomes, the functional classification of repeat containing proteins and GC content constrains of repeats’ corresponding codons.
Collapse
Affiliation(s)
- Hong Luo
- Laboratory of Bioinformatics, Wageningen University and Research Centre, PO Box 569, 6700 AN Wageningen, Netherlands
| | | | | | | | | |
Collapse
|
14
|
Bennuru S, Meng Z, Ribeiro JMC, Semnani RT, Ghedin E, Chan K, Lucas DA, Veenstra TD, Nutman TB. Stage-specific proteomic expression patterns of the human filarial parasite Brugia malayi and its endosymbiont Wolbachia. Proc Natl Acad Sci U S A 2011; 108:9649-54. [PMID: 21606368 PMCID: PMC3111283 DOI: 10.1073/pnas.1011481108] [Citation(s) in RCA: 88] [Impact Index Per Article: 6.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
Global proteomic analyses of pathogens have thus far been limited to unicellular organisms (e.g., protozoa and bacteria). Proteomic analyses of most eukaryotic pathogens (e.g., helminths) have been restricted to specific organs, specific stages, or secretomes. We report here a large-scale proteomic characterization of almost all the major mammalian stages of Brugia malayi, a causative agent of lymphatic filariasis, resulting in the identification of more than 62% of the products predicted from the Bm draft genome. The analysis also yielded much of the proteome of Wolbachia, the obligate endosymbiont of Bm that also expressed proteins in a stage-specific manner. Of the 11,610 predicted Bm gene products, 7,103 were definitively identified from adult male, adult female, blood-borne and uterine microfilariae, and infective L3 larvae. Among the 4,956 gene products (42.5%) inferred from the genome as "hypothetical," the present study was able to confirm 2,336 (47.1%) as bona fide proteins. Analysis of protein families and domains coupled with stage-specific expression highlight the important pathways that benefit the parasite during its development in the host. Gene set enrichment analysis identified extracellular matrix proteins and those with immunologic effects as enriched in the microfilarial and L3 stages. Parasite sex- and stage-specific protein expression identified those pathways related to parasite differentiation and demonstrates stage-specific expression by the Bm endosymbiont Wolbachia as well.
Collapse
Affiliation(s)
- Sasisekhar Bennuru
- Laboratory of Parasitic Diseases, National Institute of Allergy and Infectious Diseases, Bethesda, MD 20892, USA.
| | | | | | | | | | | | | | | | | |
Collapse
|
15
|
Castagnone-Sereno P, Danchin EGJ, Deleury E, Guillemaud T, Malausa T, Abad P. Genome-wide survey and analysis of microsatellites in nematodes, with a focus on the plant-parasitic species Meloidogyne incognita. BMC Genomics 2010; 11:598. [PMID: 20973953 PMCID: PMC3091743 DOI: 10.1186/1471-2164-11-598] [Citation(s) in RCA: 42] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/29/2010] [Accepted: 10/25/2010] [Indexed: 11/13/2022] Open
Abstract
Background Microsatellites are the most popular source of molecular markers for studying population genetic variation in eukaryotes. However, few data are currently available about their genomic distribution and abundance across the phylum Nematoda. The recent completion of the genomes of several nematode species, including Meloidogyne incognita, a major agricultural pest worldwide, now opens the way for a comparative survey and analysis of microsatellites in these organisms. Results Using MsatFinder, the total numbers of 1-6 bp perfect microsatellites detected in the complete genomes of five nematode species (Brugia malayi, Caenorhabditis elegans, M. hapla, M. incognita, Pristionchus pacificus) ranged from 2,842 to 61,547, and covered from 0.09 to 1.20% of the nematode genomes. Under our search criteria, the most common repeat motifs for each length class varied according to the different nematode species considered, with no obvious relation to the AT-richness of their genomes. Overall, (AT)n, (AG)n and (CT)n were the three most frequent dinucleotide microsatellite motifs found in the five genomes considered. Except for two motifs in P. pacificus, all the most frequent trinucleotide motifs were AT-rich, with (AAT)n and (ATT)n being the only common to the five nematode species. A particular attention was paid to the microsatellite content of the plant-parasitic species M. incognita. In this species, a repertoire of 4,880 microsatellite loci was identified, from which 2,183 appeared suitable to design markers for population genetic studies. Interestingly, 1,094 microsatellites were identified in 801 predicted protein-coding regions, 99% of them being trinucleotides. When compared against the InterPro domain database, 497 of these CDS were successfully annotated, and further assigned to Gene Ontology terms. Conclusions Contrasted patterns of microsatellite abundance and diversity were characterized in five nematode genomes, even in the case of two closely related Meloidogyne species. 2,245 di- to hexanucleotide loci were identified in the genome of M. incognita, providing adequate material for the future development of a wide range of microsatellite markers in this major plant parasite.
Collapse
|
16
|
Noël CJ, Diaz N, Sicheritz-Ponten T, Safarikova L, Tachezy J, Tang P, Fiori PL, Hirt RP. Trichomonas vaginalis vast BspA-like gene family: evidence for functional diversity from structural organisation and transcriptomics. BMC Genomics 2010; 11:99. [PMID: 20144183 PMCID: PMC2843621 DOI: 10.1186/1471-2164-11-99] [Citation(s) in RCA: 68] [Impact Index Per Article: 4.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/02/2009] [Accepted: 02/08/2010] [Indexed: 12/31/2022] Open
Abstract
Background Trichomonas vaginalis is the most common non-viral human sexually transmitted pathogen and importantly, contributes to facilitating the spread of HIV. Yet very little is known about its surface and secreted proteins mediating interactions with, and permitting the invasion and colonisation of, the host mucosa. Initial annotations of T. vaginalis genome identified a plethora of candidate extracellular proteins. Results Data mining of the T. vaginalis genome identified 911 BspA-like entries (TvBspA) sharing TpLRR-like leucine-rich repeats, which represent the largest gene family encoding potential extracellular proteins for the pathogen. A broad range of microorganisms encoding BspA-like proteins was identified and these are mainly known to live on mucosal surfaces, among these T. vaginalis is endowed with the largest gene family. Over 190 TvBspA proteins with inferred transmembrane domains were characterised by a considerable structural diversity between their TpLRR and other types of repetitive sequences and two subfamilies possessed distinct classic sorting signal motifs for endocytosis. One TvBspA subfamily also shared a glycine-rich protein domain with proteins from Clostridium difficile pathogenic strains and C. difficile phages. Consistent with the hypothesis that TvBspA protein structural diversity implies diverse roles, we demonstrated for several TvBspA genes differential expression at the transcript level in different growth conditions. Identified variants of repetitive segments between several TvBspA paralogues and orthologues from two clinical isolates were also consistent with TpLRR and other repetitive sequences to be functionally important. For one TvBspA protein cell surface expression and antibody responses by both female and male T. vaginalis infected patients were also demonstrated. Conclusions The biased mucosal habitat for microbial species encoding BspA-like proteins, the characterisation of a vast structural diversity for the TvBspA proteins, differential expression of a subset of TvBspA genes and the cellular localisation and immunological data for one TvBspA; all point to the importance of the TvBspA proteins to various aspects of T. vaginalis pathobiology at the host-pathogen interface.
Collapse
Affiliation(s)
- Christophe J Noël
- Institute for Cell and Molecular Biosciences, Newcastle University, Newcastle upon Tyne, NE2 4HH, UK
| | | | | | | | | | | | | | | |
Collapse
|
17
|
Lokanathan Y, Mohd-Adnan A, Wan KL, Nathan S. Transcriptome analysis of the Cryptocaryon irritans tomont stage identifies potential genes for the detection and control of cryptocaryonosis. BMC Genomics 2010; 11:76. [PMID: 20113487 PMCID: PMC2828411 DOI: 10.1186/1471-2164-11-76] [Citation(s) in RCA: 30] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/01/2009] [Accepted: 01/29/2010] [Indexed: 01/26/2023] Open
Abstract
Background Cryptocaryon irritans is a parasitic ciliate that causes cryptocaryonosis (white spot disease) in marine fish. Diagnosis of cryptocaryonosis often depends on the appearance of white spots on the surface of the fish, which are usually visible only during later stages of the disease. Identifying suitable biomarkers of this parasite would aid the development of diagnostic tools and control strategies for C. irritans. The C. irritans genome is virtually unexplored; therefore, we generated and analyzed expressed sequence tags (ESTs) of the parasite to identify genes that encode for surface proteins, excretory/secretory proteins and repeat-containing proteins. Results ESTs were generated from a cDNA library of C. irritans tomonts isolated from infected Asian sea bass, Lates calcarifer. Clustering of the 5356 ESTs produced 2659 unique transcripts (UTs) containing 1989 singletons and 670 consensi. BLAST analysis showed that 74% of the UTs had significant similarity (E-value < 10-5) to sequences that are currently available in the GenBank database, with more than 15% of the significant hits showing unknown function. Forty percent of the UTs had significant similarity to ciliates from the genera Tetrahymena and Paramecium. Comparative gene family analysis with related taxa showed that many protein families are conserved among the protozoans. Based on gene ontology annotation, functional groups were successfully assigned to 790 UTs. Genes encoding excretory/secretory proteins and membrane and membrane-associated proteins were identified because these proteins often function as antigens and are good antibody targets. A total of 481 UTs were classified as encoding membrane proteins, 54 were classified as encoding for membrane-bound proteins, and 155 were found to contain excretory/secretory protein-coding sequences. Amino acid repeat-containing proteins and GPI-anchored proteins were also identified as potential candidates for the development of diagnostic and control strategies for C. irritans. Conclusions We successfully discovered and examined a large portion of the previously unexplored C. irritans transcriptome and identified potential genes for the development and validation of diagnostic and control strategies for cryptocaryonosis.
Collapse
Affiliation(s)
- Yogeswaran Lokanathan
- School of Biosciences and Biotechnology, Faculty of Science and Technology, Universiti Kebangsaan Malaysia, Selangor, Malaysia
| | | | | | | |
Collapse
|
18
|
Frugier M, Bour T, Ayach M, Santos MAS, Rudinger-Thirion J, Théobald-Dietrich A, Pizzi E. Low Complexity Regions behave as tRNA sponges to help co-translational folding of plasmodial proteins. FEBS Lett 2009; 584:448-54. [PMID: 19900443 DOI: 10.1016/j.febslet.2009.11.004] [Citation(s) in RCA: 27] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/30/2009] [Revised: 11/02/2009] [Accepted: 11/03/2009] [Indexed: 10/20/2022]
Abstract
In most organisms, the information necessary to specify the native 3D-structures of proteins is encoded in the corresponding mRNA sequences. Translational accuracy and efficiency are coupled and sequences that are slowly translated play an essential role in the concomitant folding of protein domains. Here, we suggest that the well-known mechanisms for the regulation of translational efficiency, which involves mRNA structure and/or asymmetric tRNA abundance, do not apply to all organisms. We propose that Plasmodium, the parasite responsible for malaria, uses an alternative strategy to slow down ribosomal speed and avoid multidomain protein misfolding during translation. In our model, the abundant Low Complexity Regions present in Plasmodium proteins replace the codon preferences, which influence the assembly of protein secondary structures.
Collapse
Affiliation(s)
- Magali Frugier
- Architecture et Réactivité de l'ARN, Université de Strasbourg, CNRS, IBMC, 15 rue René Descartes, 67084 Strasbourg Cedex, France.
| | | | | | | | | | | | | |
Collapse
|
19
|
Castagnone-Sereno P, Semblat JP, Castagnone C. Modular architecture and evolution of the map-1 gene family in the root-knot nematode Meloidogyne incognita. Mol Genet Genomics 2009; 282:547-54. [PMID: 19787376 DOI: 10.1007/s00438-009-0487-x] [Citation(s) in RCA: 26] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/30/2009] [Accepted: 09/13/2009] [Indexed: 10/20/2022]
Abstract
In eukaryotes, repeat proteins (i.e. proteins that contain a tandem arrangement of repeated structural elements) are often considered as an extra source of variability, and gains and losses of repeats may be an important force driving the evolution and diversification of such proteins, that could allow fast adaptation to new environments. Here, we report genomic sequences of the MAP-1 protein family from of the asexual, plant-parasitic nematode Meloidogyne incognita. The encoded proteins exhibited highly conserved repeats of 13 and 58 aa, and variation in the number and arrangement of these repeats in the MAP-1 proteins was correlated with nematode (a)virulence, suggesting a possible role in the specificity of the plant-nematode interaction. Search in the complete genome sequence of M. incognita confirmed that a small gene family encoding proteins harboring conserved 58 and 13 aa-repeats is present in this nematode, and that the repetitive region of these proteins is modular. Both gene duplication and intragenic gain and loss of repeats have contributed to the complex evolutionary history of the map-1 gene family, and active selection pressure of the plant host probably induced recent additional gene loss, finally resulting in the present-day gene and repeat diversity observed among nematode lines. The genomic differences characterized here between avirulent and virulent individuals are assumed to reflect, at the DNA level, the adaptive capacity of these asexual root-knot nematodes.
Collapse
|
20
|
Dalby AR. A comparative proteomic analysis of the simple amino acid repeat distributions in Plasmodia reveals lineage specific amino acid selection. PLoS One 2009; 4:e6231. [PMID: 19597555 PMCID: PMC2705789 DOI: 10.1371/journal.pone.0006231] [Citation(s) in RCA: 16] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/03/2009] [Accepted: 06/17/2009] [Indexed: 11/19/2022] Open
Abstract
Background Microsatellites have been used extensively in the field of comparative genomics. By studying microsatellites in coding regions we have a simple model of how genotypic changes undergo selection as they are directly expressed in the phenotype as altered proteins. The simplest of these tandem repeats in coding regions are the tri-nucleotide repeats which produce a repeat of a single amino acid when translated into proteins. Tri-nucleotide repeats are often disease associated, and are also known to be unstable to both expansion and contraction. This makes them sensitive markers for studying proteome evolution, in closely related species. Results The evolutionary history of the family of malarial causing parasites Plasmodia is complex because of the life-cycle of the organism, where it interacts with a number of different hosts and goes through a series of tissue specific stages. This study shows that the divergence between the primate and rodent malarial parasites has resulted in a lineage specific change in the simple amino acid repeat distribution that is correlated to A–T content. The paper also shows that this altered use of amino acids in SAARs is consistent with the repeat distributions being under selective pressure. Conclusions The study shows that simple amino acid repeat distributions can be used to group related species and to examine their phylogenetic relationships. This study also shows that an outgroup species with a similar A–T content can be distinguished based only on the amino acid usage in repeats, and suggest that this might be a useful feature for proteome clustering. The lineage specific use of amino acids in repeat regions suggests that comparative studies of SAAR distributions between proteomes gives an insight into the mechanisms of expansion and the selective pressures acting on the organism.
Collapse
Affiliation(s)
- Andrew R Dalby
- Department of Statistics, University of Oxford, Oxford, UK.
| |
Collapse
|
21
|
Depledge DP, Evans KJ, Ivens AC, Aziz N, Maroof A, Kaye PM, Smith DF. Comparative expression profiling of Leishmania: modulation in gene expression between species and in different host genetic backgrounds. PLoS Negl Trop Dis 2009; 3:e476. [PMID: 19582145 PMCID: PMC2701600 DOI: 10.1371/journal.pntd.0000476] [Citation(s) in RCA: 71] [Impact Index Per Article: 4.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/25/2009] [Accepted: 06/02/2009] [Indexed: 12/03/2022] Open
Abstract
Background Genome sequencing of Leishmania species that give rise to a range of disease phenotypes in the host has revealed highly conserved gene content and synteny across the genus. Only a small number of genes are differentially distributed between the three species sequenced to date, L. major, L. infantum and L. braziliensis. It is not yet known how many of these genes are expressed in the disease-promoting intracellular amastigotes of these species or whether genes conserved between the species are differentially expressed in the host. Methods/Principal Findings We have used customised oligonucleotide microarrays to confirm that all of the differentially distributed genes identified by genome comparisons are expressed in intracellular amastigotes, with only a few of these subject to regulation at the RNA level. In the first large-scale study of gene expression in L. braziliensis, we show that only ∼9% of the genes analysed are regulated in their RNA expression during the L. braziliensis life cycle, a figure consistent with that observed in other Leishmania species. Comparing amastigote gene expression profiles between species confirms the proposal that Leishmania transcriptomes undergo little regulation but also identifies conserved genes that are regulated differently between species in the host. We have also investigated whether host immune competence influences parasite gene expression, by comparing RNA expression profiles in L. major amastigotes derived from either wild-type (BALB/c) or immunologically compromised (Rag2−/− γc−/−) mice. While parasite dissemination from the site of infection is enhanced in the Rag2−/− γc−/− genetic background, parasite RNA expression profiles are unperturbed. Conclusion/Significance These findings support the hypothesis that Leishmania amastigotes are pre-adapted for intracellular survival and undergo little dynamic modulation of gene expression at the RNA level. Species-specific parasite factors contributing to virulence and pathogenicity in the host may be limited to the products of a small number of differentially distributed genes or the differential regulation of conserved genes, either of which are subject to translational and/or post-translational controls. The single-celled parasite Leishmania, transmitted by sand flies in more than 88 tropical and sub-tropical countries globally, infects man and other mammals, causing a spectrum of diseases called the leishmaniases. Over 12 million people are currently infected worldwide with 2 million new cases reported each year. The type of leishmaniasis that develops in the mammalian host is dependent on the species of infecting parasite and the immune response to infection (that can be influenced by host genetic variation). Our research is focused on identifying parasite factors that contribute to pathogenicity in the host and understanding how these might differ between parasite species that give rise to the different clinical forms of leishmaniasis. Molecules of this type might lead to new therapeutic tools in the longer term. In this paper, we report a comparative analysis of gene expression profiles in three Leishmania species that give rise to different types of disease, focusing on the intracellular stages that reside in mammalian macrophages. Our results show that there are only a small number of differences between these parasite species, with host genetics playing only a minor role in influencing the parasites' response to their intracellular habitat. These small changes may be significant, however, in determining the clinical outcome of infection.
Collapse
Affiliation(s)
- Daniel P. Depledge
- Centre for Immunology and Infection, Department of Biology/Hull York Medical School, University of York, York, United Kingdom
| | - Krystal J. Evans
- Centre for Immunology and Infection, Department of Biology/Hull York Medical School, University of York, York, United Kingdom
| | | | - Naveed Aziz
- Technology Facility, Department of Biology, University of York, York, United Kingdom
| | - Asher Maroof
- Centre for Immunology and Infection, Department of Biology/Hull York Medical School, University of York, York, United Kingdom
| | - Paul M. Kaye
- Centre for Immunology and Infection, Department of Biology/Hull York Medical School, University of York, York, United Kingdom
| | - Deborah F. Smith
- Centre for Immunology and Infection, Department of Biology/Hull York Medical School, University of York, York, United Kingdom
- * E-mail:
| |
Collapse
|
22
|
Bour T, Akaddar A, Lorber B, Blais S, Balg C, Candolfi E, Frugier M. Plasmodial aspartyl-tRNA synthetases and peculiarities in Plasmodium falciparum. J Biol Chem 2009; 284:18893-903. [PMID: 19443655 DOI: 10.1074/jbc.m109.015297] [Citation(s) in RCA: 27] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022] Open
Abstract
Distinctive features of aspartyl-transfer RNA (tRNA) synthetases (AspRS) from the protozoan Plasmodium genus are described. These apicomplexan AspRSs contain 29-31 amino acid insertions in their anticodon binding domains, a remarkably long N-terminal appendix that varies in size from 110 to 165 amino acids and two potential initiation codons. This article focuses on the atypical functional and structural properties of Plasmodium falciparum cytosolic AspRS, the causative parasite of human malaria. This species encodes a 626 or 577 amino acids AspRS depending on whether initiation starts on the first or second in-frame initiation codon. The longer protein has poor solubility and a propensity to aggregate. Production of the short version was favored as shown by the comparison of the recombinant protein with endogenous AspRS. Comparison of the tRNA aminoacylation activity of wild-type and mutant parasite AspRSs with those of yeast and human AspRSs revealed unique properties. The N-terminal extension contains a motif that provides unexpectedly strong RNA binding to plasmodial AspRS. Furthermore, experiments demonstrated the requirement of the plasmodial insertion for AspRS dimerization and, therefore, tRNA aminoacylation and other putative functions. Implications for the parasite biology are proposed. These data provide a robust background for unraveling the precise functional properties of the parasite AspRS and for developing novel lead compounds against malaria, targeting its idiosyncratic domains.
Collapse
Affiliation(s)
- Tania Bour
- Architecture et Réactivité de l'ARN, Université de Strasbourg, CNRS, Institut de Biologie Moléculaire et Cellulaire, 15 rue René Descartes, 67084 Strasbourg Cedex, France
| | | | | | | | | | | | | |
Collapse
|
23
|
Abu-Farha M, Elisma F, Zhou H, Tian R, Zhou H, Asmer MS, Figeys D. Proteomics: From Technology Developments to Biological Applications. Anal Chem 2009; 81:4585-99. [DOI: 10.1021/ac900735j] [Citation(s) in RCA: 41] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/12/2022]
Affiliation(s)
- Mohamed Abu-Farha
- Ottawa Institute of Systems Biology (OISB), University of Ottawa, Ottawa, Ontario, Canada, and Department of Biochemistry, Microbiology and Immunology, University of Ottawa, Ottawa, Ontario, Canada
| | - Fred Elisma
- Ottawa Institute of Systems Biology (OISB), University of Ottawa, Ottawa, Ontario, Canada, and Department of Biochemistry, Microbiology and Immunology, University of Ottawa, Ottawa, Ontario, Canada
| | - Houjiang Zhou
- Ottawa Institute of Systems Biology (OISB), University of Ottawa, Ottawa, Ontario, Canada, and Department of Biochemistry, Microbiology and Immunology, University of Ottawa, Ottawa, Ontario, Canada
| | - Ruijun Tian
- Ottawa Institute of Systems Biology (OISB), University of Ottawa, Ottawa, Ontario, Canada, and Department of Biochemistry, Microbiology and Immunology, University of Ottawa, Ottawa, Ontario, Canada
| | - Hu Zhou
- Ottawa Institute of Systems Biology (OISB), University of Ottawa, Ottawa, Ontario, Canada, and Department of Biochemistry, Microbiology and Immunology, University of Ottawa, Ottawa, Ontario, Canada
| | - Mehmet Selim Asmer
- Ottawa Institute of Systems Biology (OISB), University of Ottawa, Ottawa, Ontario, Canada, and Department of Biochemistry, Microbiology and Immunology, University of Ottawa, Ottawa, Ontario, Canada
| | - Daniel Figeys
- Ottawa Institute of Systems Biology (OISB), University of Ottawa, Ottawa, Ontario, Canada, and Department of Biochemistry, Microbiology and Immunology, University of Ottawa, Ottawa, Ontario, Canada
| |
Collapse
|
24
|
|
25
|
Fankhauser N, Nguyen-Ha TM, Adler J, Mäser P. Surface antigens and potential virulence factors from parasites detected by comparative genomics of perfect amino acid repeats. Proteome Sci 2007; 5:20. [PMID: 18096064 PMCID: PMC2254594 DOI: 10.1186/1477-5956-5-20] [Citation(s) in RCA: 23] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/28/2007] [Accepted: 12/20/2007] [Indexed: 11/16/2022] Open
Abstract
Background Many parasitic organisms, eukaryotes as well as bacteria, possess surface antigens with amino acid repeats. Making up the interface between host and pathogen such repetitive proteins may be virulence factors involved in immune evasion or cytoadherence. They find immunological applications in serodiagnostics and vaccine development. Here we use proteins which contain perfect repeats as a basis for comparative genomics between parasitic and free-living organisms. Results We have developed Reptile , a program for proteome-wide probabilistic description of perfect repeats in proteins. Parasite proteomes exhibited a large variance regarding the proportion of repeat-containing proteins. Interestingly, there was a good correlation between the percentage of highly repetitive proteins and mean protein length in parasite proteomes, but not at all in the proteomes of free-living eukaryotes. Reptile combined with programs for the prediction of transmembrane domains and GPI-anchoring resulted in an effective tool for in silico identification of potential surface antigens and virulence factors from parasites. Conclusion Systemic surveys for perfect amino acid repeats allowed basic comparisons between free-living and parasitic organisms that were directly applicable to predict proteins of serological and parasitological importance. An on-line tool is available at .
Collapse
Affiliation(s)
- Niklaus Fankhauser
- University of Bern, Institute of Cell Biology, Baltzerstrasse 4, CH-3012 Bern, Switzerland.
| | | | | | | |
Collapse
|
26
|
Peacock CS, Seeger K, Harris D, Murphy L, Ruiz JC, Quail MA, Peters N, Adlem E, Tivey A, Aslett M, Kerhornou A, Ivens A, Fraser A, Rajandream MA, Carver T, Norbertczak H, Chillingworth T, Hance Z, Jagels K, Moule S, Ormond D, Rutter S, Squares R, Whitehead S, Rabbinowitsch E, Arrowsmith C, White B, Thurston S, Bringaud F, Baldauf SL, Faulconbridge A, Jeffares D, Depledge DP, Oyola SO, Hilley JD, Brito LO, Tosi LRO, Barrell B, Cruz AK, Mottram JC, Smith DF, Berriman M. Comparative genomic analysis of three Leishmania species that cause diverse human disease. Nat Genet 2007; 39:839-47. [PMID: 17572675 PMCID: PMC2592530 DOI: 10.1038/ng2053] [Citation(s) in RCA: 567] [Impact Index Per Article: 33.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/19/2006] [Accepted: 05/04/2007] [Indexed: 12/23/2022]
Abstract
Leishmania parasites cause a broad spectrum of clinical disease. Here we report the sequencing of the genomes of two species of Leishmania: Leishmania infantum and Leishmania braziliensis. The comparison of these sequences with the published genome of Leishmania major reveals marked conservation of synteny and identifies only approximately 200 genes with a differential distribution between the three species. L. braziliensis, contrary to Leishmania species examined so far, possesses components of a putative RNA-mediated interference pathway, telomere-associated transposable elements and spliced leader-associated SLACS retrotransposons. We show that pseudogene formation and gene loss are the principal forces shaping the different genomes. Genes that are differentially distributed between the species encode proteins implicated in host-pathogen interactions and parasite survival in the macrophage.
Collapse
Affiliation(s)
- Christopher S Peacock
- Wellcome Trust Sanger Institute, Wellcome Trust Genome Campus, Hinxton, Cambridgeshire CB10 1SA, UK.
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|