1
|
What is new in FungiDB: a web-based bioinformatics platform for omics-scale data analysis for fungal and oomycete species. Genetics 2024; 227:iyae035. [PMID: 38529759 DOI: 10.1093/genetics/iyae035] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/18/2023] [Accepted: 02/15/2024] [Indexed: 03/27/2024] Open
Abstract
FungiDB (https://fungidb.org) serves as a valuable online resource that seamlessly integrates genomic and related large-scale data for a wide range of fungal and oomycete species. As an integral part of the VEuPathDB Bioinformatics Resource Center (https://veupathdb.org), FungiDB continually integrates both published and unpublished data addressing various aspects of fungal biology. Established in early 2011, the database has evolved to support 674 datasets. The datasets include over 300 genomes spanning various taxa (e.g. Ascomycota, Basidiomycota, Blastocladiomycota, Chytridiomycota, Mucoromycota, as well as Albuginales, Peronosporales, Pythiales, and Saprolegniales). In addition to genomic assemblies and annotation, over 300 extra datasets encompassing diverse information, such as expression and variation data, are also available. The resource also provides an intuitive web-based interface, facilitating comprehensive approaches to data mining and visualization. Users can test their hypotheses and navigate through omics-scale datasets using a built-in search strategy system. Moreover, FungiDB offers capabilities for private data analysis via the integrated VEuPathDB Galaxy platform. FungiDB also permits genome improvements by capturing expert knowledge through the User Comments system and the Apollo genome annotation editor for structural and functional gene curation. FungiDB facilitates data exploration and analysis and contributes to advancing research efforts by capturing expert knowledge for fungal and oomycete species.
Collapse
|
2
|
The first Cryptosporidium meeting: a concerted effort to fight cryptosporidiosis. Trends Parasitol 2024:S1471-4922(24)00088-6. [PMID: 38702267 DOI: 10.1016/j.pt.2024.04.005] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/10/2024] [Accepted: 04/10/2024] [Indexed: 05/06/2024]
|
3
|
A new chromosome-level genome assembly and annotation of Cryptosporidium meleagridis. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.02.16.580748. [PMID: 38405792 PMCID: PMC10888889 DOI: 10.1101/2024.02.16.580748] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 02/27/2024]
Abstract
Cryptosporidium spp. are medically and scientifically relevant protozoan parasites that cause severe diarrheal illness in infants and immunosuppressed populations as well as animals. Although most human Cryptosporidium infections are caused by C. parvum and C. hominis, there are several other human-infecting species including C. meleagridis, which is commonly observed in developing countries. Here, we polished and annotated a long-read genome sequence assembly for C. meleagridis TU1867, a species which infects birds and humans. The genome sequence was generated using a combination of whole genome amplification (WGA) and long-read Oxford Nanopore Technologies sequencing. The assembly was then polished with Illumina data. The chromosome-level genome assembly is 9.2 Mbp with a contig N50 of 1.1 Mb. Annotation revealed 3,923 protein-coding genes. A BUSCO analysis indicates a completeness of 96.6% (n=446), including 430 (96.4%) single-copy and 1 (0.224%) duplicated apicomplexan conserved gene(s). The new C. meleagridis genome assembly is nearly gap-free and provides a valuable new resource for the Cryptosporidium community and future studies on evolution and host-specificity.
Collapse
|
4
|
"Evaluating the Benefits and Limits of Multiple Displacement Amplification with Whole-Genome Oxford Nanopore Sequencing". BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.02.09.579537. [PMID: 38405857 PMCID: PMC10888933 DOI: 10.1101/2024.02.09.579537] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 02/27/2024]
Abstract
Multiple Displacement Amplification (MDA) outperforms conventional PCR in long fragment and whole genome amplification which makes it attractive to couple with long-read sequencing of samples with limited quantities of DNA to obtain improved genome assemblies. Here, we explore the efficacy and limits of MDA for genome sequence assembly using Oxford Nanopore Technologies (ONT) rapid library preparations and minION sequencing. We successfully generated almost complete genome sequences for all organisms examined, including Cryptosporidium meleagridis, Staphylococcus aureus, Enterococcus faecium, and Escherichia coli, with the ability to generate high-quality data from samples starting with only 0.025 ng of total DNA. Controlled sheared DNA samples exhibited a distinct pattern of size-increase after MDA, which may be associated with the amplification of long, low-abundance fragments present in the assay, as well as generating concatemeric sequences during amplification. To address concatemers, we developed a computational pipeline (CADECT: Concatemer Detection Tool) to identify and remove putative concatemeric sequences. This study highlights the efficacy of MDA in generating high-quality genome assemblies from limited amounts of input DNA. Also, the CADECT pipeline effectively mitigated the impact of concatemeric sequences, enabling the assembly of contiguous sequences even in cases where the input genomic DNA was degraded. These results have significant implications for the study of organisms that are challenging to culture in vitro, such as Cryptosporidium, and for expediting critical results in clinical settings with limited quantities of available genomic DNA.
Collapse
|
5
|
Genomic and virulence analysis of in vitro cultured Cryptosporidium parvum. PLoS Pathog 2024; 20:e1011992. [PMID: 38416794 PMCID: PMC10927135 DOI: 10.1371/journal.ppat.1011992] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/24/2023] [Revised: 03/11/2024] [Accepted: 01/22/2024] [Indexed: 03/01/2024] Open
Abstract
Recent advances in the in vitro cultivation of Cryptosporidium parvum using hollow fiber bioreactor technology (HFB) have permitted continuous growth of parasites that complete all life cycle stages. The method provides access to all stages of the parasite and provides a method for non-animal production of oocysts for use in clinical trials. Here we examined the effect of long-term (>20 months) in vitro culture on virulence-factors, genome conservation, and in vivo pathogenicity of the host by in vitro cultured parasites. We find low-level sequence variation that is consistent with that observed in calf-passaged parasites. Further using a calf model infection, oocysts obtained from the HFB caused diarrhea of the same volume, duration and oocyst shedding intensity as in vivo passaged parasites.
Collapse
|
6
|
VEuPathDB: the eukaryotic pathogen, vector and host bioinformatics resource center in 2023. Nucleic Acids Res 2024; 52:D808-D816. [PMID: 37953350 PMCID: PMC10767879 DOI: 10.1093/nar/gkad1003] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/14/2023] [Revised: 10/09/2023] [Accepted: 10/19/2023] [Indexed: 11/14/2023] Open
Abstract
The Eukaryotic Pathogen, Vector and Host Informatics Resource (VEuPathDB, https://veupathdb.org) is a Bioinformatics Resource Center funded by the National Institutes of Health with additional funding from the Wellcome Trust. VEuPathDB supports >600 organisms that comprise invertebrate vectors, eukaryotic pathogens (protists and fungi) and relevant free-living or non-pathogenic species or hosts. Since 2004, VEuPathDB has analyzed omics data from the public domain using contemporary bioinformatic workflows, including orthology predictions via OrthoMCL, and integrated the analysis results with analysis tools, visualizations, and advanced search capabilities. The unique data mining platform coupled with >3000 pre-analyzed data sets facilitates the exploration of pertinent omics data in support of hypothesis driven research. Comparisons are easily made across data sets, data types and organisms. A Galaxy workspace offers the opportunity for the analysis of private large-scale datasets and for porting to VEuPathDB for comparisons with integrated data. The MapVEu tool provides a platform for exploration of spatially resolved data such as vector surveillance and insecticide resistance monitoring. To address the growing body of omics data and advances in laboratory techniques, VEuPathDB has added several new data types, searches and features, improved the Galaxy workspace environment, redesigned the MapVEu interface and updated the infrastructure to accommodate these changes.
Collapse
|
7
|
Genetic crosses within and between species of Cryptosporidium. Proc Natl Acad Sci U S A 2024; 121:e2313210120. [PMID: 38147547 PMCID: PMC10769859 DOI: 10.1073/pnas.2313210120] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/04/2023] [Accepted: 11/12/2023] [Indexed: 12/28/2023] Open
Abstract
Parasites and their hosts are engaged in reciprocal coevolution that balances competing mechanisms of virulence, resistance, and evasion. This often leads to host specificity, but genomic reassortment between different strains can enable parasites to jump host barriers and conquer new niches. In the apicomplexan parasite Cryptosporidium, genetic exchange has been hypothesized to play a prominent role in adaptation to humans. The sexual lifecycle of the parasite provides a potential mechanism for such exchange; however, the boundaries of Cryptosporidium sex are currently undefined. To explore this experimentally, we established a model for genetic crosses. Drug resistance was engineered using a mutated phenylalanyl tRNA synthetase gene and marking strains with this and the previously used Neo transgene enabled selection of recombinant progeny. This is highly efficient, and genomic recombination is evident and can be continuously monitored in real time by drug resistance, flow cytometry, and PCR mapping. Using this approach, multiple loci can now be modified with ease. We demonstrate that essential genes can be ablated by crossing a Cre recombinase driver strain with floxed strains. We further find that genetic crosses are also feasible between species. Crossing Cryptosporidium parvum, a parasite of cattle and humans, and Cryptosporidium tyzzeri a mouse parasite resulted in progeny with a recombinant genome derived from both species that continues to vigorously replicate sexually. These experiments have important fundamental and translational implications for the evolution of Cryptosporidium and open the door to reverse- and forward-genetic analysis of parasite biology and host specificity.
Collapse
|
8
|
Massive invasion of organellar DNA drives nuclear genome evolution in Toxoplasma. Proc Natl Acad Sci U S A 2023; 120:e2308569120. [PMID: 37917792 PMCID: PMC10636329 DOI: 10.1073/pnas.2308569120] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/25/2023] [Accepted: 09/26/2023] [Indexed: 11/04/2023] Open
Abstract
Toxoplasma gondii is a zoonotic protist pathogen that infects up to one third of the human population. This apicomplexan parasite contains three genome sequences: nuclear (65 Mb); plastid organellar, ptDNA (35 kb); and mitochondrial organellar, mtDNA (5.9 kb of non-repetitive sequence). We find that the nuclear genome contains a significant amount of NUMTs (nuclear integrants of mitochondrial DNA) and NUPTs (nuclear integrants of plastid DNA) that are continuously acquired and represent a significant source of intraspecific genetic variation. NUOT (nuclear DNA of organellar origin) accretion has generated 1.6% of the extant T. gondii ME49 nuclear genome-the highest fraction ever reported in any organism. NUOTs are primarily found in organisms that retain the non-homologous end-joining repair pathway. Significant movement of organellar DNA was experimentally captured via amplicon sequencing of a CRISPR-induced double-strand break in non-homologous end-joining repair competent, but not ku80 mutant, Toxoplasma parasites. Comparisons with Neospora caninum, a species that diverged from Toxoplasma ~28 mya, revealed that the movement and fixation of five NUMTs predates the split of the two genera. This unexpected level of NUMT conservation suggests evolutionary constraint for cellular function. Most NUMT insertions reside within (60%) or nearby genes (23% within 1.5 kb), and reporter assays indicate that some NUMTs have the ability to function as cis-regulatory elements modulating gene expression. Together, these findings portray a role for organellar sequence insertion in dynamically shaping the genomic architecture and likely contributing to adaptation and phenotypic changes in this important human pathogen.
Collapse
|
9
|
Genetic crosses within and between species of Cryptosporidium. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.08.04.551960. [PMID: 37577700 PMCID: PMC10418217 DOI: 10.1101/2023.08.04.551960] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 08/15/2023]
Abstract
Parasites and their hosts are engaged in rapid coevolution that balances competing mechanisms of virulence, resistance, and evasion. This often leads to host specificity, but genomic reassortment between different strains can enable parasites to jump host barriers and conquer new niches. In the apicomplexan parasite Cryptosporidium genetic exchange has been hypothesized to play a prominent role in adaptation to humans. The sexual lifecycle of the parasite provides a potential mechanism for such exchange; however, the boundaries of Cryptosporidium sex are currently undefined. To explore this experimentally, we established a model for genetic crosses. Drug resistance was engineered using a mutated phenylalanyl tRNA synthetase gene and marking strains with this and the previously used Neo transgene enabled selection of recombinant progeny. This is highly efficient, and genomic recombination is evident and can be continuously monitored in real time by drug resistance, flow cytometry, and PCR mapping. Using this approach multiple loci can now be modified with ease. We demonstrate that essential genes can be ablated by crossing a Cre recombinase driver strain with floxed strains. We further find that genetic crosses are also feasible between species. Crossing C. parvum, a parasite of cattle and humans, and C. tyzzeri a mouse parasite resulted in progeny with a recombinant genome derived from both species that continues to vigorously replicate sexually. These experiments have important fundamental and translational implications for the evolution of Cryptosporidium and open the door to reverse- and forward- genetic analysis of parasite biology and host specificity.
Collapse
|
10
|
New T2T assembly of Cryptosporidium parvum IOWA annotated with reference genome gene identifiers. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.06.13.544219. [PMID: 37398184 PMCID: PMC10312629 DOI: 10.1101/2023.06.13.544219] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 07/04/2023]
Abstract
Cryptosporidium parvum is a significant pathogen causing gastrointestinal infections in humans and animals, that is spread through the ingestion of contaminated food and water. Despite its global impact on public health, generating a C. parvum genome sequence has always been challenging due to a lack of in vitro cultivation systems and challenging sub-telomeric gene families. A gapless telomere to telomere genome assembly has been created for Cryptosporidium parvum IOWA obtained from Bunch Grass Farms, named here as CpBGF. There are 8 chromosomes that total 9,259,183 bp. The new hybrid assembly which was generated with Illumina and Oxford Nanopore resolves complex sub-telomeric regions of chromosomes 1, 7 and 8. To facilitate ease of use and consistency with the literature, whenever possible, chromosomes have been oriented and genes in this annotation have been given the same gene IDs used in the current reference genome sequence generated in 2004. The annotation of this assembly utilized considerable RNA expression evidence, thus, untranslated regions, long noncoding RNAs and antisense RNAs are annotated. The CpBGF genome assembly serves as a valuable resource for understanding the biology, pathogenesis, and transmission of C. parvum, and it facilitates the development of diagnostics, drugs, and vaccines against cryptosporidiosis.
Collapse
|
11
|
Massive invasion of organellar DNA drives nuclear genome evolution in Toxoplasma. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.05.22.539837. [PMID: 37293002 PMCID: PMC10245829 DOI: 10.1101/2023.05.22.539837] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/10/2023]
Abstract
Toxoplasma gondii is a zoonotic protist pathogen that infects up to 1/3 of the human population. This apicomplexan parasite contains three genome sequences: nuclear (63 Mb); plastid organellar, ptDNA (35 kb); and mitochondrial organellar, mtDNA (5.9 kb of non-repetitive sequence). We find that the nuclear genome contains a significant amount of NUMTs (nuclear DNA of mitochondrial origin) and NUPTs (nuclear DNA of plastid origin) that are continuously acquired and represent a significant source of intraspecific genetic variation. NUOT (nuclear DNA of organellar origin) accretion has generated 1.6% of the extant T. gondii ME49 nuclear genome; the highest fraction ever reported in any organism. NUOTs are primarily found in organisms that retain the non-homologous end-joining repair pathway. Significant movement of organellar DNA was experimentally captured via amplicon sequencing of a CRISPR-induced double-strand break in non-homologous end-joining repair competent, but not ku80 mutant, Toxoplasma parasites. Comparisons with Neospora caninum, a species that diverged from Toxoplasma ~28 MY ago, revealed that the movement and fixation of 5 NUMTs predates the split of the two genera. This unexpected level of NUMT conservation suggests evolutionary constraint for cellular function. Most NUMT insertions reside within (60%) or nearby genes (23% within 1.5 kb) and reporter assays indicate that some NUMTs have the ability to function as cis-regulatory elements modulating gene expression. Together these findings portray a role for organellar sequence insertion in dynamically shaping the genomic architecture and likely contributing to adaptation and phenotypic changes in this important human pathogen.
Collapse
|
12
|
Malaria disrupts the rhesus macaque gut microbiome. Front Cell Infect Microbiol 2023; 12:1058926. [PMID: 36710962 PMCID: PMC9880479 DOI: 10.3389/fcimb.2022.1058926] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/30/2022] [Accepted: 12/12/2022] [Indexed: 01/14/2023] Open
Abstract
Previous studies have suggested that a relationship exists between severity and transmissibility of malaria and variations in the gut microbiome, yet only limited information exists on the temporal dynamics of the gut microbial community during a malarial infection. Here, using a rhesus macaque model of relapsing malaria, we investigate how malaria affects the gut microbiome. In this study, we performed 16S sequencing on DNA isolated from rectal swabs of rhesus macaques over the course of an experimental malarial infection with Plasmodium cynomolgi and analyzed gut bacterial taxa abundance across primary and relapsing infections. We also performed metabolomics on blood plasma from the animals at the same timepoints and investigated changes in metabolic pathways over time. Members of Proteobacteria (family Helicobacteraceae) increased dramatically in relative abundance in the animal's gut microbiome during peak infection while Firmicutes (family Lactobacillaceae and Ruminococcaceae), Bacteroidetes (family Prevotellaceae) and Spirochaetes amongst others decreased compared to baseline levels. Alpha diversity metrics indicated decreased microbiome diversity at the peak of parasitemia, followed by restoration of diversity post-treatment. Comparison with healthy subjects suggested that the rectal microbiome during acute malaria is enriched with commensal bacteria typically found in the healthy animal's mucosa. Significant changes in the tryptophan-kynurenine immunomodulatory pathway were detected at peak infection with P. cynomolgi, a finding that has been described previously in the context of P. vivax infections in humans. During relapses, which have been shown to be associated with less inflammation and clinical severity, we observed minimal disruption to the gut microbiome, despite parasites being present. Altogether, these data suggest that the metabolic shift occurring during acute infection is associated with a concomitant shift in the gut microbiome, which is reversed post-treatment.
Collapse
|
13
|
TriTrypDB: An integrated functional genomics resource for kinetoplastida. PLoS Negl Trop Dis 2023; 17:e0011058. [PMID: 36656904 PMCID: PMC9888696 DOI: 10.1371/journal.pntd.0011058] [Citation(s) in RCA: 16] [Impact Index Per Article: 16.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/12/2022] [Revised: 01/31/2023] [Accepted: 12/23/2022] [Indexed: 01/20/2023] Open
Abstract
Parasitic diseases caused by kinetoplastid parasites are a burden to public health throughout tropical and subtropical regions of the world. TriTrypDB (https://tritrypdb.org) is a free online resource for data mining of genomic and functional data from these kinetoplastid parasites and is part of the VEuPathDB Bioinformatics Resource Center (https://veupathdb.org). As of release 59, TriTrypDB hosts 83 kinetoplastid genomes, nine of which, including Trypanosoma brucei brucei TREU927, Trypanosoma cruzi CL Brener and Leishmania major Friedlin, undergo manual curation by integrating information from scientific publications, high-throughput assays and user submitted comments. TriTrypDB also integrates transcriptomic, proteomic, epigenomic, population-level and isolate data, functional information from genome-wide RNAi knock-down and fluorescent tagging, and results from automated bioinformatics analysis pipelines. TriTrypDB offers a user-friendly web interface embedded with a genome browser, search strategy system and bioinformatics tools to support custom in silico experiments that leverage integrated data. A Galaxy workspace enables users to analyze their private data (e.g., RNA-sequencing, variant calling, etc.) and explore their results privately in the context of publicly available information in the database. The recent addition of an annotation platform based on Apollo enables users to provide both functional and structural changes that will appear as 'community annotations' immediately and, pending curatorial review, will be integrated into the official genome annotation.
Collapse
|
14
|
MaHPIC malaria systems biology data from Plasmodium cynomolgi sporozoite longitudinal infections in macaques. Sci Data 2022; 9:722. [PMID: 36433985 PMCID: PMC9700667 DOI: 10.1038/s41597-022-01755-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/14/2022] [Accepted: 10/10/2022] [Indexed: 11/27/2022] Open
Abstract
Plasmodium cynomolgi causes zoonotic malarial infections in Southeast Asia and this parasite species is important as a model for Plasmodium vivax and Plasmodium ovale. Each of these species produces hypnozoites in the liver, which can cause relapsing infections in the blood. Here we present methods and data generated from iterative longitudinal systems biology infection experiments designed and performed by the Malaria Host-Pathogen Interaction Center (MaHPIC) to delve deeper into the biology, pathogenesis, and immune responses of P. cynomolgi in the Macaca mulatta host. Infections were initiated by sporozoite inoculation. Blood and bone marrow samples were collected at defined timepoints for biological and computational experiments and integrative analyses revolving around primary illness, relapse illness, and subsequent disease and immune response patterns. Parasitological, clinical, haematological, immune response, and -omic datasets (transcriptomics, proteomics, metabolomics, and lipidomics) including metadata and computational results have been deposited in public repositories. The scope and depth of these datasets are unprecedented in studies of malaria, and they are projected to be a F.A.I.R., reliable data resource for decades.
Collapse
|
15
|
Plasmodium knowlesi Cytoadhesion Involves SICA Variant Proteins. Front Cell Infect Microbiol 2022; 12:888496. [PMID: 35811680 PMCID: PMC9260704 DOI: 10.3389/fcimb.2022.888496] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/02/2022] [Accepted: 04/22/2022] [Indexed: 11/13/2022] Open
Abstract
Plasmodium knowlesi poses a health threat throughout Southeast Asian communities and currently causes most cases of malaria in Malaysia. This zoonotic parasite species has been studied in Macaca mulatta (rhesus monkeys) as a model for severe malarial infections, chronicity, and antigenic variation. The phenomenon of Plasmodium antigenic variation was first recognized during rhesus monkey infections. Plasmodium-encoded variant proteins were first discovered in this species and found to be expressed at the surface of infected erythrocytes, and then named the Schizont-Infected Cell Agglutination (SICA) antigens. SICA expression was shown to be spleen dependent, as SICA expression is lost after P. knowlesi is passaged in splenectomized rhesus. Here we present data from longitudinal P. knowlesi infections in rhesus with the most comprehensive analysis to date of clinical parameters and infected red blood cell sequestration in the vasculature of tissues from 22 organs. Based on the histopathological analysis of 22 tissue types from 11 rhesus monkeys, we show a comparative distribution of parasitized erythrocytes and the degree of margination of the infected erythrocytes with the endothelium. Interestingly, there was a significantly higher burden of parasites in the gastrointestinal tissues, and extensive margination of the parasites along the endothelium, which may help explain gastrointestinal symptoms frequently reported by patients with P. knowlesi malarial infections. Moreover, this margination was not observed in splenectomized rhesus that were infected with parasites not expressing the SICA proteins. This work provides data that directly supports the view that a subpopulation of P. knowlesi parasites cytoadheres and sequesters, likely via SICA variant antigens acting as ligands. This process is akin to the cytoadhesive function of the related variant antigen proteins, namely Erythrocyte Membrane Protein-1, expressed by Plasmodium falciparum.
Collapse
|
16
|
Small and intermediate size structural RNAs in the unicellular parasite Cryptosporidium parvum as revealed by sRNA-seq and comparative genomics. Microb Genom 2022; 8. [PMID: 35536609 PMCID: PMC9465071 DOI: 10.1099/mgen.0.000821] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/03/2022] Open
Abstract
Small and intermediate-size noncoding RNAs (sRNAs and is-ncRNAs) have been shown to play important regulatory roles in the development of several eukaryotic organisms. However, they have not been thoroughly explored in Cryptosporidium parvum, an obligate zoonotic protist parasite responsible for the diarrhoeal disease cryptosporidiosis. Using Illumina sequencing of a small RNA library, a systematic identification of novel small and is-ncRNAs was performed in C. parvum excysted sporozoites. A total of 79 novel is-ncRNA candidates, including antisense, intergenic and intronic is-ncRNAs, were identified, including 7 new small nucleolar RNAs (snoRNAs). Expression of select novel is-ncRNAs was confirmed by RT-PCR. Phylogenetic conservation was analysed using covariance models (CMs) in related Cryptosporidium and apicomplexan parasite genome sequences. A potential new type of small ncRNA derived from tRNA fragments was observed. Overall, a deep profiling analysis of novel is-ncRNAs in C. parvum and related species revealed structural features and conservation of these novel is-ncRNAs. Covariance models can be used to detect is-ncRNA genes in other closely related parasites. These findings provide important new sequences for additional functional characterization of novel is-ncRNAs in the protist pathogen C. parvum.
Collapse
|
17
|
VEuPathDB: the eukaryotic pathogen, vector and host bioinformatics resource center. Nucleic Acids Res 2022; 50:D898-D911. [PMID: 34718728 PMCID: PMC8728164 DOI: 10.1093/nar/gkab929] [Citation(s) in RCA: 186] [Impact Index Per Article: 93.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/15/2020] [Revised: 09/21/2021] [Accepted: 10/04/2021] [Indexed: 11/13/2022] Open
Abstract
The Eukaryotic Pathogen, Vector and Host Informatics Resource (VEuPathDB, https://veupathdb.org) represents the 2019 merger of VectorBase with the EuPathDB projects. As a Bioinformatics Resource Center funded by the National Institutes of Health, with additional support from the Welllcome Trust, VEuPathDB supports >500 organisms comprising invertebrate vectors, eukaryotic pathogens (protists and fungi) and relevant free-living or non-pathogenic species or hosts. Designed to empower researchers with access to Omics data and bioinformatic analyses, VEuPathDB projects integrate >1700 pre-analysed datasets (and associated metadata) with advanced search capabilities, visualizations, and analysis tools in a graphic interface. Diverse data types are analysed with standardized workflows including an in-house OrthoMCL algorithm for predicting orthology. Comparisons are easily made across datasets, data types and organisms in this unique data mining platform. A new site-wide search facilitates access for both experienced and novice users. Upgraded infrastructure and workflows support numerous updates to the web interface, tools, searches and strategies, and Galaxy workspace where users can privately analyse their own data. Forthcoming upgrades include cloud-ready application architecture, expanded support for the Galaxy workspace, tools for interrogating host-pathogen interactions, and improved interactions with affiliated databases (ClinEpiDB, MicrobiomeDB) and other scientific resources, and increased interoperability with the Bacterial & Viral BRC.
Collapse
|
18
|
Abstract
BACKGROUND Kra monkeys (Macaca fascicularis), a natural host of Plasmodium knowlesi, control parasitaemia caused by this parasite species and escape death without treatment. Knowledge of the disease progression and resilience in kra monkeys will aid the effective use of this species to study mechanisms of resilience to malaria. This longitudinal study aimed to define clinical, physiological and pathological changes in kra monkeys infected with P. knowlesi, which could explain their resilient phenotype. METHODS Kra monkeys (n = 15, male, young adults) were infected intravenously with cryopreserved P. knowlesi sporozoites and the resulting parasitaemias were monitored daily. Complete blood counts, reticulocyte counts, blood chemistry and physiological telemetry data (n = 7) were acquired as described prior to infection to establish baseline values and then daily after inoculation for up to 50 days. Bone marrow aspirates, plasma samples, and 22 tissue samples were collected at specific time points to evaluate longitudinal clinical, physiological and pathological effects of P. knowlesi infections during acute and chronic infections. RESULTS As expected, the kra monkeys controlled acute infections and remained with low-level, persistent parasitaemias without anti-malarial intervention. Unexpectedly, early in the infection, fevers developed, which ultimately returned to baseline, as well as mild to moderate thrombocytopenia, and moderate to severe anaemia. Mathematical modelling and the reticulocyte production index indicated that the anaemia was largely due to the removal of uninfected erythrocytes and not impaired production of erythrocytes. Mild tissue damage was observed, and tissue parasite load was associated with tissue damage even though parasite accumulation in the tissues was generally low. CONCLUSIONS Kra monkeys experimentally infected with P. knowlesi sporozoites presented with multiple clinical signs of malaria that varied in severity among individuals. Overall, the animals shared common mechanisms of resilience characterized by controlling parasitaemia 3-5 days after patency, and controlling fever, coupled with physiological and bone marrow responses to compensate for anaemia. Together, these responses likely minimized tissue damage while supporting the establishment of chronic infections, which may be important for transmission in natural endemic settings. These results provide new foundational insights into malaria pathogenesis and resilience in kra monkeys, which may improve understanding of human infections.
Collapse
|
19
|
Challenges for Cryptosporidium Population Studies. Genes (Basel) 2021; 12:894. [PMID: 34200631 PMCID: PMC8229070 DOI: 10.3390/genes12060894] [Citation(s) in RCA: 9] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/02/2021] [Revised: 05/28/2021] [Accepted: 06/04/2021] [Indexed: 12/31/2022] Open
Abstract
Cryptosporidiosis is ranked sixth in the list of the most important food-borne parasites globally, and it is an important contributor to mortality in infants and the immunosuppressed. Recently, the number of genome sequences available for this parasite has increased drastically. The majority of the sequences are derived from population studies of Cryptosporidium parvum and Cryptosporidium hominis, the most important species causing disease in humans. Work with this parasite is challenging since it lacks an optimal, prolonged, in vitro culture system, which accurately reproduces the in vivo life cycle. This obstacle makes the cloning of isolates nearly impossible. Thus, patient isolates that are sequenced represent a population or, at times, mixed infections. Oocysts, the lifecycle stage currently used for sequencing, must be considered a population even if the sequence is derived from single-cell sequencing of a single oocyst because each oocyst contains four haploid meiotic progeny (sporozoites). Additionally, the community does not yet have a set of universal markers for strain typing that are distributed across all chromosomes. These variables pose challenges for population studies and require careful analyses to avoid biased interpretation. This review presents an overview of existing population studies, challenges, and potential solutions to facilitate future population analyses.
Collapse
|
20
|
A novel fragmented mitochondrial genome in the protist pathogen Toxoplasma gondii and related tissue coccidia. Genome Res 2021; 31:852-865. [PMID: 33906963 PMCID: PMC8092004 DOI: 10.1101/gr.266403.120] [Citation(s) in RCA: 9] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/24/2020] [Accepted: 03/09/2021] [Indexed: 12/19/2022]
Abstract
Mitochondrial genome content and structure vary widely across the eukaryotic tree of life, with protists displaying extreme examples. Apicomplexan and dinoflagellate protists have evolved highly reduced mitochondrial genome sequences, mtDNA, consisting of only three cytochrome genes and fragmented rRNA genes. Here, we report the independent evolution of fragmented cytochrome genes in Toxoplasma and related tissue coccidia and evolution of a novel genome architecture consisting minimally of 21 sequence blocks (SBs) totaling 5.9 kb that exist as nonrandom concatemers. Single-molecule Nanopore reads consisting entirely of SBs ranging from 0.1 to 23.6 kb reveal both whole and fragmented cytochrome genes. Full-length cytochrome transcripts including a divergent coxIII are detected. The topology of the mitochondrial genome remains an enigma. Analysis of a cob point mutation reveals that homoplasmy of SBs is maintained. Tissue coccidia are important pathogens of man and animals, and the mitochondrion represents an important therapeutic target. The mtDNA sequence has been elucidated, but a definitive genome architecture remains elusive.
Collapse
|
21
|
Analysis of Long Non-Coding RNA in Cryptosporidium parvum Reveals Significant Stage-Specific Antisense Transcription. Front Cell Infect Microbiol 2021; 10:608298. [PMID: 33520737 PMCID: PMC7840661 DOI: 10.3389/fcimb.2020.608298] [Citation(s) in RCA: 14] [Impact Index Per Article: 4.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/20/2020] [Accepted: 11/26/2020] [Indexed: 12/13/2022] Open
Abstract
Cryptosporidium is a protist parasite that has been identified as the second leading cause of moderate to severe diarrhea in children younger than two and a significant cause of mortality worldwide. Cryptosporidium has a complex, obligate, intracellular but extra cytoplasmic lifecycle in a single host. How genes are regulated in this parasite remains largely unknown. Long non-coding RNAs (lncRNAs) play critical regulatory roles, including gene expression across a broad range of organisms. Cryptosporidium lncRNAs have been reported to enter the host cell nucleus and affect the host response. However, no systematic study of lncRNAs in Cryptosporidium has been conducted to identify additional lncRNAs. In this study, we analyzed a C. parvum in vitro strand-specific RNA-seq developmental time series covering both asexual and sexual stages to identify lncRNAs associated with parasite development. In total, we identified 396 novel lncRNAs, mostly antisense, with 86% being differentially expressed. Surprisingly, nearly 10% of annotated mRNAs have an antisense transcript. lncRNAs occur most often at the 3' end of their corresponding sense mRNA. Putative lncRNA regulatory regions were identified and many appear to encode bidirectional promoters. A positive correlation between lncRNA and upstream mRNA expression was observed. Evolutionary conservation and expression of lncRNA candidates was observed between C. parvum, C. hominis and C. baileyi. Ten C. parvum protein-encoding genes with antisense transcripts have P. falciparum orthologs that also have antisense transcripts. Three C. parvum lncRNAs with exceptional properties (e.g., intron splicing) were experimentally validated using RT-PCR and RT-qPCR. This initial characterization of the C. parvum non-coding transcriptome facilitates further investigations into the roles of lncRNAs in parasite development and host-pathogen interactions.
Collapse
|
22
|
Strain-specific genome evolution in Trypanosoma cruzi, the agent of Chagas disease. PLoS Pathog 2021; 17:e1009254. [PMID: 33508020 PMCID: PMC7872254 DOI: 10.1371/journal.ppat.1009254] [Citation(s) in RCA: 41] [Impact Index Per Article: 13.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/17/2020] [Revised: 02/09/2021] [Accepted: 12/22/2020] [Indexed: 12/16/2022] Open
Abstract
The protozoan Trypanosoma cruzi almost invariably establishes life-long infections in humans and other mammals, despite the development of potent host immune responses that constrain parasite numbers. The consistent, decades-long persistence of T. cruzi in human hosts arises at least in part from the remarkable level of genetic diversity in multiple families of genes encoding the primary target antigens of anti-parasite immune responses. However, the highly repetitive nature of the genome-largely a result of these same extensive families of genes-have prevented a full understanding of the extent of gene diversity and its maintenance in T. cruzi. In this study, we have combined long-read sequencing and proximity ligation mapping to generate very high-quality assemblies of two T. cruzi strains representing the apparent ancestral lineages of the species. These assemblies reveal not only the full repertoire of the members of large gene families in the two strains, demonstrating extreme diversity within and between isolates, but also provide evidence of the processes that generate and maintain that diversity, including extensive gene amplification, dispersion of copies throughout the genome and diversification via recombination and in situ mutations. Gene amplification events also yield significant copy number variations in a substantial number of genes presumably not required for or involved in immune evasion, thus forming a second level of strain-dependent variation in this species. The extreme genome flexibility evident in T. cruzi also appears to create unique challenges with respect to preserving core genome functions and gene expression that sets this species apart from related kinetoplastids.
Collapse
|
23
|
Resistance to Some But Not Other Dimeric Lindenane Sesquiterpenoid Esters Is Mediated by Mutations in a Plasmodium falciparum Esterase. ACS Infect Dis 2020; 6:2994-3003. [PMID: 32970404 PMCID: PMC11075783 DOI: 10.1021/acsinfecdis.0c00487] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
Abstract
Unique lindenane sesquiterpenoid dimers from Chloranthecae spp. were recently identified with promising in vitro antiplasmodial activity and potentially novel mechanisms of action. To gain mechanistic insights to this new class of natural products, in vitro selection of Plasmodium falciparum resistance to the most active antiplasmodial compound, chlorajaponilide C, was explored. In all selected resistant clones, the half-maximal effective concentration (EC50) of chlorajaponilide C increased >250-fold, and whole genome sequencing revealed mutations in the recently discovered P. falciparum prodrug activation and resistance esterase (PfPARE). Chlorajaponilide C was highly potent (mean EC50 = 1.6 nM, n = 34) against fresh Ugandan P. falciparum isolates. The analysis of the structure-resistance relationships revealed that in vitro potency of a subset of lindenane sesquiterpenoid dimers was not mediated by PfPARE mutations. Thus, chlorajaponilide C, but not some related compounds, required parasite esterase activity for in vitro potency, and those compounds serve as the foundation for development of potent and selective antimalarials.
Collapse
|
24
|
Noncoding RNAs in Apicomplexan Parasites: An Update. Trends Parasitol 2020; 36:835-849. [DOI: 10.1016/j.pt.2020.07.006] [Citation(s) in RCA: 23] [Impact Index Per Article: 5.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/26/2020] [Revised: 06/26/2020] [Accepted: 07/18/2020] [Indexed: 12/16/2022]
|
25
|
ClinEpiDB: an open-access clinical epidemiology database resource encouraging online exploration of complex studies. Gates Open Res 2020; 3:1661. [PMID: 32047873 PMCID: PMC6993508 DOI: 10.12688/gatesopenres.13087.2] [Citation(s) in RCA: 14] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 03/25/2020] [Indexed: 11/20/2022] Open
Abstract
The concept of open data has been gaining traction as a mechanism to increase data use, ensure that data are preserved over time, and accelerate discovery. While epidemiology data sets are increasingly deposited in databases and repositories, barriers to access still remain.
ClinEpiDB was constructed as an open-access online resource for clinical and epidemiologic studies by leveraging the extensive web toolkit and infrastructure of the Eukaryotic Pathogen Database Resources (EuPathDB; a collection of databases covering 170+ eukaryotic pathogens, relevant related species, and select hosts) combined with a unified semantic web framework. Here we present an intuitive point-and-click website that allows users to visualize and subset data directly in the ClinEpiDB browser and immediately explore potential associations. Supporting study documentation aids contextualization, and data can be downloaded for advanced analyses. By facilitating access and interrogation of high-quality, large-scale data sets, ClinEpiDB aims to spur collaboration and discovery that improves global health.
Collapse
|
26
|
ClinEpiDB: an open-access clinical epidemiology database resource encouraging online exploration of complex studies. Gates Open Res 2020; 3:1661. [PMID: 32047873 PMCID: PMC6993508 DOI: 10.12688/gatesopenres.13087.1] [Citation(s) in RCA: 13] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 03/25/2020] [Indexed: 01/25/2024] Open
Abstract
The concept of open data has been gaining traction as a mechanism to increase data use, ensure that data are preserved over time, and accelerate discovery. While epidemiology data sets are increasingly deposited in databases and repositories, barriers to access still remain. ClinEpiDB was constructed as an open-access online resource for clinical and epidemiologic studies by leveraging the extensive web toolkit and infrastructure of the Eukaryotic Pathogen Database Resources (EuPathDB; a collection of databases covering 170+ eukaryotic pathogens, relevant related species, and select hosts) combined with a unified semantic web framework. Here we present an intuitive point-and-click website that allows users to visualize and subset data directly in the ClinEpiDB browser and immediately explore potential associations. Supporting study documentation aids contextualization, and data can be downloaded for advanced analyses. By facilitating access and interrogation of high-quality, large-scale data sets, ClinEpiDB aims to spur collaboration and discovery that improves global health.
Collapse
|
27
|
Update on Cryptosporidium spp.: highlights from the Seventh International Giardia and Cryptosporidium Conference. ACTA ACUST UNITED AC 2020; 27:14. [PMID: 32167464 PMCID: PMC7069357 DOI: 10.1051/parasite/2020011] [Citation(s) in RCA: 26] [Impact Index Per Article: 6.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/20/2020] [Accepted: 03/02/2020] [Indexed: 01/23/2023]
Abstract
While cryptosporidiosis is recognized as being among the most common causes of human parasitic diarrhea in the world, there is currently limited knowledge on Cryptosporidium infection mechanisms, incomplete codification of diagnostic methods, and a need for additional therapeutic options. In response, the Seventh International Giardia and Cryptosporidium Conference (IGCC 2019) was hosted from 23 to 26 June 2019, at the Rouen Normandy University, France. This trusted event brought together an international delegation of researchers to synthesize recent advances and identify key research questions and knowledge gaps. The program of the interdisciplinary conference included all aspects of host-parasite relationships from basic research to applications to human and veterinary medicine, and environmental issues associated with waterborne parasites and their epidemiological consequences. In relation to Cryptosporidium and cryptosporidiosis, the primary research areas for which novel findings and the most impressive communications were presented and discussed included: Cryptosporidium in environmental waters, seafood, and fresh produce; Animal epidemiology; Human cryptosporidiosis and epidemiology; Genomes and genomic evolution encompassing: Comparative genomics of Cryptosporidium spp., Genomic insights into biology, Acquiring and utilizing genome sequences, Genetic manipulation; Host-parasite interaction (immunology, microbiome); and Diagnosis and treatment. High quality presentations discussed at the conference reflected decisive progress and identified new opportunities that will engage investigators and funding agencies to spur future research in a “one health” approach to improve basic knowledge and the clinical and public health management of zoonotic cryptosporidiosis.
Collapse
|
28
|
Abstract
Cryptosporidium has historically been a difficult organism to work with, and molecular genomic data for this important pathogen have typically lagged behind other prominent protist pathogens. CryptoDB ( http://cryptodb.org/ ) was launched in 2004 following the appearance of draft genome sequences for both C. parvum and C. hominis. CryptoDB merged with the EuPathDB Bioinformatics Resource Center family of databases ( https://eupathdb.org ) and has been maintained and updated regularly since its establishment. These resources are freely available, are web-based, and permit users to analyze their own sequence data in the context of reference genome sequences in our user workspaces. Advances in technology have greatly facilitated Cryptosporidium research in the last several years greatly enhancing and extending the data and types of data available for this genus. Currently, 13 genome sequences are available for 9 species of Cryptosporidium as well as the distantly related Gregarina niphandrodes and two free-living alveolate outgroups of the Apicomplexa, Chromera velia and Vitrella brassicaformis. Recent years have seen several new genome sequences for both existing and new Cryptosporidium species as well as transcriptomics, proteomics, SNP, and isolate population surveys. This chapter introduces the extensive data mining and visualization capabilities of the EuPathDB software platform and introduces the data types and tools that are currently available for Cryptosporidium. Key features are demonstrated with Cryptosporidium-relevant examples and explanations.
Collapse
|
29
|
Correction to: Accessing Cryptosporidium Omic and Isolate Data via CryptoDB.org. Methods Mol Biol 2020; 2052:C1. [PMID: 32124401 DOI: 10.1007/978-1-4939-9748-0_22] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/10/2023]
Abstract
Correction to: Chapter 10 in: Susanne Warrenfeltz, Jessica C. Kissinger, and On Behalf of the EuPathDB Team.
Collapse
|
30
|
Distinct amino acid and lipid perturbations characterize acute versus chronic malaria. JCI Insight 2019; 4:125156. [PMID: 31045574 DOI: 10.1172/jci.insight.125156] [Citation(s) in RCA: 28] [Impact Index Per Article: 5.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/27/2018] [Accepted: 04/02/2019] [Indexed: 12/21/2022] Open
Abstract
Chronic malaria is a major public health problem and significant challenge for disease eradication efforts. Despite its importance, the biological factors underpinning chronic malaria are not fully understood. Recent studies have shown that host metabolic state can influence malaria pathogenesis and transmission, but its role in chronicity is not known. Here, with the goal of identifying distinct modifications in the metabolite profiles of acute versus chronic malaria, metabolomics was performed on plasma from Plasmodium-infected humans and nonhuman primates with a range of parasitemias and clinical signs. In rhesus macaques infected with Plasmodium coatneyi, significant alterations in amines, carnitines, and lipids were detected during a high parasitemic acute phase and many of these reverted to baseline levels once a low parasitemic chronic phase was established. Plasmodium gene expression, studied in parallel in the macaques, revealed transcriptional changes in amine, fatty acid, lipid and energy metabolism genes, as well as variant antigen genes. Furthermore, a common set of amines, carnitines, and lipids distinguished acute from chronic malaria in plasma from human Plasmodium falciparum cases. In summary, distinct host-parasite metabolic environments have been uncovered that characterize acute versus chronic malaria, providing insights into the underlying host-parasite biology of malaria disease progression.
Collapse
|
31
|
FungiDB: An Integrated Bioinformatic Resource for Fungi and Oomycetes. J Fungi (Basel) 2018; 4:jof4010039. [PMID: 30152809 PMCID: PMC5872342 DOI: 10.3390/jof4010039] [Citation(s) in RCA: 186] [Impact Index Per Article: 31.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/31/2018] [Revised: 03/07/2018] [Accepted: 03/15/2018] [Indexed: 02/06/2023] Open
Abstract
FungiDB (fungidb.org) is a free online resource for data mining and functional genomics analysis for fungal and oomycete species. FungiDB is part of the Eukaryotic Pathogen Genomics Database Resource (EuPathDB, eupathdb.org) platform that integrates genomic, transcriptomic, proteomic, and phenotypic datasets, and other types of data for pathogenic and nonpathogenic, free-living and parasitic organisms. FungiDB is one of the largest EuPathDB databases containing nearly 100 genomes obtained from GenBank, Aspergillus Genome Database (AspGD), The Broad Institute, Joint Genome Institute (JGI), Ensembl, and other sources. FungiDB offers a user-friendly web interface with embedded bioinformatics tools that support custom in silico experiments that leverage FungiDB-integrated data. In addition, a Galaxy-based workspace enables users to generate custom pipelines for large-scale data analysis (e.g., RNA-Seq, variant calling, etc.). This review provides an introduction to the FungiDB resources and focuses on available features, tools, and queries and how they can be used to mine data across a diverse range of integrated FungiDB datasets and records.
Collapse
|
32
|
Assembly of highly repetitive genomes using short reads: the genome of discrete typing unit III Trypanosoma cruzi strain 231. Microb Genom 2018; 4. [PMID: 29442617 PMCID: PMC5989580 DOI: 10.1099/mgen.0.000156] [Citation(s) in RCA: 18] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
Next-generation sequencing (NGS) methods are low-cost high-throughput technologies that produce thousands to millions of sequence reads. Despite the high number of raw sequence reads, their short length, relative to Sanger, PacBio or Nanopore reads, complicates the assembly of genomic repeats. Many genome tools are available, but the assembly of highly repetitive genome sequences using only NGS short reads remains challenging. Genome assembly of organisms responsible for important neglected diseases such as Trypanosoma cruzi, the aetiological agent of Chagas disease, is known to be challenging because of their repetitive nature. Only three of six recognized discrete typing units (DTUs) of the parasite have their draft genomes published and therefore genome evolution analyses in the taxon are limited. In this study, we developed a computational workflow to assemble highly repetitive genomes via a combination of de novo and reference-based assembly strategies to better overcome the intrinsic limitations of each, based on Illumina reads. The highly repetitive genome of the human-infecting parasite T. cruzi 231 strain was used as a test subject. The combined-assembly approach shown in this study benefits from the reference-based assembly ability to resolve highly repetitive sequences and from the de novo capacity to assemble genome-specific regions, improving the quality of the assembly. The acceptable confidence obtained by analyzing our results showed that our combined approach is an attractive option to assemble highly repetitive genomes with NGS short reads. Phylogenomic analysis including the 231 strain, the first representative of DTU III whose genome was sequenced, was also performed and provides new insights into T. cruzi genome evolution.
Collapse
|
33
|
MicrobiomeDB: a systems biology platform for integrating, mining and analyzing microbiome experiments. Nucleic Acids Res 2018; 46:D684-D691. [PMID: 29106667 PMCID: PMC5753346 DOI: 10.1093/nar/gkx1027] [Citation(s) in RCA: 32] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/15/2017] [Revised: 10/15/2017] [Accepted: 10/17/2017] [Indexed: 12/20/2022] Open
Abstract
MicrobiomeDB (http://microbiomeDB.org) is a data discovery and analysis platform that empowers researchers to fully leverage experimental variables to interrogate microbiome datasets. MicrobiomeDB was developed in collaboration with the Eukaryotic Pathogens Bioinformatics Resource Center (http://EuPathDB.org) and leverages the infrastructure and user interface of EuPathDB, which allows users to construct in silico experiments using an intuitive graphical 'strategy' approach. The current release of the database integrates microbial census data with sample details for nearly 14 000 samples originating from human, animal and environmental sources, including over 9000 samples from healthy human subjects in the Human Microbiome Project (http://portal.ihmpdcc.org/). Query results can be statistically analyzed and graphically visualized via interactive web applications launched directly in the browser, providing insight into microbial community diversity and allowing users to identify taxa associated with any experimental covariate.
Collapse
|
34
|
PacBio assembly of a Plasmodium knowlesi genome sequence with Hi-C correction and manual annotation of the SICAvar gene family. Parasitology 2018; 145:71-84. [PMID: 28720171 PMCID: PMC5798397 DOI: 10.1017/s0031182017001329] [Citation(s) in RCA: 31] [Impact Index Per Article: 5.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/12/2017] [Revised: 06/09/2017] [Accepted: 06/20/2017] [Indexed: 12/20/2022]
Abstract
Plasmodium knowlesi has risen in importance as a zoonotic parasite that has been causing regular episodes of malaria throughout South East Asia. The P. knowlesi genome sequence generated in 2008 highlighted and confirmed many similarities and differences in Plasmodium species, including a global view of several multigene families, such as the large SICAvar multigene family encoding the variant antigens known as the schizont-infected cell agglutination proteins. However, repetitive DNA sequences are the bane of any genome project, and this and other Plasmodium genome projects have not been immune to the gaps, rearrangements and other pitfalls created by these genomic features. Today, long-read PacBio and chromatin conformation technologies are overcoming such obstacles. Here, based on the use of these technologies, we present a highly refined de novo P. knowlesi genome sequence of the Pk1(A+) clone. This sequence and annotation, referred to as the 'MaHPIC Pk genome sequence', includes manual annotation of the SICAvar gene family with 136 full-length members categorized as type I or II. This sequence provides a framework that will permit a better understanding of the SICAvar repertoire, selective pressures acting on this gene family and mechanisms of antigenic variation in this species and other pathogens.
Collapse
|
35
|
Enrichment of Cryptosporidium parvum from in vitro culture as measured by total RNA and subsequent sequence analysis. Mol Biochem Parasitol 2017; 220:5-9. [PMID: 29292212 DOI: 10.1016/j.molbiopara.2017.12.004] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/05/2017] [Revised: 12/19/2017] [Accepted: 12/22/2017] [Indexed: 02/01/2023]
Abstract
Cryptosporidium parvum is an apicomplexan parasite that infects a wide range of hosts including humans. Due to the parasite's quasi-intracellular, intermembrane location on the host cell, it is difficult to purify parasites from in vitro and in vivo infections for molecular studies. We have developed a method to greatly enrich in vitro C. parvum merozoites from host cells. The efficiency of the protocol was assessed with C. parvum (KSU-1 isolate) parasites of different developmental stages isolated following a synchronized infection of HCT-8 host cells. Total RNA was extracted from the samples and used to evaluate the quantity of host cell contamination in enriched parasite fractions. The quality of the RNA was verified using an Agilent BioAnalyzer. cDNA libraries of RNA isolated from 24 and 48 h C. parvum in vitro preparations isolated via this protocol were sequenced at the Broad Institute via an NIH Microbial Sequencing (GSCID) Contract. Cryptosporidium sequences comprised 30% of the cDNA reads, demonstrating significant enrichment.
Collapse
|
36
|
Correction to: Integrative analysis associates monocytes with insufficient erythropoiesis during acute Plasmodium cynomolgi malaria in rhesus macaques. Malar J 2017; 16:486. [PMID: 29202752 PMCID: PMC5715518 DOI: 10.1186/s12936-017-2134-z] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/23/2017] [Accepted: 11/27/2017] [Indexed: 11/18/2022] Open
|
37
|
Integrative analysis associates monocytes with insufficient erythropoiesis during acute Plasmodium cynomolgi malaria in rhesus macaques. Malar J 2017; 16:384. [PMID: 28938907 PMCID: PMC5610412 DOI: 10.1186/s12936-017-2029-z] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/31/2017] [Accepted: 09/12/2017] [Indexed: 01/06/2023] Open
Abstract
Background Mild to severe anaemia is a common complication of malaria that is caused in part by insufficient erythropoiesis in the bone marrow. This study used systems biology to evaluate the transcriptional and alterations in cell populations in the bone marrow during Plasmodium cynomolgi infection of rhesus macaques (a model of Plasmodium vivax malaria) that may affect erythropoiesis. Results An appropriate erythropoietic response did not occur to compensate for anaemia during acute cynomolgi malaria despite an increase in erythropoietin levels. During this period, there were significant perturbations in the bone marrow transcriptome. In contrast, relapses did not induce anaemia and minimal changes in the bone marrow transcriptome were detected. The differentially expressed genes during acute infection were primarily related to ongoing inflammatory responses with significant contributions from Type I and Type II Interferon transcriptional signatures. These were associated with increased frequency of intermediate and non-classical monocytes. Recruitment and/or expansion of these populations was correlated with a decrease in the erythroid progenitor population during acute infection, suggesting that monocyte-associated inflammation may have contributed to anaemia. The decrease in erythroid progenitors was associated with downregulation of genes regulated by GATA1 and GATA2, two master regulators of erythropoiesis, providing a potential molecular basis for these findings. Conclusions These data suggest the possibility that malarial anaemia may be driven by monocyte-associated disruption of GATA1/GATA2 function in erythroid progenitors resulting in insufficient erythropoiesis during acute infection. Electronic supplementary material The online version of this article (doi:10.1186/s12936-017-2029-z) contains supplementary material, which is available to authorized users.
Collapse
|
38
|
EuPathDB: the eukaryotic pathogen genomics database resource. Nucleic Acids Res 2016; 45:D581-D591. [PMID: 27903906 PMCID: PMC5210576 DOI: 10.1093/nar/gkw1105] [Citation(s) in RCA: 140] [Impact Index Per Article: 17.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/13/2016] [Accepted: 10/28/2016] [Indexed: 12/26/2022] Open
Abstract
The Eukaryotic Pathogen Genomics Database Resource (EuPathDB, http://eupathdb.org) is a collection of databases covering 170+ eukaryotic pathogens (protists & fungi), along with relevant free-living and non-pathogenic species, and select pathogen hosts. To facilitate the discovery of meaningful biological relationships, the databases couple preconfigured searches with visualization and analysis tools for comprehensive data mining via intuitive graphical interfaces and APIs. All data are analyzed with the same workflows, including creation of gene orthology profiles, so data are easily compared across data sets, data types and organisms. EuPathDB is updated with numerous new analysis tools, features, data sets and data types. New tools include GO, metabolic pathway and word enrichment analyses plus an online workspace for analysis of personal, non-public, large-scale data. Expanded data content is mostly genomic and functional genomic data while new data types include protein microarray, metabolic pathways, compounds, quantitative proteomics, copy number variation, and polysomal transcriptomics. New features include consistent categorization of searches, data sets and genome browser tracks; redesigned gene pages; effective integration of alternative transcripts; and a EuPathDB Galaxy instance for private analyses of a user's data. Forthcoming upgrades include user workspaces for private integration of data with existing EuPathDB data and improved integration and presentation of host–pathogen interactions.
Collapse
|
39
|
Cryptosporidium hominis gene catalog: a resource for the selection of novel Cryptosporidium vaccine candidates. DATABASE-THE JOURNAL OF BIOLOGICAL DATABASES AND CURATION 2016; 2016:baw137. [PMID: 28095366 PMCID: PMC5070614 DOI: 10.1093/database/baw137] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 06/07/2016] [Revised: 09/08/2016] [Accepted: 09/09/2016] [Indexed: 01/19/2023]
Abstract
Human cryptosporidiosis, caused primarily by Cryptosporidium hominis and a subset of Cryptosporidium parvum, is a major cause of moderate-to-severe diarrhea in children under 5 years of age in developing countries and can lead to nutritional stunting and death. Cryptosporidiosis is particularly severe and potentially lethal in immunocompromised hosts. Biological and technical challenges have impeded traditional vaccinology approaches to identify novel targets for the development of vaccines against C. hominis, the predominant species associated with human disease. We deemed that the existence of genomic resources for multiple species in the genus, including a much-improved genome assembly and annotation for C. hominis, makes a reverse vaccinology approach feasible. To this end, we sought to generate a searchable online resource, termed C. hominis gene catalog, which registers all C. hominis genes and their properties relevant for the identification and prioritization of candidate vaccine antigens, including physical attributes, properties related to antigenic potential and expression data. Using bioinformatic approaches, we identified ∼400 C. hominis genes containing properties typical of surface-exposed antigens, such as predicted glycosylphosphatidylinositol (GPI)-anchor motifs, multiple transmembrane motifs and/or signal peptides targeting the encoded protein to the secretory pathway. This set can be narrowed further, e.g. by focusing on potential GPI-anchored proteins lacking homologs in the human genome, but with homologs in the other Cryptosporidium species for which genomic data are available, and with low amino acid polymorphism. Additional selection criteria related to recombinant expression and purification include minimizing predicted post-translation modifications and potential disulfide bonds. Forty proteins satisfying these criteria were selected from 3745 proteins in the updated C. hominis annotation. The immunogenic potential of a few of these is currently being tested. Database URL:http://cryptogc.igs.umaryland.edu
Collapse
|
40
|
Plasmodium cynomolgi infections in rhesus macaques display clinical and parasitological features pertinent to modelling vivax malaria pathology and relapse infections. Malar J 2016; 15:451. [PMID: 27590312 PMCID: PMC5010691 DOI: 10.1186/s12936-016-1480-6] [Citation(s) in RCA: 35] [Impact Index Per Article: 4.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/02/2016] [Accepted: 08/10/2016] [Indexed: 11/10/2022] Open
Abstract
Background Plasmodium vivax infections in humans or in new world monkeys pose research challenges that necessitate the use of alternative model systems. Plasmodium cynomolgi is a closely related species that shares genetic and biological characteristics with P. vivax, including relapses. Here, the haematological dynamics and clinical presentation of sporozoite-initiated P. cynomolgi infections in Macaca mulatta (rhesus macaques) are evaluated over a 100-day period. Methods Five M. mulatta were inoculated with 2000 P. cynomolgi B strain sporozoites. Parasitological and haematological data were collected daily to study the clinical presentations of primary infections and relapses. Peripheral blood and bone marrow aspirates were collected at specific time points during infection for future and retrospective systems biology analyses. Results Patent infections were observed between days 10 and 12, and the acute, primary infection consisted of parasitaemias ranging from 269,962 to 1,214,842 parasites/µl (4.42–19.5 % parasitaemia). All animals presented with anaemia, ranging from moderate (7–10 g/dl) to severe (<7 g/dl), based on peripheral haemoglobin concentrations. Minimum haemoglobin levels coincided with the clearance of parasites and peripheral reticulocytosis was evident at this time. Mild thrombocytopaenia (<150,000 platelets/µl) was observed in all animals, but unlike haemoglobin, platelets were lowest whenever peripheral parasitaemia peaked. The animals’ conditions were classified as non-severe, severe or lethal (in one case) based upon their clinical presentation. The lethal phenotype presented uniquely with an exceptionally high parasitaemia (19.5 %) and lack of a modest reticulocyte release, which was observed in the other animals prior to acute manifestations. One or two relapses were observed in the four surviving animals, and these were characterized by significantly lower parasitaemias and minimal changes in clinical parameters compared to pre-infection values. Conclusions Rhesus macaque infections initiated by P. cynomolgi B strain sporozoites recapitulated pathology of human malaria, including anaemia and thrombocytopaenia, with inter-individual differences in disease severity. Importantly, this study provides an in-depth assessment of clinical and parasitological data, and shows that unlike the primary infections, the relapses did not cause clinical malaria. Notably, this body of research has provided experimental plans, large accessible datasets, and blood and bone marrow samples pertinent for ongoing and iterative systems biology investigations. Electronic supplementary material The online version of this article (doi:10.1186/s12936-016-1480-6) contains supplementary material, which is available to authorized users.
Collapse
|
41
|
High-Quality Genome Assembly and Annotation for Plasmodium coatneyi, Generated Using Single-Molecule Real-Time PacBio Technology. GENOME ANNOUNCEMENTS 2016; 4:e00883-16. [PMID: 27587810 PMCID: PMC5009967 DOI: 10.1128/genomea.00883-16] [Citation(s) in RCA: 25] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Subscribe] [Scholar Register] [Received: 07/01/2016] [Accepted: 07/06/2016] [Indexed: 01/18/2023]
Abstract
Plasmodium coatneyi is a protozoan parasite species that causes simian malaria and is an excellent model for studying disease caused by the human malaria parasite, P. falciparum Here we report the complete (nontelomeric) genome sequence of P. coatneyi Hackeri generated by the application of only Pacific Biosciences RS II (PacBio RS II) single-molecule real-time (SMRT) high-resolution sequence technology and assembly using the Hierarchical Genome Assembly Process (HGAP). This is the first Plasmodium genome sequence reported to use only PacBio technology. This approach has proven to be superior to short-read only approaches for this species.
Collapse
|
42
|
Annotated draft genome sequences of three species of Cryptosporidium: Cryptosporidium meleagridis isolate UKMEL1, C. baileyi isolate TAMU-09Q1 and C. hominis isolates TU502_2012 and UKH1. Pathog Dis 2016; 74:ftw080. [PMID: 27519257 PMCID: PMC5407061 DOI: 10.1093/femspd/ftw080] [Citation(s) in RCA: 28] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 07/20/2016] [Indexed: 02/06/2023] Open
Abstract
Human cryptosporidiosis is caused primarily by Cryptosporidium hominis, C. parvum and C. meleagridis. To accelerate research on parasites in the genus Cryptosporidium, we generated annotated, draft genome sequences of human C. hominis isolates TU502_2012 and UKH1, C. meleagridis UKMEL1, also isolated from a human patient, and the avian parasite C. baileyi TAMU-09Q1. The annotation of the genome sequences relied in part on RNAseq data generated from the oocyst stage of both C. hominis and C. baileyi. The genome assembly of C. hominis is significantly more complete and less fragmented than that available previously, which enabled the generation of a much-improved gene set for this species, with an increase in average gene length of 500 bp relative to the protein-encoding genes in the 2004 C. hominis annotation. Our results reveal that the genomes of C. hominis and C. parvum are very similar in both gene density and average gene length. These data should prove a valuable resource for the Cryptosporidium research community. The release of the draft genome sequence, and corresponding annotation, of Cryptosporidium baileyi, C. hominis isolates TU502_2012 and UKH1, and C. meleagridis, will accelerate research on Cryptosporidium parasites.
Collapse
|
43
|
An evolutionary approach to identify potentially protective B cell epitopes involved in naturally acquired immunity to malaria and the role of EBA-175 in protection amongst denizens of Bolifamba, Cameroon. Malar J 2016; 15:281. [PMID: 27207101 PMCID: PMC4875671 DOI: 10.1186/s12936-016-1337-z] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/15/2015] [Accepted: 05/11/2016] [Indexed: 11/17/2022] Open
Abstract
Background The search for a vaccine against malaria caused by Plasmodium falciparum has lasted for more than 100 years, with considerable progress in the identification of a number of vaccine candidates. The post-genomic era offers new opportunities for an expedited search using rational vaccine design and prioritization of key B-cell epitopes involved in natural acquired immunity. Methods Malaria vaccine candidate genes that have reached clinical trial were searched on an evolutionary relationship tree, to determine their level of lineage-specificity. Ten other genes with similar protein features and level of lineage specificity to the vaccine candidates were randomly selected, and computationally evaluated for the presence of B-cell epitopes. The protein fragment with maximum probability of putative epitopes were synthesized and used in an ELISA experiment to determine the presence of antibodies to these peptides, in the serum of malaria patients and healthy malaria uninfected inhabitants from a malaria endemic region (Bolifamba), alongside with a vaccine candidate EBA-175. Results Two peptide fragments of 25 and 30 amino acid length from PF3D7_1233400 and PF3D7_1437500 respectively, coded as PF4-123 and PF4-143 were shown to contain B-cell epitope(s). Total IgG antibodies to these peptides were not significantly different between sick and healthy participants, but cytophilic antibodies to these peptides were significantly higher in healthy participants (p < 0.03). Total IgG to the vaccine candidate EBA-175 was significantly higher in sick participants than in healthy participants, likewise cytophilic antibodies (p < 0.04). Antibodies to the peptides PF4-123 and PF4-143 correlated negatively (p = 0.025 and 0.008 and r = −0.291 and −0.345, respectively) to parasite load. Total IgG antibodies to EBA-175 showed a negative correlation to parasite load (r = −0.144), which was not significant (p = 0.276). Duration of stay in Bolifamba also negatively correlated with parasite load (p = 0.026, r = −0.419) and total IgG to PF4-143 was significantly associated with prolonged duration of stay in the locality of Bolifamba, Cameroon (p = 0.006, r = 0.361). Conclusions The present study has identified two genes PF3D7_1233400 and PF3D7_1437500 containing peptide fragment (PF4-123 and PF4-143) with B-cell epitopes that are correlated with naturally acquired immunity to malaria. A pipeline has been developed for rapid identification of other B-cell epitopes involved in naturally acquired immunity.
Collapse
|
44
|
Abstract
The phylogenetic relationships among hemosporidian parasites, including the origin of Plasmodium falciparum, the most virulent malaria parasite of humans, have been heavily debated for decades. Studies based on multiple-gene sequences have helped settle many of these controversial phylogenetic issues. However, denser taxon sampling and genome-wide analyses are needed to confidently resolve the evolutionay relationships among hemosporidian parasites. Genome sequences of several Plasmodium parasites are available but only for species infecting primates and rodents. To root the phylogenetic tree of Plasmodium, genomic data from related parasites of birds or reptiles are required. Here, we use a novel approach to isolate parasite DNA from microgametes and describe the first genome of a bird parasite in the sister genus to Plasmodium, Haemoproteus tartakovskyi. Similar to Plasmodium parasites, H. tartakovskyi has a small genome (23.2 Mb, 5,990 genes) and a GC content (25.4%) closer to P. falciparum (19.3%) than to Plasmodium vivax (42.3%). Combined with novel transcriptome sequences of the bird parasite Plasmodium ashfordi, our phylogenomic analyses of 1,302 orthologous genes demonstrate that mammalian-infecting malaria parasites are monophyletic, thus rejecting the repeatedly proposed hypothesis that the ancestor of Laverania parasites originated from a secondary host shift from birds to humans. Genes and genomic features previously found to be shared between P. falciparum and bird malaria parasites, but absent in other mammal malaria parasites, are therefore signatures of maintained ancestral states. We foresee that the genome of H. tartakovskyi will open new directions for comparative evolutionary analyses of malarial adaptive traits.
Collapse
|
45
|
Mathematical model of susceptibility, resistance, and resilience in the within-host dynamics between a Plasmodium parasite and the immune system. Math Biosci 2015; 270:213-23. [PMID: 26505135 DOI: 10.1016/j.mbs.2015.10.003] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/26/2015] [Revised: 10/02/2015] [Accepted: 10/07/2015] [Indexed: 11/16/2022]
Abstract
We developed a coupled age-structured partial differential equation model to capture the disease dynamics during blood-stage malaria. The addition of age structure for the parasite population, with respect to previous models, allows us to better characterize the interaction between the malaria parasite and red blood cells during infection. Here we prove that the system we propose is well-posed and there exist at least two global states. We further demonstrate that the numerical simulation of the system coincides with clinically observed outcomes of primary and secondary malaria infection. The well-posedness of this system guarantees that the behavior of the model remains smooth, bounded, and continuously dependent on initial conditions; calibration with clinical data will constrain domains of parameters and variables to physiological ranges.
Collapse
|
46
|
Metabolomics in the fight against malaria. Mem Inst Oswaldo Cruz 2015; 109:589-97. [PMID: 25185001 PMCID: PMC4156452 DOI: 10.1590/0074-0276140043] [Citation(s) in RCA: 27] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/06/2014] [Accepted: 03/11/2014] [Indexed: 02/06/2023] Open
Abstract
Metabolomics uses high-resolution mass spectrometry to provide a chemical fingerprint of thousands of metabolites present in cells, tissues or body fluids. Such metabolic phenotyping has been successfully used to study various biologic processes and disease states. High-resolution metabolomics can shed new light on the intricacies of host-parasite interactions in each stage of the Plasmodium life cycle and the downstream ramifications on the host's metabolism, pathogenesis and disease. Such data can become integrated with other large datasets generated using top-down systems biology approaches and be utilised by computational biologists to develop and enhance models of malaria pathogenesis relevant for identifying new drug targets or intervention strategies. Here, we focus on the promise of metabolomics to complement systems biology approaches in the quest for novel interventions in the fight against malaria. We introduce the Malaria Host-Pathogen Interaction Center (MaHPIC), a new systems biology research coalition. A primary goal of the MaHPIC is to generate systems biology datasets relating to human and non-human primate (NHP) malaria parasites and their hosts making these openly available from an online relational database. Metabolomic data from NHP infections and clinical malaria infections from around the world will comprise a unique global resource.
Collapse
|
47
|
From genome-scale data to models of infectious disease: A Bayesian network-based strategy to drive model development. Math Biosci 2015; 270:156-68. [PMID: 26093035 DOI: 10.1016/j.mbs.2015.06.006] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/11/2014] [Revised: 06/04/2015] [Accepted: 06/08/2015] [Indexed: 11/25/2022]
Abstract
High-throughput, genome-scale data present a unique opportunity to link host to pathogen on a molecular level. Forging such connections will help drive the development of mathematical models to better understand and predict both pathogen behavior and the epidemiology of infectious diseases, including malaria. However, the datasets that can aid in identifying these links and models are vast and not amenable to simple, reductionist, and univariate analyses. These datasets require data mining in order to identify the truly important measurements that best describe clinical and molecular observations. Moreover, these datasets typically have relatively few samples due to experimental limitations (particularly for human studies or in vivo animal experiments), making data mining extremely difficult. Here, after first providing a brief overview of common strategies for data reduction and identification of relationships between variables for inclusion in mathematical models, we present a new generalized strategy for performing these data reduction and relationship inference tasks. Our approach emphasizes the importance of robustness when using data to drive model development, particularly when using genome-scale, small-sample in vivo data. We identify the use of appropriate feature reduction combined with data permutations and subsampling strategies as being critical to enable increasingly robust results from network inference using high-dimensional, low-observation data.
Collapse
|
48
|
Comparative transcriptomics and metabolomics in a rhesus macaque drug administration study. Front Cell Dev Biol 2014; 2:54. [PMID: 25453034 PMCID: PMC4233942 DOI: 10.3389/fcell.2014.00054] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/30/2014] [Accepted: 09/08/2014] [Indexed: 01/02/2023] Open
Abstract
We describe a multi-omic approach to understanding the effects that the anti-malarial drug pyrimethamine has on immune physiology in rhesus macaques (Macaca mulatta). Whole blood and bone marrow (BM) RNA-Seq and plasma metabolome profiles (each with over 15,000 features) have been generated for five naïve individuals at up to seven timepoints before, during and after three rounds of drug administration. Linear modeling and Bayesian network analyses are both considered, alongside investigations of the impact of statistical modeling strategies on biological inference. Individual macaques were found to be a major source of variance for both omic data types, and factoring individuals into subsequent modeling increases power to detect temporal effects. A major component of the whole blood transcriptome follows the BM with a time-delay, while other components of variation are unique to each compartment. We demonstrate that pyrimethamine administration does impact both compartments throughout the experiment, but very limited perturbation of transcript or metabolite abundance was observed following each round of drug exposure. New insights into the mode of action of the drug are presented in the context of pyrimethamine's predicted effect on suppression of cell division and metabolism in the immune system.
Collapse
|
49
|
The Cryptosporidium parvum ApiAP2 gene family: insights into the evolution of apicomplexan AP2 regulatory systems. Nucleic Acids Res 2014; 42:8271-84. [PMID: 24957599 PMCID: PMC4117751 DOI: 10.1093/nar/gku500] [Citation(s) in RCA: 32] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/03/2014] [Revised: 05/15/2014] [Accepted: 05/19/2014] [Indexed: 01/13/2023] Open
Abstract
We provide the first comprehensive analysis of any transcription factor family in Cryptosporidium, a basal-branching apicomplexan that is the second leading cause of infant diarrhea globally. AP2 domain-containing proteins have evolved to be the major regulatory family in the phylum to the exclusion of canonical regulators. We show that apicomplexan and perkinsid AP2 domains cluster distinctly from other chromalveolate AP2s. Protein-binding specificity assays of C. parvum AP2 domains combined with motif conservation upstream of co-regulated gene clusters allowed the construction of putative AP2 regulons across the in vitro life cycle. Orthologous Apicomplexan AP2 (ApiAP2) expression has been rearranged relative to the malaria parasite P. falciparum, suggesting ApiAP2 network rewiring during evolution. C. hominis orthologs of putative C. parvum ApiAP2 proteins and target genes show greater than average variation. C. parvum AP2 domains display reduced binding diversity relative to P. falciparum, with multiple domains binding the 5'-TGCAT-3', 5'-CACACA-3' and G-box motifs (5'-G[T/C]GGGG-3'). Many overrepresented motifs in C. parvum upstream regions are not AP2 binding motifs. We propose that C. parvum is less reliant on ApiAP2 regulators in part because it utilizes E2F/DP1 transcription factors. C. parvum may provide clues to the ancestral state of apicomplexan transcriptional regulation, pre-AP2 domination.
Collapse
|
50
|
Abstract
High throughput sequencing has accelerated the determination of genome sequences for thousands of human infectious disease pathogens and dozens of their vectors. The scale and scope of these data are enabling genotype-phenotype association studies to identify genetic determinants of pathogen virulence and drug/insecticide resistance, and phylogenetic studies to track the origin and spread of disease outbreaks. To maximize the utility of genomic sequences for these purposes, it is essential that metadata about the pathogen/vector isolate characteristics be collected and made available in organized, clear, and consistent formats. Here we report the development of the GSCID/BRC Project and Sample Application Standard, developed by representatives of the Genome Sequencing Centers for Infectious Diseases (GSCIDs), the Bioinformatics Resource Centers (BRCs) for Infectious Diseases, and the U.S. National Institute of Allergy and Infectious Diseases (NIAID), part of the National Institutes of Health (NIH), informed by interactions with numerous collaborating scientists. It includes mapping to terms from other data standards initiatives, including the Genomic Standards Consortium's minimal information (MIxS) and NCBI's BioSample/BioProjects checklists and the Ontology for Biomedical Investigations (OBI). The standard includes data fields about characteristics of the organism or environmental source of the specimen, spatial-temporal information about the specimen isolation event, phenotypic characteristics of the pathogen/vector isolated, and project leadership and support. By modeling metadata fields into an ontology-based semantic framework and reusing existing ontologies and minimum information checklists, the application standard can be extended to support additional project-specific data fields and integrated with other data represented with comparable standards. The use of this metadata standard by all ongoing and future GSCID sequencing projects will provide a consistent representation of these data in the BRC resources and other repositories that leverage these data, allowing investigators to identify relevant genomic sequences and perform comparative genomics analyses that are both statistically meaningful and biologically relevant.
Collapse
|