1
|
Basenko EY, Shanmugasundram A, Böhme U, Starns D, Wilkinson PA, Davison HR, Crouch K, Maslen G, Harb OS, Amos B, McDowell MA, Kissinger JC, Roos DS, Jones A. What is new in FungiDB: a web-based bioinformatics platform for omics-scale data analysis for fungal and oomycete species. Genetics 2024; 227:iyae035. [PMID: 38529759 DOI: 10.1093/genetics/iyae035] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/18/2023] [Accepted: 02/15/2024] [Indexed: 03/27/2024] Open
Abstract
FungiDB (https://fungidb.org) serves as a valuable online resource that seamlessly integrates genomic and related large-scale data for a wide range of fungal and oomycete species. As an integral part of the VEuPathDB Bioinformatics Resource Center (https://veupathdb.org), FungiDB continually integrates both published and unpublished data addressing various aspects of fungal biology. Established in early 2011, the database has evolved to support 674 datasets. The datasets include over 300 genomes spanning various taxa (e.g. Ascomycota, Basidiomycota, Blastocladiomycota, Chytridiomycota, Mucoromycota, as well as Albuginales, Peronosporales, Pythiales, and Saprolegniales). In addition to genomic assemblies and annotation, over 300 extra datasets encompassing diverse information, such as expression and variation data, are also available. The resource also provides an intuitive web-based interface, facilitating comprehensive approaches to data mining and visualization. Users can test their hypotheses and navigate through omics-scale datasets using a built-in search strategy system. Moreover, FungiDB offers capabilities for private data analysis via the integrated VEuPathDB Galaxy platform. FungiDB also permits genome improvements by capturing expert knowledge through the User Comments system and the Apollo genome annotation editor for structural and functional gene curation. FungiDB facilitates data exploration and analysis and contributes to advancing research efforts by capturing expert knowledge for fungal and oomycete species.
Collapse
Affiliation(s)
- Evelina Y Basenko
- Institute of Systems, Molecular and Integrative Biology, University of Liverpool, Liverpool L69 7BE, UK
| | - Achchuthan Shanmugasundram
- Institute of Systems, Molecular and Integrative Biology, University of Liverpool, Liverpool L69 7BE, UK
- Genomics England Limited, London E14 5AB, UK
| | - Ulrike Böhme
- Institute of Systems, Molecular and Integrative Biology, University of Liverpool, Liverpool L69 7BE, UK
| | - David Starns
- Institute of Systems, Molecular and Integrative Biology, University of Liverpool, Liverpool L69 7BE, UK
| | - Paul A Wilkinson
- Institute of Systems, Molecular and Integrative Biology, University of Liverpool, Liverpool L69 7BE, UK
| | - Helen R Davison
- Institute of Systems, Molecular and Integrative Biology, University of Liverpool, Liverpool L69 7BE, UK
| | - Kathryn Crouch
- School of Infection and Immunity, University of Glasgow, Glasgow G12 8QQ, UK
| | | | - Omar S Harb
- University of Pennsylvania, Philadelphia, PA 19104, USA
| | | | | | | | - David S Roos
- University of Pennsylvania, Philadelphia, PA 19104, USA
| | - Andrew Jones
- Institute of Systems, Molecular and Integrative Biology, University of Liverpool, Liverpool L69 7BE, UK
| |
Collapse
|
2
|
Alvarez-Jarreta J, Amos B, Aurrecoechea C, Bah S, Barba M, Barreto A, Basenko EY, Belnap R, Blevins A, Böhme U, Brestelli J, Brown S, Callan D, Campbell LI, Christophides GK, Crouch K, Davison HR, DeBarry JD, Demko R, Doherty R, Duan Y, Dundore W, Dyer S, Falke D, Fischer S, Gajria B, Galdi D, Giraldo-Calderón GI, Harb OS, Harper E, Helb D, Howington C, Hu S, Humphrey J, Iodice J, Jones A, Judkins J, Kelly SA, Kissinger JC, Kittur N, Kwon DK, Lamoureux K, Li W, Lodha D, MacCallum RM, Maslen G, McDowell MA, Myers J, Nural MV, Roos DS, Rund SSC, Shanmugasundram A, Sitnik V, Spruill D, Starns D, Tomko SS, Wang H, Warrenfeltz S, Wieck R, Wilkinson PA, Zheng J. VEuPathDB: the eukaryotic pathogen, vector and host bioinformatics resource center in 2023. Nucleic Acids Res 2024; 52:D808-D816. [PMID: 37953350 PMCID: PMC10767879 DOI: 10.1093/nar/gkad1003] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/14/2023] [Revised: 10/09/2023] [Accepted: 10/19/2023] [Indexed: 11/14/2023] Open
Abstract
The Eukaryotic Pathogen, Vector and Host Informatics Resource (VEuPathDB, https://veupathdb.org) is a Bioinformatics Resource Center funded by the National Institutes of Health with additional funding from the Wellcome Trust. VEuPathDB supports >600 organisms that comprise invertebrate vectors, eukaryotic pathogens (protists and fungi) and relevant free-living or non-pathogenic species or hosts. Since 2004, VEuPathDB has analyzed omics data from the public domain using contemporary bioinformatic workflows, including orthology predictions via OrthoMCL, and integrated the analysis results with analysis tools, visualizations, and advanced search capabilities. The unique data mining platform coupled with >3000 pre-analyzed data sets facilitates the exploration of pertinent omics data in support of hypothesis driven research. Comparisons are easily made across data sets, data types and organisms. A Galaxy workspace offers the opportunity for the analysis of private large-scale datasets and for porting to VEuPathDB for comparisons with integrated data. The MapVEu tool provides a platform for exploration of spatially resolved data such as vector surveillance and insecticide resistance monitoring. To address the growing body of omics data and advances in laboratory techniques, VEuPathDB has added several new data types, searches and features, improved the Galaxy workspace environment, redesigned the MapVEu interface and updated the infrastructure to accommodate these changes.
Collapse
Affiliation(s)
| | - Beatrice Amos
- Institute of Systems, Molecular and Integrative Biology, University of Liverpool, Liverpool L69 7ZB, UK
| | | | - Saikou Bah
- School of Infection and Immunity, University of Glasgow, Glasgow, UK
| | | | - Ana Barreto
- University of Pennsylvania, Philadelphia, PA 19104, USA
| | - Evelina Y Basenko
- Institute of Systems, Molecular and Integrative Biology, University of Liverpool, Liverpool L69 7ZB, UK
| | | | - Ann Blevins
- University of Pennsylvania School of Veterinary Medicine, Philadelphia, PA 19104, USA
| | | | | | - Stuart Brown
- University of Pennsylvania, Philadelphia, PA 19104, USA
| | | | | | | | - Kathryn Crouch
- School of Infection and Immunity, University of Glasgow, Glasgow, UK
| | - Helen R Davison
- Institute of Systems, Molecular and Integrative Biology, University of Liverpool, Liverpool L69 7ZB, UK
| | | | - Richard Demko
- University of Pennsylvania, Philadelphia, PA 19104, USA
| | - Ryan Doherty
- University of Pennsylvania, Philadelphia, PA 19104, USA
| | - Yikun Duan
- University of Pennsylvania, Philadelphia, PA 19104, USA
| | | | - Sarah Dyer
- European Bioinformatics Institute, Hinxton CB10 1SD, UK
| | - Dave Falke
- University of Georgia, Athens, GA 30602, USA
| | - Steve Fischer
- University of Pennsylvania, Philadelphia, PA 19104, USA
| | - Bindu Gajria
- University of Pennsylvania, Philadelphia, PA 19104, USA
| | - Daniel Galdi
- University of Pennsylvania, Philadelphia, PA 19104, USA
| | | | - Omar S Harb
- University of Pennsylvania, Philadelphia, PA 19104, USA
| | | | - Danica Helb
- University of Pennsylvania, Philadelphia, PA 19104, USA
| | | | - Sufen Hu
- University of Pennsylvania, Philadelphia, PA 19104, USA
| | | | - John Iodice
- University of Pennsylvania, Philadelphia, PA 19104, USA
| | - Andrew Jones
- Institute of Systems, Molecular and Integrative Biology, University of Liverpool, Liverpool L69 7ZB, UK
| | - John Judkins
- University of Pennsylvania, Philadelphia, PA 19104, USA
| | - Sarah A Kelly
- Imperial College London, South Kensington, London SW7 2BU, UK
| | | | | | - Dae Kun Kwon
- University of Notre Dame, Notre Dame, IN 46556, USA
| | | | - Wei Li
- University of Pennsylvania, Philadelphia, PA 19104, USA
| | - Disha Lodha
- European Bioinformatics Institute, Hinxton CB10 1SD, UK
| | | | - Gareth Maslen
- Imperial College London, South Kensington, London SW7 2BU, UK
| | | | - Jeremy Myers
- University of Pennsylvania, Philadelphia, PA 19104, USA
| | | | - David S Roos
- University of Pennsylvania, Philadelphia, PA 19104, USA
| | | | - Achchuthan Shanmugasundram
- Institute of Systems, Molecular and Integrative Biology, University of Liverpool, Liverpool L69 7ZB, UK
- Genomics England Limited, London E14 5AB, UK
| | - Vasily Sitnik
- European Bioinformatics Institute, Hinxton CB10 1SD, UK
| | | | - David Starns
- Institute of Systems, Molecular and Integrative Biology, University of Liverpool, Liverpool L69 7ZB, UK
| | | | | | | | - Robert Wieck
- University of Notre Dame, Notre Dame, IN 46556, USA
| | - Paul A Wilkinson
- Institute of Systems, Molecular and Integrative Biology, University of Liverpool, Liverpool L69 7ZB, UK
| | - Jie Zheng
- University of Pennsylvania, Philadelphia, PA 19104, USA
| |
Collapse
|
3
|
Shanmugasundram A, Starns D, Böhme U, Amos B, Wilkinson PA, Harb OS, Warrenfeltz S, Kissinger JC, McDowell MA, Roos DS, Crouch K, Jones AR. TriTrypDB: An integrated functional genomics resource for kinetoplastida. PLoS Negl Trop Dis 2023; 17:e0011058. [PMID: 36656904 PMCID: PMC9888696 DOI: 10.1371/journal.pntd.0011058] [Citation(s) in RCA: 16] [Impact Index Per Article: 16.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/12/2022] [Revised: 01/31/2023] [Accepted: 12/23/2022] [Indexed: 01/20/2023] Open
Abstract
Parasitic diseases caused by kinetoplastid parasites are a burden to public health throughout tropical and subtropical regions of the world. TriTrypDB (https://tritrypdb.org) is a free online resource for data mining of genomic and functional data from these kinetoplastid parasites and is part of the VEuPathDB Bioinformatics Resource Center (https://veupathdb.org). As of release 59, TriTrypDB hosts 83 kinetoplastid genomes, nine of which, including Trypanosoma brucei brucei TREU927, Trypanosoma cruzi CL Brener and Leishmania major Friedlin, undergo manual curation by integrating information from scientific publications, high-throughput assays and user submitted comments. TriTrypDB also integrates transcriptomic, proteomic, epigenomic, population-level and isolate data, functional information from genome-wide RNAi knock-down and fluorescent tagging, and results from automated bioinformatics analysis pipelines. TriTrypDB offers a user-friendly web interface embedded with a genome browser, search strategy system and bioinformatics tools to support custom in silico experiments that leverage integrated data. A Galaxy workspace enables users to analyze their private data (e.g., RNA-sequencing, variant calling, etc.) and explore their results privately in the context of publicly available information in the database. The recent addition of an annotation platform based on Apollo enables users to provide both functional and structural changes that will appear as 'community annotations' immediately and, pending curatorial review, will be integrated into the official genome annotation.
Collapse
Affiliation(s)
- Achchuthan Shanmugasundram
- Department of Biochemistry and Systems Biology, Institute of Integrative, Systems and Molecular Biology, University of Liverpool, Liverpool, United Kingdom
| | - David Starns
- Department of Biochemistry and Systems Biology, Institute of Integrative, Systems and Molecular Biology, University of Liverpool, Liverpool, United Kingdom
| | - Ulrike Böhme
- Department of Biochemistry and Systems Biology, Institute of Integrative, Systems and Molecular Biology, University of Liverpool, Liverpool, United Kingdom
| | - Beatrice Amos
- Department of Biochemistry and Systems Biology, Institute of Integrative, Systems and Molecular Biology, University of Liverpool, Liverpool, United Kingdom
| | - Paul A. Wilkinson
- Department of Biochemistry and Systems Biology, Institute of Integrative, Systems and Molecular Biology, University of Liverpool, Liverpool, United Kingdom
| | - Omar S. Harb
- Department of Biology, University of Pennsylvania, Philadelphia, Pennsylvania, United States of America
| | - Susanne Warrenfeltz
- Center for Tropical & Emerging Global Diseases, Department of Genetics, Institute of Bioinformatics, University of Georgia, Athens, Georgia, United States of America
| | - Jessica C. Kissinger
- Center for Tropical & Emerging Global Diseases, Department of Genetics, Institute of Bioinformatics, University of Georgia, Athens, Georgia, United States of America
| | - Mary Ann McDowell
- Department of Biological Sciences, Eck Institute for Global Health, University of Notre Dame, Notre Dame, Indiana, United States of America
| | - David S. Roos
- Department of Biology, University of Pennsylvania, Philadelphia, Pennsylvania, United States of America
| | - Kathryn Crouch
- School of Infection and Immunity, University of Glasgow, Glasgow, United Kingdom
| | - Andrew R. Jones
- Department of Biochemistry and Systems Biology, Institute of Integrative, Systems and Molecular Biology, University of Liverpool, Liverpool, United Kingdom
| |
Collapse
|
4
|
Amos B, Aurrecoechea C, Barba M, Barreto A, Basenko E, Bażant W, Belnap R, Blevins AS, Böhme U, Brestelli J, Brunk BP, Caddick M, Callan D, Campbell L, Christensen M, Christophides G, Crouch K, Davis K, DeBarry J, Doherty R, Duan Y, Dunn M, Falke D, Fisher S, Flicek P, Fox B, Gajria B, Giraldo-Calderón GI, Harb OS, Harper E, Hertz-Fowler C, Hickman M, Howington C, Hu S, Humphrey J, Iodice J, Jones A, Judkins J, Kelly SA, Kissinger JC, Kwon DK, Lamoureux K, Lawson D, Li W, Lies K, Lodha D, Long J, MacCallum RM, Maslen G, McDowell MA, Nabrzyski J, Roos DS, Rund SC, Schulman S, Shanmugasundram A, Sitnik V, Spruill D, Starns D, Stoeckert C, Tomko SS, Wang H, Warrenfeltz S, Wieck R, Wilkinson PA, Xu L, Zheng J. VEuPathDB: the eukaryotic pathogen, vector and host bioinformatics resource center. Nucleic Acids Res 2022; 50:D898-D911. [PMID: 34718728 PMCID: PMC8728164 DOI: 10.1093/nar/gkab929] [Citation(s) in RCA: 186] [Impact Index Per Article: 93.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/15/2020] [Revised: 09/21/2021] [Accepted: 10/04/2021] [Indexed: 11/13/2022] Open
Abstract
The Eukaryotic Pathogen, Vector and Host Informatics Resource (VEuPathDB, https://veupathdb.org) represents the 2019 merger of VectorBase with the EuPathDB projects. As a Bioinformatics Resource Center funded by the National Institutes of Health, with additional support from the Welllcome Trust, VEuPathDB supports >500 organisms comprising invertebrate vectors, eukaryotic pathogens (protists and fungi) and relevant free-living or non-pathogenic species or hosts. Designed to empower researchers with access to Omics data and bioinformatic analyses, VEuPathDB projects integrate >1700 pre-analysed datasets (and associated metadata) with advanced search capabilities, visualizations, and analysis tools in a graphic interface. Diverse data types are analysed with standardized workflows including an in-house OrthoMCL algorithm for predicting orthology. Comparisons are easily made across datasets, data types and organisms in this unique data mining platform. A new site-wide search facilitates access for both experienced and novice users. Upgraded infrastructure and workflows support numerous updates to the web interface, tools, searches and strategies, and Galaxy workspace where users can privately analyse their own data. Forthcoming upgrades include cloud-ready application architecture, expanded support for the Galaxy workspace, tools for interrogating host-pathogen interactions, and improved interactions with affiliated databases (ClinEpiDB, MicrobiomeDB) and other scientific resources, and increased interoperability with the Bacterial & Viral BRC.
Collapse
Affiliation(s)
- Beatrice Amos
- Institute of Systems, Molecular & Integrative Biology, University of Liverpool, Liverpool L69 7ZB, UK
| | - Cristina Aurrecoechea
- Center for Tropical & Emerging Global Diseases, University of Georgia, Athens, GA 30602, USA
| | - Matthieu Barba
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Ana Barreto
- Institute for Biomedical Informatics, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA 19104, USA
- Department of Genetics, School of Medicine, University of Pennsylvania, Philadelphia, PA 19104, USA
| | - Evelina Y Basenko
- Institute of Systems, Molecular & Integrative Biology, University of Liverpool, Liverpool L69 7ZB, UK
| | - Wojciech Bażant
- Wellcome Centre for Integrative Parasitology, University of Glasgow, Glasgow G12 8TA, UK
| | - Robert Belnap
- Center for Tropical & Emerging Global Diseases, University of Georgia, Athens, GA 30602, USA
| | - Ann S Blevins
- Department of Pathology, School of Veterinary Medicine, University of Pennsylvania, Philadelphia, PA, USA
| | - Ulrike Böhme
- Institute of Systems, Molecular & Integrative Biology, University of Liverpool, Liverpool L69 7ZB, UK
| | - John Brestelli
- Institute for Biomedical Informatics, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA 19104, USA
- Department of Genetics, School of Medicine, University of Pennsylvania, Philadelphia, PA 19104, USA
| | - Brian P Brunk
- Department of Biology, University of Pennsylvania, Philadelphia, PA 19104, USA
| | - Mark Caddick
- Institute of Systems, Molecular & Integrative Biology, University of Liverpool, Liverpool L69 7ZB, UK
| | - Danielle Callan
- Institute for Biomedical Informatics, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA 19104, USA
- Department of Genetics, School of Medicine, University of Pennsylvania, Philadelphia, PA 19104, USA
| | - Lahcen Campbell
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Mikkel B Christensen
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - George K Christophides
- Department of Life Sciences, Imperial College London, South Kensington Campus, London SW7 2AZ, UK
| | - Kathryn Crouch
- Wellcome Centre for Integrative Parasitology, University of Glasgow, Glasgow G12 8TA, UK
| | - Kristina Davis
- Center for Research Computing, University of Notre Dame, Notre Dame, IN 46556, USA
| | - Jeremy DeBarry
- Center for Tropical & Emerging Global Diseases, University of Georgia, Athens, GA 30602, USA
| | - Ryan Doherty
- Institute for Biomedical Informatics, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA 19104, USA
- Department of Biology, University of Pennsylvania, Philadelphia, PA 19104, USA
| | - Yikun Duan
- Institute for Biomedical Informatics, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA 19104, USA
- Department of Biology, University of Pennsylvania, Philadelphia, PA 19104, USA
| | - Michael Dunn
- Center for Research Computing, University of Notre Dame, Notre Dame, IN 46556, USA
| | - Dave Falke
- Center for Tropical & Emerging Global Diseases, University of Georgia, Athens, GA 30602, USA
| | - Steve Fisher
- Institute for Biomedical Informatics, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA 19104, USA
- Department of Genetics, School of Medicine, University of Pennsylvania, Philadelphia, PA 19104, USA
| | - Paul Flicek
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Brett Fox
- Center for Research Computing, University of Notre Dame, Notre Dame, IN 46556, USA
| | - Bindu Gajria
- Department of Biology, University of Pennsylvania, Philadelphia, PA 19104, USA
| | - Gloria I Giraldo-Calderón
- Department of Biological Sciences, Eck Institute for Global Health, University of Notre Dame, Notre Dame, IN 46556, USA
- Departamento de Ciencias Biológicas y Departamento de Ciencias Básicas Médicas, Universidad Icesi, Calle 18 No. 122-135, Cali, Colombia
| | - Omar S Harb
- Department of Biology, University of Pennsylvania, Philadelphia, PA 19104, USA
| | - Elizabeth Harper
- Department of Biology, University of Pennsylvania, Philadelphia, PA 19104, USA
| | - Christiane Hertz-Fowler
- Institute of Systems, Molecular & Integrative Biology, University of Liverpool, Liverpool L69 7ZB, UK
| | - Mark J Hickman
- Department of Biology, University of Pennsylvania, Philadelphia, PA 19104, USA
| | - Connor Howington
- Center for Research Computing, University of Notre Dame, Notre Dame, IN 46556, USA
| | - Sufen Hu
- Institute for Biomedical Informatics, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA 19104, USA
- Department of Biology, University of Pennsylvania, Philadelphia, PA 19104, USA
| | - Jay Humphrey
- Center for Tropical & Emerging Global Diseases, University of Georgia, Athens, GA 30602, USA
| | - John Iodice
- Institute for Biomedical Informatics, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA 19104, USA
- Department of Genetics, School of Medicine, University of Pennsylvania, Philadelphia, PA 19104, USA
| | - Andrew Jones
- Institute of Systems, Molecular & Integrative Biology, University of Liverpool, Liverpool L69 7ZB, UK
| | - John Judkins
- Institute for Biomedical Informatics, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA 19104, USA
- Department of Biology, University of Pennsylvania, Philadelphia, PA 19104, USA
| | - Sarah A Kelly
- Department of Life Sciences, Imperial College London, South Kensington Campus, London SW7 2AZ, UK
| | - Jessica C Kissinger
- Center for Tropical & Emerging Global Diseases, University of Georgia, Athens, GA 30602, USA
- Department of Genetics, University of Georgia, Athens, GA 30602, USA
- Institute of Bioinformatics, University of Georgia, Athens, GA 30602, USA
| | - Dae Kun Kwon
- Department of Civil & Environmental Engineering & Earth Sciences, University of Notre Dame, Notre Dame, IN 46556, USA
| | - Kristopher Lamoureux
- Center for Tropical & Emerging Global Diseases, University of Georgia, Athens, GA 30602, USA
| | - Daniel Lawson
- Department of Life Sciences, Imperial College London, South Kensington Campus, London SW7 2AZ, UK
| | - Wei Li
- Institute for Biomedical Informatics, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA 19104, USA
- Department of Biology, University of Pennsylvania, Philadelphia, PA 19104, USA
| | - Kallie Lies
- Center for Research Computing, University of Notre Dame, Notre Dame, IN 46556, USA
| | - Disha Lodha
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Jamie Long
- Department of Biology, University of Pennsylvania, Philadelphia, PA 19104, USA
| | - Robert M MacCallum
- Department of Life Sciences, Imperial College London, South Kensington Campus, London SW7 2AZ, UK
| | - Gareth Maslen
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Mary Ann McDowell
- Department of Biological Sciences, Eck Institute for Global Health, University of Notre Dame, Notre Dame, IN 46556, USA
| | - Jaroslaw Nabrzyski
- Center for Research Computing, University of Notre Dame, Notre Dame, IN 46556, USA
| | - David S Roos
- Department of Biology, University of Pennsylvania, Philadelphia, PA 19104, USA
| | - Samuel S C Rund
- Department of Biological Sciences, Eck Institute for Global Health, University of Notre Dame, Notre Dame, IN 46556, USA
| | | | | | - Vasily Sitnik
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Drew Spruill
- Center for Tropical & Emerging Global Diseases, University of Georgia, Athens, GA 30602, USA
| | - David Starns
- Institute of Systems, Molecular & Integrative Biology, University of Liverpool, Liverpool L69 7ZB, UK
| | - Christian J Stoeckert
- Institute for Biomedical Informatics, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA 19104, USA
- Department of Genetics, School of Medicine, University of Pennsylvania, Philadelphia, PA 19104, USA
| | - Sheena Shah Tomko
- Department of Biology, University of Pennsylvania, Philadelphia, PA 19104, USA
| | - Haiming Wang
- Center for Tropical & Emerging Global Diseases, University of Georgia, Athens, GA 30602, USA
| | - Susanne Warrenfeltz
- Center for Tropical & Emerging Global Diseases, University of Georgia, Athens, GA 30602, USA
| | - Robert Wieck
- Center for Research Computing, University of Notre Dame, Notre Dame, IN 46556, USA
| | - Paul A Wilkinson
- Institute of Systems, Molecular & Integrative Biology, University of Liverpool, Liverpool L69 7ZB, UK
| | - Lin Xu
- Institute for Biomedical Informatics, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA 19104, USA
- Department of Biology, University of Pennsylvania, Philadelphia, PA 19104, USA
| | - Jie Zheng
- Institute for Biomedical Informatics, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA 19104, USA
- Department of Genetics, School of Medicine, University of Pennsylvania, Philadelphia, PA 19104, USA
| |
Collapse
|
5
|
Gossmann TI, Shanmugasundram A, Börno S, Duvaux L, Lemaire C, Kuhl H, Klages S, Roberts LD, Schade S, Gostner JM, Hildebrand F, Vowinckel J, Bichet C, Mülleder M, Calvani E, Zelezniak A, Griffin JL, Bork P, Allaine D, Cohas A, Welch JJ, Timmermann B, Ralser M. Ice-Age Climate Adaptations Trap the Alpine Marmot in a State of Low Genetic Diversity. Curr Biol 2019; 29:1712-1720.e7. [PMID: 31080084 PMCID: PMC6538971 DOI: 10.1016/j.cub.2019.04.020] [Citation(s) in RCA: 19] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/06/2018] [Revised: 02/16/2019] [Accepted: 04/09/2019] [Indexed: 12/30/2022]
Abstract
Some species responded successfully to prehistoric changes in climate [1, 2], while others failed to adapt and became extinct [3]. The factors that determine successful climate adaptation remain poorly understood. We constructed a reference genome and studied physiological adaptations in the Alpine marmot (Marmota marmota), a large ground-dwelling squirrel exquisitely adapted to the "ice-age" climate of the Pleistocene steppe [4, 5]. Since the disappearance of this habitat, the rodent persists in large numbers in the high-altitude Alpine meadow [6, 7]. Genome and metabolome showed evidence of adaptation consistent with cold climate, affecting white adipose tissue. Conversely, however, we found that the Alpine marmot has levels of genetic variation that are among the lowest for mammals, such that deleterious mutations are less effectively purged. Our data rule out typical explanations for low diversity, such as high levels of consanguineous mating, or a very recent bottleneck. Instead, ancient demographic reconstruction revealed that genetic diversity was lost during the climate shifts of the Pleistocene and has not recovered, despite the current high population size. We attribute this slow recovery to the marmot's adaptive life history. The case of the Alpine marmot reveals a complicated relationship between climatic changes, genetic diversity, and conservation status. It shows that species of extremely low genetic diversity can be very successful and persist over thousands of years, but also that climate-adapted life history can trap a species in a persistent state of low genetic diversity.
Collapse
Affiliation(s)
- Toni I Gossmann
- University of Sheffield, Department of Animal and Plant Sciences, Sheffield S10 2TN, UK; Bielefeld University, Department of Animal Behaviour, 33501 Bielefeld, Germany
| | - Achchuthan Shanmugasundram
- Molecular Biology of Metabolism Laboratory, The Francis Crick Institute, 1 Midland Road, London NW1 1AT, UK; Centre for Genomic Research, Institute of Integrative Biology, University of Liverpool, Biosciences Building, Crown Street, Liverpool L69 7ZB, UK
| | - Stefan Börno
- Max Planck Institute for Molecular Genetics, Sequencing Core Facility, Ihnestrasse 73, 14195 Berlin, Germany
| | - Ludovic Duvaux
- IRHS, Université d'Angers, INRA, Agrocampus-Ouest, SFR 4207 QuaSaV, 49071 Beaucouzé, France; BIOGECO, INRA, Université de Bordeaux, 69 Route d'Arcachon, 33612 Cestas, France
| | - Christophe Lemaire
- IRHS, Université d'Angers, INRA, Agrocampus-Ouest, SFR 4207 QuaSaV, 49071 Beaucouzé, France
| | - Heiner Kuhl
- Max Planck Institute for Molecular Genetics, Sequencing Core Facility, Ihnestrasse 73, 14195 Berlin, Germany; Department of Ecophysiology and Aquaculture, Leibniz-Institute of Freshwater Ecology and Inland Fisheries, 12587 Berlin, Germany
| | - Sven Klages
- Max Planck Institute for Molecular Genetics, Sequencing Core Facility, Ihnestrasse 73, 14195 Berlin, Germany
| | - Lee D Roberts
- Department of Biochemistry and Cambridge Systems Biology Centre, University of Cambridge, 80 Tennis Court Road, Cambridge CB2 1GA, UK; Leeds Institute of Cardiovascular and Metabolic Medicine, University of Leeds, Leeds LS2 9JT, UK
| | - Sophia Schade
- Max Planck Institute for Molecular Genetics, Sequencing Core Facility, Ihnestrasse 73, 14195 Berlin, Germany
| | - Johanna M Gostner
- Division of Medical Biochemistry, Medical University of Innsbruck, 6020 Innsbruck, Austria
| | - Falk Hildebrand
- European Molecular Biology Laboratory (EMBL), 69117 Heidelberg, Germany; Earlham Institute, Norwich Research Park, Norwich NR4 7UZ, UK; Gut Health and Microbes Programme, Quadram Institute, Norwich Research Park, Norwich NR4 7UQ, UK
| | - Jakob Vowinckel
- Department of Biochemistry and Cambridge Systems Biology Centre, University of Cambridge, 80 Tennis Court Road, Cambridge CB2 1GA, UK
| | | | - Michael Mülleder
- Department of Biochemistry and Cambridge Systems Biology Centre, University of Cambridge, 80 Tennis Court Road, Cambridge CB2 1GA, UK; Department of Biochemistry, Charitè, Am Chariteplatz 1, 10117 Berlin, Germany
| | - Enrica Calvani
- Molecular Biology of Metabolism Laboratory, The Francis Crick Institute, 1 Midland Road, London NW1 1AT, UK; Department of Biochemistry and Cambridge Systems Biology Centre, University of Cambridge, 80 Tennis Court Road, Cambridge CB2 1GA, UK
| | - Aleksej Zelezniak
- Molecular Biology of Metabolism Laboratory, The Francis Crick Institute, 1 Midland Road, London NW1 1AT, UK; Department of Biology and Biological Engineering, Chalmers University of Technology, 412 96 Göteborg, Sweden; Science for Life Laboratory, KTH - Royal Institute of Technology, Stockholm 171 65, Sweden
| | - Julian L Griffin
- Department of Biochemistry and Cambridge Systems Biology Centre, University of Cambridge, 80 Tennis Court Road, Cambridge CB2 1GA, UK
| | - Peer Bork
- European Molecular Biology Laboratory (EMBL), 69117 Heidelberg, Germany; Max-Delbrück-Centre for Molecular Medicine, 13092 Berlin, Germany; Molecular Medicine Partnership Unit, 69120 Heidelberg, Germany
| | - Dominique Allaine
- Université de Lyon, F-69000, Lyon; Université Lyon 1; CNRS, UMR 5558, Laboratoire de Biométrie et Biologie Evolutive, 69622 Villeurbanne, France
| | - Aurélie Cohas
- Université de Lyon, F-69000, Lyon; Université Lyon 1; CNRS, UMR 5558, Laboratoire de Biométrie et Biologie Evolutive, 69622 Villeurbanne, France
| | - John J Welch
- Department of Genetics, University of Cambridge, Cambridge CB2 3EH, UK
| | - Bernd Timmermann
- Max Planck Institute for Molecular Genetics, Sequencing Core Facility, Ihnestrasse 73, 14195 Berlin, Germany
| | - Markus Ralser
- Molecular Biology of Metabolism Laboratory, The Francis Crick Institute, 1 Midland Road, London NW1 1AT, UK; Department of Biochemistry and Cambridge Systems Biology Centre, University of Cambridge, 80 Tennis Court Road, Cambridge CB2 1GA, UK; Department of Biochemistry, Charitè, Am Chariteplatz 1, 10117 Berlin, Germany.
| |
Collapse
|
6
|
Basenko EY, Pulman JA, Shanmugasundram A, Harb OS, Crouch K, Starns D, Warrenfeltz S, Aurrecoechea C, Stoeckert CJ, Kissinger JC, Roos DS, Hertz-Fowler C. FungiDB: An Integrated Bioinformatic Resource for Fungi and Oomycetes. J Fungi (Basel) 2018; 4:jof4010039. [PMID: 30152809 PMCID: PMC5872342 DOI: 10.3390/jof4010039] [Citation(s) in RCA: 186] [Impact Index Per Article: 31.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/31/2018] [Revised: 03/07/2018] [Accepted: 03/15/2018] [Indexed: 02/06/2023] Open
Abstract
FungiDB (fungidb.org) is a free online resource for data mining and functional genomics analysis for fungal and oomycete species. FungiDB is part of the Eukaryotic Pathogen Genomics Database Resource (EuPathDB, eupathdb.org) platform that integrates genomic, transcriptomic, proteomic, and phenotypic datasets, and other types of data for pathogenic and nonpathogenic, free-living and parasitic organisms. FungiDB is one of the largest EuPathDB databases containing nearly 100 genomes obtained from GenBank, Aspergillus Genome Database (AspGD), The Broad Institute, Joint Genome Institute (JGI), Ensembl, and other sources. FungiDB offers a user-friendly web interface with embedded bioinformatics tools that support custom in silico experiments that leverage FungiDB-integrated data. In addition, a Galaxy-based workspace enables users to generate custom pipelines for large-scale data analysis (e.g., RNA-Seq, variant calling, etc.). This review provides an introduction to the FungiDB resources and focuses on available features, tools, and queries and how they can be used to mine data across a diverse range of integrated FungiDB datasets and records.
Collapse
Affiliation(s)
- Evelina Y Basenko
- Centre for Genomic Research, Institute of Integrative Biology, University of Liverpool, Liverpool L69 7ZB, UK.
| | - Jane A Pulman
- Centre for Genomic Research, Institute of Integrative Biology, University of Liverpool, Liverpool L69 7ZB, UK.
| | - Achchuthan Shanmugasundram
- Centre for Genomic Research, Institute of Integrative Biology, University of Liverpool, Liverpool L69 7ZB, UK.
| | - Omar S Harb
- Department of Biology, University of Pennsylvania, Philadelphia, PA 19104, USA.
| | - Kathryn Crouch
- Wellcome Trust Centre for Molecular Parasitology, Glasgow G12 8TA, UK.
| | - David Starns
- Centre for Genomic Research, Institute of Integrative Biology, University of Liverpool, Liverpool L69 7ZB, UK.
| | - Susanne Warrenfeltz
- Center for Tropical and Emerging Global Diseases, Institute of Bioinformatics, University of Georgia, Athens, GA 30602, USA.
| | - Cristina Aurrecoechea
- Center for Tropical and Emerging Global Diseases, Institute of Bioinformatics, University of Georgia, Athens, GA 30602, USA.
| | | | - Jessica C Kissinger
- Center for Tropical and Emerging Global Diseases, Institute of Bioinformatics, University of Georgia, Athens, GA 30602, USA.
| | - David S Roos
- Department of Biology, University of Pennsylvania, Philadelphia, PA 19104, USA.
| | - Christiane Hertz-Fowler
- Centre for Genomic Research, Institute of Integrative Biology, University of Liverpool, Liverpool L69 7ZB, UK.
| |
Collapse
|
7
|
Mello LV, Tregilgas L, Cowley G, Gupta A, Makki F, Jhutty A, Shanmugasundram A. 'Students-as-partners' scheme enhances postgraduate students' employability skills while addressing gaps in bioinformatics education. High Educ Pedagog 2017; 2:43-57. [PMID: 29098185 PMCID: PMC5632996 DOI: 10.1080/23752696.2017.1339287] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 02/29/2016] [Revised: 05/29/2017] [Accepted: 06/04/2017] [Indexed: 06/07/2023]
Abstract
Teaching bioinformatics is a longstanding challenge for educators who need to demonstrate to students how skills developed in the classroom may be applied to real world research. This study employed an action research methodology which utilised student-staff partnership and peer-learning. It was centred on the experiences of peer-facilitators, students who had previously taken a postgraduate bioinformatics module, and had applied knowledge and skills gained from it to their own research. It aimed to demonstrate to peer-receivers, current students, how bioinformatics could be used in their own research while developing peer-facilitators' teaching and mentoring skills. This student-centred approach was well received by the peer-receivers, who claimed to have gained improved understanding of bioinformatics and its relevance to research. Equally, peer-facilitators also developed a better understanding of the subject and appreciated that the activity was a rare and invaluable opportunity to develop their teaching and mentoring skills, enhancing their employability.
Collapse
Affiliation(s)
| | - Luke Tregilgas
- Arthritis Research UK Centre for Integrated Research into Musculoskeletal Ageing (CIMA), Department of Musculoskeletal Biology, Institute of Ageing and Chronic Disease, University of Liverpool, Liverpool, UK
| | - Gwen Cowley
- Institute of Integrative Biology, University of Liverpool, Liverpool, UK
| | - Anshul Gupta
- Institute of Integrative Biology, University of Liverpool, Liverpool, UK
| | - Fatima Makki
- Institute of Integrative Biology, University of Liverpool, Liverpool, UK
| | - Anjeet Jhutty
- Institute of Integrative Biology, University of Liverpool, Liverpool, UK
| | | |
Collapse
|
8
|
Aurrecoechea C, Barreto A, Basenko EY, Brestelli J, Brunk BP, Cade S, Crouch K, Doherty R, Falke D, Fischer S, Gajria B, Harb OS, Heiges M, Hertz-Fowler C, Hu S, Iodice J, Kissinger JC, Lawrence C, Li W, Pinney DF, Pulman JA, Roos DS, Shanmugasundram A, Silva-Franco F, Steinbiss S, Stoeckert CJ, Spruill D, Wang H, Warrenfeltz S, Zheng J. EuPathDB: the eukaryotic pathogen genomics database resource. Nucleic Acids Res 2016; 45:D581-D591. [PMID: 27903906 PMCID: PMC5210576 DOI: 10.1093/nar/gkw1105] [Citation(s) in RCA: 140] [Impact Index Per Article: 17.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/13/2016] [Accepted: 10/28/2016] [Indexed: 12/26/2022] Open
Abstract
The Eukaryotic Pathogen Genomics Database Resource (EuPathDB, http://eupathdb.org) is a collection of databases covering 170+ eukaryotic pathogens (protists & fungi), along with relevant free-living and non-pathogenic species, and select pathogen hosts. To facilitate the discovery of meaningful biological relationships, the databases couple preconfigured searches with visualization and analysis tools for comprehensive data mining via intuitive graphical interfaces and APIs. All data are analyzed with the same workflows, including creation of gene orthology profiles, so data are easily compared across data sets, data types and organisms. EuPathDB is updated with numerous new analysis tools, features, data sets and data types. New tools include GO, metabolic pathway and word enrichment analyses plus an online workspace for analysis of personal, non-public, large-scale data. Expanded data content is mostly genomic and functional genomic data while new data types include protein microarray, metabolic pathways, compounds, quantitative proteomics, copy number variation, and polysomal transcriptomics. New features include consistent categorization of searches, data sets and genome browser tracks; redesigned gene pages; effective integration of alternative transcripts; and a EuPathDB Galaxy instance for private analyses of a user's data. Forthcoming upgrades include user workspaces for private integration of data with existing EuPathDB data and improved integration and presentation of host–pathogen interactions.
Collapse
Affiliation(s)
- Cristina Aurrecoechea
- Center for Tropical & Emerging Global Diseases, University of Georgia, Athens, GA 30602, USA
| | - Ana Barreto
- Institute for Biomedical Informatics, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA 19104, USA.,Department of Genetics, School of Medicine, University of Pennsylvania, Philadelphia, PA 19104, USA
| | - Evelina Y Basenko
- Center for Tropical & Emerging Global Diseases, University of Georgia, Athens, GA 30602, USA
| | - John Brestelli
- Institute for Biomedical Informatics, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA 19104, USA.,Department of Genetics, School of Medicine, University of Pennsylvania, Philadelphia, PA 19104, USA
| | - Brian P Brunk
- Institute for Biomedical Informatics, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA 19104, USA.,Department of Biology, University of Pennsylvania, Philadelphia, PA 19104, USA
| | - Shon Cade
- Department of Biology, University of Pennsylvania, Philadelphia, PA 19104, USA
| | - Kathryn Crouch
- Wellcome Trust Centre for Molecular Parasitology, University of Glasgow, Glasgow G12 8TA, UK
| | - Ryan Doherty
- Institute for Biomedical Informatics, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA 19104, USA.,Department of Biology, University of Pennsylvania, Philadelphia, PA 19104, USA
| | - Dave Falke
- Center for Tropical & Emerging Global Diseases, University of Georgia, Athens, GA 30602, USA
| | - Steve Fischer
- Institute for Biomedical Informatics, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA 19104, USA.,Department of Genetics, School of Medicine, University of Pennsylvania, Philadelphia, PA 19104, USA
| | - Bindu Gajria
- Institute for Biomedical Informatics, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA 19104, USA.,Department of Biology, University of Pennsylvania, Philadelphia, PA 19104, USA
| | - Omar S Harb
- Institute for Biomedical Informatics, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA 19104, USA .,Department of Biology, University of Pennsylvania, Philadelphia, PA 19104, USA
| | - Mark Heiges
- Center for Tropical & Emerging Global Diseases, University of Georgia, Athens, GA 30602, USA
| | | | - Sufen Hu
- Institute for Biomedical Informatics, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA 19104, USA.,Department of Biology, University of Pennsylvania, Philadelphia, PA 19104, USA
| | - John Iodice
- Institute for Biomedical Informatics, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA 19104, USA.,Department of Genetics, School of Medicine, University of Pennsylvania, Philadelphia, PA 19104, USA
| | - Jessica C Kissinger
- Center for Tropical & Emerging Global Diseases, University of Georgia, Athens, GA 30602, USA.,Department of Genetics, University of Georgia, Athens, GA 30602, USA.,Institute of Bioinformatics, University of Georgia, Athens, GA 30602, USA
| | - Cris Lawrence
- Institute for Biomedical Informatics, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA 19104, USA.,Department of Biology, University of Pennsylvania, Philadelphia, PA 19104, USA
| | - Wei Li
- Institute for Biomedical Informatics, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA 19104, USA.,Department of Biology, University of Pennsylvania, Philadelphia, PA 19104, USA
| | - Deborah F Pinney
- Institute for Biomedical Informatics, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA 19104, USA.,Department of Genetics, School of Medicine, University of Pennsylvania, Philadelphia, PA 19104, USA
| | - Jane A Pulman
- Centre for Genomic Research, Institute of Integrative Biology, University of Liverpool, Liverpool L69 7ZB, UK
| | - David S Roos
- Department of Biology, University of Pennsylvania, Philadelphia, PA 19104, USA
| | - Achchuthan Shanmugasundram
- Centre for Genomic Research, Institute of Integrative Biology, University of Liverpool, Liverpool L69 7ZB, UK
| | - Fatima Silva-Franco
- Centre for Genomic Research, Institute of Integrative Biology, University of Liverpool, Liverpool L69 7ZB, UK
| | - Sascha Steinbiss
- Wellcome Trust Sanger Institute, Parasite Genomics, Wellcome Trust Genome Campus, Hinxton CB10 1SA, UK
| | - Christian J Stoeckert
- Institute for Biomedical Informatics, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA 19104, USA.,Department of Genetics, School of Medicine, University of Pennsylvania, Philadelphia, PA 19104, USA
| | - Drew Spruill
- Center for Tropical & Emerging Global Diseases, University of Georgia, Athens, GA 30602, USA
| | - Haiming Wang
- Center for Tropical & Emerging Global Diseases, University of Georgia, Athens, GA 30602, USA
| | - Susanne Warrenfeltz
- Center for Tropical & Emerging Global Diseases, University of Georgia, Athens, GA 30602, USA
| | - Jie Zheng
- Institute for Biomedical Informatics, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA 19104, USA.,Department of Genetics, School of Medicine, University of Pennsylvania, Philadelphia, PA 19104, USA
| |
Collapse
|
9
|
Krishna R, Xia D, Sanderson S, Shanmugasundram A, Vermont S, Bernal A, Daniel-Naguib G, Ghali F, Brunk BP, Roos DS, Wastling JM, Jones AR. A large-scale proteogenomics study of apicomplexan pathogens-Toxoplasma gondii and Neospora caninum. Proteomics 2015; 15:2618-28. [PMID: 25867681 PMCID: PMC4692086 DOI: 10.1002/pmic.201400553] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/26/2014] [Revised: 02/09/2015] [Accepted: 04/09/2015] [Indexed: 01/08/2023]
Abstract
Proteomics data can supplement genome annotation efforts, for example being used to confirm gene models or correct gene annotation errors. Here, we present a large-scale proteogenomics study of two important apicomplexan pathogens: Toxoplasma gondii and Neospora caninum. We queried proteomics data against a panel of official and alternate gene models generated directly from RNASeq data, using several newly generated and some previously published MS datasets for this meta-analysis. We identified a total of 201 996 and 39 953 peptide-spectrum matches for T. gondii and N. caninum, respectively, at a 1% peptide FDR threshold. This equated to the identification of 30 494 distinct peptide sequences and 2921 proteins (matches to official gene models) for T. gondii, and 8911 peptides/1273 proteins for N. caninum following stringent protein-level thresholding. We have also identified 289 and 140 loci for T. gondii and N. caninum, respectively, which mapped to RNA-Seq-derived gene models used in our analysis and apparently absent from the official annotation (release 10 from EuPathDB) of these species. We present several examples in our study where the RNA-Seq evidence can help in correction of the current gene model and can help in discovery of potential new genes. The findings of this study have been integrated into the EuPathDB. The data have been deposited to the ProteomeXchange with identifiers PXD000297and PXD000298.
Collapse
Affiliation(s)
- Ritesh Krishna
- Institute of Integrative Biology, University of Liverpool, Liverpool, Merseyside, UK.,Institute of Infection and Global Health, University of Liverpool, Liverpool, Merseyside, UK
| | - Dong Xia
- Institute of Infection and Global Health, University of Liverpool, Liverpool, Merseyside, UK
| | - Sanya Sanderson
- Institute of Infection and Global Health, University of Liverpool, Liverpool, Merseyside, UK
| | - Achchuthan Shanmugasundram
- Institute of Integrative Biology, University of Liverpool, Liverpool, Merseyside, UK.,Institute of Infection and Global Health, University of Liverpool, Liverpool, Merseyside, UK
| | - Sarah Vermont
- Institute of Infection and Global Health, University of Liverpool, Liverpool, Merseyside, UK
| | - Axel Bernal
- Department of Biology, University of Pennsylvania, Philadelphia, PA, USA
| | | | - Fawaz Ghali
- Institute of Integrative Biology, University of Liverpool, Liverpool, Merseyside, UK
| | - Brian P Brunk
- Department of Biology, University of Pennsylvania, Philadelphia, PA, USA
| | - David S Roos
- Department of Biology, University of Pennsylvania, Philadelphia, PA, USA
| | - Jonathan M Wastling
- Institute of Infection and Global Health, University of Liverpool, Liverpool, Merseyside, UK
| | - Andrew R Jones
- Institute of Integrative Biology, University of Liverpool, Liverpool, Merseyside, UK
| |
Collapse
|
10
|
Shanmugasundram A, Gonzalez-Galarza FF, Wastling JM, Vasieva O, Jones AR. An integrated approach to understand apicomplexan metabolism from their genomes. BMC Bioinformatics 2014. [PMCID: PMC4071867 DOI: 10.1186/1471-2105-15-s3-a3] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/10/2022] Open
|
11
|
Shanmugasundram A, Gonzalez-Galarza FF, Wastling JM, Vasieva O, Jones AR. Library of Apicomplexan Metabolic Pathways: a manually curated database for metabolic pathways of apicomplexan parasites. Nucleic Acids Res 2012. [PMID: 23193253 PMCID: PMC3531055 DOI: 10.1093/nar/gks1139] [Citation(s) in RCA: 55] [Impact Index Per Article: 4.6] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open
Abstract
The Library of Apicomplexan Metabolic Pathways (LAMP, http://www.llamp.net) is a web database that provides near complete mapping from genes to the central metabolic functions for some of the prominent intracellular parasites of the phylum Apicomplexa. This phylum includes the causative agents of malaria, toxoplasmosis and theileriosis-diseases with a huge economic and social impact. A number of apicomplexan genomes have been sequenced, but the accurate annotation of gene function remains challenging. We have adopted an approach called metabolic reconstruction, in which genes are systematically assigned to functions within pathways/networks for Toxoplasma gondii, Neospora caninum, Cryptosporidium and Theileria species, and Babesia bovis. Several functions missing from pathways have been identified, where the corresponding gene for an essential process appears to be absent from the current genome annotation. For each species, LAMP contains interactive diagrams of each pathway, hyperlinked to external resources and annotated with detailed information, including the sources of evidence used. We have also developed a section to highlight the overall metabolic capabilities of each species, such as the ability to synthesize or the dependence on the host for a particular metabolite. We expect this new database will become a valuable resource for fundamental and applied research on the Apicomplexa.
Collapse
Affiliation(s)
- Achchuthan Shanmugasundram
- Department of Functional and Comparative Genomics, Institute of Integrative Biology, University of Liverpool, Biosciences Building, Crown Street, Liverpool L69 7ZB, UK.
| | | | | | | | | |
Collapse
|