1
|
Gupta P, Elser J, Hooks E, D’Eustachio P, Jaiswal P, Naithani S. Plant Reactome Knowledgebase: empowering plant pathway exploration and OMICS data analysis. Nucleic Acids Res 2024; 52:D1538-D1547. [PMID: 37986220 PMCID: PMC10767815 DOI: 10.1093/nar/gkad1052] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/22/2023] [Revised: 10/20/2023] [Accepted: 10/23/2023] [Indexed: 11/22/2023] Open
Abstract
Plant Reactome (https://plantreactome.gramene.org) is a freely accessible, comprehensive plant pathway knowledgebase. It provides curated reference pathways from rice (Oryza sativa) and gene-orthology-based pathway projections to 129 additional species, spanning single-cell photoautotrophs, non-vascular plants, and higher plants, thus encompassing a wide-ranging taxonomic diversity. Currently, Plant Reactome houses a collection of 339 reference pathways, covering metabolic and transport pathways, hormone signaling, genetic regulations of developmental processes, and intricate transcriptional networks that orchestrate a plant's response to abiotic and biotic stimuli. Beyond being a mere repository, Plant Reactome serves as a dynamic data discovery platform. Users can analyze and visualize omics data, such as gene expression, gene-gene interaction, proteome, and metabolome data, all within the rich context of plant pathways. Plant Reactome is dedicated to fostering data interoperability, upholding global data standards, and embracing the tenets of the Findable, Accessible, Interoperable and Re-usable (FAIR) data policy.
Collapse
Affiliation(s)
- Parul Gupta
- Department of Botany & Plant Pathology, Oregon State University, Corvallis, OR 97331, USA
| | - Justin Elser
- Department of Botany & Plant Pathology, Oregon State University, Corvallis, OR 97331, USA
| | - Elizabeth Hooks
- Department of Botany & Plant Pathology, Oregon State University, Corvallis, OR 97331, USA
| | | | - Pankaj Jaiswal
- Department of Botany & Plant Pathology, Oregon State University, Corvallis, OR 97331, USA
| | - Sushma Naithani
- Department of Botany & Plant Pathology, Oregon State University, Corvallis, OR 97331, USA
| |
Collapse
|
2
|
Gupta P, Geniza M, Elser J, Al-Bader N, Baschieri R, Phillips JL, Haq E, Preece J, Naithani S, Jaiswal P. Reference genome of the nutrition-rich orphan crop chia ( Salvia hispanica) and its implications for future breeding. FRONTIERS IN PLANT SCIENCE 2023; 14:1272966. [PMID: 38162307 PMCID: PMC10757625 DOI: 10.3389/fpls.2023.1272966] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 08/04/2023] [Accepted: 10/23/2023] [Indexed: 01/03/2024]
Abstract
Chia (Salvia hispanica L.) is one of the most popular nutrition-rich foods and pseudocereal crops of the family Lamiaceae. Chia seeds are a rich source of proteins, polyunsaturated fatty acids (PUFAs), dietary fibers, and antioxidants. In this study, we present the assembly of the chia reference genome, which spans 303.6 Mb and encodes 48,090 annotated protein-coding genes. Our analysis revealed that ~42% of the chia genome harbors repetitive content, and identified ~3 million single nucleotide polymorphisms (SNPs) and 15,380 simple sequence repeat (SSR) marker sites. By investigating the chia transcriptome, we discovered that ~44% of the genes undergo alternative splicing with a higher frequency of intron retention events. Additionally, we identified chia genes associated with important nutrient content and quality traits, such as the biosynthesis of PUFAs and seed mucilage fiber (dietary fiber) polysaccharides. Notably, this is the first report of in-silico annotation of a plant genome for protein-derived small bioactive peptides (biopeptides) associated with improving human health. To facilitate further research and translational applications of this valuable orphan crop, we have developed the Salvia genomics database (SalviaGDB), accessible at https://salviagdb.org.
Collapse
Affiliation(s)
- Parul Gupta
- Department of Botany and Plant Pathology, Oregon State University, Corvallis, OR, United States
| | - Matthew Geniza
- Department of Botany and Plant Pathology, Oregon State University, Corvallis, OR, United States
- Molecular and Cellular Biology Graduate Program, Oregon State University, Corvallis, OR, United States
| | - Justin Elser
- Department of Botany and Plant Pathology, Oregon State University, Corvallis, OR, United States
| | - Noor Al-Bader
- Department of Botany and Plant Pathology, Oregon State University, Corvallis, OR, United States
- Molecular and Cellular Biology Graduate Program, Oregon State University, Corvallis, OR, United States
| | - Rachel Baschieri
- Department of Botany and Plant Pathology, Oregon State University, Corvallis, OR, United States
| | - Jeremy Levi Phillips
- Department of Botany and Plant Pathology, Oregon State University, Corvallis, OR, United States
| | - Ebaad Haq
- Department of Botany and Plant Pathology, Oregon State University, Corvallis, OR, United States
| | - Justin Preece
- Department of Botany and Plant Pathology, Oregon State University, Corvallis, OR, United States
| | - Sushma Naithani
- Department of Botany and Plant Pathology, Oregon State University, Corvallis, OR, United States
| | - Pankaj Jaiswal
- Department of Botany and Plant Pathology, Oregon State University, Corvallis, OR, United States
| |
Collapse
|
3
|
Naithani S, Deng CH, Sahu SK, Jaiswal P. Exploring Pan-Genomes: An Overview of Resources and Tools for Unraveling Structure, Function, and Evolution of Crop Genes and Genomes. Biomolecules 2023; 13:1403. [PMID: 37759803 PMCID: PMC10527062 DOI: 10.3390/biom13091403] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/31/2023] [Revised: 08/29/2023] [Accepted: 09/12/2023] [Indexed: 09/29/2023] Open
Abstract
The availability of multiple sequenced genomes from a single species made it possible to explore intra- and inter-specific genomic comparisons at higher resolution and build clade-specific pan-genomes of several crops. The pan-genomes of crops constructed from various cultivars, accessions, landraces, and wild ancestral species represent a compendium of genes and structural variations and allow researchers to search for the novel genes and alleles that were inadvertently lost in domesticated crops during the historical process of crop domestication or in the process of extensive plant breeding. Fortunately, many valuable genes and alleles associated with desirable traits like disease resistance, abiotic stress tolerance, plant architecture, and nutrition qualities exist in landraces, ancestral species, and crop wild relatives. The novel genes from the wild ancestors and landraces can be introduced back to high-yielding varieties of modern crops by implementing classical plant breeding, genomic selection, and transgenic/gene editing approaches. Thus, pan-genomic represents a great leap in plant research and offers new avenues for targeted breeding to mitigate the impact of global climate change. Here, we summarize the tools used for pan-genome assembly and annotations, web-portals hosting plant pan-genomes, etc. Furthermore, we highlight a few discoveries made in crops using the pan-genomic approach and future potential of this emerging field of study.
Collapse
Affiliation(s)
- Sushma Naithani
- Department of Botany and Plant Pathology, Oregon State University, Corvallis, OR 97331, USA;
| | - Cecilia H. Deng
- Molecular & Digital Breeing Group, New Cultivar Innovation, The New Zealand Institute for Plant and Food Research Limited, Private Bag 92169, Auckland 1142, New Zealand;
| | - Sunil Kumar Sahu
- State Key Laboratory of Agricultural Genomics, Key Laboratory of Genomics, Ministry of Agriculture, BGI Research, Shenzhen 518083, China;
| | - Pankaj Jaiswal
- Department of Botany and Plant Pathology, Oregon State University, Corvallis, OR 97331, USA;
| |
Collapse
|
4
|
pSBVB: A Versatile Simulation Tool To Evaluate Genomic Selection in Polyploid Species. G3-GENES GENOMES GENETICS 2019; 9:327-334. [PMID: 30573468 PMCID: PMC6385978 DOI: 10.1534/g3.118.200942] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
Abstract
Genomic Selection (GS) is the procedure whereby molecular information is used to predict complex phenotypes and it is standard in many animal and plant breeding schemes. However, only a small number of studies have been reported in horticultural crops, and in polyploid species in particular. In this paper, we have developed a versatile forward simulation tool, called polyploid Sequence Based Virtual Breeding (pSBVB), to evaluate GS strategies in polyploids; pSBVB is an efficient gene dropping software that can simulate any number of complex phenotypes, allowing a very flexible modeling of phenotypes suited to polyploids. As input, it takes genotype data from the founder population, which can vary from single nucleotide polymorphisms (SNP) chips up to sequence, a list of causal variants for every trait and their heritabilities, and the pedigree. Recombination rates between homeologous chromosomes can be specified, so that both allo- and autopolyploid species can be considered. The program outputs phenotype and genotype data for all individuals in the pedigree. Optionally, it can produce several genomic relationship matrices that consider exact or approximate genotype values. pSBVB can therefore be used to evaluate GS strategies in polyploid species (say varying SNP density, genetic architecture or population size, among other factors), or to optimize experimental designs for association studies. We illustrate pSBVB with SNP data from tetraploid potato and partial sequence data from octoploid strawberry, and we show that GS is a promising breeding strategy for polyploid species but that the actual advantage critically depends on the underlying genetic architecture. Source code, examples and a complete manual are freely available in GitHub https://github.com/lauzingaretti/pSBVB.
Collapse
|
5
|
Naithani S, Gupta P, Preece J, Garg P, Fraser V, Padgitt-Cobb LK, Martin M, Vining K, Jaiswal P. Involving community in genes and pathway curation. DATABASE-THE JOURNAL OF BIOLOGICAL DATABASES AND CURATION 2019; 2019:5289625. [PMID: 30649295 PMCID: PMC6334007 DOI: 10.1093/database/bay146] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 09/08/2018] [Accepted: 12/11/2018] [Indexed: 12/25/2022]
Abstract
Biocuration plays a crucial role in building databases and complex systems-level platforms required for processing, annotating and analyzing ‘Big Data’ in biology. However, biocuration efforts cannot keep pace with a dramatic increase in the production of omics data; this presents one of the bottlenecks in genomics. In two pathway curation jamborees, Plant Reactome curators tested strategies for introducing researchers to pathway curation tools, harnessing biologists’ expertise in curating plant pathways and developing a network of community biocurators. We summarize the strategy, workflow and outcomes of these exercises, and discuss the role of community biocuration in advancing databases and genomic resources.
Collapse
Affiliation(s)
- Sushma Naithani
- Department of Botany and Plant Pathology, Oregon State University, Corvallis, OR, USA
| | - Parul Gupta
- Department of Botany and Plant Pathology, Oregon State University, Corvallis, OR, USA
| | - Justin Preece
- Department of Botany and Plant Pathology, Oregon State University, Corvallis, OR, USA
| | - Priyanka Garg
- Department of Botany and Plant Pathology, Oregon State University, Corvallis, OR, USA
| | - Valerie Fraser
- Department of Botany and Plant Pathology, Oregon State University, Corvallis, OR, USA.,Molecular and Cellular Biology Graduate Program, Oregon State University, Corvallis, OR, USA
| | | | - Matthew Martin
- Department of Botany and Plant Pathology, Oregon State University, Corvallis, OR, USA
| | - Kelly Vining
- Department of Horticulture, Oregon State University, Corvallis, OR, USA
| | - Pankaj Jaiswal
- Department of Botany and Plant Pathology, Oregon State University, Corvallis, OR, USA
| |
Collapse
|
6
|
Edger PP, VanBuren R, Colle M, Poorten TJ, Wai CM, Niederhuth CE, Alger EI, Ou S, Acharya CB, Wang J, Callow P, McKain MR, Shi J, Collier C, Xiong Z, Mower JP, Slovin JP, Hytönen T, Jiang N, Childs KL, Knapp SJ. Single-molecule sequencing and optical mapping yields an improved genome of woodland strawberry (Fragaria vesca) with chromosome-scale contiguity. Gigascience 2018; 7:1-7. [PMID: 29253147 PMCID: PMC5801600 DOI: 10.1093/gigascience/gix124] [Citation(s) in RCA: 154] [Impact Index Per Article: 25.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/14/2017] [Accepted: 11/30/2017] [Indexed: 12/18/2022] Open
Abstract
Background Although draft genomes are available for most agronomically important plant species, the majority are incomplete, highly fragmented, and often riddled with assembly and scaffolding errors. These assembly issues hinder advances in tool development for functional genomics and systems biology. Findings Here we utilized a robust, cost-effective approach to produce high-quality reference genomes. We report a near-complete genome of diploid woodland strawberry (Fragaria vesca) using single-molecule real-time sequencing from Pacific Biosciences (PacBio). This assembly has a contig N50 length of ∼7.9 million base pairs (Mb), representing a ∼300-fold improvement of the previous version. The vast majority (>99.8%) of the assembly was anchored to 7 pseudomolecules using 2 sets of optical maps from Bionano Genomics. We obtained ∼24.96 Mb of sequence not present in the previous version of the F. vesca genome and produced an improved annotation that includes 1496 new genes. Comparative syntenic analyses uncovered numerous, large-scale scaffolding errors present in each chromosome in the previously published version of the F. vesca genome. Conclusions Our results highlight the need to improve existing short-read based reference genomes. Furthermore, we demonstrate how genome quality impacts commonly used analyses for addressing both fundamental and applied biological questions.
Collapse
Affiliation(s)
- Patrick P Edger
- Department of Horticulture, Ecology, Evolutionary Biology, and Behavior, Department of Plant Biology, and Center for Genomics Enabled Plant Science, Michigan State University, East Lansing, Michigan, 48823.,Ecology, Evolutionary Biology, and Behavior, Department of Plant Biology, and Center for Genomics Enabled Plant Science, Michigan State University, East Lansing, Michigan, 48823
| | - Robert VanBuren
- Department of Horticulture, Ecology, Evolutionary Biology, and Behavior, Department of Plant Biology, and Center for Genomics Enabled Plant Science, Michigan State University, East Lansing, Michigan, 48823
| | - Marivi Colle
- Department of Horticulture, Ecology, Evolutionary Biology, and Behavior, Department of Plant Biology, and Center for Genomics Enabled Plant Science, Michigan State University, East Lansing, Michigan, 48823
| | - Thomas J Poorten
- Department of Plant Sciences, University of California - Davis, Davis, California, 95616
| | - Ching Man Wai
- Department of Horticulture, Ecology, Evolutionary Biology, and Behavior, Department of Plant Biology, and Center for Genomics Enabled Plant Science, Michigan State University, East Lansing, Michigan, 48823
| | - Chad E Niederhuth
- Department of Genetics, University of Georgia, Athens, Georgia, 30602
| | - Elizabeth I Alger
- Department of Horticulture, Ecology, Evolutionary Biology, and Behavior, Department of Plant Biology, and Center for Genomics Enabled Plant Science, Michigan State University, East Lansing, Michigan, 48823
| | - Shujun Ou
- Department of Horticulture, Ecology, Evolutionary Biology, and Behavior, Department of Plant Biology, and Center for Genomics Enabled Plant Science, Michigan State University, East Lansing, Michigan, 48823.,Ecology, Evolutionary Biology, and Behavior, Department of Plant Biology, and Center for Genomics Enabled Plant Science, Michigan State University, East Lansing, Michigan, 48823
| | - Charlotte B Acharya
- Department of Plant Sciences, University of California - Davis, Davis, California, 95616
| | - Jie Wang
- Department of Plant Biology, and Center for Genomics Enabled Plant Science, Michigan State University, East Lansing, Michigan, 48823
| | - Pete Callow
- Department of Horticulture, Ecology, Evolutionary Biology, and Behavior, Department of Plant Biology, and Center for Genomics Enabled Plant Science, Michigan State University, East Lansing, Michigan, 48823
| | - Michael R McKain
- Donald Danforth Plant Science Center, St. Louis, Missouri, 63132
| | - Jinghua Shi
- Bionano Genomics, San Diego, California, 92121
| | | | - Zhiyong Xiong
- Potato Engineering and Technology Research Center, Inner Mongolia University, Hohhot, 010021, China
| | - Jeffrey P Mower
- Center for Plant Science Innovation, University of Nebraska, Lincoln, Nebraska, 68588
| | - Janet P Slovin
- USDA/ARS, Genetic Improvement of Fruits and Vegetables Laboratory, Beltsville, Maryland, 20705
| | - Timo Hytönen
- Department of Agricultural Sciences, Viikki Plant Science Centre, University of Helsinki, Helsinki, 00014, Finland
| | - Ning Jiang
- Department of Horticulture, Ecology, Evolutionary Biology, and Behavior, Department of Plant Biology, and Center for Genomics Enabled Plant Science, Michigan State University, East Lansing, Michigan, 48823.,Ecology, Evolutionary Biology, and Behavior, Department of Plant Biology, and Center for Genomics Enabled Plant Science, Michigan State University, East Lansing, Michigan, 48823
| | - Kevin L Childs
- Department of Plant Biology, and Center for Genomics Enabled Plant Science, Michigan State University, East Lansing, Michigan, 48823.,Center for Genomics Enabled Plant Science, Michigan State University, East Lansing, Michigan, 48823
| | - Steven J Knapp
- Department of Plant Sciences, University of California - Davis, Davis, California, 95616
| |
Collapse
|
7
|
Foerster H, Bombarely A, Battey JND, Sierro N, Ivanov NV, Mueller LA. SolCyc: a database hub at the Sol Genomics Network (SGN) for the manual curation of metabolic networks in Solanum and Nicotiana specific databases. Database (Oxford) 2018; 2018:4995113. [PMID: 29762652 PMCID: PMC5946812 DOI: 10.1093/database/bay035] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/23/2017] [Revised: 03/13/2018] [Accepted: 03/15/2018] [Indexed: 01/20/2023]
Abstract
Database URL https://solgenomics.net/tools/solcyc/.
Collapse
Affiliation(s)
- Hartmut Foerster
- Boyce Thompson Institute, 533 Tower Road, Ithaca, New York, 14853-1801, USA
| | - Aureliano Bombarely
- Department of Horticulture, Virginia Polytechnic Institute and State University, 220 Ag Quad Lane, Blacksburg, VA 24061, USA
| | - James N D Battey
- PMI R&D, Philip Morris Products S.A (Part of Philip Morris International group of companies), Quai Jeanrenaud 6, Neuchâtel CH-2000, Switzerland
| | - Nicolas Sierro
- PMI R&D, Philip Morris Products S.A (Part of Philip Morris International group of companies), Quai Jeanrenaud 6, Neuchâtel CH-2000, Switzerland
| | - Nikolai V Ivanov
- PMI R&D, Philip Morris Products S.A (Part of Philip Morris International group of companies), Quai Jeanrenaud 6, Neuchâtel CH-2000, Switzerland
| | - Lukas A Mueller
- Boyce Thompson Institute, 533 Tower Road, Ithaca, New York, 14853-1801, USA
| |
Collapse
|
8
|
Pathway Analysis and Omics Data Visualization Using Pathway Genome Databases: FragariaCyc, a Case Study. Methods Mol Biol 2016. [PMID: 27987175 DOI: 10.1007/978-1-4939-6658-5_14] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 09/29/2023]
Abstract
The species-specific plant Pathway Genome Databases (PGDBs) based on the BioCyc platform provide a conceptual model of the cellular metabolic network of an organism. Such frameworks allow analysis of the genome-scale expression data to understand changes in the overall metabolisms of an organism (or organs, tissues, and cells) in response to various extrinsic (e.g. developmental and differentiation) and/or extrinsic signals (e.g. pathogens and abiotic stresses) from the surrounding environment. Using FragariaCyc, a pathway database for the diploid strawberry Fragaria vesca, we show (1) the basic navigation across a PGDB; (2) a case study of pathway comparison across plant species; and (3) an example of RNA-Seq data analysis using Omics Viewer tool. The protocols described here generally apply to other Pathway Tools-based PGDBs.
Collapse
|