1
|
van den Brandt A, Jonkheer EM, van Workum DJM, van de Wetering H, Smit S, Vilanova A. PanVA: Pangenomic Variant Analysis. IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS 2024; 30:4895-4909. [PMID: 37267130 DOI: 10.1109/tvcg.2023.3282364] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/04/2023]
Abstract
Genomics researchers increasingly use multiple reference genomes to comprehensively explore genetic variants underlying differences in detectable characteristics between organisms. Pangenomes allow for an efficient data representation of multiple related genomes and their associated metadata. However, current visual analysis approaches for exploring these complex genotype-phenotype relationships are often based on single reference approaches or lack adequate support for interpreting the variants in the genomic context with heterogeneous (meta)data. This design study introduces PanVA, a visual analytics design for pangenomic variant analysis developed with the active participation of genomics researchers. The design uniquely combines tailored visual representations with interactions such as sorting, grouping, and aggregation, allowing users to navigate and explore different perspectives on complex genotype-phenotype relations. Through evaluation in the context of plants and pathogen research, we show that PanVA helps researchers explore variants in genes and generate hypotheses about their role in phenotypic variation.
Collapse
|
2
|
Bayer PE, Petereit J, Durant É, Monat C, Rouard M, Hu H, Chapman B, Li C, Cheng S, Batley J, Edwards D. Wheat Panache: A pangenome graph database representing presence-absence variation across sixteen bread wheat genomes. THE PLANT GENOME 2022; 15:e20221. [PMID: 35644986 DOI: 10.1002/tpg2.20221] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/04/2022] [Accepted: 04/11/2022] [Indexed: 06/15/2023]
Abstract
Bread wheat (Triticum aestivum L.) is one of humanity's most important staple crops, characterized by a large and complex genome with a high level of gene presence-absence variation (PAV) between cultivars, hampering genomic approaches for crop improvement. With the growing global population and the increasing impact of climate change on crop yield, there is an urgent need to apply genomic approaches to accelerate wheat breeding. With recent advances in DNA sequencing technology, a growing number of high-quality reference genomes are becoming available, reflecting the genetic content of a diverse range of cultivars. However, information on the presence or absence of genomic regions has been hard to visualize and interrogate because of the size of these genomes and the lack of suitable bioinformatics tools. To address this limitation, we have produced a wheat pangenome graph maintained within an online database to facilitate interrogation and comparison of wheat cultivar genomes. The database allows users to visualize regions of the pangenome to assess PAV between bread wheat genomes.
Collapse
Affiliation(s)
- Philipp E Bayer
- School of Biological Sciences, The Univ. of Western Australia, Perth, 6009, Australia
| | - Jakob Petereit
- School of Biological Sciences, The Univ. of Western Australia, Perth, 6009, Australia
| | - Éloi Durant
- DIADE, Univ. of Montpellier, CIRAD, IRD, Montpellier, 34830, France
- Syngenta Seeds S.A.S., 12 chemin de l'Hobit, Saint-Sauveur, 31790, France
- Bioversity International, Parc Scientifique Agropolis II, Montpellier, 34397, France
- French Institute of Bioinformatics (IFB)-South Green Bioinformatics Platform, Bioversity, CIRAD, INRAE, IRD, Montpellier, 34398, France
| | - Cécile Monat
- Syngenta Seeds S.A.S., 12 chemin de l'Hobit, Saint-Sauveur, 31790, France
| | - Mathieu Rouard
- Bioversity International, Parc Scientifique Agropolis II, Montpellier, 34397, France
- French Institute of Bioinformatics (IFB)-South Green Bioinformatics Platform, Bioversity, CIRAD, INRAE, IRD, Montpellier, 34398, France
| | - Haifei Hu
- Western Crop Genetics Alliance, Murdoch Univ., 90 South Street, Murdoch, 6150, Australia
| | - Brett Chapman
- Western Crop Genetics Alliance, Murdoch Univ., 90 South Street, Murdoch, 6150, Australia
| | - Chengdao Li
- Western Crop Genetics Alliance, Murdoch Univ., 90 South Street, Murdoch, 6150, Australia
| | - Shifeng Cheng
- Shenzhen Branch, Guangdong Laboratory of Lingnan Modern Agriculture, Genome Analysis Laboratory of the Ministry of Agriculture and Rural Affairs, Agricultural Genomics Institute at Shenzhen, Chinese Academy of Agricultural Sciences, Shenzhen, 518120, China
| | - Jacqueline Batley
- School of Biological Sciences, The Univ. of Western Australia, Perth, 6009, Australia
| | - David Edwards
- School of Biological Sciences, The Univ. of Western Australia, Perth, 6009, Australia
| |
Collapse
|
3
|
Brown AV, Conners SI, Huang W, Wilkey AP, Grant D, Weeks NT, Cannon SB, Graham MA, Nelson RT. A new decade and new data at SoyBase, the USDA-ARS soybean genetics and genomics database. Nucleic Acids Res 2021; 49:D1496-D1501. [PMID: 33264401 PMCID: PMC7778910 DOI: 10.1093/nar/gkaa1107] [Citation(s) in RCA: 31] [Impact Index Per Article: 10.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/14/2020] [Revised: 10/18/2020] [Accepted: 11/23/2020] [Indexed: 01/15/2023] Open
Abstract
SoyBase, a USDA genetic and genomics database, holds professionally curated soybean genetic and genomic data, which is integrated and made accessible to researchers and breeders. The site holds several reference genome assemblies, as well as genetic maps, thousands of mapped traits, expression and epigenetic data, pedigree information, and extensive variant and genotyping data sets. SoyBase displays include genetic, genomic, and epigenetic maps of the soybean genome. Gene expression data is presented in the genome viewer as heat maps and pictorial and tabular displays in gene report pages. Millions of sequence variants have been added, representing variations across various collections of cultivars. This variant data is explorable using new interactive tools to visualize the distribution of those variants across the genome, between selected accessions. SoyBase holds several reference-quality soybean genome assemblies, accessible via various query tools and browsers, including a new visualization system for exploring the soybean pan-genome. SoyBase also serves as a nexus of announcements pertinent to the greater soybean research community. The database also includes a soybean-specific anatomic and biochemical trait ontology. The database can be accessed at https://soybase.org.
Collapse
Affiliation(s)
- Anne V Brown
- USDA-ARS Corn Insects and Crop Genetics Research Unit, Ames, IA, USA
| | - Shawn I Conners
- USDA-ARS Corn Insects and Crop Genetics Research Unit, Ames, IA, USA
| | - Wei Huang
- USDA-ARS Corn Insects and Crop Genetics Research Unit, Ames, IA, USA
| | - Andrew P Wilkey
- ORISE Fellow USDA-ARS Corn Insects and Crop Genetics Research Unit, Ames, IA, USA
| | - David Grant
- USDA-ARS Corn Insects and Crop Genetics Research Unit, Ames, IA, USA
- Department of Agronomy, Iowa State University, Ames, IA, USA
| | - Nathan T Weeks
- USDA-ARS Corn Insects and Crop Genetics Research Unit, Ames, IA, USA
| | - Steven B Cannon
- USDA-ARS Corn Insects and Crop Genetics Research Unit, Ames, IA, USA
| | - Michelle A Graham
- USDA-ARS Corn Insects and Crop Genetics Research Unit, Ames, IA, USA
| | - Rex T Nelson
- USDA-ARS Corn Insects and Crop Genetics Research Unit, Ames, IA, USA
| |
Collapse
|
4
|
Samy JKA, Rognli OA, Kovi MR. ForageGrassBase: molecular resource for the forage grass meadow fescue (Festuca pratensis Huds.). DATABASE-THE JOURNAL OF BIOLOGICAL DATABASES AND CURATION 2020; 2020:5856901. [PMID: 32539086 PMCID: PMC7294657 DOI: 10.1093/database/baaa046] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 07/03/2019] [Revised: 12/24/2019] [Accepted: 05/21/2020] [Indexed: 11/14/2022]
Abstract
Meadow fescue (Festuca pratensis Huds.) is one of the most important forage grasses in temperate regions. It is a diploid (2n = 14) outbreeding species that belongs to the genus Festuca. Together with Lolium perenne, they are the most important genera of forage grasses. Meadow fescue has very high quality of yield with good winter survival and persistency. However, extensive genomic resources for meadow fescue have not become available so far. To address this lack of comprehensive publicly available datasets, we have developed functionally annotated draft genome sequences of two meadow fescue genotypes, ‘HF7/2’ and ‘B14/16’, and constructed the platform ForageGrassBase, available at http://foragegrass.org/, for data visualization, download and querying. This is the first open-access platform that provides extensive genomic resources related to this forage grass species. The current database provides the most up-to-date draft genome sequence along with structural and functional annotations for genes that can be accessed using Genome Browser (GBrowse), along with comparative genomic alignments to Arabidopsis, L. perenne, barley, rice, Brachypodium and maize genomes. We have integrated homologous search tool BLAST also for the users to analyze their data. Combined, GBrowse, BLAST and downloadable data gives a user-friendly access to meadow fescue genomic resources. To our knowledge, ForageGrassBase is the first genome database dedicated to forage grasses. The current forage grass database provides valuable resources for a range of research fields related to meadow fescue and other forage crop species, as well as for plant research communities in general. The genome database can be accessed at http://foragegrass.org.
Collapse
Affiliation(s)
- Jeevan Karloss Antony Samy
- Department of Plant Sciences (IPV), Faculty of Biosciences (BIOVIT), Norwegian University of Life Sciences (NMBU), Ås, Akershus 1432, Norway
| | - Odd Arne Rognli
- Department of Plant Sciences (IPV), Faculty of Biosciences (BIOVIT), Norwegian University of Life Sciences (NMBU), Ås, Akershus 1432, Norway
| | - Mallikarjuna Rao Kovi
- Department of Plant Sciences (IPV), Faculty of Biosciences (BIOVIT), Norwegian University of Life Sciences (NMBU), Ås, Akershus 1432, Norway
| |
Collapse
|
5
|
Cho KT, Portwood JL, Gardiner JM, Harper LC, Lawrence-Dill CJ, Friedberg I, Andorf CM. MaizeDIG: Maize Database of Images and Genomes. FRONTIERS IN PLANT SCIENCE 2019; 10:1050. [PMID: 31555312 PMCID: PMC6724615 DOI: 10.3389/fpls.2019.01050] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/15/2019] [Accepted: 07/29/2019] [Indexed: 05/23/2023]
Abstract
Background: An organism can be described by its observable features (phenotypes) and the genes and genomic information (genotypes) that cause these phenotypes. For many decades, researchers have tried to find relationships between genotypes and phenotypes, and great strides have been made. However, improved methods and tools for discovering and visualizing these phenotypic relationships are still needed. The maize genetics and genomics database (MaizeGDB, www.maizegdb.org) provides an array of useful resources for diverse data types including thousands of images related to mutant phenotypes in Zea mays ssp. mays (maize). To integrate mutant phenotype images with genomics information, we implemented and enhanced the web-based software package BioDIG (Biological Database of Images and Genomes). Findings: We developed a genotype-phenotype database for maize called MaizeDIG. MaizeDIG has several enhancements over the original BioDIG package. MaizeDIG, which supports multiple reference genome assemblies, is seamlessly integrated with genome browsers to accommodate custom tracks showing tagged mutant phenotypes images in their genomic context and allows for custom tagging of images to highlight the phenotype. This is accomplished through an updated interface allowing users to create image-to-gene links and is accessible via the image search tool. Conclusions: We have created a user-friendly and extensible web-based resource called MaizeDIG. MaizeDIG is preloaded with 2,396 images that are available on genome browsers for 10 different maize reference genomes. Approximately 90 images of classically defined maize genes have been manually annotated. MaizeDIG is available at http://maizedig.maizegdb.org/. The code is free and open source and can be found at https://github.com/Maize-Genetics-and-Genomics-Database/maizedig.
Collapse
Affiliation(s)
- Kyoung Tak Cho
- Department of Computer Science, Iowa State University, Ames, IA, United States
| | - John L. Portwood
- USDA-ARS Corn Insects and Crop Genetics Research Unit, Iowa State University, Ames, IA, United States
| | - Jack M. Gardiner
- Division of Animal Sciences, University of Missouri, Columbia, MO, United States
| | - Lisa C. Harper
- USDA-ARS Corn Insects and Crop Genetics Research Unit, Iowa State University, Ames, IA, United States
| | - Carolyn J. Lawrence-Dill
- Department of Genetics, Development and Cell Biology, Iowa State University, Ames, IA, United States
- Department of Agronomy, Iowa State University, Ames, IA, United States
- Bioinformatics and Computational Biology Graduate Program, Iowa State University, Ames, IA, United States
| | - Iddo Friedberg
- Bioinformatics and Computational Biology Graduate Program, Iowa State University, Ames, IA, United States
- Department of Veterinary Microbiology and Preventive Medicine, Iowa State University, Ames, IA, United States
| | - Carson M. Andorf
- USDA-ARS Corn Insects and Crop Genetics Research Unit, Iowa State University, Ames, IA, United States
| |
Collapse
|
6
|
Song S, Tian D, Li C, Tang B, Dong L, Xiao J, Bao Y, Zhao W, He H, Zhang Z. Genome Variation Map: a data repository of genome variations in BIG Data Center. Nucleic Acids Res 2019; 46:D944-D949. [PMID: 29069473 PMCID: PMC5753358 DOI: 10.1093/nar/gkx986] [Citation(s) in RCA: 42] [Impact Index Per Article: 8.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/14/2017] [Accepted: 10/11/2017] [Indexed: 11/16/2022] Open
Abstract
The Genome Variation Map (GVM; http://bigd.big.ac.cn/gvm/) is a public data repository of genome variations. As a core resource in the BIG Data Center, Beijing Institute of Genomics, Chinese Academy of Sciences, GVM dedicates to collect, integrate and visualize genome variations for a wide range of species, accepts submissions of different types of genome variations from all over the world and provides free open access to all publicly available data in support of worldwide research activities. Unlike existing related databases, GVM features integration of a large number of genome variations for a broad diversity of species including human, cultivated plants and domesticated animals. Specifically, the current implementation of GVM not only houses a total of ∼4.9 billion variants for 19 species including chicken, dog, goat, human, poplar, rice and tomato, but also incorporates 8669 individual genotypes and 13 262 manually curated high-quality genotype-to-phenotype associations for non-human species. In addition, GVM provides friendly intuitive web interfaces for data submission, browse, search and visualization. Collectively, GVM serves as an important resource for archiving genomic variation data, helpful for better understanding population genetic diversity and deciphering complex mechanisms associated with different phenotypes.
Collapse
Affiliation(s)
- Shuhui Song
- BIG Data Center, Beijing Institute of Genomics, Chinese Academy of Sciences, Beijing 100101, China
| | - Dongmei Tian
- BIG Data Center, Beijing Institute of Genomics, Chinese Academy of Sciences, Beijing 100101, China
| | - Cuiping Li
- BIG Data Center, Beijing Institute of Genomics, Chinese Academy of Sciences, Beijing 100101, China
| | - Bixia Tang
- BIG Data Center, Beijing Institute of Genomics, Chinese Academy of Sciences, Beijing 100101, China.,CAS Key Laboratory of Genome Sciences and Information, Beijing Institute of Genomics, Chinese Academy of Sciences, Beijing 100101, China.,University of Chinese Academy of Sciences, Beijing 100049, China
| | - Lili Dong
- BIG Data Center, Beijing Institute of Genomics, Chinese Academy of Sciences, Beijing 100101, China
| | - Jingfa Xiao
- BIG Data Center, Beijing Institute of Genomics, Chinese Academy of Sciences, Beijing 100101, China.,CAS Key Laboratory of Genome Sciences and Information, Beijing Institute of Genomics, Chinese Academy of Sciences, Beijing 100101, China.,University of Chinese Academy of Sciences, Beijing 100049, China.,Collaborative Innovation Center of Genetics and Development, Fudan University, Shanghai 200438, China
| | - Yiming Bao
- BIG Data Center, Beijing Institute of Genomics, Chinese Academy of Sciences, Beijing 100101, China.,CAS Key Laboratory of Genome Sciences and Information, Beijing Institute of Genomics, Chinese Academy of Sciences, Beijing 100101, China
| | - Wenming Zhao
- BIG Data Center, Beijing Institute of Genomics, Chinese Academy of Sciences, Beijing 100101, China
| | - Hang He
- School of Life Sciences, Peking University, Beijing 100871, China
| | - Zhang Zhang
- BIG Data Center, Beijing Institute of Genomics, Chinese Academy of Sciences, Beijing 100101, China.,CAS Key Laboratory of Genome Sciences and Information, Beijing Institute of Genomics, Chinese Academy of Sciences, Beijing 100101, China.,University of Chinese Academy of Sciences, Beijing 100049, China.,Collaborative Innovation Center of Genetics and Development, Fudan University, Shanghai 200438, China
| |
Collapse
|
7
|
Nusrat S, Harbig T, Gehlenborg N. Tasks, Techniques, and Tools for Genomic Data Visualization. COMPUTER GRAPHICS FORUM : JOURNAL OF THE EUROPEAN ASSOCIATION FOR COMPUTER GRAPHICS 2019; 38:781-805. [PMID: 31768085 PMCID: PMC6876635 DOI: 10.1111/cgf.13727] [Citation(s) in RCA: 24] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/06/2023]
Abstract
Genomic data visualization is essential for interpretation and hypothesis generation as well as a valuable aid in communicating discoveries. Visual tools bridge the gap between algorithmic approaches and the cognitive skills of investigators. Addressing this need has become crucial in genomics, as biomedical research is increasingly data-driven and many studies lack well-defined hypotheses. A key challenge in data-driven research is to discover unexpected patterns and to formulate hypotheses in an unbiased manner in vast amounts of genomic and other associated data. Over the past two decades, this has driven the development of numerous data visualization techniques and tools for visualizing genomic data. Based on a comprehensive literature survey, we propose taxonomies for data, visualization, and tasks involved in genomic data visualization. Furthermore, we provide a comprehensive review of published genomic visualization tools in the context of the proposed taxonomies.
Collapse
Affiliation(s)
- S Nusrat
- Department of Biomedical Informatics, Harvard Medical School, Boston, MA, USA
| | - T Harbig
- Department of Biomedical Informatics, Harvard Medical School, Boston, MA, USA
| | - N Gehlenborg
- Department of Biomedical Informatics, Harvard Medical School, Boston, MA, USA
| |
Collapse
|
8
|
Stam R, Münsterkötter M, Pophaly SD, Fokkens L, Sghyer H, Güldener U, Hückelhoven R, Hess M. A New Reference Genome Shows the One-Speed Genome Structure of the Barley Pathogen Ramularia collo-cygni. Genome Biol Evol 2018; 10:3243-3249. [PMID: 30371775 PMCID: PMC6301796 DOI: 10.1093/gbe/evy240] [Citation(s) in RCA: 14] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 10/26/2018] [Indexed: 01/17/2023] Open
Abstract
Ramularia leaf spot has recently emerged as a major threat to barley production world-wide, causing 25% yield loss in many barley growing regions. Here, we provide a new reference genome of the causal agent, the Dothideomycete Ramularia collo-cygni. The assembly of 32 Mb consists of 78 scaffolds. We used RNA-seq to identify 11,622 genes of which 1,303 and 282 are coding for predicted secreted proteins and putative effectors respectively. The pathogen separated from its nearest sequenced relative, Zymoseptoria tritici ∼27 Ma. We calculated the divergence of the two species on protein level and see remarkably high synonymous and nonsynonymous divergence. Unlike in many other plant pathogens, the comparisons of transposable elements and gene distributions, show a very homogeneous genome for R. collo-cygni. We see no evidence for higher selective pressure on putative effectors or other secreted proteins and repetitive sequences are spread evenly across the scaffolds. These findings could be associated to the predominantly endophytic life-style of the pathogen. We hypothesize that R. collo-cygni only recently became pathogenic and that therefore its genome does not yet show the typical pathogen characteristics. Because of its high scaffold length and improved CDS annotations, our new reference sequence provides a valuable resource for the community for future comparative genomics and population genetics studies.
Collapse
Affiliation(s)
- Remco Stam
- Chair of Phytopathology, School of Life Sciences Weihenstephan, Technische University Munich, Germany
| | - Martin Münsterkötter
- Functional Genomics and Bioinformatics, Research Centre for Forestry and Wood Industry, University of Sopron, Hungary.,Institute of Bioinformatics and Systems Biology, Helmholtz Centre Munich, Germany
| | - Saurabh Dilip Pophaly
- Section of Population Genetics, School of Life Sciences Weihenstephan, Technische Universität München, Germany.,Department of Evolutionary Biology, Evolutionary Biology Centre, Uppsala University, Sweden and Division of Evolutionary Biology, Faculty of Biology II, Ludwig-Maximilians-Universität München, Germany
| | - Like Fokkens
- Molecular Plant Pathology, Swammerdam Institute for Life Sciences, University of Amsterdam, The Netherlands
| | - Hind Sghyer
- Chair of Phytopathology, School of Life Sciences Weihenstephan, Technische University Munich, Germany
| | - Ulrich Güldener
- Department of Bioinformatics, School of Life Sciences Weihenstephan, Technische University Munich, Germany
| | - Ralph Hückelhoven
- Chair of Phytopathology, School of Life Sciences Weihenstephan, Technische University Munich, Germany
| | - Michael Hess
- Chair of Phytopathology, School of Life Sciences Weihenstephan, Technische University Munich, Germany
| |
Collapse
|
9
|
Howe DG, Blake JA, Bradford YM, Bult CJ, Calvi BR, Engel SR, Kadin JA, Kaufman TC, Kishore R, Laulederkind SJF, Lewis SE, Moxon SAT, Richardson JE, Smith C. Model organism data evolving in support of translational medicine. Lab Anim (NY) 2018; 47:277-289. [PMID: 30224793 PMCID: PMC6322546 DOI: 10.1038/s41684-018-0150-4] [Citation(s) in RCA: 23] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/14/2018] [Accepted: 08/13/2018] [Indexed: 02/07/2023]
Abstract
Model organism databases (MODs) have been collecting and integrating biomedical research data for 30 years and were designed to meet specific needs of each model organism research community. The contributions of model organism research to understanding biological systems would be hard to overstate. Modern molecular biology methods and cost reductions in nucleotide sequencing have opened avenues for direct application of model organism research to elucidating mechanisms of human diseases. Thus, the mandate for model organism research and databases has now grown to include facilitating use of these data in translational applications. Challenges in meeting this opportunity include the distribution of research data across many databases and websites, a lack of data format standards for some data types, and sustainability of scale and cost for genomic database resources like MODs. The issues of widely distributed data and application of data standards are some of the challenges addressed by FAIR (Findable, Accessible, Interoperable, and Re-usable) data principles. The Alliance of Genome Resources is now moving to address these challenges by bringing together expertly curated research data from fly, mouse, rat, worm, yeast, zebrafish, and the Gene Ontology consortium. Centralized multi-species data access, integration, and format standardization will lower the data utilization barrier in comparative genomics and translational applications and will provide a framework in which sustainable scale and cost can be addressed. This article presents a brief historical perspective on how the Alliance model organisms are complementary and how they have already contributed to understanding the etiology of human diseases. In addition, we discuss four challenges for using data from MODs in translational applications and how the Alliance is working to address them, in part by applying FAIR data principles. Ultimately, combined data from these animal models are more powerful than the sum of the parts.
Collapse
Affiliation(s)
- Douglas G Howe
- The Institute of Neuroscience, University of Oregon, Eugene, OR, USA.
| | | | - Yvonne M Bradford
- The Institute of Neuroscience, University of Oregon, Eugene, OR, USA
| | | | - Brian R Calvi
- Department of Biology, Indiana University, Bloomington, IN, USA
| | - Stacia R Engel
- Department of Genetics, Stanford University, Palo Alto, CA, USA
| | | | | | - Ranjana Kishore
- Division of Biology and Biological Engineering, California Institute of Technology, Pasadena, CA, USA
| | - Stanley J F Laulederkind
- Department of Biomedical Engineering, Medical College of Wisconsin and Marquette University, Milwaukee, WI, USA
| | - Suzanna E Lewis
- Environmental Genomics and Systems Biology Division, Lawrence Berkeley National Laboratory, Berkeley, CA, USA
| | - Sierra A T Moxon
- The Institute of Neuroscience, University of Oregon, Eugene, OR, USA
| | | | | |
Collapse
|
10
|
Using TropGeneDB: A Database Containing Data on Molecular Markers, QTLs, Maps, Genotypes, and Phenotypes for Tropical Crops. Methods Mol Biol 2018; 1533:161-172. [PMID: 27987169 DOI: 10.1007/978-1-4939-6658-5_8] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/07/2023]
Abstract
TropGeneDB ( http://tropgenedb.cirad.fr ) is a web database that manages genomic, genetic, and phenotypic information on tropical crops. It is organized on a crop basis with currently nine public modules: banana, cocoa, coconut, coffee, cotton, oil palm, rice, rubber tree, and sugarcane. TropGeneDB contains data on molecular markers, quantitative trait loci (QTLs), genetic and physical maps, genotyping and phenotyping studies, and information on genetic resources (geographic origin, parentage, collection). Crop-specific web interfaces have been designed to allow quick consultations as well as personalized complex queries.
Collapse
|
11
|
Advances in Sequencing and Resequencing in Crop Plants. ADVANCES IN BIOCHEMICAL ENGINEERING/BIOTECHNOLOGY 2018. [PMID: 29516115 DOI: 10.1007/10_2017_46] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/13/2022]
Abstract
DNA sequencing technologies have changed the face of biological research over the last 20 years. From reference genomes to population level resequencing studies, these technologies have made significant contributions to our understanding of plant biology and evolution. As the technologies have increased in power, the breadth and complexity of the questions that can be asked has increased. Along with this, the challenges of managing unprecedented quantities of sequence data are mounting. This chapter describes a few aspects of the journey so far and looks forward to what may lie ahead. Graphical Abstract.
Collapse
|
12
|
Reiser L, Subramaniam S, Li D, Huala E. Using the
Arabidopsis
Information Resource (TAIR) to Find Information About
Arabidopsis
Genes. ACTA ACUST UNITED AC 2017; 60:1.11.1-1.11.45. [DOI: 10.1002/cpbi.36] [Citation(s) in RCA: 35] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/19/2022]
Affiliation(s)
| | | | - Donghui Li
- Phoenix Bioinformatics Fremont California
| | - Eva Huala
- Phoenix Bioinformatics Fremont California
| |
Collapse
|
13
|
Genome expansion and lineage-specific genetic innovations in the forest pathogenic fungi Armillaria. Nat Ecol Evol 2017; 1:1931-1941. [DOI: 10.1038/s41559-017-0347-8] [Citation(s) in RCA: 98] [Impact Index Per Article: 14.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/16/2017] [Accepted: 09/19/2017] [Indexed: 12/17/2022]
|
14
|
Niehaus EM, Kim HK, Münsterkötter M, Janevska S, Arndt B, Kalinina SA, Houterman PM, Ahn IP, Alberti I, Tonti S, Kim DW, Sieber CMK, Humpf HU, Yun SH, Güldener U, Tudzynski B. Comparative genomics of geographically distant Fusarium fujikuroi isolates revealed two distinct pathotypes correlating with secondary metabolite profiles. PLoS Pathog 2017; 13:e1006670. [PMID: 29073267 PMCID: PMC5675463 DOI: 10.1371/journal.ppat.1006670] [Citation(s) in RCA: 42] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/29/2017] [Revised: 11/07/2017] [Accepted: 09/26/2017] [Indexed: 12/21/2022] Open
Abstract
Fusarium fujikuroi causes bakanae ("foolish seedling") disease of rice which is characterized by hyper-elongation of seedlings resulting from production of gibberellic acids (GAs) by the fungus. This plant pathogen is also known for production of harmful mycotoxins, such as fusarins, fusaric acid, apicidin F and beauvericin. Recently, we generated the first de novo genome sequence of F. fujikuroi strain IMI 58289 combined with extensive transcriptional, epigenetic, proteomic and chemical product analyses. GA production was shown to provide a selective advantage during infection of the preferred host plant rice. Here, we provide genome sequences of eight additional F. fujikuroi isolates from distant geographic regions. The isolates differ in the size of chromosomes, most likely due to variability of subtelomeric regions, the type of asexual spores (microconidia and/or macroconidia), and the number and expression of secondary metabolite gene clusters. Whilst most of the isolates caused the typical bakanae symptoms, one isolate, B14, caused stunting and early withering of infected seedlings. In contrast to the other isolates, B14 produced no GAs but high amounts of fumonisins during infection on rice. Furthermore, it differed from the other isolates by the presence of three additional polyketide synthase (PKS) genes (PKS40, PKS43, PKS51) and the absence of the F. fujikuroi-specific apicidin F (NRPS31) gene cluster. Analysis of additional field isolates confirmed the strong correlation between the pathotype (bakanae or stunting/withering), and the ability to produce either GAs or fumonisins. Deletion of the fumonisin and fusaric acid-specific PKS genes in B14 reduced the stunting/withering symptoms, whereas deletion of the PKS51 gene resulted in elevated symptom development. Phylogenetic analyses revealed two subclades of F. fujikuroi strains according to their pathotype and secondary metabolite profiles.
Collapse
Affiliation(s)
- Eva-Maria Niehaus
- Institute of Biology and Biotechnology of Plants, Molecular Biology and Biotechnology of Fungi, Westfälische Wilhelms-Universität Münster, Münster, Germany
| | - Hee-Kyoung Kim
- Department of Medical Biotechnology, Soonchunhyang University, Asan, Republic of Korea
| | - Martin Münsterkötter
- Institute of Bioinformatics and Systems Biology, Helmholtz Zentrum München, German Research Center for Environmental Health (GmbH), Neuherberg, Germany
| | - Slavica Janevska
- Institute of Biology and Biotechnology of Plants, Molecular Biology and Biotechnology of Fungi, Westfälische Wilhelms-Universität Münster, Münster, Germany
| | - Birgit Arndt
- Institute of Food Chemistry, Westfälische Wilhelms-Universität Münster, Corrensstraße 45, Münster, Germany
| | - Svetlana A. Kalinina
- Institute of Food Chemistry, Westfälische Wilhelms-Universität Münster, Corrensstraße 45, Münster, Germany
| | - Petra M. Houterman
- University of Amsterdam, Swammerdam Institute for Life Sciences, Plant Pathology, Amsterdam, The Netherlands
| | - Il-Pyung Ahn
- Department of Agricultural Biotechnology, National Institute of Agricultural Sciences, Rural Development Administration, Wanju, Republic of Korea
| | - Ilaria Alberti
- CREA-CIN Sede di Rovigo, Viale Giovanni Amendola, 82, 45100 Rovigo, Italy
| | - Stefano Tonti
- CREA-SCS Sede di Bologna, Via di Corticella, 133, 40128 Bologna, Italy
| | - Da-Woon Kim
- Department of Medical Biotechnology, Soonchunhyang University, Asan, Republic of Korea
| | - Christian M. K. Sieber
- Department of Energy Joint Genome Institute, University of California, Walnut Creek, Berkeley, California
| | - Hans-Ulrich Humpf
- Institute of Food Chemistry, Westfälische Wilhelms-Universität Münster, Corrensstraße 45, Münster, Germany
| | - Sung-Hwan Yun
- Department of Medical Biotechnology, Soonchunhyang University, Asan, Republic of Korea
- * E-mail: (BT); (UG); (SY)
| | - Ulrich Güldener
- Institute of Bioinformatics and Systems Biology, Helmholtz Zentrum München, German Research Center for Environmental Health (GmbH), Neuherberg, Germany
- Chair of Genome-oriented Bioinformatics, TUM School of Life Sciences Weihenstephan, Technical University of Munich, Freising, Germany
- * E-mail: (BT); (UG); (SY)
| | - Bettina Tudzynski
- Institute of Biology and Biotechnology of Plants, Molecular Biology and Biotechnology of Fungi, Westfälische Wilhelms-Universität Münster, Münster, Germany
- * E-mail: (BT); (UG); (SY)
| |
Collapse
|
15
|
Abstract
Background The constant progress in sequencing technology leads to ever increasing amounts of genomic data. In the light of current evidence transposable elements (TEs for short) are becoming useful tools for learning about the evolution of host genome. Therefore the software for genome-wide detection and analysis of TEs is of great interest. Results Here we describe the computational tool for mining, classifying and storing TEs from newly sequenced genomes. This is an online, web-based, user-friendly service, enabling users to upload their own genomic data, and perform de-novo searches for TEs. The detected TEs are automatically analyzed, compared to reference databases, annotated, clustered into families, and stored in TEs repository. Also, the genome-wide nesting structure of found elements are detected and analyzed by new method for inferring evolutionary history of TEs. We illustrate the functionality of our tool by performing a full-scale analyses of TE landscape in Medicago truncatula genome. Conclusions TRANScendence is an effective tool for the de-novo annotation and classification of transposable elements in newly-acquired genomes. Its streamlined interface makes it well-suited for evolutionary studies.
Collapse
|
16
|
Yamazaki T, Ichihara K, Suzuki R, Oshima K, Miyamura S, Kuwano K, Toyoda A, Suzuki Y, Sugano S, Hattori M, Kawano S. Genomic structure and evolution of the mating type locus in the green seaweed Ulva partita. Sci Rep 2017; 7:11679. [PMID: 28916791 PMCID: PMC5601483 DOI: 10.1038/s41598-017-11677-0] [Citation(s) in RCA: 20] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/07/2016] [Accepted: 08/29/2017] [Indexed: 01/08/2023] Open
Abstract
The evolution of sex chromosomes and mating loci in organisms with UV systems of sex/mating type determination in haploid phases via genes on UV chromosomes is not well understood. We report the structure of the mating type (MT) locus and its evolutionary history in the green seaweed Ulva partita, which is a multicellular organism with an isomorphic haploid-diploid life cycle and mating type determination in the haploid phase. Comprehensive comparison of a total of 12.0 and 16.6 Gb of genomic next-generation sequencing data for mt- and mt+ strains identified highly rearranged MT loci of 1.0 and 1.5 Mb in size and containing 46 and 67 genes, respectively, including 23 gametologs. Molecular evolutionary analyses suggested that the MT loci diverged over a prolonged period in the individual mating types after their establishment in an ancestor. A gene encoding an RWP-RK domain-containing protein was found in the mt- MT locus but was not an ortholog of the chlorophycean mating type determination gene MID. Taken together, our results suggest that the genomic structure and its evolutionary history in the U. partita MT locus are similar to those on other UV chromosomes and that the MT locus genes are quite different from those of Chlorophyceae.
Collapse
Affiliation(s)
- Tomokazu Yamazaki
- Department of Integrated Biosciences, Graduate School of Frontier Sciences, University of Tokyo, Kashiwa, Chiba, Japan
| | - Kensuke Ichihara
- Department of Integrated Biosciences, Graduate School of Frontier Sciences, University of Tokyo, Kashiwa, Chiba, Japan
| | - Ryogo Suzuki
- Department of Integrated Biosciences, Graduate School of Frontier Sciences, University of Tokyo, Kashiwa, Chiba, Japan
| | - Kenshiro Oshima
- Department of Medical Genome Sciences, Graduate School of Frontier Sciences, University of Tokyo, Kashiwa, Chiba, Japan
| | - Shinichi Miyamura
- Faculty of Life and Environmental Sciences, University of Tsukuba, Tsukuba, Ibaraki, Japan
| | - Kazuyoshi Kuwano
- Graduate School of Fisheries and Environmental Sciences, Nagasaki University, Nagasaki, Japan
| | - Atsushi Toyoda
- Center for Information Biology, National Institute of Genetics, Shizuoka, Japan
| | - Yutaka Suzuki
- Department of Medical Genome Sciences, Graduate School of Frontier Sciences, University of Tokyo, Kashiwa, Chiba, Japan
| | - Sumio Sugano
- Department of Medical Genome Sciences, Graduate School of Frontier Sciences, University of Tokyo, Kashiwa, Chiba, Japan
| | - Masahira Hattori
- Department of Medical Genome Sciences, Graduate School of Frontier Sciences, University of Tokyo, Kashiwa, Chiba, Japan
- Graduate School of Advanced Science and Engineering, Waseda University, Tokyo, Japan
| | - Shigeyuki Kawano
- Department of Integrated Biosciences, Graduate School of Frontier Sciences, University of Tokyo, Kashiwa, Chiba, Japan.
| |
Collapse
|
17
|
Samy JKA, Mulugeta TD, Nome T, Sandve SR, Grammes F, Kent MP, Lien S, Våge DI. SalmoBase: an integrated molecular data resource for Salmonid species. BMC Genomics 2017. [PMID: 28651544 PMCID: PMC5485693 DOI: 10.1186/s12864-017-3877-1] [Citation(s) in RCA: 36] [Impact Index Per Article: 5.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/03/2022] Open
Abstract
Background Salmonids are ray-finned fishes which constitute 11 genera and at least 70 species including Atlantic salmon, whitefishes, graylings, rainbow trout, and char. The common ancestor of all Salmonidae experienced a whole genome duplication (WGD) ~80 million years ago, resulting in an autotetraploid genome. Genomic rediplodization is still going on in salmonid species, providing an unique system for studying evolutionary consequences of whole genome duplication. In recent years, high quality genome sequences of Atlantic salmon and Rainbow trout has been established, due to their scientific and commercial values. In this paper we introduce SalmoBase (http://www.salmobase.org/), a tool for making molecular resources for salmonids public available in a framework of visualizations and analytic tools. Results SalmoBase has been developed as a part of the ELIXIR.NO project. Currently, SalmoBase contains molecular resources for Atlantic salmon and Rainbow trout. Data can be accessed through BLAST, Genome Browser (GBrowse), Genetic Variation Browser (GVBrowse) and Gene Expression Browser (GEBrowse). Conclusions To the best of our knowledge, SalmoBase is the first database which integrates salmonids data and allow users to study salmonids in an integrated framework. The database and its tools (e.g., comparative genomics tools, synteny browsers) will be expanded as additional public resources describing other Salmonidae genomes become available.
Collapse
Affiliation(s)
- Jeevan Karloss Antony Samy
- Centre for Integrative Genetics (CIGENE), Department of Animal and Aquacultural Sciences (IHA), Faculty of Biosciences (BIOVIT), Norwegian University of Life Sciences (NMBU), 1432, Ås, Akershus, Norway.
| | - Teshome Dagne Mulugeta
- Centre for Integrative Genetics (CIGENE), Department of Animal and Aquacultural Sciences (IHA), Faculty of Biosciences (BIOVIT), Norwegian University of Life Sciences (NMBU), 1432, Ås, Akershus, Norway
| | - Torfinn Nome
- Centre for Integrative Genetics (CIGENE), Department of Animal and Aquacultural Sciences (IHA), Faculty of Biosciences (BIOVIT), Norwegian University of Life Sciences (NMBU), 1432, Ås, Akershus, Norway
| | - Simen Rød Sandve
- Centre for Integrative Genetics (CIGENE), Department of Animal and Aquacultural Sciences (IHA), Faculty of Biosciences (BIOVIT), Norwegian University of Life Sciences (NMBU), 1432, Ås, Akershus, Norway
| | - Fabian Grammes
- Centre for Integrative Genetics (CIGENE), Department of Animal and Aquacultural Sciences (IHA), Faculty of Biosciences (BIOVIT), Norwegian University of Life Sciences (NMBU), 1432, Ås, Akershus, Norway
| | - Matthew Peter Kent
- Centre for Integrative Genetics (CIGENE), Department of Animal and Aquacultural Sciences (IHA), Faculty of Biosciences (BIOVIT), Norwegian University of Life Sciences (NMBU), 1432, Ås, Akershus, Norway
| | - Sigbjørn Lien
- Centre for Integrative Genetics (CIGENE), Department of Animal and Aquacultural Sciences (IHA), Faculty of Biosciences (BIOVIT), Norwegian University of Life Sciences (NMBU), 1432, Ås, Akershus, Norway
| | - Dag Inge Våge
- Centre for Integrative Genetics (CIGENE), Department of Animal and Aquacultural Sciences (IHA), Faculty of Biosciences (BIOVIT), Norwegian University of Life Sciences (NMBU), 1432, Ås, Akershus, Norway
| |
Collapse
|
18
|
Wei X, Gong H, Yu J, Liu P, Wang L, Zhang Y, Zhang X. SesameFG: an integrated database for the functional genomics of sesame. Sci Rep 2017; 7:2342. [PMID: 28539606 PMCID: PMC5443765 DOI: 10.1038/s41598-017-02586-3] [Citation(s) in RCA: 17] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/12/2016] [Accepted: 04/12/2017] [Indexed: 11/09/2022] Open
Abstract
Sesame (Sesamum indicum L.) has high oil content, a small diploid genome and a short growth period, making it an attractive species for genetic studies on oilseed crops. With the advancement of next-generation sequencing technology, genomics and functional genomics research of sesame has developed quickly in the last few years, and large amounts of data have been generated. However, these results are distributed in many different publications, and there is a lack of integration. To promote functional genomics research of sesame, we collected genetic information combined with comprehensive phenotypic information and integrated them in the web-based database named SesameFG. The current version of SesameFG contains phenotypic information on agronomic traits of 705 sesame accessions, de novo assembled genomes of three sesame varieties, massive numbers of identified SNPs, gene expression profiles of five tissues, gene families, candidate genes for the important agronomic traits and genomic-SSR markers. All phenotypic and genotypic information in SesameFG is available for online queries and can be downloaded freely. SesameFG provides useful search functions and data mining tools, including Genome Browser and local BLAST services. SesameFG is freely accessible at http://ncgr.ac.cn/SesameFG/. SesameFG provides valuable resources and tools for functional genomics research and the molecular breeding of sesame.
Collapse
Affiliation(s)
- Xin Wei
- Key Laboratory of Biology and Genetic Improvement of Oil Crops, Ministry of Agriculture of People's Republic of China, Oilcrops Research Institute, Chinese Academy of Agricultural Sciences, Wuhan, 430062, China
| | - Hao Gong
- National Center for Gene Research, Collaborative Innovation Center for Genetics and Development, Institute of Plant Physiology and Ecology, Shanghai Institutes for Biological Sciences, Chinese Academy of Sciences, Shanghai, 200233, China
| | - Jingyin Yu
- Key Laboratory of Biology and Genetic Improvement of Oil Crops, Ministry of Agriculture of People's Republic of China, Oilcrops Research Institute, Chinese Academy of Agricultural Sciences, Wuhan, 430062, China
| | - Pan Liu
- Key Laboratory of Biology and Genetic Improvement of Oil Crops, Ministry of Agriculture of People's Republic of China, Oilcrops Research Institute, Chinese Academy of Agricultural Sciences, Wuhan, 430062, China
| | - Linhai Wang
- Key Laboratory of Biology and Genetic Improvement of Oil Crops, Ministry of Agriculture of People's Republic of China, Oilcrops Research Institute, Chinese Academy of Agricultural Sciences, Wuhan, 430062, China
| | - Yanxin Zhang
- Key Laboratory of Biology and Genetic Improvement of Oil Crops, Ministry of Agriculture of People's Republic of China, Oilcrops Research Institute, Chinese Academy of Agricultural Sciences, Wuhan, 430062, China
| | - Xiurong Zhang
- Key Laboratory of Biology and Genetic Improvement of Oil Crops, Ministry of Agriculture of People's Republic of China, Oilcrops Research Institute, Chinese Academy of Agricultural Sciences, Wuhan, 430062, China.
| |
Collapse
|
19
|
Carrere S, Gouzy J. myGenomeBrowser: building and sharing your own genome browser. Bioinformatics 2017; 33:1255-1257. [PMID: 28011789 PMCID: PMC5408841 DOI: 10.1093/bioinformatics/btw800] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/05/2016] [Accepted: 12/13/2016] [Indexed: 01/08/2023] Open
Abstract
myGenomeBrowser is a web-based environment that provides biologists with a way to build, query and share their genome browsers. This tool, that builds on JBrowse, is designed to give users more autonomy while simplifying and minimizing intervention from system administrators. We have extended genome browser basic features to allow users to query, analyze and share their data. Availability and implementation: myGenomeBrowser is freely available at https://bbric-pipelines.toulouse.inra.fr/myGenomeBrowser and includes tutorial screencasts. Source code and installation instructions can be found at https://framagit.org/BBRIC/myGenomeBrowser. myGenomeBrowser is open-source and mainly implemented in Perl, JavaScript, Apache and Docker.
Collapse
Affiliation(s)
- Sébastien Carrere
- LIPM, Université de Toulouse, INRA, CNRS, Castanet-Tolosan, France
- To whom correspondence should be addressed.
| | - Jérôme Gouzy
- LIPM, Université de Toulouse, INRA, CNRS, Castanet-Tolosan, France
| |
Collapse
|
20
|
Hane JK, Ming Y, Kamphuis LG, Nelson MN, Garg G, Atkins CA, Bayer PE, Bravo A, Bringans S, Cannon S, Edwards D, Foley R, Gao L, Harrison MJ, Huang W, Hurgobin B, Li S, Liu C, McGrath A, Morahan G, Murray J, Weller J, Jian J, Singh KB. A comprehensive draft genome sequence for lupin (Lupinus angustifolius), an emerging health food: insights into plant-microbe interactions and legume evolution. PLANT BIOTECHNOLOGY JOURNAL 2017; 15:318-330. [PMID: 27557478 PMCID: PMC5316927 DOI: 10.1111/pbi.12615] [Citation(s) in RCA: 108] [Impact Index Per Article: 15.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/04/2016] [Revised: 08/12/2016] [Accepted: 08/20/2016] [Indexed: 05/18/2023]
Abstract
Lupins are important grain legume crops that form a critical part of sustainable farming systems, reducing fertilizer use and providing disease breaks. It has a basal phylogenetic position relative to other crop and model legumes and a high speciation rate. Narrow-leafed lupin (NLL; Lupinus angustifolius L.) is gaining popularity as a health food, which is high in protein and dietary fibre but low in starch and gluten-free. We report the draft genome assembly (609 Mb) of NLL cultivar Tanjil, which has captured >98% of the gene content, sequences of additional lines and a dense genetic map. Lupins are unique among legumes and differ from most other land plants in that they do not form mycorrhizal associations. Remarkably, we find that NLL has lost all mycorrhiza-specific genes, but has retained genes commonly required for mycorrhization and nodulation. In addition, the genome also provided candidate genes for key disease resistance and domestication traits. We also find evidence of a whole-genome triplication at around 25 million years ago in the genistoid lineage leading to Lupinus. Our results will support detailed studies of legume evolution and accelerate lupin breeding programmes.
Collapse
Affiliation(s)
- James K. Hane
- CSIRO AgricultureWembleyWAAustralia
- Department of Environment and AgricultureCCDM BioinformaticsCentre for Crop and Disease ManagementCurtin UniversityBentleyWAAustralia
- Curtin Institute for ComputationCurtin UniversityBentleyWAAustralia
| | - Yao Ming
- Department of Plant and Animal Genome ResearchBeijing Genome InstituteShenzhenChina
| | - Lars G. Kamphuis
- CSIRO AgricultureWembleyWAAustralia
- UWA Institute of AgricultureUniversity of Western AustraliaCrawleyWAAustralia
| | - Matthew N. Nelson
- UWA Institute of AgricultureUniversity of Western AustraliaCrawleyWAAustralia
- School of Plant BiologyUniversity of Western AustraliaCrawleyWAAustralia
- Present address: Royal Botanic Gardens KewNatural Capital and Plant HealthArdinglyRH17 6TNUK
| | | | - Craig A. Atkins
- UWA Institute of AgricultureUniversity of Western AustraliaCrawleyWAAustralia
- School of Plant BiologyUniversity of Western AustraliaCrawleyWAAustralia
| | - Philipp E. Bayer
- School of Plant BiologyUniversity of Western AustraliaCrawleyWAAustralia
| | - Armando Bravo
- Boyce Thompson Institute for Plant ResearchIthacaNYUSA
| | | | - Steven Cannon
- USDA‐ARS Corn Insects and Crop Genetics Research UnitCrop Genome Informatics LabIowa State UniversityAmesIAUSA
- Department of AgronomyIowa State UniversityAmesIAUSA
| | - David Edwards
- School of Plant BiologyUniversity of Western AustraliaCrawleyWAAustralia
- University of QueenslandBrisbaneQldAustralia
| | | | | | | | - Wei Huang
- Department of AgronomyIowa State UniversityAmesIAUSA
| | - Bhavna Hurgobin
- School of Plant BiologyUniversity of Western AustraliaCrawleyWAAustralia
- University of QueenslandBrisbaneQldAustralia
| | - Sean Li
- Data61CSIROCanberraACTAustralia
| | | | | | - Grant Morahan
- Centre for Diabetes ResearchUniversity of Western AustraliaCrawleyWAAustralia
| | | | - James Weller
- School of Biological SciencesUniversity of TasmaniaHobartTASAustralia
| | - Jianbo Jian
- Department of Plant and Animal Genome ResearchBeijing Genome InstituteShenzhenChina
| | - Karam B. Singh
- CSIRO AgricultureWembleyWAAustralia
- UWA Institute of AgricultureUniversity of Western AustraliaCrawleyWAAustralia
| |
Collapse
|
21
|
Knaus BJ, Grünwald NJ. vcfr: a package to manipulate and visualize variant call format data in R. Mol Ecol Resour 2017; 17:44-53. [PMID: 27401132 DOI: 10.1101/041277] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/26/2016] [Revised: 05/24/2016] [Accepted: 05/31/2016] [Indexed: 05/20/2023]
Abstract
Software to call single-nucleotide polymorphisms or related genetic variants has converged on the variant call format (VCF) as the output format of choice. This has created a need for tools to work with VCF files. While an increasing number of software exists to read VCF data, many only extract the genotypes without including the data associated with each genotype that describes its quality. We created the r package vcfr to address this issue. We developed a VCF file exploration tool implemented in the r language because r provides an interactive experience and an environment that is commonly used for genetic data analysis. Functions to read and write VCF files into r as well as functions to extract portions of the data and to plot summary statistics of the data are implemented. vcfr further provides the ability to visualize how various parameterizations of the data affect the results. Additional tools are included to integrate sequence (fasta) and annotation data (GFF) for visualization of genomic regions such as chromosomes. Conversion functions translate data from the vcfr data structure to formats used by other r genetics packages. Computationally intensive functions are implemented in C++ to improve performance. Use of these tools is intended to facilitate VCF data exploration, including intuitive methods for data quality control and easy export to other r packages for further analysis. vcfr thus provides essential, novel tools currently not available in r.
Collapse
Affiliation(s)
- Brian J Knaus
- Horticultural Crops Research Unit, USDA-ARS, Corvallis, OR, 97330, USA
| | - Niklaus J Grünwald
- Horticultural Crops Research Unit, USDA-ARS, Corvallis, OR, 97330, USA
- Department of Botany and Plant Pathology, Oregon State University, Corvallis, OR, 97331, USA
| |
Collapse
|
22
|
Abstract
The dynamic structure and functions of genomes are being revealed simultaneously with the progress of technologies and researches in genomics. Evidence indicating genome regional characteristics (genome annotations in a broad sense) provide the basis for further analyses. Target listing and screening can be effectively performed in silico using such data. Here, we describe steps to obtain publicly available genome annotations or to construct new annotations based on your own analyses, as well as an overview of the types of available genome annotations and corresponding resources.
Collapse
Affiliation(s)
- Imad Abugessaisa
- Division of Genomic Technologies, RIKEN Center for Life Science Technologies, 1-7-22 Suehiro-cho, Tsurumi-ku, Yokohama, Kanagawa, 230-0045, Japan
| | - Takeya Kasukawa
- Division of Genomic Technologies, RIKEN Center for Life Science Technologies, 1-7-22 Suehiro-cho, Tsurumi-ku, Yokohama, Kanagawa, 230-0045, Japan
| | - Hideya Kawaji
- Division of Genomic Technologies, RIKEN Center for Life Science Technologies, 1-7-22 Suehiro-cho, Tsurumi-ku, Yokohama, Kanagawa, 230-0045, Japan. .,RIKEN Preventive Medicine and Diagnosis Innovation Program, 2-1, Hirosawa, Wako, Saitama, 351-0198, Japan. .,Preventive Medicine and Applied Genomics Unit, RIKEN Advanced Center for Computing and Communication, 1-7-22 Suehiro-cho, Tsurumi-ku, Yokohama, Kanagawa, 230-0045, Japan.
| |
Collapse
|
23
|
Nakaya A, Ichihara H, Asamizu E, Shirasawa S, Nakamura Y, Tabata S, Hirakawa H. Plant Genome DataBase Japan (PGDBj). Methods Mol Biol 2017; 1533:45-77. [PMID: 27987164 DOI: 10.1007/978-1-4939-6658-5_3] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/02/2023]
Abstract
A portal website that integrates a variety of information related to genomes of model and crop plants from databases (DBs) and the literature was generated. This website, named the Plant Genome DataBase Japan (PGDBj, http://pgdbj. jp/en/ ), is comprised of three component DBs and a cross-search engine which provides a seamless search over their contents. One of the three component DBs is the Ortholog DB, which provides gene cluster information based on the amino acid sequence similarity. Over 1,000,000 amino acid sequences of 40 Viridiplantae species were collected from the public DNA DBs, and plant genome DBs such as TAIR and RAP-DB were subjected to reciprocal BLAST searches for clustering. Another component DB is the Plant Resource DB for genomic- and bio-resources. This DB also integrates the SABRE DB, which provides cDNA and genome sequence resources maintained in the RIKEN BioResource Center and National BioResource Projects Japan. The third component DB of PGDBj is the DNA Marker DB, which manually or automatically collects curated information on DNA markers, quantitative trait loci (QTL), and related genetic linkage maps, from the literature and external DBs. By combining these component DBs and a cross-search engine, PGDBj serves as a useful platform to study genetic systems for both fundamental and applied researches for a wide range of plant species.
Collapse
Affiliation(s)
- Akihiro Nakaya
- Department of Genome Informatics, Graduate School of Medicine, Osaka University, 1-1 Yamadaoka, Suita, Osaka, 565-0871, Japan
| | - Hisako Ichihara
- Department of Technology Development, Kazusa DNA Research Institute, 2-6-7 Kazusa-kamatari, Kisarazu, Chiba, 292-0818, Japan
| | - Erika Asamizu
- Department of Plant Life Sciences, Faculty of Agriculture, Ryukoku University, 1-5 Yokotani, Seta Oe-cho, Otsu, Shiga, 520-2194, Japan
| | - Sachiko Shirasawa
- Department of Technology Development, Kazusa DNA Research Institute, 2-6-7 Kazusa-kamatari, Kisarazu, Chiba, 292-0818, Japan
| | - Yasukazu Nakamura
- Department of Technology Development, Kazusa DNA Research Institute, 2-6-7 Kazusa-kamatari, Kisarazu, Chiba, 292-0818, Japan
| | - Satoshi Tabata
- Department of Technology Development, Kazusa DNA Research Institute, 2-6-7 Kazusa-kamatari, Kisarazu, Chiba, 292-0818, Japan
| | - Hideki Hirakawa
- Department of Technology Development, Kazusa DNA Research Institute, 2-6-7 Kazusa-kamatari, Kisarazu, Chiba, 292-0818, Japan.
| |
Collapse
|
24
|
Niehaus EM, Münsterkötter M, Proctor RH, Brown DW, Sharon A, Idan Y, Oren-Young L, Sieber CM, Novák O, Pěnčík A, Tarkowská D, Hromadová K, Freeman S, Maymon M, Elazar M, Youssef SA, El-Shabrawy ESM, Shalaby ABA, Houterman P, Brock NL, Burkhardt I, Tsavkelova EA, Dickschat JS, Galuszka P, Güldener U, Tudzynski B. Comparative "Omics" of the Fusarium fujikuroi Species Complex Highlights Differences in Genetic Potential and Metabolite Synthesis. Genome Biol Evol 2016; 8:3574-3599. [PMID: 28040774 PMCID: PMC5203792 DOI: 10.1093/gbe/evw259] [Citation(s) in RCA: 95] [Impact Index Per Article: 11.9] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 10/21/2016] [Indexed: 11/14/2022] Open
Abstract
Species of the Fusarium fujikuroi species complex (FFC) cause a wide spectrum of often devastating diseases on diverse agricultural crops, including coffee, fig, mango, maize, rice, and sugarcane. Although species within the FFC are difficult to distinguish by morphology, and their genes often share 90% sequence similarity, they can differ in host plant specificity and life style. FFC species can also produce structurally diverse secondary metabolites (SMs), including the mycotoxins fumonisins, fusarins, fusaric acid, and beauvericin, and the phytohormones gibberellins, auxins, and cytokinins. The spectrum of SMs produced can differ among closely related species, suggesting that SMs might be determinants of host specificity. To date, genomes of only a limited number of FFC species have been sequenced. Here, we provide draft genome sequences of three more members of the FFC: a single isolate of F. mangiferae, the cause of mango malformation, and two isolates of F. proliferatum, one a pathogen of maize and the other an orchid endophyte. We compared these genomes to publicly available genome sequences of three other FFC species. The comparisons revealed species-specific and isolate-specific differences in the composition and expression (in vitro and in planta) of genes involved in SM production including those for phytohormome biosynthesis. Such differences have the potential to impact host specificity and, as in the case of F. proliferatum, the pathogenic versus endophytic life style.
Collapse
Affiliation(s)
- Eva-Maria Niehaus
- Institut für Biologie und Biotechnologie der Pflanzen, Molecular Biology and Biotechnology of Fungi, Westfälische Wilhelms-Universität Münster, Münster, Germany
| | - Martin Münsterkötter
- Institute of Bioinformatics and Systems Biology, Helmholtz Zentrum München, German Research Center for Environmental Health (GmbH), Neuherberg, Germany
| | - Robert H Proctor
- United States Department of Agriculture, National Center for Agricultural Utilization Research, Peoria, Illinois
| | - Daren W Brown
- United States Department of Agriculture, National Center for Agricultural Utilization Research, Peoria, Illinois
| | - Amir Sharon
- Department of Molecular Biology and Ecology of Plants, Tel Aviv University, Tel Aviv, Israel
| | - Yifat Idan
- Department of Molecular Biology and Ecology of Plants, Tel Aviv University, Tel Aviv, Israel
| | - Liat Oren-Young
- Department of Molecular Biology and Ecology of Plants, Tel Aviv University, Tel Aviv, Israel
| | - Christian M Sieber
- Department of Energy Joint Genome Institute, University of California, Walnut Creek, Berkeley, California
| | - Ondřej Novák
- Centre of the Region Hana for Biotechnological and Agricultural Research, Palacky University, Olomouc, Czech Republic
| | - Aleš Pěnčík
- Centre of the Region Hana for Biotechnological and Agricultural Research, Palacky University, Olomouc, Czech Republic
| | - Danuše Tarkowská
- Centre of the Region Hana for Biotechnological and Agricultural Research, Palacky University, Olomouc, Czech Republic
| | - Kristýna Hromadová
- Centre of the Region Hana for Biotechnological and Agricultural Research, Palacky University, Olomouc, Czech Republic
| | - Stanley Freeman
- Department of Plant Pathology and Weed Research, Agricultural Research Organization (ARO), The Volcani Center, Bet-Dagan, Israel
| | - Marcel Maymon
- Department of Plant Pathology and Weed Research, Agricultural Research Organization (ARO), The Volcani Center, Bet-Dagan, Israel
| | - Meirav Elazar
- Department of Plant Pathology and Weed Research, Agricultural Research Organization (ARO), The Volcani Center, Bet-Dagan, Israel
| | - Sahar A Youssef
- Plant Pathology Research Institute, Agricultural Research Center, Giza, Egypt
| | | | | | - Petra Houterman
- University of Amsterdam, Swammerdam Institute for Life Sciences, Plant Pathology, Amsterdam, The Netherlands
| | - Nelson L Brock
- Rheinische Friedrich-Wilhelms-Universität Bonn, Kekulé-Institut für Organische Chemie und Biochemie, Germany
| | - Immo Burkhardt
- Rheinische Friedrich-Wilhelms-Universität Bonn, Kekulé-Institut für Organische Chemie und Biochemie, Germany
| | - Elena A Tsavkelova
- Department of Microbiology Faculty of Biology, Lomonosov Moscow State University, Moscow, Russia
| | - Jeroen S Dickschat
- Rheinische Friedrich-Wilhelms-Universität Bonn, Kekulé-Institut für Organische Chemie und Biochemie, Germany
| | - Petr Galuszka
- Centre of the Region Hana for Biotechnological and Agricultural Research, Palacky University, Olomouc, Czech Republic
| | - Ulrich Güldener
- Department of Genome-oriented Bioinformatics, Wissenschaftszentrum Weihenstephan, Technische Universität München, Maximus-von-Imhof-Forum 3, Freising, Germany
| | - Bettina Tudzynski
- Institut für Biologie und Biotechnologie der Pflanzen, Molecular Biology and Biotechnology of Fungi, Westfälische Wilhelms-Universität Münster, Münster, Germany
| |
Collapse
|
25
|
Penselin D, Münsterkötter M, Kirsten S, Felder M, Taudien S, Platzer M, Ashelford K, Paskiewicz KH, Harrison RJ, Hughes DJ, Wolf T, Shelest E, Graap J, Hoffmann J, Wenzel C, Wöltje N, King KM, Fitt BDL, Güldener U, Avrova A, Knogge W. Comparative genomics to explore phylogenetic relationship, cryptic sexual potential and host specificity of Rhynchosporium species on grasses. BMC Genomics 2016; 17:953. [PMID: 27875982 PMCID: PMC5118889 DOI: 10.1186/s12864-016-3299-5] [Citation(s) in RCA: 23] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/07/2016] [Accepted: 11/15/2016] [Indexed: 01/07/2023] Open
Abstract
BACKGROUND The Rhynchosporium species complex consists of hemibiotrophic fungal pathogens specialized to different sweet grass species including the cereal crops barley and rye. A sexual stage has not been described, but several lines of evidence suggest the occurrence of sexual reproduction. Therefore, a comparative genomics approach was carried out to disclose the evolutionary relationship of the species and to identify genes demonstrating the potential for a sexual cycle. Furthermore, due to the evolutionary very young age of the five species currently known, this genus appears to be well-suited to address the question at the molecular level of how pathogenic fungi adapt to their hosts. RESULTS The genomes of the different Rhynchosporium species were sequenced, assembled and annotated using ab initio gene predictors trained on several fungal genomes as well as on Rhynchosporium expressed sequence tags. Structures of the rDNA regions and genome-wide single nucleotide polymorphisms provided a hypothesis for intra-genus evolution. Homology screening detected core meiotic genes along with most genes crucial for sexual recombination in ascomycete fungi. In addition, a large number of cell wall-degrading enzymes that is characteristic for hemibiotrophic and necrotrophic fungi infecting monocotyledonous hosts were found. Furthermore, the Rhynchosporium genomes carry a repertoire of genes coding for polyketide synthases and non-ribosomal peptide synthetases. Several of these genes are missing from the genome of the closest sequenced relative, the poplar pathogen Marssonina brunnea, and are possibly involved in adaptation to the grass hosts. Most importantly, six species-specific genes coding for protein effectors were identified in R. commune. Their deletion yielded mutants that grew more vigorously in planta than the wild type. CONCLUSION Both cryptic sexuality and secondary metabolites may have contributed to host adaptation. Most importantly, however, the growth-retarding activity of the species-specific effectors suggests that host adaptation of R. commune aims at extending the biotrophic stage at the expense of the necrotrophic stage of pathogenesis. Like other apoplastic fungi Rhynchosporium colonizes the intercellular matrix of host leaves relatively slowly without causing symptoms, reminiscent of the development of endophytic fungi. Rhynchosporium may therefore become an object for studying the mutualism-parasitism transition.
Collapse
Affiliation(s)
- Daniel Penselin
- Department of Stress and Developmental Biology, Leibniz Institute of Plant Biochemistry, Halle/Saale, Germany
| | - Martin Münsterkötter
- Institute of Bioinformatics and Systems Biology, Helmholtz Zentrum München, Neuherberg, Germany
| | - Susanne Kirsten
- Department of Stress and Developmental Biology, Leibniz Institute of Plant Biochemistry, Halle/Saale, Germany
| | - Marius Felder
- Genomic Analysis, Leibniz Institute on Aging, Fritz Lipmann Institute, Jena, Germany
| | - Stefan Taudien
- Genomic Analysis, Leibniz Institute on Aging, Fritz Lipmann Institute, Jena, Germany
| | - Matthias Platzer
- Genomic Analysis, Leibniz Institute on Aging, Fritz Lipmann Institute, Jena, Germany
| | - Kevin Ashelford
- Institute of Medical Genetics, Cardiff University, Cardiff, UK
| | | | | | - David J. Hughes
- Applied Bioinformatics, Rothamsted Research, Harpenden, Hertfordshire UK
| | - Thomas Wolf
- Systems Biology and Bioinformatics, Leibniz Institute for Natural Product Research and Infection Biology, Hans Knöll Institute, Jena, Germany
| | - Ekaterina Shelest
- Systems Biology and Bioinformatics, Leibniz Institute for Natural Product Research and Infection Biology, Hans Knöll Institute, Jena, Germany
| | - Jenny Graap
- Department of Stress and Developmental Biology, Leibniz Institute of Plant Biochemistry, Halle/Saale, Germany
| | - Jan Hoffmann
- Department of Stress and Developmental Biology, Leibniz Institute of Plant Biochemistry, Halle/Saale, Germany
| | - Claudia Wenzel
- Department of Stress and Developmental Biology, Leibniz Institute of Plant Biochemistry, Halle/Saale, Germany ,Present address: Food Quality and Nutrition, Agroscope, Bern, Switzerland
| | - Nadine Wöltje
- Department of Stress and Developmental Biology, Leibniz Institute of Plant Biochemistry, Halle/Saale, Germany
| | - Kevin M. King
- Biological Chemistry and Crop Protection, Rothamsted Research, Harpenden, Hertfordshire UK
| | - Bruce D. L. Fitt
- Biological and Environmental Sciences, University of Hertfordshire, Hatfield, Hertfordshire UK
| | - Ulrich Güldener
- Department of Genome-Oriented Bioinformatics, Technische Universität München, Wissenschaftszentrum Weihenstephan, Freising, Germany
| | - Anna Avrova
- Cell and Molecular Sciences, The James Hutton Institute, Invergowrie, Dundee, Scotland
| | - Wolfgang Knogge
- Department of Stress and Developmental Biology, Leibniz Institute of Plant Biochemistry, Halle/Saale, Germany
| |
Collapse
|
26
|
Rabe F, Bosch J, Stirnberg A, Guse T, Bauer L, Seitner D, Rabanal FA, Czedik-Eysenberg A, Uhse S, Bindics J, Genenncher B, Navarrete F, Kellner R, Ekker H, Kumlehn J, Vogel JP, Gordon SP, Marcel TC, Münsterkötter M, Walter MC, Sieber CMK, Mannhaupt G, Güldener U, Kahmann R, Djamei A. A complete toolset for the study of Ustilago bromivora and Brachypodium sp. as a fungal-temperate grass pathosystem. eLife 2016; 5:e20522. [PMID: 27835569 PMCID: PMC5106213 DOI: 10.7554/elife.20522] [Citation(s) in RCA: 40] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/10/2016] [Accepted: 10/12/2016] [Indexed: 11/18/2022] Open
Abstract
Due to their economic relevance, the study of plant pathogen interactions is of importance. However, elucidating these interactions and their underlying molecular mechanisms remains challenging since both host and pathogen need to be fully genetically accessible organisms. Here we present milestones in the establishment of a new biotrophic model pathosystem: Ustilago bromivora and Brachypodium sp. We provide a complete toolset, including an annotated fungal genome and methods for genetic manipulation of the fungus and its host plant. This toolset will enable researchers to easily study biotrophic interactions at the molecular level on both the pathogen and the host side. Moreover, our research on the fungal life cycle revealed a mating type bias phenomenon. U. bromivora harbors a haplo-lethal allele that is linked to one mating type region. As a result, the identified mating type bias strongly promotes inbreeding, which we consider to be a potential speciation driver.
Collapse
Affiliation(s)
- Franziska Rabe
- Gregor Mendel Institute, Austrian Academy of Sciences, Vienna Biocenter, Vienna, Austria
- Max Planck Institute for Terrestrial Microbiology, Marburg, Germany
| | - Jason Bosch
- Gregor Mendel Institute, Austrian Academy of Sciences, Vienna Biocenter, Vienna, Austria
| | - Alexandra Stirnberg
- Gregor Mendel Institute, Austrian Academy of Sciences, Vienna Biocenter, Vienna, Austria
| | - Tilo Guse
- Gregor Mendel Institute, Austrian Academy of Sciences, Vienna Biocenter, Vienna, Austria
| | - Lisa Bauer
- Gregor Mendel Institute, Austrian Academy of Sciences, Vienna Biocenter, Vienna, Austria
| | - Denise Seitner
- Gregor Mendel Institute, Austrian Academy of Sciences, Vienna Biocenter, Vienna, Austria
| | - Fernando A Rabanal
- Gregor Mendel Institute, Austrian Academy of Sciences, Vienna Biocenter, Vienna, Austria
| | | | - Simon Uhse
- Gregor Mendel Institute, Austrian Academy of Sciences, Vienna Biocenter, Vienna, Austria
| | - Janos Bindics
- Gregor Mendel Institute, Austrian Academy of Sciences, Vienna Biocenter, Vienna, Austria
| | - Bianca Genenncher
- Gregor Mendel Institute, Austrian Academy of Sciences, Vienna Biocenter, Vienna, Austria
| | - Fernando Navarrete
- Gregor Mendel Institute, Austrian Academy of Sciences, Vienna Biocenter, Vienna, Austria
| | - Ronny Kellner
- Max Planck Institute for Plant Breeding Research, Cologne, Germany
| | - Heinz Ekker
- Vienna Biocenter Core Facilities GmbH, Vienna, Austria
| | - Jochen Kumlehn
- Leibniz-Institut für Pflanzengenetik und Kulturpflanzenforschung, Gatersleben, Germany
| | - John P Vogel
- DOE Joint Genome Institute, California, United States
| | - Sean P Gordon
- DOE Joint Genome Institute, California, United States
| | - Thierry C Marcel
- INRA UMR BIOGER, AgroParisTech, Université Paris-Saclay, Thiverval-Grignon, France
| | - Martin Münsterkötter
- Institute of Bioinformatics and Systems Biology, Helmholtz Zentrum München, German Research Center for Environmental Health, Neuherberg, Germany
| | - Mathias C Walter
- Department of Genome-oriented Bioinformatics, Wissenschaftszentrum Weihenstephan, Technische Universität München, Freising, Germany
| | - Christian MK Sieber
- Institute of Bioinformatics and Systems Biology, Helmholtz Zentrum München, German Research Center for Environmental Health, Neuherberg, Germany
| | - Gertrud Mannhaupt
- Max Planck Institute for Terrestrial Microbiology, Marburg, Germany
- Institute of Bioinformatics and Systems Biology, Helmholtz Zentrum München, German Research Center for Environmental Health, Neuherberg, Germany
| | - Ulrich Güldener
- Institute of Bioinformatics and Systems Biology, Helmholtz Zentrum München, German Research Center for Environmental Health, Neuherberg, Germany
- Department of Genome-oriented Bioinformatics, Wissenschaftszentrum Weihenstephan, Technische Universität München, Freising, Germany
| | - Regine Kahmann
- Max Planck Institute for Terrestrial Microbiology, Marburg, Germany
| | - Armin Djamei
- Gregor Mendel Institute, Austrian Academy of Sciences, Vienna Biocenter, Vienna, Austria
- Max Planck Institute for Terrestrial Microbiology, Marburg, Germany
| |
Collapse
|
27
|
Olsson S, Seoane-Zonjic P, Bautista R, Claros MG, González-Martínez SC, Scotti I, Scotti-Saintagne C, Hardy OJ, Heuertz M. Development of genomic tools in a widespread tropical tree, Symphonia globulifera L.f.: a new low-coverage draft genome, SNP and SSR markers. Mol Ecol Resour 2016; 17:614-630. [PMID: 27718316 DOI: 10.1111/1755-0998.12605] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/19/2016] [Revised: 09/30/2016] [Accepted: 10/04/2016] [Indexed: 01/08/2023]
Abstract
Population genetic studies in tropical plants are often challenging because of limited information on taxonomy, phylogenetic relationships and distribution ranges, scarce genomic information and logistic challenges in sampling. We describe a strategy to develop robust and widely applicable genetic markers based on a modest development of genomic resources in the ancient tropical tree species Symphonia globulifera L.f. (Clusiaceae), a keystone species in African and Neotropical rainforests. We provide the first low-coverage (11X) fragmented draft genome sequenced on an individual from Cameroon, covering 1.027 Gbp or 67.5% of the estimated genome size. Annotation of 565 scaffolds (7.57 Mbp) resulted in the prediction of 1046 putative genes (231 of them containing a complete open reading frame) and 1523 exact simple sequence repeats (SSRs, microsatellites). Aligning a published transcriptome of a French Guiana population against this draft genome produced 923 high-quality single nucleotide polymorphisms. We also preselected genic SSRs in silico that were conserved and polymorphic across a wide geographical range, thus reducing marker development tests on rare DNA samples. Of 23 SSRs tested, 19 amplified and 18 were successfully genotyped in four S. globulifera populations from South America (Brazil and French Guiana) and Africa (Cameroon and São Tomé island, FST = 0.34). Most loci showed only population-specific deviations from Hardy-Weinberg proportions, pointing to local population effects (e.g. null alleles). The described genomic resources are valuable for evolutionary studies in Symphonia and for comparative studies in plants. The methods are especially interesting for widespread tropical or endangered taxa with limited DNA availability.
Collapse
Affiliation(s)
- Sanna Olsson
- Department of Forest Ecology and Genetics, INIA Forest Research Centre (INIA-CIFOR), Carretera de A Coruña km 7.5, E-28040, Madrid, Spain
| | - Pedro Seoane-Zonjic
- Departamento de Biología Molecular y Bioquímica, and Plataforma Andaluza de Bioinformática, Universidad de Málaga, calle Severo Ochoa 34, E-29590, Campanillas, Málaga, Spain
| | - Rocío Bautista
- Departamento de Biología Molecular y Bioquímica, and Plataforma Andaluza de Bioinformática, Universidad de Málaga, calle Severo Ochoa 34, E-29590, Campanillas, Málaga, Spain
| | - M Gonzalo Claros
- Departamento de Biología Molecular y Bioquímica, and Plataforma Andaluza de Bioinformática, Universidad de Málaga, calle Severo Ochoa 34, E-29590, Campanillas, Málaga, Spain
| | - Santiago C González-Martínez
- Department of Forest Ecology and Genetics, INIA Forest Research Centre (INIA-CIFOR), Carretera de A Coruña km 7.5, E-28040, Madrid, Spain.,UMR1202 BioGeCo, INRA, Univ. Bordeaux, 69 route d'Arcachon, F-33610, Cestas, France
| | - Ivan Scotti
- INRA, UR629 URFM, Ecologie des Forêts Méditerranéennes, Site Agroparc, Domaine Saint Paul, F-84914, Avignon Cedex 9, France
| | - Caroline Scotti-Saintagne
- INRA, UR629 URFM, Ecologie des Forêts Méditerranéennes, Site Agroparc, Domaine Saint Paul, F-84914, Avignon Cedex 9, France
| | - Olivier J Hardy
- Faculté des Sciences, Evolutionary Biology and Ecology, Université Libre de Bruxelles, Av. F.D. Roosevelt 50, CP 160/12, B-1050, Brussels, Belgium
| | - Myriam Heuertz
- Department of Forest Ecology and Genetics, INIA Forest Research Centre (INIA-CIFOR), Carretera de A Coruña km 7.5, E-28040, Madrid, Spain.,UMR1202 BioGeCo, INRA, Univ. Bordeaux, 69 route d'Arcachon, F-33610, Cestas, France.,Faculté des Sciences, Evolutionary Biology and Ecology, Université Libre de Bruxelles, Av. F.D. Roosevelt 50, CP 160/12, B-1050, Brussels, Belgium
| |
Collapse
|
28
|
Abstract
An integrated database with a variety of Web-based systems named WheatGenome.info hosting wheat genome and genomic data has been developed to support wheat research and crop improvement. The resource includes multiple Web-based applications, which are implemented as a variety of Web-based systems. These include a GBrowse2-based wheat genome viewer with BLAST search portal, TAGdb for searching wheat second generation genome sequence data, wheat autoSNPdb, links to wheat genetic maps using CMap and CMap3D, and a wheat genome Wiki to allow interaction between diverse wheat genome sequencing activities. This portal provides links to a variety of wheat genome resources hosted at other research organizations. This integrated database aims to accelerate wheat genome research and is freely accessible via the web interface at http://www.wheatgenome.info/ .
Collapse
|
29
|
Jung S, Lee T, Ficklin S, Yu J, Cheng CH, Main D. Chado use case: storing genomic, genetic and breeding data of Rosaceae and Gossypium crops in Chado. DATABASE-THE JOURNAL OF BIOLOGICAL DATABASES AND CURATION 2016; 2016:baw010. [PMID: 26989146 PMCID: PMC4795932 DOI: 10.1093/database/baw010] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 09/25/2015] [Accepted: 01/21/2016] [Indexed: 11/12/2022]
Abstract
The Genome Database for Rosaceae (GDR) and CottonGen are comprehensive online data repositories that provide access to integrated genomic, genetic and breeding data through search, visualization and analysis tools for Rosaceae crops and Gossypium (cotton). These online databases use Chado, an open-source, generic and ontology-driven database schema for biological data, as the primary data storage platform. Chado is highly normalized and uses ontologies to indicate the 'types' of data. Therefore, Chado is flexible such that it has been used to house genomic, genetic and breeding data for GDR and CottonGen. These data include whole genome sequence and annotation, transcripts, molecular markers, genetic maps, Quantitative Trait Loci, Mendelian Trait Loci, traits, germplasm, pedigrees, large scale phenotypic and genotypic data, ontologies and publications. We provide information about how to store these types of data in Chado using GDR and CottonGen as examples sites that were converted from an older legacy infrastructure. Database URL: GDR (www.rosaceae.org), CottonGen (www.cottongen.org).
Collapse
Affiliation(s)
- Sook Jung
- Department of Horticulture, Washington State University Pullman, WA, USA
| | - Taein Lee
- Department of Horticulture, Washington State University Pullman, WA, USA
| | - Stephen Ficklin
- Department of Horticulture, Washington State University Pullman, WA, USA
| | - Jing Yu
- Department of Horticulture, Washington State University Pullman, WA, USA
| | - Chun-Huai Cheng
- Department of Horticulture, Washington State University Pullman, WA, USA
| | - Dorrie Main
- Department of Horticulture, Washington State University Pullman, WA, USA
| |
Collapse
|
30
|
Seol YJ, Lee TH, Park DS, Kim CK. NABIC: A New Access Portal to Search, Visualize, and Share Agricultural Genomics Data. Evol Bioinform Online 2016; 12:51-8. [PMID: 26848255 PMCID: PMC4737523 DOI: 10.4137/ebo.s34493] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/11/2015] [Revised: 12/15/2015] [Accepted: 12/20/2015] [Indexed: 01/12/2023] Open
Abstract
The National Agricultural Biotechnology Information Center developed an access portal to search, visualize, and share agricultural genomics data with a focus on South Korean information and resources. The portal features an agricultural biotechnology database containing a wide range of omics data from public and proprietary sources. We collected 28.4 TB of data from 162 agricultural organisms, with 10 types of omics data comprising next-generation sequencing sequence read archive, genome, gene, nucleotide, DNA chip, expressed sequence tag, interactome, protein structure, molecular marker, and single-nucleotide polymorphism datasets. Our genomic resources contain information on five animals, seven plants, and one fungus, which is accessed through a genome browser. We also developed a data submission and analysis system as a web service, with easy-to-use functions and cutting-edge algorithms, including those for handling next-generation sequencing data.
Collapse
Affiliation(s)
- Young-Joo Seol
- Genomics Division, National Academy of Agricultural Science (NAAS), Jeonju, Korea
| | - Tae-Ho Lee
- Genomics Division, National Academy of Agricultural Science (NAAS), Jeonju, Korea
| | - Dong-Suk Park
- Genomics Division, National Academy of Agricultural Science (NAAS), Jeonju, Korea
| | - Chang-Kug Kim
- Genomics Division, National Academy of Agricultural Science (NAAS), Jeonju, Korea
| |
Collapse
|
31
|
Brozovic M, Martin C, Dantec C, Dauga D, Mendez M, Simion P, Percher M, Laporte B, Scornavacca C, Di Gregorio A, Fujiwara S, Gineste M, Lowe EK, Piette J, Racioppi C, Ristoratore F, Sasakura Y, Takatori N, Brown TC, Delsuc F, Douzery E, Gissi C, McDougall A, Nishida H, Sawada H, Swalla BJ, Yasuo H, Lemaire P. ANISEED 2015: a digital framework for the comparative developmental biology of ascidians. Nucleic Acids Res 2016; 44:D808-18. [PMID: 26420834 PMCID: PMC4702943 DOI: 10.1093/nar/gkv966] [Citation(s) in RCA: 52] [Impact Index Per Article: 6.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/24/2015] [Accepted: 09/14/2015] [Indexed: 11/24/2022] Open
Abstract
Ascidians belong to the tunicates, the sister group of vertebrates and are recognized model organisms in the field of embryonic development, regeneration and stem cells. ANISEED is the main information system in the field of ascidian developmental biology. This article reports the development of the system since its initial publication in 2010. Over the past five years, we refactored the system from an initial custom schema to an extended version of the Chado schema and redesigned all user and back end interfaces. This new architecture was used to improve and enrich the description of Ciona intestinalis embryonic development, based on an improved genome assembly and gene model set, refined functional gene annotation, and anatomical ontologies, and a new collection of full ORF cDNAs. The genomes of nine ascidian species have been sequenced since the release of the C. intestinalis genome. In ANISEED 2015, all nine new ascidian species can be explored via dedicated genome browsers, and searched by Blast. In addition, ANISEED provides full functional gene annotation, anatomical ontologies and some gene expression data for the six species with highest quality genomes. ANISEED is publicly available at: http://www.aniseed.cnrs.fr.
Collapse
Affiliation(s)
- Matija Brozovic
- Centre de Recherches de Biochimie Macromoléculaire (CRBM), UMR5237, CNRS-Université de Montpellier, 1919 route de Mende, F-34090 Montpellier, France
| | - Cyril Martin
- Centre de Recherches de Biochimie Macromoléculaire (CRBM), UMR5237, CNRS-Université de Montpellier, 1919 route de Mende, F-34090 Montpellier, France
| | - Christelle Dantec
- Centre de Recherches de Biochimie Macromoléculaire (CRBM), UMR5237, CNRS-Université de Montpellier, 1919 route de Mende, F-34090 Montpellier, France
| | - Delphine Dauga
- Institut de Biologie du Développement de Marseille (IBDM), UMR7288 CNRS-Aix Marseille Université, Parc Scientifique de Luminy, Case 907, F-13288 Marseille Cedex 9, France Bioself Communication, 28 rue de la Bibliothèque, F-13001 Marseille, France
| | - Mickaël Mendez
- Centre de Recherches de Biochimie Macromoléculaire (CRBM), UMR5237, CNRS-Université de Montpellier, 1919 route de Mende, F-34090 Montpellier, France
| | - Paul Simion
- Institut des Sciences de l'Evolution de Montpellier (ISE-M), UMR 5554 CNRS-IRD-Université de Montpellier, F-34090 Montpellier, France
| | - Madeline Percher
- Centre de Recherches de Biochimie Macromoléculaire (CRBM), UMR5237, CNRS-Université de Montpellier, 1919 route de Mende, F-34090 Montpellier, France
| | - Baptiste Laporte
- Institut de Biologie du Développement de Marseille (IBDM), UMR7288 CNRS-Aix Marseille Université, Parc Scientifique de Luminy, Case 907, F-13288 Marseille Cedex 9, France
| | - Céline Scornavacca
- Institut des Sciences de l'Evolution de Montpellier (ISE-M), UMR 5554 CNRS-IRD-Université de Montpellier, F-34090 Montpellier, France
| | - Anna Di Gregorio
- Department of Basic Science and Craniofacial Biology New York University College of Dentistry, 345 E 24th Street, New York, NY 10010, USA
| | - Shigeki Fujiwara
- Department of Applied Science, Kochi University, Kochi-shi, Kochi 780-8520, Japan
| | - Mathieu Gineste
- Centre de Recherches de Biochimie Macromoléculaire (CRBM), UMR5237, CNRS-Université de Montpellier, 1919 route de Mende, F-34090 Montpellier, France
| | - Elijah K Lowe
- Department of Biology and Evolution of Marine Organisms, Stazione Zoologica Anton Dohrn, Villa Comunale, I-80121 Napoli, Italy BEACON Center for the Study of Evolution in Action, Michigan State University, East Lansing, Michigan, USA
| | - Jacques Piette
- Centre de Recherches de Biochimie Macromoléculaire (CRBM), UMR5237, CNRS-Université de Montpellier, 1919 route de Mende, F-34090 Montpellier, France
| | - Claudia Racioppi
- Center for Developmental Genetics, Department of Biology, New York University, New York, NY 10003, USA
| | - Filomena Ristoratore
- Department of Biology and Evolution of Marine Organisms, Stazione Zoologica Anton Dohrn, Villa Comunale, I-80121 Napoli, Italy
| | - Yasunori Sasakura
- Shimoda Marine Research Center, University of Tsukuba, Shimoda, Shizuoka 415-0025, Japan
| | - Naohito Takatori
- Developmental Biology Laboratory, Department of Biological Sciences, School of Science and Engineering, Tokyo Metropolitan University, 1-1 Minamioosawa, Hachiooji, Tokyo 192-0397, Japan Department of Biological Sciences, Graduate School of Science, Osaka University, 1-1 Machikaneyama-cho, Toyonaka, Osaka 560-0043, Japan
| | - Titus C Brown
- Population Health and Reproduction, UC Davis, Davis, CA 95616, USA
| | - Frédéric Delsuc
- Institut des Sciences de l'Evolution de Montpellier (ISE-M), UMR 5554 CNRS-IRD-Université de Montpellier, F-34090 Montpellier, France
| | - Emmanuel Douzery
- Institut des Sciences de l'Evolution de Montpellier (ISE-M), UMR 5554 CNRS-IRD-Université de Montpellier, F-34090 Montpellier, France
| | - Carmela Gissi
- Dipartimento di Bioscienze, Università degli Studi di Milano, Via Celoria 26, Milano 20133, Italy
| | - Alex McDougall
- Sorbonne Universités, Université Pierre et Marie Curie, CNRS, Laboratoire de Biologie du Développement de Villefranche-sur-mer, Observatoire Océanologique, F-06230 Villefranche-sur-mer, France
| | - Hiroki Nishida
- Department of Biological Sciences, Graduate School of Science, Osaka University, 1-1 Machikaneyama-cho, Toyonaka, Osaka 560-0043, Japan
| | - Hitoshi Sawada
- Sugashima Marine Biological Laboratory, Graduate School of Science, Nagoya University, 429-63 Sugashima, Toba 517-0004, Japan
| | - Billie J Swalla
- Friday Harbor Laboratories, 620 University Road, Friday Harbor, WA 98250-9299, USA
| | - Hitoyoshi Yasuo
- Sorbonne Universités, Université Pierre et Marie Curie, CNRS, Laboratoire de Biologie du Développement de Villefranche-sur-mer, Observatoire Océanologique, F-06230 Villefranche-sur-mer, France
| | - Patrick Lemaire
- Centre de Recherches de Biochimie Macromoléculaire (CRBM), UMR5237, CNRS-Université de Montpellier, 1919 route de Mende, F-34090 Montpellier, France Institut de Biologie du Développement de Marseille (IBDM), UMR7288 CNRS-Aix Marseille Université, Parc Scientifique de Luminy, Case 907, F-13288 Marseille Cedex 9, France
| |
Collapse
|
32
|
Campbell MS, Yandell M. An Introduction to Genome Annotation. CURRENT PROTOCOLS IN BIOINFORMATICS 2015; 52:4.1.1-4.1.17. [PMID: 26678385 DOI: 10.1002/0471250953.bi0401s52] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/11/2022]
Abstract
Genome projects have evolved from large international undertakings to tractable endeavors for a single lab. Accurate genome annotation is critical for successful genomic, genetic, and molecular biology experiments. These annotations can be generated using a number of approaches and available software tools. This unit describes methods for genome annotation and a number of software tools commonly used in gene annotation.
Collapse
Affiliation(s)
- Michael S Campbell
- Eccles Institute of Human Genetics, University of Utah, Salt Lake City, Utah
| | - Mark Yandell
- Eccles Institute of Human Genetics, University of Utah, Salt Lake City, Utah.,USTAR Center for Genetic Discovery, University of Utah, Salt Lake City, Utah
| |
Collapse
|
33
|
Wang X, Wu J, Liang J, Cheng F, Wang X. Brassica database (BRAD) version 2.0: integrating and mining Brassicaceae species genomic resources. DATABASE-THE JOURNAL OF BIOLOGICAL DATABASES AND CURATION 2015; 2015:bav093. [PMID: 26589635 PMCID: PMC4653866 DOI: 10.1093/database/bav093] [Citation(s) in RCA: 48] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 03/04/2015] [Accepted: 08/31/2015] [Indexed: 11/15/2022]
Abstract
The Brassica database (BRAD) was built initially to assist users apply Brassica rapa and Arabidopsis thaliana genomic data efficiently to their research. However, many Brassicaceae genomes have been sequenced and released after its construction. These genomes are rich resources for comparative genomics, gene annotation and functional evolutionary studies of Brassica crops. Therefore, we have updated BRAD to version 2.0 (V2.0). In BRAD V2.0, 11 more Brassicaceae genomes have been integrated into the database, namely those of Arabidopsis lyrata, Aethionema arabicum, Brassica oleracea, Brassica napus, Camelina sativa, Capsella rubella, Leavenworthia alabamica, Sisymbrium irio and three extremophiles Schrenkiella parvula, Thellungiella halophila and Thellungiella salsuginea. BRAD V2.0 provides plots of syntenic genomic fragments between pairs of Brassicaceae species, from the level of chromosomes to genomic blocks. The Generic Synteny Browser (GBrowse_syn), a module of the Genome Browser (GBrowse), is used to show syntenic relationships between multiple genomes. Search functions for retrieving syntenic and non-syntenic orthologs, as well as their annotation and sequences are also provided. Furthermore, genome and annotation information have been imported into GBrowse so that all functional elements can be visualized in one frame. We plan to continually update BRAD by integrating more Brassicaceae genomes into the database. Database URL:http://brassicadb.org/brad/
Collapse
Affiliation(s)
- Xiaobo Wang
- Institute of Vegetables and Flowers, Chinese Academy of Agricultural Sciences, Beijing 100081, China
| | - Jian Wu
- Institute of Vegetables and Flowers, Chinese Academy of Agricultural Sciences, Beijing 100081, China
| | - Jianli Liang
- Institute of Vegetables and Flowers, Chinese Academy of Agricultural Sciences, Beijing 100081, China
| | - Feng Cheng
- Institute of Vegetables and Flowers, Chinese Academy of Agricultural Sciences, Beijing 100081, China
| | - Xiaowu Wang
- Institute of Vegetables and Flowers, Chinese Academy of Agricultural Sciences, Beijing 100081, China
| |
Collapse
|
34
|
Dash S, Campbell JD, Cannon EKS, Cleary AM, Huang W, Kalberer SR, Karingula V, Rice AG, Singh J, Umale PE, Weeks NT, Wilkey AP, Farmer AD, Cannon SB. Legume information system (LegumeInfo.org): a key component of a set of federated data resources for the legume family. Nucleic Acids Res 2015; 44:D1181-8. [PMID: 26546515 PMCID: PMC4702835 DOI: 10.1093/nar/gkv1159] [Citation(s) in RCA: 96] [Impact Index Per Article: 10.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/14/2015] [Accepted: 10/19/2015] [Indexed: 12/02/2022] Open
Abstract
Legume Information System (LIS), at http://legumeinfo.org, is a genomic data portal (GDP) for the legume family. LIS provides access to genetic and genomic information for major crop and model legumes. With more than two-dozen domesticated legume species, there are numerous specialists working on particular species, and also numerous GDPs for these species. LIS has been redesigned in the last three years both to better integrate data sets across the crop and model legumes, and to better accommodate specialized GDPs that serve particular legume species. To integrate data sets, LIS provides genome and map viewers, holds synteny mappings among all sequenced legume species and provides a set of gene families to allow traversal among orthologous and paralogous sequences across the legumes. To better accommodate other specialized GDPs, LIS uses open-source GMOD components where possible, and advocates use of common data templates, formats, schemas and interfaces so that data collected by one legume research community are accessible across all legume GDPs, through similar interfaces and using common APIs. This federated model for the legumes is managed as part of the ‘Legume Federation’ project (accessible via http://legumefederation.org), which can be thought of as an umbrella project encompassing LIS and other legume GDPs.
Collapse
Affiliation(s)
- Sudhansu Dash
- National Center for Genome Resources, Santa Fe, NM 87505, USA
| | | | | | - Alan M Cleary
- National Center for Genome Resources, Santa Fe, NM 87505, USA Dept. of Computer Science, Montana State University, Bozeman, MT 59715, USA
| | - Wei Huang
- Dept. of Agronomy, Iowa State University, Ames, IA 50011, USA
| | - Scott R Kalberer
- USDA-ARS Corn Insects and Crop Genetics Research Unit, Crop Genome Informatics Lab, Iowa State University, Ames, IA 50011, USA
| | - Vijay Karingula
- Dept. of Agronomy, Iowa State University, Ames, IA 50011, USA
| | - Alex G Rice
- National Center for Genome Resources, Santa Fe, NM 87505, USA
| | - Jugpreet Singh
- ORISE Fellow, USDA-Agricultural Research Service, Corn Insects and Crop Genetics Research Unit, Ames, IA 50011, USA
| | - Pooja E Umale
- National Center for Genome Resources, Santa Fe, NM 87505, USA
| | - Nathan T Weeks
- USDA-ARS Corn Insects and Crop Genetics Research Unit, Crop Genome Informatics Lab, Iowa State University, Ames, IA 50011, USA
| | - Andrew P Wilkey
- Dept. of Agronomy, Iowa State University, Ames, IA 50011, USA
| | - Andrew D Farmer
- National Center for Genome Resources, Santa Fe, NM 87505, USA
| | - Steven B Cannon
- Dept. of Agronomy, Iowa State University, Ames, IA 50011, USA USDA-ARS Corn Insects and Crop Genetics Research Unit, Crop Genome Informatics Lab, Iowa State University, Ames, IA 50011, USA
| |
Collapse
|
35
|
Pulvirenti A, Giugno R, Distefano R, Pigola G, Mongiovi M, Giudice G, Vendramin V, Lombardo A, Cattonaro F, Ferro A. A knowledge base for Vitis vinifera functional analysis. BMC SYSTEMS BIOLOGY 2015; 9 Suppl 3:S5. [PMID: 26050794 PMCID: PMC4464603 DOI: 10.1186/1752-0509-9-s3-s5] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 11/23/2022]
Abstract
Background Vitis vinifera (Grapevine) is the most important fruit species in the modern world. Wine and table grapes sales contribute significantly to the economy of major wine producing countries. The most relevant goals in wine production concern quality and safety. In order to significantly improve the achievement of these objectives and to gain biological knowledge about cultivars, a genomic approach is the most reliable strategy. The recent grapevine genome sequencing offers the opportunity to study the potential roles of genes and microRNAs in fruit maturation and other physiological and pathological processes. Although several systems allowing the analysis of plant genomes have been reported, none of them has been designed specifically for the functional analysis of grapevine genomes of cultivars under environmental stress in connection with microRNA data. Description Here we introduce a novel knowledge base, called BIOWINE, designed for the functional analysis of Vitis vinifera genomes of cultivars present in Sicily. The system allows the analysis of RNA-seq experiments of two different cultivars, namely Nero d'Avola and Nerello Mascalese. Samples were taken under different climatic conditions of phenological phases, diseases, and geographic locations. The BIOWINE web interface is equipped with data analysis modules for grapevine genomes. In particular users may analyze the current genome assembly together with the RNA-seq data through a customized version of GBrowse. The web interface allows users to perform gene set enrichment by exploiting third-party databases. Conclusions BIOWINE is a knowledge base implementing a set of bioinformatics tools for the analysis of grapevine genomes. The system aims to increase our understanding of the grapevine varieties and species of Sicilian products focusing on adaptability to different climatic conditions, phenological phases, diseases, and geographic locations.
Collapse
|
36
|
Darde TA, Sallou O, Becker E, Evrard B, Monjeaud C, Le Bras Y, Jégou B, Collin O, Rolland AD, Chalmel F. The ReproGenomics Viewer: an integrative cross-species toolbox for the reproductive science community. Nucleic Acids Res 2015; 43:W109-16. [PMID: 25883147 PMCID: PMC4489245 DOI: 10.1093/nar/gkv345] [Citation(s) in RCA: 40] [Impact Index Per Article: 4.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/30/2015] [Accepted: 04/06/2015] [Indexed: 12/23/2022] Open
Abstract
We report the development of the ReproGenomics Viewer (RGV), a multi- and cross-species working environment for the visualization, mining and comparison of published omics data sets for the reproductive science community. The system currently embeds 15 published data sets related to gametogenesis from nine model organisms. Data sets have been curated and conveniently organized into broad categories including biological topics, technologies, species and publications. RGV's modular design for both organisms and genomic tools enables users to upload and compare their data with that from the data sets embedded in the system in a cross-species manner. The RGV is freely available at http://rgv.genouest.org.
Collapse
Affiliation(s)
- Thomas A Darde
- Inserm U1085-Irset, Université de Rennes 1, F-35042 Rennes, France Institut de Recherche en Informatique et Systèmes Aléatoires (IRISA/INRIA) - GenOuest platform, Université de Rennes 1, F-35042 Rennes, France
| | - Olivier Sallou
- Institut de Recherche en Informatique et Systèmes Aléatoires (IRISA/INRIA) - GenOuest platform, Université de Rennes 1, F-35042 Rennes, France
| | | | - Bertrand Evrard
- Inserm U1085-Irset, Université de Rennes 1, F-35042 Rennes, France
| | - Cyril Monjeaud
- Institut de Recherche en Informatique et Systèmes Aléatoires (IRISA/INRIA) - GenOuest platform, Université de Rennes 1, F-35042 Rennes, France
| | - Yvan Le Bras
- Institut de Recherche en Informatique et Systèmes Aléatoires (IRISA/INRIA) - GenOuest platform, Université de Rennes 1, F-35042 Rennes, France
| | - Bernard Jégou
- Inserm U1085-Irset, Université de Rennes 1, F-35042 Rennes, France Ecole des Hautes Études en Santé Publique, Avenue du Professeur Léon-Bernard, F-35043 Rennes, France
| | - Olivier Collin
- Institut de Recherche en Informatique et Systèmes Aléatoires (IRISA/INRIA) - GenOuest platform, Université de Rennes 1, F-35042 Rennes, France
| | | | - Frédéric Chalmel
- Inserm U1085-Irset, Université de Rennes 1, F-35042 Rennes, France
| |
Collapse
|
37
|
Jaschob D, Davis TN, Riffle M. Mason: a JavaScript web site widget for visualizing and comparing annotated features in nucleotide or protein sequences. BMC Res Notes 2015; 8:70. [PMID: 25884379 PMCID: PMC4354989 DOI: 10.1186/s13104-015-1009-z] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/20/2014] [Accepted: 02/10/2015] [Indexed: 12/29/2022] Open
Abstract
BACKGROUND Sequence feature annotations (e.g., protein domain boundaries, binding sites, and secondary structure predictions) are an essential part of biological research. Annotations are widely used by scientists during research and experimental design, and are frequently the result of biological studies. A generalized and simple means of disseminating and visualizing these data via the web would be of value to the research community. FINDINGS Mason is a web site widget designed to visualize and compare annotated features of one or more nucleotide or protein sequence. Annotated features may be of virtually any type, ranging from annotating transcription binding sites or exons and introns in DNA to secondary structure or domain boundaries in proteins. Mason is simple to use and easy to integrate into web sites. Mason has a highly dynamic and configurable interface supporting multiple sets of annotations per sequence, overlapping regions, customization of interface and user-driven events (e.g., clicks and text to appear for tooltips). It is written purely in JavaScript and SVG, requiring no 3(rd) party plugins or browser customization. CONCLUSIONS Mason is a solution for dissemination of sequence annotation data on the web. It is highly flexible, customizable, simple to use, and is designed to be easily integrated into web sites. Mason is open source and freely available at https://github.com/yeastrc/mason.
Collapse
Affiliation(s)
- Daniel Jaschob
- Department of Biochemistry, University of Washington, UW Box 357350, 1705 NE Pacific St, Seattle, WA, 98195-7350, USA.
| | - Trisha N Davis
- Department of Biochemistry, University of Washington, UW Box 357350, 1705 NE Pacific St, Seattle, WA, 98195-7350, USA.
| | - Michael Riffle
- Department of Biochemistry, University of Washington, UW Box 357350, 1705 NE Pacific St, Seattle, WA, 98195-7350, USA. .,Department of Genome Sciences, University of Washington, UW Box 357350, 1705 NE Pacific St, Seattle, WA, 98195-7350, USA.
| |
Collapse
|
38
|
Developing genome-wide microsatellite markers of bamboo and their applications on molecular marker assisted taxonomy for accessions in the genus Phyllostachys. Sci Rep 2015; 5:8018. [PMID: 25620112 PMCID: PMC4306134 DOI: 10.1038/srep08018] [Citation(s) in RCA: 43] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/03/2014] [Accepted: 12/29/2014] [Indexed: 01/02/2023] Open
Abstract
Morphology-based taxonomy via exiguously reproductive organ has severely limitation on bamboo taxonomy, mainly owing to infrequent and unpredictable flowering events of bamboo. Here, we present the first genome-wide analysis and application of microsatellites based on the genome of moso bamboo (Phyllostachys edulis) to assist bamboo taxonomy. Of identified 127,593 microsatellite repeat-motifs, the primers of 1,451 microsatellites were designed and 1,098 markers were physically mapped on the genome of moso bamboo. A total of 917 markers were successfully validated in 9 accessions with ~39.8% polymorphic potential. Retrieved from validated microsatellite markers, 23 markers were selected for polymorphic analysis among 78 accessions and 64 alleles were detected with an average of 2.78 alleles per primers. The cluster result indicated the majority of the accessions were consistent with their current taxonomic classification, confirming the suitability and effectiveness of the developed microsatellite markers. The variations of microsatellite marker in different species were confirmed by sequencing and in silico comparative genome mapping were investigated. Lastly, a bamboo microsatellites database (http://www.bamboogdb.org/ssr) was implemented to browse and search large information of bamboo microsatellites. Consequently, our results of microsatellite marker development are valuable for assisting bamboo taxonomy and investigating genomic studies in bamboo and related grass species.
Collapse
|
39
|
Misra G, Priya P, Bandhiwal N, Bareja N, Jain M, Bhatia S, Chattopadhyay D, Tyagi AK, Yadav G. The chickpea genomic web resource: visualization and analysis of the desi-type Cicer arietinum nuclear genome for comparative exploration of legumes. BMC PLANT BIOLOGY 2014; 14:315. [PMID: 25518738 PMCID: PMC4307184 DOI: 10.1186/s12870-014-0315-2] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 06/16/2014] [Accepted: 10/30/2014] [Indexed: 06/04/2023]
Abstract
BACKGROUND Availability of the draft nuclear genome sequences of small-seeded desi-type legume crop Cicer arietinum has provided an opportunity for investigating unique chickpea genomic features and evaluation of their biological significance. The increasing number of legume genome sequences also presents a challenge for developing reliable and information-driven bioinformatics applications suitable for comparative exploration of this important class of crop plants. RESULTS The Chickpea Genomic Web Resource (CGWR) is an implementation of a suite of web-based applications dedicated to chickpea genome visualization and comparative analysis, based on next generation sequencing and assembly of Cicer arietinum desi-type genotype ICC4958. CGWR has been designed and configured for mapping, scanning and browsing the significant chickpea genomic features in view of the important existing and potential roles played by the various legume genome projects in mutant mapping and cloning. It also enables comparative informatics of ICC4958 DNA sequence analysis with other wild and cultivated genotypes of chickpea, various other leguminous species as well as several non-leguminous model plants, to enable investigations into evolutionary processes that shape legume genomes. CONCLUSIONS CGWR is an online database offering a comprehensive visual and functional genomic analysis of the chickpea genome, along with customized maps and gene-clustering options. It is also the only plant based web resource supporting display and analysis of nucleosome positioning patterns in the genome. The usefulness of CGWR has been demonstrated with discoveries of biological significance made using this server. The CGWR is compatible with all available operating systems and browsers, and is available freely under the open source license at http://www.nipgr.res.in/CGWR/home.php.
Collapse
Affiliation(s)
- Gopal Misra
- National Institute of Plant Genome Research (NIPGR), Aruna Asaf Ali Marg, New Delhi, 110067 India
| | - Piyush Priya
- National Institute of Plant Genome Research (NIPGR), Aruna Asaf Ali Marg, New Delhi, 110067 India
| | - Nitesh Bandhiwal
- National Institute of Plant Genome Research (NIPGR), Aruna Asaf Ali Marg, New Delhi, 110067 India
| | - Neha Bareja
- National Institute of Plant Genome Research (NIPGR), Aruna Asaf Ali Marg, New Delhi, 110067 India
| | - Mukesh Jain
- National Institute of Plant Genome Research (NIPGR), Aruna Asaf Ali Marg, New Delhi, 110067 India
| | - Sabhyata Bhatia
- National Institute of Plant Genome Research (NIPGR), Aruna Asaf Ali Marg, New Delhi, 110067 India
| | - Debasis Chattopadhyay
- National Institute of Plant Genome Research (NIPGR), Aruna Asaf Ali Marg, New Delhi, 110067 India
| | - Akhilesh K Tyagi
- National Institute of Plant Genome Research (NIPGR), Aruna Asaf Ali Marg, New Delhi, 110067 India
| | - Gitanjali Yadav
- National Institute of Plant Genome Research (NIPGR), Aruna Asaf Ali Marg, New Delhi, 110067 India
| |
Collapse
|
40
|
Wang L, Yu J, Li D, Zhang X. Sinbase: An Integrated Database to Study Genomics, Genetics and Comparative Genomics in Sesamum indicum. ACTA ACUST UNITED AC 2014; 56:e2. [DOI: 10.1093/pcp/pcu175] [Citation(s) in RCA: 39] [Impact Index Per Article: 3.9] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022]
|
41
|
Wodke JAH, Alibés A, Cozzuto L, Hermoso A, Yus E, Lluch-Senar M, Serrano L, Roma G. MyMpn: a database for the systems biology model organism Mycoplasma pneumoniae. Nucleic Acids Res 2014; 43:D618-23. [PMID: 25378328 PMCID: PMC4383923 DOI: 10.1093/nar/gku1105] [Citation(s) in RCA: 30] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open
Abstract
MyMpn (http://mympn.crg.eu) is an online resource devoted to studying the human pathogen Mycoplasma pneumoniae, a minimal bacterium causing lower respiratory tract infections. Due to its small size, its ability to grow in vitro, and the amount of data produced over the past decades, M. pneumoniae is an interesting model organisms for the development of systems biology approaches for unicellular organisms. Our database hosts a wealth of omics-scale datasets generated by hundreds of experimental and computational analyses. These include data obtained from gene expression profiling experiments, gene essentiality studies, protein abundance profiling, protein complex analysis, metabolic reactions and network modeling, cell growth experiments, comparative genomics and 3D tomography. In addition, the intuitive web interface provides access to several visualization and analysis tools as well as to different data search options. The availability and—even more relevant—the accessibility of properly structured and organized data are of up-most importance when aiming to understand the biology of an organism on a global scale. Therefore, MyMpn constitutes a unique and valuable new resource for the large systems biology and microbiology community.
Collapse
Affiliation(s)
- Judith A H Wodke
- EMBL/CRG Systems Biology Research Unit, Centre for Genomic Regulation (CRG), Dr. Aiguader 88, 08003 Barcelona, Spain Universitat Pompeu Fabra (UPF), Dr. Aiguader 88, 08003 Barcelona, Spain Theoretical Biophysics, Humboldt-Universitt zu Berlin, Invalidenstr 42, 10115 Berlin, Germany
| | - Andreu Alibés
- Universitat Pompeu Fabra (UPF), Dr. Aiguader 88, 08003 Barcelona, Spain CRG Bioinformatics Facility, Centre for Genomic Regulation (CRG), Dr. Aiguader 88, 08003 Barcelona, Spain
| | - Luca Cozzuto
- Universitat Pompeu Fabra (UPF), Dr. Aiguader 88, 08003 Barcelona, Spain CRG Bioinformatics Facility, Centre for Genomic Regulation (CRG), Dr. Aiguader 88, 08003 Barcelona, Spain
| | - Antonio Hermoso
- Universitat Pompeu Fabra (UPF), Dr. Aiguader 88, 08003 Barcelona, Spain CRG Bioinformatics Facility, Centre for Genomic Regulation (CRG), Dr. Aiguader 88, 08003 Barcelona, Spain
| | - Eva Yus
- EMBL/CRG Systems Biology Research Unit, Centre for Genomic Regulation (CRG), Dr. Aiguader 88, 08003 Barcelona, Spain Universitat Pompeu Fabra (UPF), Dr. Aiguader 88, 08003 Barcelona, Spain
| | - Maria Lluch-Senar
- EMBL/CRG Systems Biology Research Unit, Centre for Genomic Regulation (CRG), Dr. Aiguader 88, 08003 Barcelona, Spain Universitat Pompeu Fabra (UPF), Dr. Aiguader 88, 08003 Barcelona, Spain
| | - Luis Serrano
- EMBL/CRG Systems Biology Research Unit, Centre for Genomic Regulation (CRG), Dr. Aiguader 88, 08003 Barcelona, Spain Universitat Pompeu Fabra (UPF), Dr. Aiguader 88, 08003 Barcelona, Spain Institució Catalana de Recerca i Estudis Avançats (ICREA), Pg. Lluis Companys 23, 08010 Barcelona, Spain
| | - Guglielmo Roma
- Universitat Pompeu Fabra (UPF), Dr. Aiguader 88, 08003 Barcelona, Spain CRG Bioinformatics Facility, Centre for Genomic Regulation (CRG), Dr. Aiguader 88, 08003 Barcelona, Spain
| |
Collapse
|
42
|
Shimoyama M, De Pons J, Hayman GT, Laulederkind SJF, Liu W, Nigam R, Petri V, Smith JR, Tutaj M, Wang SJ, Worthey E, Dwinell M, Jacob H. The Rat Genome Database 2015: genomic, phenotypic and environmental variations and disease. Nucleic Acids Res 2014; 43:D743-50. [PMID: 25355511 PMCID: PMC4383884 DOI: 10.1093/nar/gku1026] [Citation(s) in RCA: 167] [Impact Index Per Article: 16.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/12/2023] Open
Abstract
The Rat Genome Database (RGD, http://rgd.mcw.edu) provides the most comprehensive data repository and informatics platform related to the laboratory rat, one of the most important model organisms for disease studies. RGD maintains and updates datasets for genomic elements such as genes, transcripts and increasingly in recent years, sequence variations, as well as map positions for multiple assemblies and sequence information. Functional annotations for genomic elements are curated from published literature, submitted by researchers and integrated from other public resources. Complementing the genomic data catalogs are those associated with phenotypes and disease, including strains, QTL and experimental phenotype measurements across hundreds of strains. Data are submitted by researchers, acquired through bulk data pipelines or curated from published literature. Innovative software tools provide users with an integrated platform to query, mine, display and analyze valuable genomic and phenomic datasets for discovery and enhancement of their own research. This update highlights recent developments that reflect an increasing focus on: (i) genomic variation, (ii) phenotypes and diseases, (iii) data related to the environment and experimental conditions and (iv) datasets and software tools that allow the user to explore and analyze the interactions among these and their impact on disease.
Collapse
Affiliation(s)
- Mary Shimoyama
- Human and Molecular Genetics Center, Medical College of Wisconsin, Milwaukee, WI 53226, USA Department of Surgery, Medical College of Wisconsin, Milwaukee, WI 53226, USA
| | - Jeff De Pons
- Human and Molecular Genetics Center, Medical College of Wisconsin, Milwaukee, WI 53226, USA
| | - G Thomas Hayman
- Human and Molecular Genetics Center, Medical College of Wisconsin, Milwaukee, WI 53226, USA
| | | | - Weisong Liu
- Human and Molecular Genetics Center, Medical College of Wisconsin, Milwaukee, WI 53226, USA
| | - Rajni Nigam
- Human and Molecular Genetics Center, Medical College of Wisconsin, Milwaukee, WI 53226, USA
| | - Victoria Petri
- Human and Molecular Genetics Center, Medical College of Wisconsin, Milwaukee, WI 53226, USA
| | - Jennifer R Smith
- Human and Molecular Genetics Center, Medical College of Wisconsin, Milwaukee, WI 53226, USA
| | - Marek Tutaj
- Human and Molecular Genetics Center, Medical College of Wisconsin, Milwaukee, WI 53226, USA
| | - Shur-Jen Wang
- Human and Molecular Genetics Center, Medical College of Wisconsin, Milwaukee, WI 53226, USA
| | - Elizabeth Worthey
- Human and Molecular Genetics Center, Medical College of Wisconsin, Milwaukee, WI 53226, USA Department of Pediatrics, Medical College of Wisconsin, Milwaukee, WI 53226, USA
| | - Melinda Dwinell
- Human and Molecular Genetics Center, Medical College of Wisconsin, Milwaukee, WI 53226, USA Department of Physiology, Medical College of Wisconsin, Milwaukee, WI 53226, USA
| | - Howard Jacob
- Human and Molecular Genetics Center, Medical College of Wisconsin, Milwaukee, WI 53226, USA Department of Physiology, Medical College of Wisconsin, Milwaukee, WI 53226, USA
| |
Collapse
|
43
|
Abstract
Genome-wide association studies (GWAS) have become popular as an approach for the identification of large numbers of phenotype-associated variants. However, differences in genetic architecture and environmental factors mean that the effect of variants can vary across populations. Understanding population genetic diversity is valuable for the investigation of possible population specific and independent effects of variants. EvoSNP-DB aims to provide information regarding genetic diversity among East Asian populations, including Chinese, Japanese, and Korean. Non-redundant SNPs (1.6 million) were genotyped in 54 Korean trios (162 samples) and were compared with 4 million SNPs from HapMap phase II populations. EvoSNP-DB provides two user interfaces for data query and visualization, and integrates scores of genetic diversity (Fst and VarLD) at the level of SNPs, genes, and chromosome regions. EvoSNP-DB is a web-based application that allows users to navigate and visualize measurements of population genetic differences in an interactive manner, and is available online at [http://biomi.cdc.go.kr/EvoSNP/]. [BMB Reports 2013; 46(8): 416-421]
Collapse
Affiliation(s)
- Young Uk Kim
- Division of Bio-Medical Informatics, Center for Genome Science, National Institute of Health, Cheongwon 363-951, Korea
| | | | | | | |
Collapse
|
44
|
Thanki AS, Jimenez RC, Kaithakottil GG, Corpas M, Davey RP. wigExplorer, a BioJS component to visualise wig data. F1000Res 2014; 3:53. [PMID: 27781080 PMCID: PMC5054804 DOI: 10.12688/f1000research.3-53.v3] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Accepted: 08/05/2016] [Indexed: 11/26/2022] Open
Abstract
Summary: wigExplorer is a BioJS component whose main purpose is to provide a platform for visualisation of wig-formatted data. Wig files are extensively used by genome browsers such as the UCSC Genome Browser.
wigExplorer follows the BioJS standard specification, requiring a simple configuration and installation.
wigExplorer provides an easy way to navigate the visible region of the canvas and allows interaction with other components via predefined events. Availability: http://biojs.io/d/biojs-vis-wigexplorer;http://dx.doi.org/10.5281/zenodo.8516
Collapse
Affiliation(s)
- Anil S Thanki
- Earlham Institute, Norwich Research Park, Norwich, NR4 7UH, UK
| | - Rafael C Jimenez
- European Bioinformatics Institute, Hinxton, Cambridge, CB10 1SD, UK
| | | | - Manuel Corpas
- Earlham Institute, Norwich Research Park, Norwich, NR4 7UH, UK
| | - Robert P Davey
- Earlham Institute, Norwich Research Park, Norwich, NR4 7UH, UK
| |
Collapse
|
45
|
Asamizu E, Ichihara H, Nakaya A, Nakamura Y, Hirakawa H, Ishii T, Tamura T, Fukami-Kobayashi K, Nakajima Y, Tabata S. Plant Genome DataBase Japan (PGDBj): a portal website for the integration of plant genome-related databases. PLANT & CELL PHYSIOLOGY 2014; 55:e8. [PMID: 24363285 PMCID: PMC3894704 DOI: 10.1093/pcp/pct189] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/08/2023]
Abstract
The Plant Genome DataBase Japan (PGDBj, http://pgdbj.jp/?ln=en) is a portal website that aims to integrate plant genome-related information from databases (DBs) and the literature. The PGDBj is comprised of three component DBs and a cross-search engine, which provides a seamless search over the contents of the DBs. The three DBs are as follows. (i) The Ortholog DB, providing gene cluster information based on the amino acid sequence similarity. Over 500,000 amino acid sequences of 20 Viridiplantae species were subjected to reciprocal BLAST searches and clustered. Sequences from plant genome DBs (e.g. TAIR10 and RAP-DB) were also included in the cluster with a direct link to the original DB. (ii) The Plant Resource DB, integrating the SABRE DB, which provides cDNA and genome sequence resources accumulated and maintained in the RIKEN BioResource Center and National BioResource Projects. (iii) The DNA Marker DB, providing manually or automatically curated information of DNA markers, quantitative trait loci and related linkage maps, from the literature and external DBs. As the PGDBj targets various plant species, including model plants, algae, and crops important as food, fodder and biofuel, researchers in the field of basic biology as well as a wide range of agronomic fields are encouraged to perform searches using DNA sequences, gene names, traits and phenotypes of interest. The PGDBj will return the search results from the component DBs and various types of linked external DBs.
Collapse
Affiliation(s)
- Erika Asamizu
- Department of Plant Genome Research, Kazusa DNA Research Institute, 2-6-7 Kazusa-kamatari, Kisarazu, Chiba, 292-0818 Japan
| | - Hisako Ichihara
- Department of Plant Genome Research, Kazusa DNA Research Institute, 2-6-7 Kazusa-kamatari, Kisarazu, Chiba, 292-0818 Japan
| | - Akihiro Nakaya
- Center for Transdisciplinary Research, Niigata University, 1-757 Asahimachi-dori, Chuo-ku, Niigata, 951-8585 Japan
| | - Yasukazu Nakamura
- Department of Plant Genome Research, Kazusa DNA Research Institute, 2-6-7 Kazusa-kamatari, Kisarazu, Chiba, 292-0818 Japan
| | - Hideki Hirakawa
- Department of Plant Genome Research, Kazusa DNA Research Institute, 2-6-7 Kazusa-kamatari, Kisarazu, Chiba, 292-0818 Japan
| | - Takahiro Ishii
- Department of Plant Genome Research, Kazusa DNA Research Institute, 2-6-7 Kazusa-kamatari, Kisarazu, Chiba, 292-0818 Japan
| | - Takuro Tamura
- LINE Co., Ltd., 5-201 Kandamatsunaga-cho, Tokyo, 101-0023 Japan
| | | | - Yukari Nakajima
- Department of Plant Genome Research, Kazusa DNA Research Institute, 2-6-7 Kazusa-kamatari, Kisarazu, Chiba, 292-0818 Japan
| | - Satoshi Tabata
- Department of Plant Genome Research, Kazusa DNA Research Institute, 2-6-7 Kazusa-kamatari, Kisarazu, Chiba, 292-0818 Japan
- *Corresponding author: Fax: +81-438-52-3918; E-mail,
| |
Collapse
|
46
|
Klein CC, Alves JMP, Serrano MG, Buck GA, Vasconcelos ATR, Sagot MF, Teixeira MMG, Camargo EP, Motta MCM. Biosynthesis of vitamins and cofactors in bacterium-harbouring trypanosomatids depends on the symbiotic association as revealed by genomic analyses. PLoS One 2013; 8:e79786. [PMID: 24260300 PMCID: PMC3833962 DOI: 10.1371/journal.pone.0079786] [Citation(s) in RCA: 42] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/26/2013] [Accepted: 09/25/2013] [Indexed: 12/03/2022] Open
Abstract
Some non-pathogenic trypanosomatids maintain a mutualistic relationship with a betaproteobacterium of the Alcaligenaceae family. Intensive nutritional exchanges have been reported between the two partners, indicating that these protozoa are excellent biological models to study metabolic co-evolution. We previously sequenced and herein investigate the entire genomes of five trypanosomatids which harbor a symbiotic bacterium (SHTs for Symbiont-Haboring Trypanosomatids) and the respective bacteria (TPEs for Trypanosomatid Proteobacterial Endosymbiont), as well as two trypanosomatids without symbionts (RTsfor Regular Trypanosomatids), for the presence of genes of the classical pathways for vitamin biosynthesis. Our data show that genes for the biosynthetic pathways of thiamine, biotin, and nicotinic acid are absent from all trypanosomatid genomes. This is in agreement with the absolute growth requirement for these vitamins in all protozoa of the family. Also absent from the genomes of RTs are the genes for the synthesis of pantothenic acid, folic acid, riboflavin, and vitamin B6. This is also in agreement with the available data showing that RTs are auxotrophic for these essential vitamins. On the other hand, SHTs are autotrophic for such vitamins. Indeed, all the genes of the corresponding biosynthetic pathways were identified, most of them in the symbiont genomes, while a few genes, mostly of eukaryotic origin, were found in the host genomes. The only exceptions to the latter are: the gene coding for the enzyme ketopantoate reductase (EC:1.1.1.169) which is related instead to the Firmicutes bacteria; and two other genes, one involved in the salvage pathway of pantothenic acid and the other in the synthesis of ubiquinone, that are related to Gammaproteobacteria. Their presence in trypanosomatids may result from lateral gene transfer. Taken together, our results reinforce the idea that the low nutritional requirement of SHTs is associated with the presence of the symbiotic bacterium, which contains most genes for vitamin production.
Collapse
Affiliation(s)
- Cecilia C. Klein
- BAMBOO Team, INRIA Grenoble-Rhône-Alpes, Villeurbanne, France
- Laboratoire Biométrie et Biologie Evolutive, Université de Lyon, Université Lyon 1, CNRS, UMR5558, Villeurbanne, France
- Laboratório Nacional de Computação Científica, Petrópolis, Rio de Janeiro, Brazil
- * E-mail: (CCK); (MCMM)
| | - João M. P. Alves
- Department of Parasitology, Institute of Biomedical Sciences, University of São Paulo, São Paulo, Brazil
| | - Myrna G. Serrano
- Virginia Commonwealth University, Richmond, Virginia, United States of America
| | - Gregory A. Buck
- Virginia Commonwealth University, Richmond, Virginia, United States of America
| | | | - Marie-France Sagot
- BAMBOO Team, INRIA Grenoble-Rhône-Alpes, Villeurbanne, France
- Laboratoire Biométrie et Biologie Evolutive, Université de Lyon, Université Lyon 1, CNRS, UMR5558, Villeurbanne, France
| | - Marta M. G. Teixeira
- Department of Parasitology, Institute of Biomedical Sciences, University of São Paulo, São Paulo, Brazil
| | - Erney P. Camargo
- Department of Parasitology, Institute of Biomedical Sciences, University of São Paulo, São Paulo, Brazil
| | - Maria Cristina M. Motta
- Instituto de Biofísica Carlos Chagas Filho, Universidade Federal do Rio de Janeiro, Rio de Janeiro, Brazil
- * E-mail: (CCK); (MCMM)
| |
Collapse
|
47
|
Jung S, Ficklin SP, Lee T, Cheng CH, Blenda A, Zheng P, Yu J, Bombarely A, Cho I, Ru S, Evans K, Peace C, Abbott AG, Mueller LA, Olmstead MA, Main D. The Genome Database for Rosaceae (GDR): year 10 update. Nucleic Acids Res 2013; 42:D1237-44. [PMID: 24225320 PMCID: PMC3965003 DOI: 10.1093/nar/gkt1012] [Citation(s) in RCA: 154] [Impact Index Per Article: 14.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/25/2023] Open
Abstract
The Genome Database for Rosaceae (GDR, http:/www.rosaceae.org), the long-standing central repository and data mining resource for Rosaceae research, has been enhanced with new genomic, genetic and breeding data, and improved functionality. Whole genome sequences of apple, peach and strawberry are available to browse or download with a range of annotations, including gene model predictions, aligned transcripts, repetitive elements, polymorphisms, mapped genetic markers, mapped NCBI Rosaceae genes, gene homologs and association of InterPro protein domains, GO terms and Kyoto Encyclopedia of Genes and Genomes pathway terms. Annotated sequences can be queried using search interfaces and visualized using GBrowse. New expressed sequence tag unigene sets are available for major genera, and Pathway data are available through FragariaCyc, AppleCyc and PeachCyc databases. Synteny among the three sequenced genomes can be viewed using GBrowse_Syn. New markers, genetic maps and extensively curated qualitative/Mendelian and quantitative trait loci are available. Phenotype and genotype data from breeding projects and genetic diversity projects are also included. Improved search pages are available for marker, trait locus, genetic diversity and publication data. New search tools for breeders enable selection comparison and assistance with breeding decision making.
Collapse
Affiliation(s)
- Sook Jung
- Department of Horticulture, Washington State University, Pullman, WA 99164-6414, USA, Department of Genetics and Biochemistry, Clemson University, Clemson, SC 29634, USA, Boyce Thompson Institute for Plant Research, Tower Road, Ithaca, NY 14853, USA, Department of Computer Science, Saginaw Valley State University, University Center, MI 48710, USA and Horticultural Sciences Department, University of Florida, Gainesville, FL 32611, USA
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
48
|
Sanderson LA, Ficklin SP, Cheng CH, Jung S, Feltus FA, Bett KE, Main D. Tripal v1.1: a standards-based toolkit for construction of online genetic and genomic databases. DATABASE-THE JOURNAL OF BIOLOGICAL DATABASES AND CURATION 2013; 2013:bat075. [PMID: 24163125 PMCID: PMC3808541 DOI: 10.1093/database/bat075] [Citation(s) in RCA: 44] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 12/18/2022]
Abstract
Tripal is an open-source freely available toolkit for construction of online genomic and genetic databases. It aims to facilitate development of community-driven biological websites by integrating the GMOD Chado database schema with Drupal, a popular website creation and content management software. Tripal provides a suite of tools for interaction with a Chado database and display of content therein. The tools are designed to be generic to support the various ways in which data may be stored in Chado. Previous releases of Tripal have supported organisms, genomic libraries, biological stocks, stock collections and genomic features, their alignments and annotations. Also, Tripal and its extension modules provided loaders for commonly used file formats such as FASTA, GFF, OBO, GAF, BLAST XML, KEGG heir files and InterProScan XML. Default generic templates were provided for common views of biological data, which could be customized using an open Application Programming Interface to change the way data are displayed. Here, we report additional tools and functionality that are part of release v1.1 of Tripal. These include (i) a new bulk loader that allows a site curator to import data stored in a custom tab delimited format; (ii) full support of every Chado table for Drupal Views (a powerful tool allowing site developers to construct novel displays and search pages); (iii) new modules including ‘Feature Map’, ‘Genetic’, ‘Publication’, ‘Project’, ‘Contact’ and the ‘Natural Diversity’ modules. Tutorials, mailing lists, download and set-up instructions, extension modules and other documentation can be found at the Tripal website located at http://tripal.info. Database URL: http://tripal.info/
Collapse
Affiliation(s)
- Lacey-Anne Sanderson
- Department of Plant Sciences, University of Saskatchewan. Saskatoon, SK Canada, Department of Horticulture, Washington State University. Pullman, WA, USA and Department of Genetics and Biochemistry, Clemson University. Clemson, SC, USA
| | | | | | | | | | | | | |
Collapse
|
49
|
Yu J, Zhao M, Wang X, Tong C, Huang S, Tehrim S, Liu Y, Hua W, Liu S. Bolbase: a comprehensive genomics database for Brassica oleracea. BMC Genomics 2013; 14:664. [PMID: 24079801 PMCID: PMC3849793 DOI: 10.1186/1471-2164-14-664] [Citation(s) in RCA: 86] [Impact Index Per Article: 7.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/13/2013] [Accepted: 09/25/2013] [Indexed: 11/10/2022] Open
Abstract
Background Brassica oleracea is a morphologically diverse species in the family Brassicaceae and contains a group of nutrition-rich vegetable crops, including common heading cabbage, cauliflower, broccoli, kohlrabi, kale, Brussels sprouts. This diversity along with its phylogenetic membership in a group of three diploid and three tetraploid species, and the recent availability of genome sequences within Brassica provide an unprecedented opportunity to study intra- and inter-species divergence and evolution in this species and its close relatives. Description We have developed a comprehensive database, Bolbase, which provides access to the B. oleracea genome data and comparative genomics information. The whole genome of B. oleracea is available, including nine fully assembled chromosomes and 1,848 scaffolds, with 45,758 predicted genes, 13,382 transposable elements, and 3,581 non-coding RNAs. Comparative genomics information is available, including syntenic regions among B. oleracea, Brassica rapa and Arabidopsis thaliana, synonymous (Ks) and non-synonymous (Ka) substitution rates between orthologous gene pairs, gene families or clusters, and differences in quantity, category, and distribution of transposable elements on chromosomes. Bolbase provides useful search and data mining tools, including a keyword search, a local BLAST server, and a customized GBrowse tool, which can be used to extract annotations of genome components, identify similar sequences and visualize syntenic regions among species. Users can download all genomic data and explore comparative genomics in a highly visual setting. Conclusions Bolbase is the first resource platform for the B. oleracea genome and for genomic comparisons with its relatives, and thus it will help the research community to better study the function and evolution of Brassica genomes as well as enhance molecular breeding research. This database will be updated regularly with new features, improvements to genome annotation, and new genomic sequences as they become available. Bolbase is freely available at http://ocri-genomics.org/bolbase.
Collapse
Affiliation(s)
- Jingyin Yu
- The Key Laboratory of Oil Crops Biology and Genetic Breeding, the Ministry of Agriculture, Oil Crops Research Institute, the Chinese Academy of Agricultural Sciences, Wuhan 430062, China.
| | | | | | | | | | | | | | | | | |
Collapse
|
50
|
Sakurai T, Mochida K, Yoshida T, Akiyama K, Ishitani M, Seki M, Shinozaki K. Genome-wide discovery and information resource development of DNA polymorphisms in cassava. PLoS One 2013; 8:e74056. [PMID: 24040164 PMCID: PMC3770675 DOI: 10.1371/journal.pone.0074056] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/16/2013] [Accepted: 07/29/2013] [Indexed: 01/06/2023] Open
Abstract
Cassava (Manihot esculenta Crantz) is an important crop that provides food security and income generation in many tropical countries, and is known for its adaptability to various environmental conditions. Its draft genome sequence and many expressed sequence tags are now publicly available, allowing the development of cassava polymorphism information. Here, we describe the genome-wide discovery of cassava DNA polymorphisms. Using the alignment of predicted transcribed sequences from the cassava draft genome sequence and ESTs from GenBank, we discovered 10,546 single-nucleotide polymorphisms and 647 insertions and deletions. To facilitate molecular marker development for cassava, we designed 9,316 PCR primer pairs to amplify the genomic region around each DNA polymorphism. Of the discovered SNPs, 62.7% occurred in protein-coding regions. Disease-resistance genes were found to have a significantly higher ratio of nonsynonymous-to-synonymous substitutions. We identified 24 read-through (changes of a stop codon to a coding codon) and 38 premature stop (changes of a coding codon to a stop codon) single-nucleotide polymorphisms, and found that the 5 gene ontology terms in biological process were significantly different in genes with read-through single-nucleotide polymorphisms compared with all cassava genes. All data on the discovered DNA polymorphisms were organized into the Cassava Online Archive database, which is available at http://cassava.psc.riken.jp/.
Collapse
Affiliation(s)
- Tetsuya Sakurai
- RIKEN Center for Sustainable Resource Science, Tsurumi-ku, Yokohama, Kanagawa, Japan
| | - Keiichi Mochida
- RIKEN Center for Sustainable Resource Science, Tsurumi-ku, Yokohama, Kanagawa, Japan
- RIKEN Biomass Engineering Program, Tsurumi-ku, Yokohama, Kanagawa, Japan
- Kihara Institute for Biological Research, Yokohama City University, Totsuka-ku, Yokohama, Kanagawa, Japan
| | - Takuhiro Yoshida
- RIKEN Center for Sustainable Resource Science, Tsurumi-ku, Yokohama, Kanagawa, Japan
| | - Kenji Akiyama
- RIKEN Center for Sustainable Resource Science, Tsurumi-ku, Yokohama, Kanagawa, Japan
| | - Manabu Ishitani
- Agrobiodiversity Research Area, International Center for Tropical Agriculture (CIAT), Cali, Colombia
| | - Motoaki Seki
- RIKEN Center for Sustainable Resource Science, Tsurumi-ku, Yokohama, Kanagawa, Japan
- Kihara Institute for Biological Research, Yokohama City University, Totsuka-ku, Yokohama, Kanagawa, Japan
| | - Kazuo Shinozaki
- RIKEN Center for Sustainable Resource Science, Tsurumi-ku, Yokohama, Kanagawa, Japan
- RIKEN Biomass Engineering Program, Tsurumi-ku, Yokohama, Kanagawa, Japan
| |
Collapse
|