1
|
da Silva Pescador G, Baia Amaral D, Varberg JM, Zhang Y, Hao Y, Florens L, Bazzini AA. Protein profiling of zebrafish embryos unmasks regulatory layers during early embryogenesis. Cell Rep 2024; 43:114769. [PMID: 39302832 DOI: 10.1016/j.celrep.2024.114769] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/12/2024] [Revised: 07/22/2024] [Accepted: 08/30/2024] [Indexed: 09/22/2024] Open
Abstract
The maternal-to-zygotic transition is crucial in embryonic development, marked by the degradation of maternally provided mRNAs and initiation of zygotic gene expression. However, the changes occurring at the protein level during this transition remain unclear. Here, we conducted protein profiling throughout zebrafish embryogenesis using quantitative mass spectrometry, integrating transcriptomics and translatomics datasets. Our data show that, unlike RNA changes, protein changes are less dynamic. Further, increases in protein levels correlate with mRNA translation, whereas declines in protein levels do not, suggesting active protein degradation processes. Interestingly, proteins from pure zygotic genes are present at fertilization, challenging existing mRNA-based gene classifications. As a proof of concept, we utilized CRISPR-Cas13d to target znf281b mRNA, a gene whose protein significantly accumulates within the first 2 h post-fertilization, demonstrating its crucial role in development. Consequently, our protein profiling, coupled with CRISPR-Cas13d, offers a complementary approach to unraveling maternal factor function during embryonic development.
Collapse
Affiliation(s)
| | | | - Joseph M Varberg
- Stowers Institute for Medical Research, Kansas City, MO 64110, USA
| | - Ying Zhang
- Stowers Institute for Medical Research, Kansas City, MO 64110, USA
| | - Yan Hao
- Stowers Institute for Medical Research, Kansas City, MO 64110, USA
| | - Laurence Florens
- Stowers Institute for Medical Research, Kansas City, MO 64110, USA
| | - Ariel A Bazzini
- Stowers Institute for Medical Research, Kansas City, MO 64110, USA; Department of Molecular and Integrative Physiology, University of Kansas School of Medicine, Kansas City, KS 66160, USA.
| |
Collapse
|
2
|
Harrison MC, Opulente DA, Wolters JF, Shen XX, Zhou X, Groenewald M, Hittinger CT, Rokas A, LaBella AL. Exploring Saccharomycotina Yeast Ecology Through an Ecological Ontology Framework. Yeast 2024; 41:615-628. [PMID: 39295298 PMCID: PMC11522959 DOI: 10.1002/yea.3981] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/02/2024] [Revised: 08/26/2024] [Accepted: 09/03/2024] [Indexed: 09/21/2024] Open
Abstract
Yeasts in the subphylum Saccharomycotina are found across the globe in disparate ecosystems. A major aim of yeast research is to understand the diversity and evolution of ecological traits, such as carbon metabolic breadth, insect association, and cactophily. This includes studying aspects of ecological traits like genetic architecture or association with other phenotypic traits. Genomic resources in the Saccharomycotina have grown rapidly. Ecological data, however, are still limited for many species, especially those only known from species descriptions where usually only a limited number of strains are studied. Moreover, ecological information is recorded in natural language format limiting high throughput computational analysis. To address these limitations, we developed an ontological framework for the analysis of yeast ecology. A total of 1,088 yeast strains were added to the Ontology of Yeast Environments (OYE) and analyzed in a machine-learning framework to connect genotype to ecology. This framework is flexible and can be extended to additional isolates, species, or environmental sequencing data. Widespread adoption of OYE would greatly aid the study of macroecology in the Saccharomycotina subphylum.
Collapse
Affiliation(s)
- Marie-Claire Harrison
- Department of Biological Sciences, Vanderbilt University, Nashville, Tennessee, USA
- Evolutionary Studies Initiative, Vanderbilt University, Nashville, Tennessee, USA
| | - Dana A. Opulente
- Department of Biology, Villanova University, Villanova, Pennsylvania, USA
- Laboratory of Genetics, DOE Great Lakes Bioenergy Research Center, Center for Genomic Science Innovation, Wisconsin Energy Institute, J. F. Crow Institute for the Study of Evolution, University of Wisconsin-Madison, Madison, Wisconsin, USA
| | - John F. Wolters
- Laboratory of Genetics, DOE Great Lakes Bioenergy Research Center, Center for Genomic Science Innovation, Wisconsin Energy Institute, J. F. Crow Institute for the Study of Evolution, University of Wisconsin-Madison, Madison, Wisconsin, USA
| | - Xing-Xing Shen
- Centre for Evolutionary and Organismal Biology, Institute of Insect Sciences, Zhejiang University, Hangzhou, China
| | - Xiaofan Zhou
- Guangdong Province Key Laboratory of Microbial Signals and Disease Control, Integrative Microbiology Research Center, South China Agricultural University, Guangzhou, China
| | | | - Chris Todd Hittinger
- Laboratory of Genetics, DOE Great Lakes Bioenergy Research Center, Center for Genomic Science Innovation, Wisconsin Energy Institute, J. F. Crow Institute for the Study of Evolution, University of Wisconsin-Madison, Madison, Wisconsin, USA
| | - Antonis Rokas
- Department of Biological Sciences, Vanderbilt University, Nashville, Tennessee, USA
- Evolutionary Studies Initiative, Vanderbilt University, Nashville, Tennessee, USA
| | - Abigail Leavitt LaBella
- Department of Bioinformatics and Genomics, University of North Carolina at Charlotte, Kannapolis, North Carolina, USA
- Center for Computational Intelligence to Predict Health and Environmental Risks (CIPHER), University of North Carolina at Charlotte, Charlotte, North Carolina, USA
| |
Collapse
|
3
|
Napoli AJ, Laderwager S, Zoodsma JD, Biju B, Mucollari O, Schubel SK, Aprea C, Sayed A, Morgan K, Napoli A, Flanagan S, Wollmuth LP, Sirotkin HI. Developmental loss of NMDA receptors results in supernumerary forebrain neurons through delayed maturation of transit-amplifying neuroblasts. Sci Rep 2024; 14:3395. [PMID: 38336823 PMCID: PMC10858180 DOI: 10.1038/s41598-024-53910-7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/06/2023] [Accepted: 02/06/2024] [Indexed: 02/12/2024] Open
Abstract
Developmental neurogenesis is a tightly regulated spatiotemporal process with its dysregulation implicated in neurodevelopmental disorders. NMDA receptors are glutamate-gated ion channels that are widely expressed in the early nervous system, yet their contribution to neurogenesis is poorly understood. Notably, a variety of mutations in genes encoding NMDA receptor subunits are associated with neurodevelopmental disorders. To rigorously define the role of NMDA receptors in developmental neurogenesis, we used a mutant zebrafish line (grin1-/-) that lacks all NMDA receptors yet survives to 10 days post-fertilization, offering the opportunity to study post-embryonic neurodevelopment in the absence of NMDA receptors. Focusing on the forebrain, we find that these fish have a progressive supernumerary neuron phenotype confined to the telencephalon at the end of embryonic neurogenesis, but which extends to all forebrain regions during postembryonic neurogenesis. This enhanced neuron population does not arise directly from increased numbers or mitotic activity of radial glia cells, the principal neural stem cells. Rather, it stems from a lack of timely maturation of transit-amplifying neuroblasts into post-mitotic neurons, as indicated by a decrease in expression of the ontogenetically-expressed chloride transporter, KCC2. Pharmacological blockade with MK-801 recapitulates the grin1-/- supernumerary neuron phenotype, indicating a requirement for ionotropic signaling. Thus, NMDA receptors are required for suppression of indirect, transit amplifying cell-driven neurogenesis by promoting maturational termination of mitosis. Loss of suppression results in neuronal overpopulation that can fundamentally change brain circuitry and may be a key factor in pathogenesis of neurodevelopmental disorders caused by NMDA receptor dysfunction.
Collapse
Affiliation(s)
- Amalia J Napoli
- Department of Neurobiology and Behavior, Stony Brook University, Stony Brook, NY, 11794-5230, USA
| | - Stephanie Laderwager
- Department of Neurobiology and Behavior, Stony Brook University, Stony Brook, NY, 11794-5230, USA
- Graduate Program in Neuroscience, Stony Brook University, Stony Brook, NY, 11794-5230, USA
| | - Josiah D Zoodsma
- Department of Neurobiology and Behavior, Stony Brook University, Stony Brook, NY, 11794-5230, USA
| | - Bismi Biju
- Department of Neurobiology and Behavior, Stony Brook University, Stony Brook, NY, 11794-5230, USA
| | - Olgerta Mucollari
- Department of Neurobiology and Behavior, Stony Brook University, Stony Brook, NY, 11794-5230, USA
| | - Sarah K Schubel
- Department of Neurobiology and Behavior, Stony Brook University, Stony Brook, NY, 11794-5230, USA
| | - Christieann Aprea
- Department of Neurobiology and Behavior, Stony Brook University, Stony Brook, NY, 11794-5230, USA
| | - Aaliya Sayed
- Department of Neurobiology and Behavior, Stony Brook University, Stony Brook, NY, 11794-5230, USA
| | - Kiele Morgan
- Department of Neurobiology and Behavior, Stony Brook University, Stony Brook, NY, 11794-5230, USA
| | - Annelysia Napoli
- Department of Neurobiology and Behavior, Stony Brook University, Stony Brook, NY, 11794-5230, USA
| | - Stephanie Flanagan
- Department of Neurobiology and Behavior, Stony Brook University, Stony Brook, NY, 11794-5230, USA
| | - Lonnie P Wollmuth
- Department of Neurobiology and Behavior, Stony Brook University, Stony Brook, NY, 11794-5230, USA
- Department of Biochemistry and Cell Biology, Stony Brook University, Stony Brook, NY, 11794-5215, USA
- Center for Nervous System Disorders, Stony Brook University, Stony Brook, NY, 11794-5230, USA
| | - Howard I Sirotkin
- Department of Neurobiology and Behavior, Stony Brook University, Stony Brook, NY, 11794-5230, USA.
| |
Collapse
|
4
|
Clarke JL, Cooper LD, Poelchau MF, Berardini TZ, Elser J, Farmer AD, Ficklin S, Kumari S, Laporte MA, Nelson RT, Sadohara R, Selby P, Thessen AE, Whitehead B, Sen TZ. Data sharing and ontology use among agricultural genetics, genomics, and breeding databases and resources of the Agbiodata Consortium. Database (Oxford) 2023; 2023:baad076. [PMID: 37971715 PMCID: PMC10653126 DOI: 10.1093/database/baad076] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/17/2023] [Accepted: 10/17/2023] [Indexed: 11/19/2023]
Abstract
Over the last couple of decades, there has been a rapid growth in the number and scope of agricultural genetics, genomics and breeding databases and resources. The AgBioData Consortium (https://www.agbiodata.org/) currently represents 44 databases and resources (https://www.agbiodata.org/databases) covering model or crop plant and animal GGB data, ontologies, pathways, genetic variation and breeding platforms (referred to as 'databases' throughout). One of the goals of the Consortium is to facilitate FAIR (Findable, Accessible, Interoperable, and Reusable) data management and the integration of datasets which requires data sharing, along with structured vocabularies and/or ontologies. Two AgBioData working groups, focused on Data Sharing and Ontologies, respectively, conducted a Consortium-wide survey to assess the current status and future needs of the members in those areas. A total of 33 researchers responded to the survey, representing 37 databases. Results suggest that data-sharing practices by AgBioData databases are in a fairly healthy state, but it is not clear whether this is true for all metadata and data types across all databases; and that, ontology use has not substantially changed since a similar survey was conducted in 2017. Based on our evaluation of the survey results, we recommend (i) providing training for database personnel in a specific data-sharing techniques, as well as in ontology use; (ii) further study on what metadata is shared, and how well it is shared among databases; (iii) promoting an understanding of data sharing and ontologies in the stakeholder community; (iv) improving data sharing and ontologies for specific phenotypic data types and formats; and (v) lowering specific barriers to data sharing and ontology use, by identifying sustainability solutions, and the identification, promotion, or development of data standards. Combined, these improvements are likely to help AgBioData databases increase development efforts towards improved ontology use, and data sharing via programmatic means. Database URL https://www.agbiodata.org/databases.
Collapse
Affiliation(s)
- Jennifer L Clarke
- Department of Statistics and Department of Food Science and Technology, University of Nebraska–Lincoln, 340 Hardin Hall North Wing, Lincoln, NE 68583, USA
| | - Laurel D Cooper
- Department of Botany and Plant Pathology, Oregon State University, 2503 Cordley Hall, Corvallis, OR 97331, USA
| | - Monica F Poelchau
- USDA, Agricultural Research Service, National Agricultural Library, 10301 Baltimore Ave, Beltsville 20705, USA
| | - Tanya Z Berardini
- The Arabidopsis Information Resource and Phoenix Bioinformatic, 39899 Balentine Drive, Suite 200, Newark, CA, USA
| | - Justin Elser
- Department of Botany and Plant Pathology, Oregon State University, 2503 Cordley Hall, Corvallis, OR 97331, USA
| | - Andrew D Farmer
- National Center for Genome Resources, 2935 Rodeo Park Dr. E., Santa Fe, NM 87505, USA
| | - Stephen Ficklin
- Department of Horticulture, Washington State University, 249 Clark Hall, PO Box 646414, Pullman, WA 99164, USA
| | - Sunita Kumari
- Cold Spring Harbor Laboratory, One Bungtown Road, Cold Spring Harbor, NY 11724, USA
| | - Marie-Angélique Laporte
- Digital Inclusion, Bioversity International, Parc Scientifique Agropolis II, 1990 Bd de la Lironde, Montpellier 34397, France
| | - Rex T Nelson
- USDA, Agricultural Research Service, Corn Insects and Crop Genetics Research Unit, Iowa State University, 716 Farmhouse Lane, Ames, IA 50011, USA
| | - Rie Sadohara
- Department of Plant, Soil, and Microbial Sciences, Michigan State University, 1066 Bogue St, East Lansing, MI 48824, USA
| | - Peter Selby
- School of Integrative Plant Science, College of Agriculture and Life Sciences, Cornell University, 215 Garden Avenue, Ithaca, NY 14850, USA
| | - Anne E Thessen
- Department of Biomedical Informatics, University of Colorado Anschutz, 1890 N. Revere Court, Mailstop F600, Aurora CO 80045, USA
| | - Brandon Whitehead
- Data Science and Informatics, Manaaki Whenua—Landcare Research, Ltd., Riddet Road, Massey University, Palmerston North 4472, New Zealand
| | - Taner Z Sen
- USDA, Agricultural Research Service, Crop Improvement Genetics Research Unit, Western Regional Research Center, 800 Buchanan St, Albany 94710, USA
- Department of Bioengineering, University of California, 306 Stanley Hall, Berkeley, CA 94720, USA
| |
Collapse
|
5
|
Bradford YM, Van Slyke CE, Howe DG, Fashena D, Frazer K, Martin R, Paddock H, Pich C, Ramachandran S, Ruzicka L, Singer A, Taylor R, Tseng WC, Westerfield M. From multiallele fish to nonstandard environments, how ZFIN assigns phenotypes, human disease models, and gene expression annotations to genes. Genetics 2023; 224:iyad032. [PMID: 36864549 PMCID: PMC10158835 DOI: 10.1093/genetics/iyad032] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/05/2022] [Accepted: 02/13/2023] [Indexed: 03/04/2023] Open
Abstract
Danio rerio is a model organism used to investigate vertebrate development. Manipulation of the zebrafish genome and resultant gene products by mutation or targeted knockdown has made the zebrafish a good system for investigating gene function, providing a resource to investigate genetic contributors to phenotype and human disease. Phenotypic outcomes can be the result of gene mutation, targeted knockdown of gene products, manipulation of experimental conditions, or any combination thereof. Zebrafish have been used in various genetic and chemical screens to identify genetic and environmental contributors to phenotype and disease outcomes. The Zebrafish Information Network (ZFIN, zfin.org) is the central repository for genetic, genomic, and phenotypic data that result from research using D. rerio. Here we describe how ZFIN annotates phenotype, expression, and disease model data across various experimental designs, how we computationally determine wild-type gene expression, the phenotypic gene, and how these results allow us to propagate gene expression, phenotype, and disease model data to the correct gene, or gene related entity.
Collapse
Affiliation(s)
- Yvonne M Bradford
- The Institute of Neuroscience, University of Oregon, Eugene, OR 97403-1254, USA
| | - Ceri E Van Slyke
- The Institute of Neuroscience, University of Oregon, Eugene, OR 97403-1254, USA
| | - Douglas G Howe
- The Institute of Neuroscience, University of Oregon, Eugene, OR 97403-1254, USA
| | - David Fashena
- The Institute of Neuroscience, University of Oregon, Eugene, OR 97403-1254, USA
| | - Ken Frazer
- The Institute of Neuroscience, University of Oregon, Eugene, OR 97403-1254, USA
| | - Ryan Martin
- The Institute of Neuroscience, University of Oregon, Eugene, OR 97403-1254, USA
| | - Holly Paddock
- The Institute of Neuroscience, University of Oregon, Eugene, OR 97403-1254, USA
| | - Christian Pich
- The Institute of Neuroscience, University of Oregon, Eugene, OR 97403-1254, USA
| | | | - Leyla Ruzicka
- The Institute of Neuroscience, University of Oregon, Eugene, OR 97403-1254, USA
| | - Amy Singer
- The Institute of Neuroscience, University of Oregon, Eugene, OR 97403-1254, USA
| | - Ryan Taylor
- The Institute of Neuroscience, University of Oregon, Eugene, OR 97403-1254, USA
| | - Wei-Chia Tseng
- The Institute of Neuroscience, University of Oregon, Eugene, OR 97403-1254, USA
| | - Monte Westerfield
- The Institute of Neuroscience, University of Oregon, Eugene, OR 97403-1254, USA
| |
Collapse
|
6
|
Gosline SJC, Kim DN, Pande P, Thomas DG, Truong L, Hoffman P, Barton M, Loftus J, Moran A, Hampton S, Dowson S, Franklin L, Degnan D, Anderson L, Thessen A, Tanguay RL, Anderson KA, Waters KM. The Superfund Research Program Analytics Portal: linking environmental chemical exposure to biological phenotypes. Sci Data 2023; 10:151. [PMID: 36944655 PMCID: PMC10030892 DOI: 10.1038/s41597-023-02021-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/22/2022] [Accepted: 02/14/2023] [Indexed: 03/23/2023] Open
Abstract
The OSU/PNNL Superfund Research Program (SRP) represents a longstanding collaboration to quantify Polycyclic Aromatic Hydrocarbons (PAHs) at various superfund sites in the Pacific Northwest and assess their potential impact on human health. To link the chemical measurements to biological activity, we describe the use of the zebrafish as a high-throughput developmental toxicity model that provides quantitative measurements of the exposure to chemicals. Toward this end, we have linked over 150 PAHs found at Superfund sites to the effect of these same chemicals in zebrafish, creating a rich dataset that links environmental exposure to biological response. To quantify this response, we have implemented a dose-response modelling pipeline to calculate benchmark dose parameters which enable potency comparison across over 500 chemicals and 12 of the phenotypes measured in zebrafish. We provide a rich dataset for download and analysis as well as a web portal that provides public access to this dataset via an interactive web site designed to support exploration and re-use of these data by the scientific community at http://srp.pnnl.gov .
Collapse
Affiliation(s)
| | - Doo Nam Kim
- Pacific Northwest National Laboratory, Richland, WA, USA
| | - Paritosh Pande
- Pacific Northwest National Laboratory, Richland, WA, USA
| | | | | | | | | | - Joseph Loftus
- Pacific Northwest National Laboratory, Richland, WA, USA
| | - Addy Moran
- Pacific Northwest National Laboratory, Richland, WA, USA
| | - Shawn Hampton
- Pacific Northwest National Laboratory, Richland, WA, USA
| | - Scott Dowson
- Pacific Northwest National Laboratory, Richland, WA, USA
| | | | - David Degnan
- Pacific Northwest National Laboratory, Richland, WA, USA
| | | | - Anne Thessen
- University of Colorado Anschutz Medical School, Denver, CO, USA
| | | | | | - Katrina M Waters
- Pacific Northwest National Laboratory, Richland, WA, USA.
- Oregon State University, Corvallis, WA, USA.
| |
Collapse
|
7
|
Zahn N, James-Zorn C, Ponferrada VG, Adams DS, Grzymkowski J, Buchholz DR, Nascone-Yoder NM, Horb M, Moody SA, Vize PD, Zorn AM. Normal Table of Xenopus development: a new graphical resource. Development 2022; 149:dev200356. [PMID: 35833709 PMCID: PMC9445888 DOI: 10.1242/dev.200356] [Citation(s) in RCA: 46] [Impact Index Per Article: 23.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/18/2021] [Accepted: 03/17/2022] [Indexed: 12/26/2022]
Abstract
Normal tables of development are essential for studies of embryogenesis, serving as an important resource for model organisms, including the frog Xenopus laevis. Xenopus has long been used to study developmental and cell biology, and is an increasingly important model for human birth defects and disease, genomics, proteomics and toxicology. Scientists utilize Nieuwkoop and Faber's classic 'Normal Table of Xenopus laevis (Daudin)' and accompanying illustrations to enable experimental reproducibility and reuse the illustrations in new publications and teaching. However, it is no longer possible to obtain permission for these copyrighted illustrations. We present 133 new, high-quality illustrations of X. laevis development from fertilization to metamorphosis, with additional views that were not available in the original collection. All the images are available on Xenbase, the Xenopus knowledgebase (http://www.xenbase.org/entry/zahn.do), for download and reuse under an attributable, non-commercial creative commons license. Additionally, we have compiled a 'Landmarks Table' of key morphological features and marker gene expression that can be used to distinguish stages quickly and reliably (https://www.xenbase.org/entry/landmarks-table.do). This new open-access resource will facilitate Xenopus research and teaching in the decades to come.
Collapse
Affiliation(s)
| | - Christina James-Zorn
- Xenbase, Division of Developmental Biology, Cincinnati Children's Hospital Research Foundation, 3333 Burnet Ave, Cincinnati, OH 45229, USA
| | - Virgilio G. Ponferrada
- Xenbase, Division of Developmental Biology, Cincinnati Children's Hospital Research Foundation, 3333 Burnet Ave, Cincinnati, OH 45229, USA
| | - Dany S. Adams
- Lucell Diagnostics Inc, 16 Stearns Street, Cambridge, MA 02138, USA
| | - Julia Grzymkowski
- Department of Molecular Biomedical Sciences, College of Veterinary Medicine, North Carolina State University, Raleigh, NC 27695, USA
| | - Daniel R. Buchholz
- Department of Biology Sciences, University of Cincinnati, Cincinnati, OH 45221, USA
| | - Nanette M. Nascone-Yoder
- Department of Molecular Biomedical Sciences, College of Veterinary Medicine, North Carolina State University, Raleigh, NC 27695, USA
| | - Marko Horb
- National Xenopus Resource, Marine Biological Laboratory, Woods Hole, MA 02543, USA
| | - Sally A. Moody
- Department of Anatomy and Cell Biology, George Washington University Medical Center, Washington, DC 20037, USA
| | - Peter D. Vize
- Xenbase, Department of Biological Science, University of Calgary, Calgary, Alberta T2N 1N4, Canada
| | - Aaron M. Zorn
- Xenbase, Division of Developmental Biology, Cincinnati Children's Hospital Research Foundation, 3333 Burnet Ave, Cincinnati, OH 45229, USA
- Department of Pediatrics, University of Cincinnati College of Medicine, Cincinnati, OH 45229, USA
| |
Collapse
|
8
|
Alghamdi SM, Schofield PN, Hoehndorf R. How much do model organism phenotypes contribute to the computational identification of human disease genes? Dis Model Mech 2022; 15:275986. [PMID: 35758016 PMCID: PMC9366895 DOI: 10.1242/dmm.049441] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/24/2021] [Accepted: 06/13/2022] [Indexed: 12/04/2022] Open
Abstract
Computing phenotypic similarity helps identify new disease genes and diagnose rare diseases. Genotype–phenotype data from orthologous genes in model organisms can compensate for lack of human data and increase genome coverage. In the past decade, cross-species phenotype comparisons have proven valuble, and several ontologies have been developed for this purpose. The relative contribution of different model organisms to computational identification of disease-associated genes is not fully explored. We used phenotype ontologies to semantically relate phenotypes resulting from loss-of-function mutations in model organisms to disease-associated phenotypes in humans. Semantic machine learning methods were used to measure the contribution of different model organisms to the identification of known human gene–disease associations. We found that mouse genotype–phenotype data provided the most important dataset in the identification of human disease genes by semantic similarity and machine learning over phenotype ontologies. Other model organisms' data did not improve identification over that obtained using the mouse alone, and therefore did not contribute significantly to this task. Our work impacts on the development of integrated phenotype ontologies, as well as for the use of model organism phenotypes in human genetic variant interpretation. This article has an associated First Person interview with the first author of the paper. Editor's choice: We investigated the use of model organism phenotypes in the computational identification of disease genes, identifying several data biases and concluding that mouse model phenotypes contribute most to computational disease gene identification.
Collapse
Affiliation(s)
- Sarah M Alghamdi
- Computational Bioscience Research Center (CBRC), King Abdullah University of Science and Technology, 4700 KAUST, 23955 Thuwal, Saudi Arabia
| | - Paul N Schofield
- Department of Physiology, Development & Neuroscience, University of Cambridge, Downing Street, CB2 3EG, Cambridge, UK
| | - Robert Hoehndorf
- Computational Bioscience Research Center (CBRC), King Abdullah University of Science and Technology, 4700 KAUST, 23955 Thuwal, Saudi Arabia
| |
Collapse
|
9
|
Agapite J, Albou LP, Aleksander SA, Alexander M, Anagnostopoulos AV, Antonazzo G, Argasinska J, Arnaboldi V, Attrill H, Becerra A, Bello SM, Blake JA, Blodgett O, Bradford YM, Bult CJ, Cain S, Calvi BR, Carbon S, Chan J, Chen WJ, Michael Cherry J, Cho J, Christie KR, Crosby MA, Davis P, da Veiga Beltrame E, De Pons JL, D’Eustachio P, Diamantakis S, Dolan ME, dos Santos G, Douglass E, Dunn B, Eagle A, Ebert D, Engel SR, Fashena D, Foley S, Frazer K, Gao S, Gibson AC, Gondwe F, Goodman J, Sian Gramates L, Grove CA, Hale P, Harris T, Thomas Hayman G, Hill DP, Howe DG, Howe KL, Hu Y, Jha S, Kadin JA, Kaufman TC, Kalita P, Karra K, Kishore R, Kwitek AE, Laulederkind SJF, Lee R, Longden I, Luypaert M, MacPherson KA, Martin R, Marygold SJ, Matthews B, McAndrews MS, Millburn G, Miyasato S, Motenko H, Moxon S, Muller HM, Mungall CJ, Muruganujan A, Mushayahama T, Nalabolu HS, Nash RS, Ng P, Nuin P, Paddock H, Paulini M, Perrimon N, Pich C, Quinton-Tulloch M, Raciti D, Ramachandran S, Richardson JE, Gelbart SR, Ruzicka L, Schaper K, Schindelman G, Shimoyama M, Simison M, Shaw DR, Shrivatsav A, Singer A, Skrzypek M, Smith CM, Smith CL, Smith JR, Stein L, Sternberg PW, Tabone CJ, Thomas PD, Thorat K, Thota J, Toro S, Tomczuk M, Trovisco V, Tutaj MA, Tutaj M, Urbano JM, Van Auken K, Van Slyke CE, Wang Q, Wang SJ, Weng S, Westerfield M, Williams G, Wilming LG, Wong ED, Wright A, Yook K, Zarowiecki M, Zhou P, Zytkovicz M. Harmonizing model organism data in the Alliance of Genome Resources. Genetics 2022; 220:iyac022. [PMID: 35380658 PMCID: PMC8982023 DOI: 10.1093/genetics/iyac022] [Citation(s) in RCA: 55] [Impact Index Per Article: 27.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/12/2021] [Accepted: 01/26/2022] [Indexed: 02/06/2023] Open
Abstract
The Alliance of Genome Resources (the Alliance) is a combined effort of 7 knowledgebase projects: Saccharomyces Genome Database, WormBase, FlyBase, Mouse Genome Database, the Zebrafish Information Network, Rat Genome Database, and the Gene Ontology Resource. The Alliance seeks to provide several benefits: better service to the various communities served by these projects; a harmonized view of data for all biomedical researchers, bioinformaticians, clinicians, and students; and a more sustainable infrastructure. The Alliance has harmonized cross-organism data to provide useful comparative views of gene function, gene expression, and human disease relevance. The basis of the comparative views is shared calls of orthology relationships and the use of common ontologies. The key types of data are alleles and variants, gene function based on gene ontology annotations, phenotypes, association to human disease, gene expression, protein-protein and genetic interactions, and participation in pathways. The information is presented on uniform gene pages that allow facile summarization of information about each gene in each of the 7 organisms covered (budding yeast, roundworm Caenorhabditis elegans, fruit fly, house mouse, zebrafish, brown rat, and human). The harmonized knowledge is freely available on the alliancegenome.org portal, as downloadable files, and by APIs. We expect other existing and emerging knowledge bases to join in the effort to provide the union of useful data and features that each knowledge base currently provides.
Collapse
|
10
|
Fisher ME, Segerdell E, Matentzoglu N, Nenni MJ, Fortriede JD, Chu S, Pells TJ, Osumi-Sutherland D, Chaturvedi P, James-Zorn C, Sundararaj N, Lotay VS, Ponferrada V, Wang DZ, Kim E, Agalakov S, Arshinoff BI, Karimi K, Vize PD, Zorn AM. The Xenopus phenotype ontology: bridging model organism phenotype data to human health and development. BMC Bioinformatics 2022; 23:99. [PMID: 35317743 PMCID: PMC8939077 DOI: 10.1186/s12859-022-04636-8] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/12/2021] [Accepted: 03/08/2022] [Indexed: 11/10/2022] Open
Abstract
Background Ontologies of precisely defined, controlled vocabularies are essential to curate the results of biological experiments such that the data are machine searchable, can be computationally analyzed, and are interoperable across the biomedical research continuum. There is also an increasing need for methods to interrelate phenotypic data easily and accurately from experiments in animal models with human development and disease. Results Here we present the Xenopus phenotype ontology (XPO) to annotate phenotypic data from experiments in Xenopus, one of the major vertebrate model organisms used to study gene function in development and disease. The XPO implements design patterns from the Unified Phenotype Ontology (uPheno), and the principles outlined by the Open Biological and Biomedical Ontologies (OBO Foundry) to maximize interoperability with other species and facilitate ongoing ontology management. Constructed in Web Ontology Language (OWL) the XPO combines the existing uPheno library of ontology design patterns with additional terms from the Xenopus Anatomy Ontology (XAO), the Phenotype and Trait Ontology (PATO) and the Gene Ontology (GO). The integration of these different ontologies into the XPO enables rich phenotypic curation, whilst the uPheno bridging axioms allows phenotypic data from Xenopus experiments to be related to phenotype data from other model organisms and human disease. Moreover, the simple post-composed uPheno design patterns facilitate ongoing XPO development as the generation of new terms and classes of terms can be substantially automated. Conclusions The XPO serves as an example of current best practices to help overcome many of the inherent challenges in harmonizing phenotype data between different species. The XPO currently consists of approximately 22,000 terms and is being used to curate phenotypes by Xenbase, the Xenopus Model Organism Knowledgebase, forming a standardized corpus of genotype–phenotype data that can be directly related to other uPheno compliant resources. Supplementary Information The online version contains supplementary material available at 10.1186/s12859-022-04636-8.
Collapse
Affiliation(s)
- Malcolm E Fisher
- Division of Developmental Biology, Cincinnati Children's Hospital Medical Center, Cincinnati, OH, USA
| | - Erik Segerdell
- Division of Developmental Biology, Cincinnati Children's Hospital Medical Center, Cincinnati, OH, USA
| | - Nicolas Matentzoglu
- Monarch Initiative, London, UK.,Semanticly Ltd, London, UK.,European Bioinformatics Institute (EMBL-EBI), Hinxton, UK
| | - Mardi J Nenni
- Division of Developmental Biology, Cincinnati Children's Hospital Medical Center, Cincinnati, OH, USA
| | - Joshua D Fortriede
- Division of Developmental Biology, Cincinnati Children's Hospital Medical Center, Cincinnati, OH, USA
| | - Stanley Chu
- Department of Biological Science, University of Calgary, Calgary, AB, Canada
| | - Troy J Pells
- Department of Biological Science, University of Calgary, Calgary, AB, Canada
| | | | - Praneet Chaturvedi
- Division of Developmental Biology, Cincinnati Children's Hospital Medical Center, Cincinnati, OH, USA
| | - Christina James-Zorn
- Division of Developmental Biology, Cincinnati Children's Hospital Medical Center, Cincinnati, OH, USA
| | - Nivitha Sundararaj
- Division of Developmental Biology, Cincinnati Children's Hospital Medical Center, Cincinnati, OH, USA
| | - Vaneet S Lotay
- Department of Biological Science, University of Calgary, Calgary, AB, Canada
| | - Virgilio Ponferrada
- Division of Developmental Biology, Cincinnati Children's Hospital Medical Center, Cincinnati, OH, USA
| | - Dong Zhuo Wang
- Department of Biological Science, University of Calgary, Calgary, AB, Canada
| | - Eugene Kim
- Department of Biological Science, University of Calgary, Calgary, AB, Canada
| | - Sergei Agalakov
- Department of Biological Science, University of Calgary, Calgary, AB, Canada
| | - Bradley I Arshinoff
- Department of Biological Science, University of Calgary, Calgary, AB, Canada
| | - Kamran Karimi
- Department of Biological Science, University of Calgary, Calgary, AB, Canada
| | - Peter D Vize
- Department of Biological Science, University of Calgary, Calgary, AB, Canada
| | - Aaron M Zorn
- Division of Developmental Biology, Cincinnati Children's Hospital Medical Center, Cincinnati, OH, USA.
| |
Collapse
|
11
|
Lachowicz J, Niedziałek K, Rostkowska E, Szopa A, Świąder K, Szponar J, Serefko A. Zebrafish as an Animal Model for Testing Agents with Antidepressant Potential. Life (Basel) 2021; 11:life11080792. [PMID: 34440536 PMCID: PMC8401799 DOI: 10.3390/life11080792] [Citation(s) in RCA: 16] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/04/2021] [Revised: 08/01/2021] [Accepted: 08/03/2021] [Indexed: 12/28/2022] Open
Abstract
Depression is a serious mental disease that, according to statistics, affects 320 million people worldwide. Additionally, a current situation related to the COVID-19 pandemic has led to a significant deterioration of mental health in people around the world. So far, rodents have been treated as basic animal models used in studies on this disease, but in recent years, Danio rerio has emerged as a new organism that might serve well in preclinical experiments. Zebrafish have a lot of advantages, such as a quick reproductive cycle, transparent body during the early developmental stages, high genetic and physiological homology to humans, and low costs of maintenance. Here, we discuss the potential of the zebrafish model to be used in behavioral studies focused on testing agents with antidepressant potential.
Collapse
Affiliation(s)
- Joanna Lachowicz
- Student’s Scientific Circle at Laboratory of Preclinical Testing, Medical University of Lublin, Chodźki 1, 20-093 Lublin, Poland; (J.L.); (K.N.)
| | - Karolina Niedziałek
- Student’s Scientific Circle at Laboratory of Preclinical Testing, Medical University of Lublin, Chodźki 1, 20-093 Lublin, Poland; (J.L.); (K.N.)
| | | | - Aleksandra Szopa
- Laboratory of Preclinical Testing, Chair and Department of Applied and Social Pharmacy, Medical University of Lublin, Chodźki 1, 20-093 Lublin, Poland
- Correspondence: (A.S.); (A.S.)
| | - Katarzyna Świąder
- Chair and Department of Applied and Social Pharmacy, Medical University of Lublin, Chodźki 1, 20-093 Lublin, Poland;
| | - Jarosław Szponar
- Clinical Department of Toxicology and Cardiology, Medical University of Lublin, Chodźki 1, 20-093 Lublin, Poland;
- Toxicology Clinic, Stefan Wyszyński Regional Specialist Hospital in Lublin, Al. Kraśnicka 100, 20-718 Lublin, Poland
| | - Anna Serefko
- Laboratory of Preclinical Testing, Chair and Department of Applied and Social Pharmacy, Medical University of Lublin, Chodźki 1, 20-093 Lublin, Poland
- Correspondence: (A.S.); (A.S.)
| |
Collapse
|
12
|
Nowotarski SH, Davies EL, Robb SMC, Ross EJ, Matentzoglu N, Doddihal V, Mir M, McClain M, Sánchez Alvarado A. Planarian Anatomy Ontology: a resource to connect data within and across experimental platforms. Development 2021; 148:271068. [PMID: 34318308 PMCID: PMC8353266 DOI: 10.1242/dev.196097] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/18/2020] [Accepted: 06/28/2021] [Indexed: 12/23/2022]
Abstract
As the planarian research community expands, the need for an interoperable data organization framework for tool building has become increasingly apparent. Such software would streamline data annotation and enhance cross-platform and cross-species searchability. We created the Planarian Anatomy Ontology (PLANA), an extendable relational framework of defined Schmidtea mediterranea (Smed) anatomical terms used in the field. At publication, PLANA contains over 850 terms describing Smed anatomy from subcellular to system levels across all life cycle stages, in intact animals and regenerating body fragments. Terms from other anatomy ontologies were imported into PLANA to promote interoperability and comparative anatomy studies. To demonstrate the utility of PLANA as a tool for data curation, we created resources for planarian embryogenesis, including a staging series and molecular fate-mapping atlas, and the Planarian Anatomy Gene Expression database, which allows retrieval of a variety of published transcript/gene expression data associated with PLANA terms. As an open-source tool built using FAIR (findable, accessible, interoperable, reproducible) principles, our strategy for continued curation and versioning of PLANA also provides a platform for community-led growth and evolution of this resource. Summary: Description of the construction of an anatomy ontology tool for planaria with examples of its potential use to curate and mine data across multiple experimental platforms.
Collapse
Affiliation(s)
- Stephanie H Nowotarski
- Stowers Institute for Medical Research, Kansas City, MO 64110, USA.,Howard Hughes Medical Institute, Chevy Chase, MD 20815, USA
| | - Erin L Davies
- Stowers Institute for Medical Research, Kansas City, MO 64110, USA.,Center for Cancer Research, National Cancer Institute, National Institutes of Health, Frederick, MD 21702, USA
| | - Sofia M C Robb
- Stowers Institute for Medical Research, Kansas City, MO 64110, USA
| | - Eric J Ross
- Stowers Institute for Medical Research, Kansas City, MO 64110, USA.,Howard Hughes Medical Institute, Chevy Chase, MD 20815, USA
| | - Nicolas Matentzoglu
- European Bioinformatics Institute (EMBL-EBI), European Molecular Biology Laboratory, Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Viraj Doddihal
- Stowers Institute for Medical Research, Kansas City, MO 64110, USA
| | - Mol Mir
- Stowers Institute for Medical Research, Kansas City, MO 64110, USA
| | - Melainia McClain
- Stowers Institute for Medical Research, Kansas City, MO 64110, USA
| | - Alejandro Sánchez Alvarado
- Stowers Institute for Medical Research, Kansas City, MO 64110, USA.,Howard Hughes Medical Institute, Chevy Chase, MD 20815, USA
| |
Collapse
|
13
|
Yi W, Rücklin M, Poelmann RE, Aldridge DC, Richardson MK. Normal stages of embryonic development of a brood parasite, the rosy bitterling Rhodeus ocellatus (Teleostei: Cypriniformes). J Morphol 2021; 282:783-819. [PMID: 33583089 PMCID: PMC8252481 DOI: 10.1002/jmor.21335] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/03/2020] [Revised: 02/10/2021] [Accepted: 02/10/2021] [Indexed: 12/14/2022]
Abstract
Bitterlings, a group of freshwater teleosts, provide a fascinating example among vertebrates of the evolution of brood parasitism. Their eggs are laid inside the gill chamber of their freshwater mussel hosts where they develop as brood parasites. Studies of the embryonic development of bitterlings are crucial in deciphering the evolution of their distinct early life-history. Here, we have studied 255 embryos and larvae of the rosy bitterling (Rhodeus ocellatus) using in vitro fertilization and X-ray microtomography (microCT). We describe 11 pre-hatching and 13 post-hatching developmental stages spanning the first 14 days of development, from fertilization to the free-swimming stage. In contrast to previous developmental studies of various bitterling species, the staging system we describe is character-based and therefore more compatible with the widely-used stages described for zebrafish. Our bitterling data provide new insights into to the polarity of the chorion, and into notochord vacuolization and yolk sac extension in relation to body straightening. This study represents the first application of microCT scanning to bitterling development and provides one of the most detailed systematic descriptions of development in any teleost. Our staging series will be an important tool for heterochrony analysis and other comparative studies of teleost development, and may provide insight into the co-evolution of brood parasitism.
Collapse
Affiliation(s)
- Wenjing Yi
- Institute of BiologyUniversity of Leiden, Sylvius LaboratoryLeidenthe Netherlands
- The Key Laboratory of Aquatic Biodiversity and Conservation, Institute of HydrobiologyChinese Academy of SciencesHubeiChina
| | - Martin Rücklin
- Vertebrate Evolution, Development and EcologyNaturalis Biodiversity CenterLeidenThe Netherlands
| | - Robert E. Poelmann
- Institute of BiologyUniversity of Leiden, Sylvius LaboratoryLeidenthe Netherlands
| | | | | |
Collapse
|
14
|
Bertrand S, Carvalho JE, Dauga D, Matentzoglu N, Daric V, Yu JK, Schubert M, Escrivá H. The Ontology of the Amphioxus Anatomy and Life Cycle (AMPHX). Front Cell Dev Biol 2021; 9:668025. [PMID: 33981708 PMCID: PMC8107275 DOI: 10.3389/fcell.2021.668025] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/15/2021] [Accepted: 03/31/2021] [Indexed: 11/13/2022] Open
Abstract
An ontology is a computable representation of the different parts of an organism and its different developmental stages as well as the relationships between them. The ontology of model organisms is therefore a fundamental tool for a multitude of bioinformatics and comparative analyses. The cephalochordate amphioxus is a marine animal representing the earliest diverging evolutionary lineage of chordates. Furthermore, its morphology, its anatomy and its genome can be considered as prototypes of the chordate phylum. For these reasons, amphioxus is a very important animal model for evolutionary developmental biology studies aimed at understanding the origin and diversification of vertebrates. Here, we have constructed an amphioxus ontology (AMPHX) which combines anatomical and developmental terms and includes the relationships between these terms. AMPHX will be used to annotate amphioxus gene expression patterns as well as phenotypes. We encourage the scientific community to adopt this amphioxus ontology and send recommendations for future updates and improvements.
Collapse
Affiliation(s)
- Stephanie Bertrand
- CNRS, Biologie Intégrative des Organismes Marins, Sorbonne Université, Paris, France
| | - João E. Carvalho
- CNRS, Laboratoire de Biologie du Développement de Villefranche-sur-Mer, Institut de la Mer de Villefranche, Sorbonne Université, Paris, France
| | | | | | - Vladimir Daric
- CNRS, Biologie Intégrative des Organismes Marins, Sorbonne Université, Paris, France
| | - Jr-Kai Yu
- Institute of Cellular and Organismic Biology, Academia Sinica, Taipei City, Taiwan
- Marine Research Station, Institute of Cellular and Organismic Biology, Academia Sinica, Yilan, Taiwan
| | - Michael Schubert
- CNRS, Laboratoire de Biologie du Développement de Villefranche-sur-Mer, Institut de la Mer de Villefranche, Sorbonne Université, Paris, France
| | - Hector Escrivá
- CNRS, Biologie Intégrative des Organismes Marins, Sorbonne Université, Paris, France
| |
Collapse
|
15
|
Bastian FB, Roux J, Niknejad A, Comte A, Fonseca Costa SS, de Farias TM, Moretti S, Parmentier G, de Laval VR, Rosikiewicz M, Wollbrett J, Echchiki A, Escoriza A, Gharib WH, Gonzales-Porta M, Jarosz Y, Laurenczy B, Moret P, Person E, Roelli P, Sanjeev K, Seppey M, Robinson-Rechavi M. The Bgee suite: integrated curated expression atlas and comparative transcriptomics in animals. Nucleic Acids Res 2021; 49:D831-D847. [PMID: 33037820 PMCID: PMC7778977 DOI: 10.1093/nar/gkaa793] [Citation(s) in RCA: 98] [Impact Index Per Article: 32.7] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/14/2020] [Revised: 08/24/2020] [Accepted: 09/15/2020] [Indexed: 01/24/2023] Open
Abstract
Bgee is a database to retrieve and compare gene expression patterns in multiple animal species, produced by integrating multiple data types (RNA-Seq, Affymetrix, in situ hybridization, and EST data). It is based exclusively on curated healthy wild-type expression data (e.g., no gene knock-out, no treatment, no disease), to provide a comparable reference of normal gene expression. Curation includes very large datasets such as GTEx (re-annotation of samples as ‘healthy’ or not) as well as many small ones. Data are integrated and made comparable between species thanks to consistent data annotation and processing, and to calls of presence/absence of expression, along with expression scores. As a result, Bgee is capable of detecting the conditions of expression of any single gene, accommodating any data type and species. Bgee provides several tools for analyses, allowing, e.g., automated comparisons of gene expression patterns within and between species, retrieval of the prefered conditions of expression of any gene, or enrichment analyses of conditions with expression of sets of genes. Bgee release 14.1 includes 29 animal species, and is available at https://bgee.org/ and through its Bioconductor R package BgeeDB.
Collapse
Affiliation(s)
- Frederic B Bastian
- Department of Ecology and Evolution, University of Lausanne, 1015 Lausanne, Switzerland.,SIB Swiss Institute of Bioinformatics, 1015 Lausanne, Switzerland
| | - Julien Roux
- Department of Ecology and Evolution, University of Lausanne, 1015 Lausanne, Switzerland.,SIB Swiss Institute of Bioinformatics, 1015 Lausanne, Switzerland
| | - Anne Niknejad
- Department of Ecology and Evolution, University of Lausanne, 1015 Lausanne, Switzerland.,SIB Swiss Institute of Bioinformatics, 1015 Lausanne, Switzerland
| | - Aurélie Comte
- Department of Ecology and Evolution, University of Lausanne, 1015 Lausanne, Switzerland.,SIB Swiss Institute of Bioinformatics, 1015 Lausanne, Switzerland
| | - Sara S Fonseca Costa
- Department of Ecology and Evolution, University of Lausanne, 1015 Lausanne, Switzerland.,SIB Swiss Institute of Bioinformatics, 1015 Lausanne, Switzerland
| | - Tarcisio Mendes de Farias
- Department of Ecology and Evolution, University of Lausanne, 1015 Lausanne, Switzerland.,SIB Swiss Institute of Bioinformatics, 1015 Lausanne, Switzerland
| | - Sébastien Moretti
- Department of Ecology and Evolution, University of Lausanne, 1015 Lausanne, Switzerland.,SIB Swiss Institute of Bioinformatics, 1015 Lausanne, Switzerland
| | - Gilles Parmentier
- Department of Ecology and Evolution, University of Lausanne, 1015 Lausanne, Switzerland.,SIB Swiss Institute of Bioinformatics, 1015 Lausanne, Switzerland
| | - Valentine Rech de Laval
- Department of Ecology and Evolution, University of Lausanne, 1015 Lausanne, Switzerland.,SIB Swiss Institute of Bioinformatics, 1015 Lausanne, Switzerland
| | - Marta Rosikiewicz
- Department of Ecology and Evolution, University of Lausanne, 1015 Lausanne, Switzerland.,SIB Swiss Institute of Bioinformatics, 1015 Lausanne, Switzerland
| | - Julien Wollbrett
- Department of Ecology and Evolution, University of Lausanne, 1015 Lausanne, Switzerland.,SIB Swiss Institute of Bioinformatics, 1015 Lausanne, Switzerland
| | - Amina Echchiki
- Department of Ecology and Evolution, University of Lausanne, 1015 Lausanne, Switzerland.,SIB Swiss Institute of Bioinformatics, 1015 Lausanne, Switzerland
| | - Angélique Escoriza
- Department of Ecology and Evolution, University of Lausanne, 1015 Lausanne, Switzerland.,SIB Swiss Institute of Bioinformatics, 1015 Lausanne, Switzerland
| | - Walid H Gharib
- Department of Ecology and Evolution, University of Lausanne, 1015 Lausanne, Switzerland.,SIB Swiss Institute of Bioinformatics, 1015 Lausanne, Switzerland
| | - Mar Gonzales-Porta
- Department of Ecology and Evolution, University of Lausanne, 1015 Lausanne, Switzerland.,SIB Swiss Institute of Bioinformatics, 1015 Lausanne, Switzerland
| | - Yohan Jarosz
- Department of Ecology and Evolution, University of Lausanne, 1015 Lausanne, Switzerland.,SIB Swiss Institute of Bioinformatics, 1015 Lausanne, Switzerland
| | - Balazs Laurenczy
- Department of Ecology and Evolution, University of Lausanne, 1015 Lausanne, Switzerland.,SIB Swiss Institute of Bioinformatics, 1015 Lausanne, Switzerland
| | - Philippe Moret
- Department of Ecology and Evolution, University of Lausanne, 1015 Lausanne, Switzerland.,SIB Swiss Institute of Bioinformatics, 1015 Lausanne, Switzerland
| | - Emilie Person
- Department of Ecology and Evolution, University of Lausanne, 1015 Lausanne, Switzerland.,SIB Swiss Institute of Bioinformatics, 1015 Lausanne, Switzerland
| | - Patrick Roelli
- Department of Ecology and Evolution, University of Lausanne, 1015 Lausanne, Switzerland.,SIB Swiss Institute of Bioinformatics, 1015 Lausanne, Switzerland
| | - Komal Sanjeev
- Department of Ecology and Evolution, University of Lausanne, 1015 Lausanne, Switzerland.,SIB Swiss Institute of Bioinformatics, 1015 Lausanne, Switzerland
| | - Mathieu Seppey
- Department of Ecology and Evolution, University of Lausanne, 1015 Lausanne, Switzerland.,SIB Swiss Institute of Bioinformatics, 1015 Lausanne, Switzerland
| | - Marc Robinson-Rechavi
- Department of Ecology and Evolution, University of Lausanne, 1015 Lausanne, Switzerland.,SIB Swiss Institute of Bioinformatics, 1015 Lausanne, Switzerland
| |
Collapse
|
16
|
Thessen AE, Walls RL, Vogt L, Singer J, Warren R, Buttigieg PL, Balhoff JP, Mungall CJ, McGuinness DL, Stucky BJ, Yoder MJ, Haendel MA. Transforming the study of organisms: Phenomic data models and knowledge bases. PLoS Comput Biol 2020; 16:e1008376. [PMID: 33232313 PMCID: PMC7685442 DOI: 10.1371/journal.pcbi.1008376] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/07/2023] Open
Abstract
The rapidly decreasing cost of gene sequencing has resulted in a deluge of genomic data from across the tree of life; however, outside a few model organism databases, genomic data are limited in their scientific impact because they are not accompanied by computable phenomic data. The majority of phenomic data are contained in countless small, heterogeneous phenotypic data sets that are very difficult or impossible to integrate at scale because of variable formats, lack of digitization, and linguistic problems. One powerful solution is to represent phenotypic data using data models with precise, computable semantics, but adoption of semantic standards for representing phenotypic data has been slow, especially in biodiversity and ecology. Some phenotypic and trait data are available in a semantic language from knowledge bases, but these are often not interoperable. In this review, we will compare and contrast existing ontology and data models, focusing on nonhuman phenotypes and traits. We discuss barriers to integration of phenotypic data and make recommendations for developing an operationally useful, semantically interoperable phenotypic data ecosystem.
Collapse
Affiliation(s)
- Anne E. Thessen
- Environmental and Molecular Toxicology, Oregon State University, Corvallis, Oregon, United States of America
- Ronin Institute for Independent Scholarship, Monclair, New Jersey, United States of America
| | - Ramona L. Walls
- Bio5 Institute, University of Arizona, Tucson, Arizona, United States of America
| | - Lars Vogt
- TIB Leibniz Information Centre for Science and Technology, Hannover, Germany
| | | | | | - Pier Luigi Buttigieg
- Alfred-Wegener-Institut, Helmholtz-Zentrum für Polar- und Meeresforschung, Bremerhaven, Germany
| | - James P. Balhoff
- Renaissance Computing Institute, University of North Carolina, Chapel Hill, North Carolina, United States of America
| | - Christopher J. Mungall
- Environmental Genomics and Systems Biology, Lawrence Berkeley National Laboratory, Berkeley, California, United States of America
| | | | - Brian J. Stucky
- Florida Museum of Natural History, University of Florida, Gainesville, Florida, United States of America
| | - Matthew J. Yoder
- Illinois Natural History Survey, Champaign, Illinois, United States of America
| | - Melissa A. Haendel
- Environmental and Molecular Toxicology, Oregon State University, Corvallis, Oregon, United States of America
| |
Collapse
|
17
|
Hotta K, Dauga D, Manni L. The ontology of the anatomy and development of the solitary ascidian Ciona: the swimming larva and its metamorphosis. Sci Rep 2020; 10:17916. [PMID: 33087765 PMCID: PMC7578030 DOI: 10.1038/s41598-020-73544-9] [Citation(s) in RCA: 20] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/24/2020] [Accepted: 09/14/2020] [Indexed: 02/07/2023] Open
Abstract
Ciona robusta (Ciona intestinalis type A), a model organism for biological studies, belongs to ascidians, the main class of tunicates, which are the closest relatives of vertebrates. In Ciona, a project on the ontology of both development and anatomy is ongoing for several years. Its goal is to standardize a resource relating each anatomical structure to developmental stages. Today, the ontology is codified until the hatching larva stage. Here, we present its extension throughout the swimming larva stages, the metamorphosis, until the juvenile stages. For standardizing the developmental ontology, we acquired different time-lapse movies, confocal microscope images and histological serial section images for each developmental event from the hatching larva stage (17.5 h post fertilization) to the juvenile stage (7 days post fertilization). Combining these data, we defined 12 new distinct developmental stages (from Stage 26 to Stage 37), in addition to the previously defined 26 stages, referred to embryonic development. The new stages were grouped into four Periods named: Adhesion, Tail Absorption, Body Axis Rotation, and Juvenile. To build the anatomical ontology, 203 anatomical entities were identified, defined according to the literature, and annotated, taking advantage from the high resolution and the complementary information obtained from confocal microscopy and histology. The ontology describes the anatomical entities in hierarchical levels, from the cell level (cell lineage) to the tissue/organ level. Comparing the number of entities during development, we found two rounds on entity increase: in addition to the one occurring after fertilization, there is a second one during the Body Axis Rotation Period, when juvenile structures appear. Vice versa, one-third of anatomical entities associated with the embryo/larval life were significantly reduced at the beginning of metamorphosis. Data was finally integrated within the web-based resource "TunicAnatO", which includes a number of anatomical images and a dictionary with synonyms. This ontology will allow the standardization of data underpinning an accurate annotation of gene expression and the comprehension of mechanisms of differentiation. It will help in understanding the emergence of elaborated structures during both embryogenesis and metamorphosis, shedding light on tissue degeneration and differentiation occurring at metamorphosis.
Collapse
Affiliation(s)
- Kohji Hotta
- Department of Biosciences and Informatics, Faculty of Science and Technology, Keio University, Kouhoku-ku, Yokohama, 223-8522, Japan.
| | - Delphine Dauga
- Bioself Communication, 28 rue de la bibliotheque, 13001, Marseille, France
| | - Lucia Manni
- Department of Biology, University of Padova, Via Ugo Bassi 58/B, 35121, Padova, Italy.
| |
Collapse
|
18
|
Fernando PC, Mabee PM, Zeng E. Integration of anatomy ontology data with protein-protein interaction networks improves the candidate gene prediction accuracy for anatomical entities. BMC Bioinformatics 2020; 21:442. [PMID: 33028186 PMCID: PMC7542696 DOI: 10.1186/s12859-020-03773-2] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/22/2020] [Accepted: 09/22/2020] [Indexed: 01/04/2023] Open
Abstract
Background Identification of genes responsible for anatomical entities is a major requirement in many fields including developmental biology, medicine, and agriculture. Current wet lab techniques used for this purpose, such as gene knockout, are high in resource and time consumption. Protein–protein interaction (PPI) networks are frequently used to predict disease genes for humans and gene candidates for molecular functions, but they are rarely used to predict genes for anatomical entities. Moreover, PPI networks suffer from network quality issues, which can be a limitation for their usage in predicting candidate genes. Therefore, we developed an integrative framework to improve the candidate gene prediction accuracy for anatomical entities by combining existing experimental knowledge about gene-anatomical entity relationships with PPI networks using anatomy ontology annotations. We hypothesized that this integration improves the quality of the PPI networks by reducing the number of false positive and false negative interactions and is better optimized to predict candidate genes for anatomical entities. We used existing Uberon anatomical entity annotations for zebrafish and mouse genes to construct gene networks by calculating semantic similarity between the genes. These anatomy-based gene networks were semantic networks, as they were constructed based on the anatomy ontology annotations that were obtained from the experimental data in the literature. We integrated these anatomy-based gene networks with mouse and zebrafish PPI networks retrieved from the STRING database and compared the performance of their network-based candidate gene predictions. Results According to evaluations of candidate gene prediction performance tested under four different semantic similarity calculation methods (Lin, Resnik, Schlicker, and Wang), the integrated networks, which were semantically improved PPI networks, showed better performances by having higher area under the curve values for receiver operating characteristic and precision-recall curves than PPI networks for both zebrafish and mouse. Conclusion Integration of existing experimental knowledge about gene-anatomical entity relationships with PPI networks via anatomy ontology improved the candidate gene prediction accuracy and optimized them for predicting candidate genes for anatomical entities.
Collapse
Affiliation(s)
- Pasan C Fernando
- Department of Biology, University of South Dakota, Vermillion, SD, USA.
| | - Paula M Mabee
- Department of Biology, University of South Dakota, Vermillion, SD, USA.,National Ecological Observatory Network, Battelle Memorial Institute, 1685 38th St., Suite 100, Boulder, CO, 80301, USA
| | - Erliang Zeng
- Division of Biostatistics and Computational Biology, College of Dentistry, University of Iowa, Iowa City, IA, USA. .,Department of Preventive and Community Dentistry, College of Dentistry, University of Iowa, Iowa City, IA, USA. .,Department of Biostatistics, College of Public Health, University of Iowa, Iowa City, IA, USA. .,Department of Biomedical Engineering, College of Engineering, University of Iowa, Iowa City, IA, USA.
| |
Collapse
|
19
|
Shefchek KA, Harris NL, Gargano M, Matentzoglu N, Unni D, Brush M, Keith D, Conlin T, Vasilevsky N, Zhang XA, Balhoff JP, Babb L, Bello SM, Blau H, Bradford Y, Carbon S, Carmody L, Chan LE, Cipriani V, Cuzick A, Della Rocca M, Dunn N, Essaid S, Fey P, Grove C, Gourdine JP, Hamosh A, Harris M, Helbig I, Hoatlin M, Joachimiak M, Jupp S, Lett KB, Lewis SE, McNamara C, Pendlington ZM, Pilgrim C, Putman T, Ravanmehr V, Reese J, Riggs E, Robb S, Roncaglia P, Seager J, Segerdell E, Similuk M, Storm AL, Thaxon C, Thessen A, Jacobsen JOB, McMurry JA, Groza T, Köhler S, Smedley D, Robinson PN, Mungall CJ, Haendel MA, Munoz-Torres MC, Osumi-Sutherland D. The Monarch Initiative in 2019: an integrative data and analytic platform connecting phenotypes to genotypes across species. Nucleic Acids Res 2020; 48:D704-D715. [PMID: 31701156 PMCID: PMC7056945 DOI: 10.1093/nar/gkz997] [Citation(s) in RCA: 143] [Impact Index Per Article: 35.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/20/2019] [Revised: 10/09/2019] [Accepted: 10/14/2019] [Indexed: 12/14/2022] Open
Abstract
In biology and biomedicine, relating phenotypic outcomes with genetic variation and environmental factors remains a challenge: patient phenotypes may not match known diseases, candidate variants may be in genes that haven’t been characterized, research organisms may not recapitulate human or veterinary diseases, environmental factors affecting disease outcomes are unknown or undocumented, and many resources must be queried to find potentially significant phenotypic associations. The Monarch Initiative (https://monarchinitiative.org) integrates information on genes, variants, genotypes, phenotypes and diseases in a variety of species, and allows powerful ontology-based search. We develop many widely adopted ontologies that together enable sophisticated computational analysis, mechanistic discovery and diagnostics of Mendelian diseases. Our algorithms and tools are widely used to identify animal models of human disease through phenotypic similarity, for differential diagnostics and to facilitate translational research. Launched in 2015, Monarch has grown with regards to data (new organisms, more sources, better modeling); new API and standards; ontologies (new Mondo unified disease ontology, improvements to ontologies such as HPO and uPheno); user interface (a redesigned website); and community development. Monarch data, algorithms and tools are being used and extended by resources such as GA4GH and NCATS Translator, among others, to aid mechanistic discovery and diagnostics.
Collapse
Affiliation(s)
- Kent A Shefchek
- Center for Genome Research and Biocomputing, Environmental and Molecular Toxicology, Oregon State University, Corvallis, OR 97331, USA
| | - Nomi L Harris
- Environmental Genomics and Systems Biology, Lawrence Berkeley National Laboratory, Berkeley, CA 94710, USA
| | - Michael Gargano
- The Jackson Laboratory For Genomic Medicine, Farmington, CT 06032, USA
| | - Nicolas Matentzoglu
- European Bioinformatics Institute (EMBL-EBI), European Molecular Biology Laboratory, Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Deepak Unni
- Environmental Genomics and Systems Biology, Lawrence Berkeley National Laboratory, Berkeley, CA 94710, USA
| | - Matthew Brush
- Oregon Clinical and Translational Research Institute, Oregon Health & Science University, Portland, OR 97239, USA
| | - Daniel Keith
- Center for Genome Research and Biocomputing, Environmental and Molecular Toxicology, Oregon State University, Corvallis, OR 97331, USA
| | - Tom Conlin
- Center for Genome Research and Biocomputing, Environmental and Molecular Toxicology, Oregon State University, Corvallis, OR 97331, USA
| | - Nicole Vasilevsky
- Oregon Clinical and Translational Research Institute, Oregon Health & Science University, Portland, OR 97239, USA
| | | | - James P Balhoff
- Renaissance Computing Institute at UNC, Chapel Hill, NC 27517, USA
| | - Larry Babb
- Broad Institute, Cambridge, MA 02142, USA
| | | | - Hannah Blau
- The Jackson Laboratory For Genomic Medicine, Farmington, CT 06032, USA
| | - Yvonne Bradford
- Institute of Neuroscience, University of Oregon, Eugene, OR 97401, USA
| | - Seth Carbon
- Environmental Genomics and Systems Biology, Lawrence Berkeley National Laboratory, Berkeley, CA 94710, USA
| | - Leigh Carmody
- The Jackson Laboratory For Genomic Medicine, Farmington, CT 06032, USA
| | - Lauren E Chan
- College of Public Health and Human Sciences, Oregon State University, Corvallis, OR 97331, USA
| | - Valentina Cipriani
- William Harvey Research Institute, Barts & The London School of Medicine & Dentistry, Queen Mary University of London, Charterhouse Square, London EC1M 6BQ, UK
| | | | - Maria Della Rocca
- Office of Rare Diseases Research (ORDR), National Center for Advancing Translational Sciences (NCATS), National Institutes of Health (NIH), Bethesda, MD 20892, USA
| | - Nathan Dunn
- Environmental Genomics and Systems Biology, Lawrence Berkeley National Laboratory, Berkeley, CA 94710, USA
| | - Shahim Essaid
- Oregon Clinical and Translational Research Institute, Oregon Health & Science University, Portland, OR 97239, USA
| | - Petra Fey
- dictyBase, Center for Genetic Medicine, Northwestern University, Chicago, IL 60611, USA
| | - Chris Grove
- California Institute of Technology, Pasadena, CA 91125, USA
| | - Jean-Phillipe Gourdine
- Oregon Clinical and Translational Research Institute, Oregon Health & Science University, Portland, OR 97239, USA
| | - Ada Hamosh
- McKusick-Nathans Institute of Genetic Medicine, Johns Hopkins University, Baltimore, MD 21205, USA
| | | | - Ingo Helbig
- Division of Neurology, Children's Hospital of Philadelphia, Philadelphia, PA 19104, USA.,Department of Biomedical and Health Informatics, Children's Hospital of Philadelphia, Philadelphia, PA 19104, USA.,Department of Neuropediatrics, Christian-Albrechts-University of Kiel, 24105 Kiel, Germany.,Department of Neurology, University of Pennsylvania, Perelman School of Medicine, Philadelphia, PA 19104, USA
| | - Maureen Hoatlin
- Department of Biochemistry and Molecular Biology, Oregon Health & Science University, Portland, OR 97239, USA
| | - Marcin Joachimiak
- Environmental Genomics and Systems Biology, Lawrence Berkeley National Laboratory, Berkeley, CA 94710, USA
| | - Simon Jupp
- European Bioinformatics Institute (EMBL-EBI), European Molecular Biology Laboratory, Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Kenneth B Lett
- Center for Genome Research and Biocomputing, Environmental and Molecular Toxicology, Oregon State University, Corvallis, OR 97331, USA
| | - Suzanna E Lewis
- Environmental Genomics and Systems Biology, Lawrence Berkeley National Laboratory, Berkeley, CA 94710, USA
| | | | - Zoë M Pendlington
- European Bioinformatics Institute (EMBL-EBI), European Molecular Biology Laboratory, Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | | | - Tim Putman
- Center for Genome Research and Biocomputing, Environmental and Molecular Toxicology, Oregon State University, Corvallis, OR 97331, USA
| | - Vida Ravanmehr
- The Jackson Laboratory For Genomic Medicine, Farmington, CT 06032, USA
| | - Justin Reese
- Environmental Genomics and Systems Biology, Lawrence Berkeley National Laboratory, Berkeley, CA 94710, USA
| | - Erin Riggs
- Autism & Developmental Medicine Institute, Geisinger, Danville, PA 17837, USA
| | - Sofia Robb
- Stowers Institute for Medical Research, Kansas City, MO 64110, USA
| | - Paola Roncaglia
- European Bioinformatics Institute (EMBL-EBI), European Molecular Biology Laboratory, Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | | | - Erik Segerdell
- Xenbase, Cincinnati Children's Hospital, Cincinnati, OH 45229, USA
| | - Morgan Similuk
- National Institute of Allergy and Infectious Diseases, National Institutes of Health, Bethesda, MD 20892, USA
| | - Andrea L Storm
- Office of Rare Diseases Research (ORDR), National Center for Advancing Translational Sciences (NCATS), National Institutes of Health (NIH), Bethesda, MD 20892, USA
| | - Courtney Thaxon
- University of North Carolina Medical School, University of North Carolina at Chapel Hill, Chapel Hill, NC 27516, USA
| | - Anne Thessen
- Center for Genome Research and Biocomputing, Environmental and Molecular Toxicology, Oregon State University, Corvallis, OR 97331, USA
| | - Julius O B Jacobsen
- William Harvey Research Institute, Barts & The London School of Medicine & Dentistry, Queen Mary University of London, Charterhouse Square, London EC1M 6BQ, UK
| | - Julie A McMurry
- College of Public Health and Human Sciences, Oregon State University, Corvallis, OR 97331, USA
| | | | - Sebastian Köhler
- Institute for Medical Genetics and Human Genetics, Charité-Universitätsmedizin Berlin, Augustenburger Platz 1, 13353 Berlin, Germany
| | - Damian Smedley
- William Harvey Research Institute, Barts & The London School of Medicine & Dentistry, Queen Mary University of London, Charterhouse Square, London EC1M 6BQ, UK
| | - Peter N Robinson
- The Jackson Laboratory For Genomic Medicine, Farmington, CT 06032, USA
| | - Christopher J Mungall
- Environmental Genomics and Systems Biology, Lawrence Berkeley National Laboratory, Berkeley, CA 94710, USA
| | - Melissa A Haendel
- Center for Genome Research and Biocomputing, Environmental and Molecular Toxicology, Oregon State University, Corvallis, OR 97331, USA.,Oregon Clinical and Translational Research Institute, Oregon Health & Science University, Portland, OR 97239, USA
| | - Monica C Munoz-Torres
- Center for Genome Research and Biocomputing, Environmental and Molecular Toxicology, Oregon State University, Corvallis, OR 97331, USA
| | - David Osumi-Sutherland
- European Bioinformatics Institute (EMBL-EBI), European Molecular Biology Laboratory, Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| |
Collapse
|
20
|
Oliveira D, Butt AS, Haller A, Rebholz-Schuhmann D, Sahay R. Where to search top-K biomedical ontologies? Brief Bioinform 2020; 20:1477-1491. [PMID: 29579141 PMCID: PMC6781604 DOI: 10.1093/bib/bby015] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/10/2017] [Revised: 02/12/2018] [Indexed: 01/08/2023] Open
Abstract
Motivation Searching for precise terms and terminological definitions in the biomedical data space is problematic, as researchers find overlapping, closely related and even equivalent concepts in a single or multiple ontologies. Search engines that retrieve ontological resources often suggest an extensive list of search results for a given input term, which leads to the tedious task of selecting the best-fit ontological resource (class or property) for the input term and reduces user confidence in the retrieval engines. A systematic evaluation of these search engines is necessary to understand their strengths and weaknesses in different search requirements. Result We have implemented seven comparable Information Retrieval ranking algorithms to search through ontologies and compared them against four search engines for ontologies. Free-text queries have been performed, the outcomes have been judged by experts and the ranking algorithms and search engines have been evaluated against the expert-based ground truth (GT). In addition, we propose a probabilistic GT that is developed automatically to provide deeper insights and confidence to the expert-based GT as well as evaluating a broader range of search queries. Conclusion The main outcome of this work is the identification of key search factors for biomedical ontologies together with search requirements and a set of recommendations that will help biomedical experts and ontology engineers to select the best-suited retrieval mechanism in their search scenarios. We expect that this evaluation will allow researchers and practitioners to apply the current search techniques more reliably and that it will help them to select the right solution for their daily work. Availability The source code (of seven ranking algorithms), ground truths and experimental results are available at https://github.com/danielapoliveira/bioont-search-benchmark
Collapse
Affiliation(s)
| | | | - Armin Haller
- Australian National University, Canberra, Australia
| | | | - Ratnesh Sahay
- Insight Centre for Data Analytics, NUI Galway, Ireland
| |
Collapse
|
21
|
Pal LR, Kundu K, Yin Y, Moult J. Matching whole genomes to rare genetic disorders: Identification of potential causative variants using phenotype-weighted knowledge in the CAGI SickKids5 clinical genomes challenge. Hum Mutat 2020; 41:347-362. [PMID: 31680375 PMCID: PMC7182498 DOI: 10.1002/humu.23933] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/09/2019] [Revised: 09/26/2019] [Accepted: 10/13/2019] [Indexed: 02/06/2023]
Abstract
Precise identification of causative variants from whole-genome sequencing data, including both coding and noncoding variants, is challenging. The Critical Assessment of Genome Interpretation 5 SickKids clinical genome challenge provided an opportunity to assess our ability to extract such information. Participants in the challenge were required to match each of the 24 whole-genome sequences to the correct phenotypic profile and to identify the disease class of each genome. These are all rare disease cases that have resisted genetic diagnosis in a state-of-the-art pipeline. The patients have a range of eye, neurological, and connective-tissue disorders. We used a gene-centric approach to address this problem, assigning each gene a multiphenotype-matching score. Mutations in the top-scoring genes for each phenotype profile were ranked on a 6-point scale of pathogenicity probability, resulting in an approximately equal number of top-ranked coding and noncoding candidate variants overall. We were able to assign the correct disease class for 12 cases and the correct genome to a clinical profile for five cases. The challenge assessor found genes in three of these five cases as likely appropriate. In the postsubmission phase, after careful screening of the genes in the correct genome, we identified additional potential diagnostic variants, a high proportion of which are noncoding.
Collapse
Affiliation(s)
- Lipika R. Pal
- Institute for Bioscience and Biotechnology Research, University of Maryland, 9600 Gudelsky Drive, Rockville, MD 20850, USA
| | - Kunal Kundu
- Institute for Bioscience and Biotechnology Research, University of Maryland, 9600 Gudelsky Drive, Rockville, MD 20850, USA
- Computational Biology, Bioinformatics and Genomics, Biological Sciences Graduate Program, University of Maryland, College Park, MD 20742, USA
| | - Yizhou Yin
- Institute for Bioscience and Biotechnology Research, University of Maryland, 9600 Gudelsky Drive, Rockville, MD 20850, USA
| | - John Moult
- Institute for Bioscience and Biotechnology Research, University of Maryland, 9600 Gudelsky Drive, Rockville, MD 20850, USA
- Department of Cell Biology and Molecular Genetics, University of Maryland, College Park, MD 20742, USA
| |
Collapse
|
22
|
Kishore R, Arnaboldi V, Van Slyke CE, Chan J, Nash RS, Urbano JM, Dolan ME, Engel SR, Shimoyama M, Sternberg PW, Genome Resources TAO. Automated generation of gene summaries at the Alliance of Genome Resources. Database (Oxford) 2020; 2020:baaa037. [PMID: 32559296 PMCID: PMC7304461 DOI: 10.1093/database/baaa037] [Citation(s) in RCA: 28] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/14/2020] [Revised: 04/06/2020] [Accepted: 04/29/2020] [Indexed: 12/28/2022]
Abstract
Short paragraphs that describe gene function, referred to as gene summaries, are valued by users of biological knowledgebases for the ease with which they convey key aspects of gene function. Manual curation of gene summaries, while desirable, is difficult for knowledgebases to sustain. We developed an algorithm that uses curated, structured gene data at the Alliance of Genome Resources (Alliance; www.alliancegenome.org) to automatically generate gene summaries that simulate natural language. The gene data used for this purpose include curated associations (annotations) to ontology terms from the Gene Ontology, Disease Ontology, model organism knowledgebase (MOK)-specific anatomy ontologies and Alliance orthology data. The method uses sentence templates for each data category included in the gene summary in order to build a natural language sentence from the list of terms associated with each gene. To improve readability of the summaries when numerous gene annotations are present, we developed a new algorithm that traverses ontology graphs in order to group terms by their common ancestors. The algorithm optimizes the coverage of the initial set of terms and limits the length of the final summary, using measures of information content of each ontology term as a criterion for inclusion in the summary. The automated gene summaries are generated with each Alliance release, ensuring that they reflect current data at the Alliance. Our method effectively leverages category-specific curation efforts of the Alliance member databases to create modular, structured and standardized gene summaries for seven member species of the Alliance. These automatically generated gene summaries make cross-species gene function comparisons tenable and increase discoverability of potential models of human disease. In addition to being displayed on Alliance gene pages, these summaries are also included on several MOK gene pages.
Collapse
Affiliation(s)
- Ranjana Kishore
- WormBase, Division of Biology and Biological Engineering, California Institute of Technology, 1200 East California Boulevard, Pasadena, CA 91125, USA
| | - Valerio Arnaboldi
- WormBase, Division of Biology and Biological Engineering, California Institute of Technology, 1200 East California Boulevard, Pasadena, CA 91125, USA
| | - Ceri E Van Slyke
- ZFIN, The Institute of Neuroscience, 222 Huestis Hall, University of Oregon, Eugene, OR 97403-1254, USA
| | - Juancarlos Chan
- WormBase, Division of Biology and Biological Engineering, California Institute of Technology, 1200 East California Boulevard, Pasadena, CA 91125, USA
| | - Robert S Nash
- Saccharomyces Genome Database, Department of Genetics, Stanford University, 3165 Porter Drive, Palo Alto, CA 94304, USA
| | - Jose M Urbano
- FlyBase, Department of Physiology, Development and Neuroscience, 7 Downing Pl, University of Cambridge, Cambridge CB2 3DY, UK
| | - Mary E Dolan
- MGI, The Jackson Laboratory, Bar Harbor, ME 04609, USA
| | - Stacia R Engel
- Saccharomyces Genome Database, Department of Genetics, Stanford University, 3165 Porter Drive, Palo Alto, CA 94304, USA
| | - Mary Shimoyama
- Rat Genome Database, Department of Biomedical Engineering, Medical College of Wisconsin and Marquette University, 8701 Watertown Plank Road, Milwaukee, WI 53226, USA
| | - Paul W Sternberg
- WormBase, Division of Biology and Biological Engineering, California Institute of Technology, 1200 East California Boulevard, Pasadena, CA 91125, USA
| | | |
Collapse
|
23
|
Hamm JT, Ceger P, Allen D, Stout M, Maull EA, Baker G, Zmarowski A, Padilla S, Perkins E, Planchart A, Stedman D, Tal T, Tanguay RL, Volz DC, Wilbanks MS, Walker NJ. Characterizing sources of variability in zebrafish embryo screening protocols. ALTEX 2018; 36:103-120. [PMID: 30415271 PMCID: PMC10424490 DOI: 10.14573/altex.1804162] [Citation(s) in RCA: 26] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/26/2018] [Accepted: 10/30/2018] [Indexed: 11/23/2022]
Abstract
There is a need for fast, efficient, and cost-effective hazard identification and characterization of chemical hazards. This need is generating increased interest in the use of zebrafish embryos as both a screening tool and an alternative to mammalian test methods. A Collaborative Workshop on Aquatic Models and 21st Century Toxicology identified the lack of appropriate and consistent testing protocols as a challenge to the broader application of the zebrafish embryo model. The National Toxicology Program established the Systematic Evaluation of the Application of Zebrafish in Toxicology (SEAZIT) initiative to address the lack of consistent testing guidelines and identify sources of variability for zebrafish-based assays. This report summarizes initial SEAZIT information-gathering efforts. Investigators in academic, government, and industry laboratories that routinely use zebrafish embryos for chemical toxicity testing were asked about their husbandry practices and standard protocols. Information was collected about protocol components including zebrafish strains, feed, system water, disease surveillance, embryo exposure conditions, and endpoints. Literature was reviewed to assess issues raised by the investigators. Interviews revealed substantial variability across design parameters, data collected, and analysis procedures. The presence of the chorion and renewal of exposure media (static versus static-renewal) were identified as design parameters that could potentially influence study outcomes and should be investigated further with studies to determine chemical uptake from treatment solution into embryos. The information gathered in this effort provides a basis for future SEAZIT activities to promote more consistent practices among researchers using zebrafish embryos for toxicity evaluation.
Collapse
Affiliation(s)
- Jon T Hamm
- Integrated Laboratory Systems, Research Triangle Park, NC, USA
| | - Patricia Ceger
- Integrated Laboratory Systems, Research Triangle Park, NC, USA
| | - David Allen
- Integrated Laboratory Systems, Research Triangle Park, NC, USA
| | - Matt Stout
- Division of the National Toxicology Program, National Institute of Environmental Health Sciences, National Institutes of Health, Research Triangle Park, NC, USA
| | - Elizabeth A Maull
- National Toxicology Program Interagency Center for the Evaluation of Alternative Toxicological Methods, Division of the National Toxicology Program, National Institute of Environmental Health Sciences, National Institutes of Health, Research Triangle Park, NC, USA
| | - Greg Baker
- Battelle, Life Sciences Research, Columbus, OH, USA
| | | | - Stephanie Padilla
- Integrated Systems Toxicology Division, National Health and Environmental Effects Research Laboratory, Office of Research and Development, U.S. Environmental Protection Agency, Research Triangle Park, NC, USA
| | - Edward Perkins
- United States Army Engineer Research and Development Center, Vicksburg, MS, USA
| | - Antonio Planchart
- Department of Biological Sciences and Center for Human Health and the Environment, North Carolina State University, Raleigh, NC, USA
| | | | - Tamara Tal
- Integrated Systems Toxicology Division, National Health and Environmental Effects Research Laboratory, Office of Research and Development, U.S. Environmental Protection Agency, Research Triangle Park, NC, USA
| | - Robert L Tanguay
- Department of Environmental & Molecular Toxicology, Oregon State University, Corvallis, OR, USA
| | - David C Volz
- Department of Environmental Sciences, University of California, Riverside, CA, USA
| | - Mitch S Wilbanks
- United States Army Engineer Research and Development Center, Vicksburg, MS, USA
| | - Nigel J Walker
- Division of the National Toxicology Program, National Institute of Environmental Health Sciences, National Institutes of Health, Research Triangle Park, NC, USA
| |
Collapse
|
24
|
Chiu B, Pyysalo S, Vulić I, Korhonen A. Bio-SimVerb and Bio-SimLex: wide-coverage evaluation sets of word similarity in biomedicine. BMC Bioinformatics 2018; 19:33. [PMID: 29402212 PMCID: PMC5800055 DOI: 10.1186/s12859-018-2039-z] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/04/2017] [Accepted: 01/24/2018] [Indexed: 01/10/2023] Open
Abstract
Background Word representations support a variety of Natural Language Processing (NLP) tasks. The quality of these representations is typically assessed by comparing the distances in the induced vector spaces against human similarity judgements. Whereas comprehensive evaluation resources have recently been developed for the general domain, similar resources for biomedicine currently suffer from the lack of coverage, both in terms of word types included and with respect to the semantic distinctions. Notably, verbs have been excluded, although they are essential for the interpretation of biomedical language. Further, current resources do not discern between semantic similarity and semantic relatedness, although this has been proven as an important predictor of the usefulness of word representations and their performance in downstream applications. Results We present two novel comprehensive resources targeting the evaluation of word representations in biomedicine. These resources, Bio-SimVerb and Bio-SimLex, address the previously mentioned problems, and can be used for evaluations of verb and noun representations respectively. In our experiments, we have computed the Pearson’s correlation between performances on intrinsic and extrinsic tasks using twelve popular state-of-the-art representation models (e.g. word2vec models). The intrinsic–extrinsic correlations using our datasets are notably higher than with previous intrinsic evaluation benchmarks such as UMNSRS and MayoSRS. In addition, when evaluating representation models for their abilities to capture verb and noun semantics individually, we show a considerable variation between performances across all models. Conclusion Bio-SimVerb and Bio-SimLex enable intrinsic evaluation of word representations. This evaluation can serve as a predictor of performance on various downstream tasks in the biomedical domain. The results on Bio-SimVerb and Bio-SimLex using standard word representation models highlight the importance of developing dedicated evaluation resources for NLP in biomedicine for particular word classes (e.g. verbs). These are needed to identify the most accurate methods for learning class-specific representations. Bio-SimVerb and Bio-SimLex are publicly available.
Collapse
Affiliation(s)
- Billy Chiu
- Language Technology Laboratory, DTAL, University of Cambridge, 9 West Road, Cambridge, CB39DB, UK.
| | - Sampo Pyysalo
- Language Technology Laboratory, DTAL, University of Cambridge, 9 West Road, Cambridge, CB39DB, UK
| | - Ivan Vulić
- Language Technology Laboratory, DTAL, University of Cambridge, 9 West Road, Cambridge, CB39DB, UK
| | - Anna Korhonen
- Language Technology Laboratory, DTAL, University of Cambridge, 9 West Road, Cambridge, CB39DB, UK
| |
Collapse
|
25
|
Travin D, Popov I, Guler AT, Medvedev D, van der Plas-Duivesteijn S, Varela M, Kolder ICRM, Meijer AH, Spaink HP, Palmblad M. COMICS: Cartoon Visualization of Omics Data in Spatial Context Using Anatomical Ontologies. J Proteome Res 2018; 17:739-744. [PMID: 29083911 PMCID: PMC5772887 DOI: 10.1021/acs.jproteome.7b00615] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
Abstract
![]()
COMICS
is an interactive and open-access web platform for integration
and visualization of molecular expression data in anatomograms of
zebrafish, carp, and mouse model systems. Anatomical ontologies are
used to map omics data across experiments and between an experiment
and a particular visualization in a data-dependent manner. COMICS
is built on top of several existing resources. Zebrafish and mouse
anatomical ontologies with their controlled vocabulary (CV) and defined
hierarchy are used with the ontoCAT R package to aggregate data for
comparison and visualization. Libraries from the QGIS geographical
information system are used with the R packages “maps”
and “maptools” to visualize and interact with molecular
expression data in anatomical drawings of the model systems. COMICS
allows users to upload their own data from omics experiments, using
any gene or protein nomenclature they wish, as long as CV terms are
used to define anatomical regions or developmental stages. Common
nomenclatures such as the ZFIN gene names and UniProt accessions are
provided additional support. COMICS can be used to generate publication-quality
visualizations of gene and protein expression across experiments.
Unlike previous tools that have used anatomical ontologies to interpret
imaging data in several animal models, including zebrafish, COMICS
is designed to take spatially resolved data generated by dissection
or fractionation and display this data in visually clear anatomical
representations rather than large data tables. COMICS is optimized
for ease-of-use, with a minimalistic web interface and automatic selection
of the appropriate visual representation depending on the input data.
Collapse
Affiliation(s)
- Dmitrii Travin
- Faculty of Bioengineering and Bioinformatics, Lomonosov Moscow State University , 119234 Moscow, Russian Federation
| | - Iaroslav Popov
- Faculty of Bioengineering and Bioinformatics, Lomonosov Moscow State University , 119234 Moscow, Russian Federation
| | - Arzu Tugce Guler
- Center for Proteomics and Metabolomics, Leiden University Medical Center , PO Box 9600, 2300 RC, Leiden The Netherlands
| | - Dmitry Medvedev
- Faculty of Bioengineering and Bioinformatics, Lomonosov Moscow State University , 119234 Moscow, Russian Federation
| | | | - Monica Varela
- Institute of Biology, Leiden University , PO Box 9502, 2300 RA, Leiden The Netherlands
| | - Iris C R M Kolder
- Institute of Biology, Leiden University , PO Box 9502, 2300 RA, Leiden The Netherlands
| | - Annemarie H Meijer
- Institute of Biology, Leiden University , PO Box 9502, 2300 RA, Leiden The Netherlands
| | - Herman P Spaink
- Institute of Biology, Leiden University , PO Box 9502, 2300 RA, Leiden The Netherlands
| | - Magnus Palmblad
- Center for Proteomics and Metabolomics, Leiden University Medical Center , PO Box 9600, 2300 RC, Leiden The Netherlands
| |
Collapse
|
26
|
Van Slyke CE, Bradford YM, Howe DG, Fashena DS, Ramachandran S, Ruzicka L. Using ZFIN: Data Types, Organization, and Retrieval. Methods Mol Biol 2018; 1757:307-347. [PMID: 29761463 PMCID: PMC6319390 DOI: 10.1007/978-1-4939-7737-6_11] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/13/2023]
Abstract
The Zebrafish Model Organism Database (ZFIN; zfin.org) was established in 1994 as the primary genetic and genomic resource for the zebrafish research community. Some of the earliest records in ZFIN were for people and laboratories. Since that time, services and data types provided by ZFIN have grown considerably. Today, ZFIN provides the official nomenclature for zebrafish genes, mutants, and transgenics and curates many data types including gene expression, phenotypes, Gene Ontology, models of human disease, orthology, knockdown reagents, transgenic constructs, and antibodies. Ontologies are used throughout ZFIN to structure these expertly curated data. An integrated genome browser provides genomic context for genes, transgenics, mutants, and knockdown reagents. ZFIN also supports a community wiki where the research community can post new antibody records and research protocols. Data in ZFIN are accessible via web pages, download files, and the ZebrafishMine (zebrafishmine.org), an installation of the InterMine data warehousing software. Searching for data at ZFIN utilizes both parameterized search forms and a single box search for searching or browsing data quickly. This chapter aims to describe the primary ZFIN data and services, and provide insight into how to use and interpret ZFIN searches, data, and web pages.
Collapse
Affiliation(s)
- Ceri E Van Slyke
- The Zebrafish Information Network, University of Oregon, Eugene, OR, USA.
| | - Yvonne M Bradford
- The Zebrafish Information Network, University of Oregon, Eugene, OR, USA
| | - Douglas G Howe
- The Zebrafish Information Network, University of Oregon, Eugene, OR, USA
| | - David S Fashena
- The Zebrafish Information Network, University of Oregon, Eugene, OR, USA
| | | | - Leyla Ruzicka
- The Zebrafish Information Network, University of Oregon, Eugene, OR, USA
| |
Collapse
|
27
|
Abstract
Background Data-driven cell classification is becoming common and is now being implemented on a massive scale by projects such as the Human Cell Atlas. The scale of these efforts poses a challenge. How can the results be made searchable and accessible to biologists in general? How can they be related back to the rich classical knowledge of cell-types, anatomy and development? How will data from the various types of single cell analysis be made cross-searchable? Structured annotation with ontology terms provides a potential solution to these problems. In turn, there is great potential for using the outputs of data-driven cell classification to structure ontologies and integrate them with data-driven cell query systems. Results Focusing on examples from the mouse retina and Drosophila olfactory system, I present worked examples illustrating how formalization of cell ontologies can enhance querying of data-driven cell-classifications and how ontologies can be extended by integrating the outputs of data-driven cell classifications. Conclusions Annotation with ontology terms can play an important role in making data driven classifications searchable and query-able, but fulfilling this potential requires standardized formal patterns for structuring ontologies and annotations and for linking ontologies to the outputs of data-driven classification.
Collapse
Affiliation(s)
- David Osumi-Sutherland
- European Bioinformatics Institute (EMBL-EBI), Wellcome Trust Genome Campus, Hinxton, CB10 1SD, UK.
| |
Collapse
|
28
|
White RJ, Collins JE, Sealy IM, Wali N, Dooley CM, Digby Z, Stemple DL, Murphy DN, Billis K, Hourlier T, Füllgrabe A, Davis MP, Enright AJ, Busch-Nentwich EM. A high-resolution mRNA expression time course of embryonic development in zebrafish. eLife 2017; 6. [PMID: 29144233 PMCID: PMC5690287 DOI: 10.7554/elife.30860] [Citation(s) in RCA: 200] [Impact Index Per Article: 28.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/28/2017] [Accepted: 11/04/2017] [Indexed: 12/18/2022] Open
Abstract
We have produced an mRNA expression time course of zebrafish development across 18 time points from 1 cell to 5 days post-fertilisation sampling individual and pools of embryos. Using poly(A) pulldown stranded RNA-seq and a 3′ end transcript counting method we characterise temporal expression profiles of 23,642 genes. We identify temporal and functional transcript co-variance that associates 5024 unnamed genes with distinct developmental time points. Specifically, a class of over 100 previously uncharacterised zinc finger domain containing genes, located on the long arm of chromosome 4, is expressed in a sharp peak during zygotic genome activation. In addition, the data reveal new genes and transcripts, differential use of exons and previously unidentified 3′ ends across development, new primary microRNAs and temporal divergence of gene paralogues generated in the teleost genome duplication. To make this dataset a useful baseline reference, the data can be browsed and downloaded at Expression Atlas and Ensembl.
Collapse
Affiliation(s)
| | - John E Collins
- Wellcome Trust Sanger Institute, Hinxton, United Kingdom
| | - Ian M Sealy
- Wellcome Trust Sanger Institute, Hinxton, United Kingdom
| | - Neha Wali
- Wellcome Trust Sanger Institute, Hinxton, United Kingdom
| | | | - Zsofia Digby
- Wellcome Trust Sanger Institute, Hinxton, United Kingdom
| | | | - Daniel N Murphy
- European Molecular Biology Laboratory, European Bioinformatics Institute, Hinxton, United Kingdom
| | - Konstantinos Billis
- European Molecular Biology Laboratory, European Bioinformatics Institute, Hinxton, United Kingdom
| | - Thibaut Hourlier
- European Molecular Biology Laboratory, European Bioinformatics Institute, Hinxton, United Kingdom
| | - Anja Füllgrabe
- European Molecular Biology Laboratory, European Bioinformatics Institute, Hinxton, United Kingdom
| | - Matthew P Davis
- European Molecular Biology Laboratory, European Bioinformatics Institute, Hinxton, United Kingdom
| | - Anton J Enright
- European Molecular Biology Laboratory, European Bioinformatics Institute, Hinxton, United Kingdom
| | - Elisabeth M Busch-Nentwich
- Wellcome Trust Sanger Institute, Hinxton, United Kingdom.,Department of Medicine, University of Cambridge, Cambridge, United Kingdom
| |
Collapse
|
29
|
Ali NM, Khan HA, Then AYH, Ving Ching C, Gaur M, Dhillon SK. Fish Ontology framework for taxonomy-based fish recognition. PeerJ 2017; 5:e3811. [PMID: 28929028 PMCID: PMC5602685 DOI: 10.7717/peerj.3811] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/02/2017] [Accepted: 08/25/2017] [Indexed: 11/20/2022] Open
Abstract
Life science ontologies play an important role in Semantic Web. Given the diversity in fish species and the associated wealth of information, it is imperative to develop an ontology capable of linking and integrating this information in an automated fashion. As such, we introduce the Fish Ontology (FO), an automated classification architecture of existing fish taxa which provides taxonomic information on unknown fish based on metadata restrictions. It is designed to support knowledge discovery, provide semantic annotation of fish and fisheries resources, data integration, and information retrieval. Automated classification for unknown specimens is a unique feature that currently does not appear to exist in other known ontologies. Examples of automated classification for major groups of fish are demonstrated, showing the inferred information by introducing several restrictions at the species or specimen level. The current version of FO has 1,830 classes, includes widely used fisheries terminology, and models major aspects of fish taxonomy, grouping, and character. With more than 30,000 known fish species globally, the FO will be an indispensable tool for fish scientists and other interested users.
Collapse
Affiliation(s)
- Najib M. Ali
- Institute of Biological Sciences, Faculty of Science, University of Malaya, Kuala Lumpur, Malaysia
| | - Haris A. Khan
- Institute of Biological Sciences, Faculty of Science, University of Malaya, Kuala Lumpur, Malaysia
| | - Amy Y-Hui Then
- Institute of Biological Sciences, Faculty of Science, University of Malaya, Kuala Lumpur, Malaysia
| | - Chong Ving Ching
- Institute of Biological Sciences, Faculty of Science, University of Malaya, Kuala Lumpur, Malaysia
| | - Manas Gaur
- Wright State University, Kno.e.sis Center, Dayton, OH, United States of America
| | - Sarinder Kaur Dhillon
- Institute of Biological Sciences, Faculty of Science, University of Malaya, Kuala Lumpur, Malaysia
| |
Collapse
|
30
|
Cipriani V, Kalhoro A, Arno G, Silva RS, Pontikos N, Puech V, McClements ME, Hunt DM, van Heyningen V, Michaelides M, Webster AR, Moore AT, Puech B. Genome-wide linkage and haplotype sharing analysis implicates the MCDR3 locus as a candidate region for a developmental macular disorder in association with digit abnormalities. Ophthalmic Genet 2017. [DOI: 10.1080/13816810.2017.1289544] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/19/2022]
Affiliation(s)
- Valentina Cipriani
- Department of Ocular Biology and Therapeutics, UCL Institute of Ophthalmology, University College London, London, UK
- Moorfields Eye Hospital, London, UK
- UCL Genetics Institute, University College London, London, UK
| | - Ambreen Kalhoro
- Department of Ocular Biology and Therapeutics, UCL Institute of Ophthalmology, University College London, London, UK
- Moorfields Eye Hospital, London, UK
| | - Gavin Arno
- Department of Ocular Biology and Therapeutics, UCL Institute of Ophthalmology, University College London, London, UK
- Moorfields Eye Hospital, London, UK
| | - Raquel S. Silva
- Department of Ocular Biology and Therapeutics, UCL Institute of Ophthalmology, University College London, London, UK
- Moorfields Eye Hospital, London, UK
| | - Nikolas Pontikos
- Department of Ocular Biology and Therapeutics, UCL Institute of Ophthalmology, University College London, London, UK
- UCL Genetics Institute, University College London, London, UK
| | | | - Michelle E. McClements
- Nuffield Department of Clinical Neurosciences (Ophthalmology), University of Oxford, Oxford, UK
| | - David M. Hunt
- Lions Eye Institute and School of Animal Biology, University of Western Australia, Perth, Western Australia, Australia
| | - Veronica van Heyningen
- Department of Ocular Biology and Therapeutics, UCL Institute of Ophthalmology, University College London, London, UK
| | - Michel Michaelides
- Department of Ocular Biology and Therapeutics, UCL Institute of Ophthalmology, University College London, London, UK
- Moorfields Eye Hospital, London, UK
| | - Andrew R. Webster
- Department of Ocular Biology and Therapeutics, UCL Institute of Ophthalmology, University College London, London, UK
- Moorfields Eye Hospital, London, UK
| | - Anthony T. Moore
- Department of Ocular Biology and Therapeutics, UCL Institute of Ophthalmology, University College London, London, UK
- Moorfields Eye Hospital, London, UK
- Ophthalmology Department, University of California, San Francisco School of Medicine, San Francisco, California, USA
| | - Bernard Puech
- Service d’Exploration de la Vision CHU, Lille, France
| |
Collapse
|
31
|
Howe DG, Bradford YM, Eagle A, Fashena D, Frazer K, Kalita P, Mani P, Martin R, Moxon ST, Paddock H, Pich C, Ramachandran S, Ruzicka L, Schaper K, Shao X, Singer A, Toro S, Van Slyke C, Westerfield M. The Zebrafish Model Organism Database: new support for human disease models, mutation details, gene expression phenotypes and searching. Nucleic Acids Res 2016; 45:D758-D768. [PMID: 27899582 PMCID: PMC5210580 DOI: 10.1093/nar/gkw1116] [Citation(s) in RCA: 50] [Impact Index Per Article: 6.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/09/2016] [Revised: 10/25/2016] [Accepted: 10/27/2016] [Indexed: 12/16/2022] Open
Abstract
The Zebrafish Model Organism Database (ZFIN; http://zfin.org) is the central resource for zebrafish (Danio rerio) genetic, genomic, phenotypic and developmental data. ZFIN curators provide expert manual curation and integration of comprehensive data involving zebrafish genes, mutants, transgenic constructs and lines, phenotypes, genotypes, gene expressions, morpholinos, TALENs, CRISPRs, antibodies, anatomical structures, models of human disease and publications. We integrate curated, directly submitted, and collaboratively generated data, making these available to zebrafish research community. Among the vertebrate model organisms, zebrafish are superbly suited for rapid generation of sequence-targeted mutant lines, characterization of phenotypes including gene expression patterns, and generation of human disease models. The recent rapid adoption of zebrafish as human disease models is making management of these data particularly important to both the research and clinical communities. Here, we describe recent enhancements to ZFIN including use of the zebrafish experimental conditions ontology, ‘Fish’ records in the ZFIN database, support for gene expression phenotypes, models of human disease, mutation details at the DNA, RNA and protein levels, and updates to the ZFIN single box search.
Collapse
Affiliation(s)
- Douglas G Howe
- The Institute of Neuroscience, University of Oregon, Eugene, OR 97403-1254, USA
| | - Yvonne M Bradford
- The Institute of Neuroscience, University of Oregon, Eugene, OR 97403-1254, USA
| | - Anne Eagle
- The Institute of Neuroscience, University of Oregon, Eugene, OR 97403-1254, USA
| | - David Fashena
- The Institute of Neuroscience, University of Oregon, Eugene, OR 97403-1254, USA
| | - Ken Frazer
- The Institute of Neuroscience, University of Oregon, Eugene, OR 97403-1254, USA
| | - Patrick Kalita
- The Institute of Neuroscience, University of Oregon, Eugene, OR 97403-1254, USA
| | - Prita Mani
- The Institute of Neuroscience, University of Oregon, Eugene, OR 97403-1254, USA
| | - Ryan Martin
- The Institute of Neuroscience, University of Oregon, Eugene, OR 97403-1254, USA
| | - Sierra Taylor Moxon
- The Institute of Neuroscience, University of Oregon, Eugene, OR 97403-1254, USA
| | - Holly Paddock
- The Institute of Neuroscience, University of Oregon, Eugene, OR 97403-1254, USA
| | - Christian Pich
- The Institute of Neuroscience, University of Oregon, Eugene, OR 97403-1254, USA
| | | | - Leyla Ruzicka
- The Institute of Neuroscience, University of Oregon, Eugene, OR 97403-1254, USA
| | - Kevin Schaper
- The Institute of Neuroscience, University of Oregon, Eugene, OR 97403-1254, USA
| | - Xiang Shao
- The Institute of Neuroscience, University of Oregon, Eugene, OR 97403-1254, USA
| | - Amy Singer
- The Institute of Neuroscience, University of Oregon, Eugene, OR 97403-1254, USA
| | - Sabrina Toro
- The Institute of Neuroscience, University of Oregon, Eugene, OR 97403-1254, USA
| | - Ceri Van Slyke
- The Institute of Neuroscience, University of Oregon, Eugene, OR 97403-1254, USA
| | - Monte Westerfield
- The Institute of Neuroscience, University of Oregon, Eugene, OR 97403-1254, USA
| |
Collapse
|
32
|
Diehl AD, Meehan TF, Bradford YM, Brush MH, Dahdul WM, Dougall DS, He Y, Osumi-Sutherland D, Ruttenberg A, Sarntivijai S, Van Slyke CE, Vasilevsky NA, Haendel MA, Blake JA, Mungall CJ. The Cell Ontology 2016: enhanced content, modularization, and ontology interoperability. J Biomed Semantics 2016; 7:44. [PMID: 27377652 PMCID: PMC4932724 DOI: 10.1186/s13326-016-0088-7] [Citation(s) in RCA: 145] [Impact Index Per Article: 18.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/15/2016] [Accepted: 06/23/2016] [Indexed: 12/04/2022] Open
Abstract
BACKGROUND The Cell Ontology (CL) is an OBO Foundry candidate ontology covering the domain of canonical, natural biological cell types. Since its inception in 2005, the CL has undergone multiple rounds of revision and expansion, most notably in its representation of hematopoietic cells. For in vivo cells, the CL focuses on vertebrates but provides general classes that can be used for other metazoans, which can be subtyped in species-specific ontologies. CONSTRUCTION AND CONTENT Recent work on the CL has focused on extending the representation of various cell types, and developing new modules in the CL itself, and in related ontologies in coordination with the CL. For example, the Kidney and Urinary Pathway Ontology was used as a template to populate the CL with additional cell types. In addition, subtypes of the class 'cell in vitro' have received improved definitions and labels to provide for modularity with the representation of cells in the Cell Line Ontology and Reagent Ontology. Recent changes in the ontology development methodology for CL include a switch from OBO to OWL for the primary encoding of the ontology, and an increasing reliance on logical definitions for improved reasoning. UTILITY AND DISCUSSION The CL is now mandated as a metadata standard for large functional genomics and transcriptomics projects, and is used extensively for annotation, querying, and analyses of cell type specific data in sequencing consortia such as FANTOM5 and ENCODE, as well as for the NIAID ImmPort database and the Cell Image Library. The CL is also a vital component used in the modular construction of other biomedical ontologies-for example, the Gene Ontology and the cross-species anatomy ontology, Uberon, use CL to support the consistent representation of cell types across different levels of anatomical granularity, such as tissues and organs. CONCLUSIONS The ongoing improvements to the CL make it a valuable resource to both the OBO Foundry community and the wider scientific community, and we continue to experience increased interest in the CL both among developers and within the user community.
Collapse
Affiliation(s)
- Alexander D. Diehl
- />Department of Neurology, University at Buffalo School of Medicine and Biomedical Sciences, Buffalo, NY 14203 USA
| | - Terrence F. Meehan
- />European Molecular Biology Laboratory, European Bioinformatics Institute, Hinxton, Cambridge, CB10 1SD UK
| | - Yvonne M. Bradford
- />ZFIN, the Zebrafish Model Organism Database, 5291 University of Oregon, Eugene, OR 97403 USA
| | - Matthew H. Brush
- />Ontology Development Group, Library, Oregon Health and Science University, Portland, Oregon 97239 USA
| | - Wasila M. Dahdul
- />Department of Biology, University of South Dakota, Vermillion, SD 57069 USA
- />National Evolutionary Synthesis Center, Durham, NC 27705 USA
| | - David S. Dougall
- />Southwestern Medical Center, University of Texas, Dallas, TX 75235 USA
| | - Yongqun He
- />Unit for Laboratory Animal Medicine, University of Michigan Medical School, Ann Arbor, MI 48109 USA
| | - David Osumi-Sutherland
- />European Molecular Biology Laboratory, European Bioinformatics Institute, Hinxton, Cambridge, CB10 1SD UK
| | - Alan Ruttenberg
- />Oral Diagnostics Sciences, University at Buffalo School of Dental Medicine, Buffalo, NY 14210 USA
| | - Sirarat Sarntivijai
- />European Molecular Biology Laboratory, European Bioinformatics Institute, Hinxton, Cambridge, CB10 1SD UK
| | - Ceri E. Van Slyke
- />ZFIN, the Zebrafish Model Organism Database, 5291 University of Oregon, Eugene, OR 97403 USA
| | - Nicole A. Vasilevsky
- />Ontology Development Group, Library, Oregon Health and Science University, Portland, Oregon 97239 USA
| | - Melissa A. Haendel
- />Ontology Development Group, Library, Oregon Health and Science University, Portland, Oregon 97239 USA
| | | | | |
Collapse
|
33
|
Clarkson MD. Representation of anatomy in online atlases and databases: a survey and collection of patterns for interface design. BMC DEVELOPMENTAL BIOLOGY 2016; 16:18. [PMID: 27206491 PMCID: PMC4875762 DOI: 10.1186/s12861-016-0116-y] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 01/11/2016] [Accepted: 05/09/2016] [Indexed: 12/17/2022]
Abstract
BACKGROUND A large number of online atlases and databases have been developed to mange the rapidly growing amount of data describing embryogenesis. As these community resources continue to evolve, it is important to understand how representations of anatomy can facilitate the sharing and integration of data. In addition, attention to the design of the interfaces is critical to make online resources useful and usable. RESULTS I first present a survey of online atlases and gene expression resources for model organisms, with a focus on methods of semantic and spatial representation of anatomy. A total of 14 anatomical atlases and 21 gene expression resources are included. This survey demonstrates how choices in semantic representation, in the form of ontologies, can enhance interface search functions and provide links between relevant information. This survey also reviews methods for spatially representing anatomy in online resources. I then provide a collection of patterns for interface design based on the atlases and databases surveyed. These patterns include methods for displaying graphics, integrating semantic and spatial representations, organizing information, and querying databases to find genes expressed in anatomical structures. CONCLUSIONS This collection of patterns for interface design will assist biologists and software developers in planning the interfaces of new atlases and databases or enhancing existing ones. They also show the benefits of standardizing semantic and spatial representations of anatomy by demonstrating how interfaces can use standardization to provide enhanced functionality.
Collapse
Affiliation(s)
- Melissa D Clarkson
- Department of Biological Structure, School of Medicine, University of Washington, Seattle, WA, USA.
| |
Collapse
|
34
|
Howe DG, Bradford YM, Eagle A, Fashena D, Frazer K, Kalita P, Mani P, Martin R, Moxon ST, Paddock H, Pich C, Ramachandran S, Ruzicka L, Schaper K, Shao X, Singer A, Toro S, Van Slyke C, Westerfield M. A scientist's guide for submitting data to ZFIN. Methods Cell Biol 2016; 135:451-81. [PMID: 27443940 DOI: 10.1016/bs.mcb.2016.04.010] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/14/2022]
Abstract
The Zebrafish Model Organism Database (ZFIN; zfin.org) serves as the central repository for genetic and genomic data produced using zebrafish (Danio rerio). Data in ZFIN are either manually curated from peer-reviewed publications or submitted directly to ZFIN from various data repositories. Data types currently supported include mutants, transgenic lines, DNA constructs, gene expression, phenotypes, antibodies, morpholinos, TALENs, CRISPRs, disease models, movies, and images. The rapidly changing methods of genomic science have increased the production of data that cannot readily be represented in standard journal publications. These large data sets require web-based presentation. As the central repository for zebrafish research data, it has become increasingly important for ZFIN to provide the zebrafish research community with support for their data sets and guidance on what is required to submit these data to ZFIN. Regardless of their volume, all data that are submitted for inclusion in ZFIN must include a minimum set of information that describes the data. The aim of this chapter is to identify data types that fit into the current ZFIN database and explain how to provide those data in the optimal format for integration. We identify the required and optional data elements, define jargon, and present tools and templates that can help with the acquisition and organization of data as they are being prepared for submission to ZFIN. This information will also appear in the ZFIN wiki, where it will be updated as our services evolve over time.
Collapse
Affiliation(s)
- D G Howe
- University of Oregon, Eugene, OR, United States
| | | | - A Eagle
- University of Oregon, Eugene, OR, United States
| | - D Fashena
- University of Oregon, Eugene, OR, United States
| | - K Frazer
- University of Oregon, Eugene, OR, United States
| | - P Kalita
- University of Oregon, Eugene, OR, United States
| | - P Mani
- University of Oregon, Eugene, OR, United States
| | - R Martin
- University of Oregon, Eugene, OR, United States
| | - S T Moxon
- University of Oregon, Eugene, OR, United States
| | - H Paddock
- University of Oregon, Eugene, OR, United States
| | - C Pich
- University of Oregon, Eugene, OR, United States
| | | | - L Ruzicka
- University of Oregon, Eugene, OR, United States
| | - K Schaper
- University of Oregon, Eugene, OR, United States
| | - X Shao
- University of Oregon, Eugene, OR, United States
| | - A Singer
- University of Oregon, Eugene, OR, United States
| | - S Toro
- University of Oregon, Eugene, OR, United States
| | - C Van Slyke
- University of Oregon, Eugene, OR, United States
| | | |
Collapse
|
35
|
Druzinsky RE, Balhoff JP, Crompton AW, Done J, German RZ, Haendel MA, Herrel A, Herring SW, Lapp H, Mabee PM, Muller HM, Mungall CJ, Sternberg PW, Van Auken K, Vinyard CJ, Williams SH, Wall CE. Muscle Logic: New Knowledge Resource for Anatomy Enables Comprehensive Searches of the Literature on the Feeding Muscles of Mammals. PLoS One 2016; 11:e0149102. [PMID: 26870952 PMCID: PMC4752357 DOI: 10.1371/journal.pone.0149102] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/14/2015] [Accepted: 01/27/2016] [Indexed: 01/27/2023] Open
Abstract
Background In recent years large bibliographic databases have made much of the published literature of biology available for searches. However, the capabilities of the search engines integrated into these databases for text-based bibliographic searches are limited. To enable searches that deliver the results expected by comparative anatomists, an underlying logical structure known as an ontology is required. Development and Testing of the Ontology Here we present the Mammalian Feeding Muscle Ontology (MFMO), a multi-species ontology focused on anatomical structures that participate in feeding and other oral/pharyngeal behaviors. A unique feature of the MFMO is that a simple, computable, definition of each muscle, which includes its attachments and innervation, is true across mammals. This construction mirrors the logical foundation of comparative anatomy and permits searches using language familiar to biologists. Further, it provides a template for muscles that will be useful in extending any anatomy ontology. The MFMO is developed to support the Feeding Experiments End-User Database Project (FEED, https://feedexp.org/), a publicly-available, online repository for physiological data collected from in vivo studies of feeding (e.g., mastication, biting, swallowing) in mammals. Currently the MFMO is integrated into FEED and also into two literature-specific implementations of Textpresso, a text-mining system that facilitates powerful searches of a corpus of scientific publications. We evaluate the MFMO by asking questions that test the ability of the ontology to return appropriate answers (competency questions). We compare the results of queries of the MFMO to results from similar searches in PubMed and Google Scholar. Results and Significance Our tests demonstrate that the MFMO is competent to answer queries formed in the common language of comparative anatomy, but PubMed and Google Scholar are not. Overall, our results show that by incorporating anatomical ontologies into searches, an expanded and anatomically comprehensive set of results can be obtained. The broader scientific and publishing communities should consider taking up the challenge of semantically enabled search capabilities.
Collapse
Affiliation(s)
- Robert E. Druzinsky
- Department of Oral Biology, University of Illinois at Chicago, Chicago, Illinois, United States of America
- * E-mail:
| | - James P. Balhoff
- RTI International, Research Triangle Park, North Carolina, United States of America
| | - Alfred W. Crompton
- Organismic and Evolutionary Biology, Harvard University, Cambridge, Massachusetts, United States of America
| | - James Done
- Division of Biology and Biological Engineering, M/C 156–29, California Institute of Technology, Pasadena, California, United States of America
| | - Rebecca Z. German
- Department of Anatomy and Neurobiology, Northeast Ohio Medical University, Rootstown, Ohio, United States of America
| | - Melissa A. Haendel
- Oregon Health and Science University, Portland, Oregon, United States of America
| | - Anthony Herrel
- Département d’Ecologie et de Gestion de la Biodiversité, Museum National d’Histoire Naturelle, Paris, France
| | - Susan W. Herring
- University of Washington, Department of Orthodontics, Seattle, Washington, United States of America
| | - Hilmar Lapp
- National Evolutionary Synthesis Center, Durham, North Carolina, United States of America
- Center for Genomic and Computational Biology, Duke University, Durham, North Carolina, United States of America
| | - Paula M. Mabee
- Department of Biology, University of South Dakota, Vermillion, South Dakota, United States of America
| | - Hans-Michael Muller
- Division of Biology and Biological Engineering, M/C 156–29, California Institute of Technology, Pasadena, California, United States of America
| | - Christopher J. Mungall
- Genomics Division, Lawrence Berkeley National Laboratory, Berkeley, California, United States of America
| | - Paul W. Sternberg
- Division of Biology and Biological Engineering, M/C 156–29, California Institute of Technology, Pasadena, California, United States of America
- Howard Hughes Medical Institute, M/C 156–29, California Institute of Technology, Pasadena, California, United States of America
| | - Kimberly Van Auken
- Division of Biology and Biological Engineering, M/C 156–29, California Institute of Technology, Pasadena, California, United States of America
| | - Christopher J. Vinyard
- Department of Anatomy and Neurobiology, Northeast Ohio Medical University, Rootstown, Ohio, United States of America
| | - Susan H. Williams
- Department of Biomedical Sciences, Ohio University Heritage College of Osteopathic Medicine, Athens, Ohio, United States of America
| | - Christine E. Wall
- Department of Evolutionary Anthropology, Duke University, Durham, North Carolina, United States of America
| |
Collapse
|
36
|
Brozovic M, Martin C, Dantec C, Dauga D, Mendez M, Simion P, Percher M, Laporte B, Scornavacca C, Di Gregorio A, Fujiwara S, Gineste M, Lowe EK, Piette J, Racioppi C, Ristoratore F, Sasakura Y, Takatori N, Brown TC, Delsuc F, Douzery E, Gissi C, McDougall A, Nishida H, Sawada H, Swalla BJ, Yasuo H, Lemaire P. ANISEED 2015: a digital framework for the comparative developmental biology of ascidians. Nucleic Acids Res 2016; 44:D808-18. [PMID: 26420834 PMCID: PMC4702943 DOI: 10.1093/nar/gkv966] [Citation(s) in RCA: 52] [Impact Index Per Article: 6.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/24/2015] [Accepted: 09/14/2015] [Indexed: 11/24/2022] Open
Abstract
Ascidians belong to the tunicates, the sister group of vertebrates and are recognized model organisms in the field of embryonic development, regeneration and stem cells. ANISEED is the main information system in the field of ascidian developmental biology. This article reports the development of the system since its initial publication in 2010. Over the past five years, we refactored the system from an initial custom schema to an extended version of the Chado schema and redesigned all user and back end interfaces. This new architecture was used to improve and enrich the description of Ciona intestinalis embryonic development, based on an improved genome assembly and gene model set, refined functional gene annotation, and anatomical ontologies, and a new collection of full ORF cDNAs. The genomes of nine ascidian species have been sequenced since the release of the C. intestinalis genome. In ANISEED 2015, all nine new ascidian species can be explored via dedicated genome browsers, and searched by Blast. In addition, ANISEED provides full functional gene annotation, anatomical ontologies and some gene expression data for the six species with highest quality genomes. ANISEED is publicly available at: http://www.aniseed.cnrs.fr.
Collapse
Affiliation(s)
- Matija Brozovic
- Centre de Recherches de Biochimie Macromoléculaire (CRBM), UMR5237, CNRS-Université de Montpellier, 1919 route de Mende, F-34090 Montpellier, France
| | - Cyril Martin
- Centre de Recherches de Biochimie Macromoléculaire (CRBM), UMR5237, CNRS-Université de Montpellier, 1919 route de Mende, F-34090 Montpellier, France
| | - Christelle Dantec
- Centre de Recherches de Biochimie Macromoléculaire (CRBM), UMR5237, CNRS-Université de Montpellier, 1919 route de Mende, F-34090 Montpellier, France
| | - Delphine Dauga
- Institut de Biologie du Développement de Marseille (IBDM), UMR7288 CNRS-Aix Marseille Université, Parc Scientifique de Luminy, Case 907, F-13288 Marseille Cedex 9, France Bioself Communication, 28 rue de la Bibliothèque, F-13001 Marseille, France
| | - Mickaël Mendez
- Centre de Recherches de Biochimie Macromoléculaire (CRBM), UMR5237, CNRS-Université de Montpellier, 1919 route de Mende, F-34090 Montpellier, France
| | - Paul Simion
- Institut des Sciences de l'Evolution de Montpellier (ISE-M), UMR 5554 CNRS-IRD-Université de Montpellier, F-34090 Montpellier, France
| | - Madeline Percher
- Centre de Recherches de Biochimie Macromoléculaire (CRBM), UMR5237, CNRS-Université de Montpellier, 1919 route de Mende, F-34090 Montpellier, France
| | - Baptiste Laporte
- Institut de Biologie du Développement de Marseille (IBDM), UMR7288 CNRS-Aix Marseille Université, Parc Scientifique de Luminy, Case 907, F-13288 Marseille Cedex 9, France
| | - Céline Scornavacca
- Institut des Sciences de l'Evolution de Montpellier (ISE-M), UMR 5554 CNRS-IRD-Université de Montpellier, F-34090 Montpellier, France
| | - Anna Di Gregorio
- Department of Basic Science and Craniofacial Biology New York University College of Dentistry, 345 E 24th Street, New York, NY 10010, USA
| | - Shigeki Fujiwara
- Department of Applied Science, Kochi University, Kochi-shi, Kochi 780-8520, Japan
| | - Mathieu Gineste
- Centre de Recherches de Biochimie Macromoléculaire (CRBM), UMR5237, CNRS-Université de Montpellier, 1919 route de Mende, F-34090 Montpellier, France
| | - Elijah K Lowe
- Department of Biology and Evolution of Marine Organisms, Stazione Zoologica Anton Dohrn, Villa Comunale, I-80121 Napoli, Italy BEACON Center for the Study of Evolution in Action, Michigan State University, East Lansing, Michigan, USA
| | - Jacques Piette
- Centre de Recherches de Biochimie Macromoléculaire (CRBM), UMR5237, CNRS-Université de Montpellier, 1919 route de Mende, F-34090 Montpellier, France
| | - Claudia Racioppi
- Center for Developmental Genetics, Department of Biology, New York University, New York, NY 10003, USA
| | - Filomena Ristoratore
- Department of Biology and Evolution of Marine Organisms, Stazione Zoologica Anton Dohrn, Villa Comunale, I-80121 Napoli, Italy
| | - Yasunori Sasakura
- Shimoda Marine Research Center, University of Tsukuba, Shimoda, Shizuoka 415-0025, Japan
| | - Naohito Takatori
- Developmental Biology Laboratory, Department of Biological Sciences, School of Science and Engineering, Tokyo Metropolitan University, 1-1 Minamioosawa, Hachiooji, Tokyo 192-0397, Japan Department of Biological Sciences, Graduate School of Science, Osaka University, 1-1 Machikaneyama-cho, Toyonaka, Osaka 560-0043, Japan
| | - Titus C Brown
- Population Health and Reproduction, UC Davis, Davis, CA 95616, USA
| | - Frédéric Delsuc
- Institut des Sciences de l'Evolution de Montpellier (ISE-M), UMR 5554 CNRS-IRD-Université de Montpellier, F-34090 Montpellier, France
| | - Emmanuel Douzery
- Institut des Sciences de l'Evolution de Montpellier (ISE-M), UMR 5554 CNRS-IRD-Université de Montpellier, F-34090 Montpellier, France
| | - Carmela Gissi
- Dipartimento di Bioscienze, Università degli Studi di Milano, Via Celoria 26, Milano 20133, Italy
| | - Alex McDougall
- Sorbonne Universités, Université Pierre et Marie Curie, CNRS, Laboratoire de Biologie du Développement de Villefranche-sur-mer, Observatoire Océanologique, F-06230 Villefranche-sur-mer, France
| | - Hiroki Nishida
- Department of Biological Sciences, Graduate School of Science, Osaka University, 1-1 Machikaneyama-cho, Toyonaka, Osaka 560-0043, Japan
| | - Hitoshi Sawada
- Sugashima Marine Biological Laboratory, Graduate School of Science, Nagoya University, 429-63 Sugashima, Toba 517-0004, Japan
| | - Billie J Swalla
- Friday Harbor Laboratories, 620 University Road, Friday Harbor, WA 98250-9299, USA
| | - Hitoyoshi Yasuo
- Sorbonne Universités, Université Pierre et Marie Curie, CNRS, Laboratoire de Biologie du Développement de Villefranche-sur-mer, Observatoire Océanologique, F-06230 Villefranche-sur-mer, France
| | - Patrick Lemaire
- Centre de Recherches de Biochimie Macromoléculaire (CRBM), UMR5237, CNRS-Université de Montpellier, 1919 route de Mende, F-34090 Montpellier, France Institut de Biologie du Développement de Marseille (IBDM), UMR7288 CNRS-Aix Marseille Université, Parc Scientifique de Luminy, Case 907, F-13288 Marseille Cedex 9, France
| |
Collapse
|
37
|
Smedley D, Jacobsen JOB, Jäger M, Köhler S, Holtgrewe M, Schubach M, Siragusa E, Zemojtel T, Buske OJ, Washington NL, Bone WP, Haendel MA, Robinson PN. Next-generation diagnostics and disease-gene discovery with the Exomiser. Nat Protoc 2015; 10:2004-15. [PMID: 26562621 DOI: 10.1038/nprot.2015.124] [Citation(s) in RCA: 238] [Impact Index Per Article: 26.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/23/2022]
Abstract
Exomiser is an application that prioritizes genes and variants in next-generation sequencing (NGS) projects for novel disease-gene discovery or differential diagnostics of Mendelian disease. Exomiser comprises a suite of algorithms for prioritizing exome sequences using random-walk analysis of protein interaction networks, clinical relevance and cross-species phenotype comparisons, as well as a wide range of other computational filters for variant frequency, predicted pathogenicity and pedigree analysis. In this protocol, we provide a detailed explanation of how to install Exomiser and use it to prioritize exome sequences in a number of scenarios. Exomiser requires ∼3 GB of RAM and roughly 15-90 s of computing time on a standard desktop computer to analyze a variant call format (VCF) file. Exomiser is freely available for academic use from http://www.sanger.ac.uk/science/tools/exomiser.
Collapse
Affiliation(s)
- Damian Smedley
- Skarnes Faculty Group, Wellcome Trust Sanger Institute, Hinxton, UK
| | | | - Marten Jäger
- Institute for Medical and Human Genetics, Charité-Universitätsmedizin Berlin, Berlin, Germany.,Berlin Brandenburg Center for Regenerative Therapies (BCRT), Charité-Universitätsmedizin Berlin, Berlin, Germany
| | - Sebastian Köhler
- Institute for Medical and Human Genetics, Charité-Universitätsmedizin Berlin, Berlin, Germany
| | - Manuel Holtgrewe
- Institute for Medical and Human Genetics, Charité-Universitätsmedizin Berlin, Berlin, Germany.,Berlin Institute for Health, Berlin, Germany
| | - Max Schubach
- Institute for Medical and Human Genetics, Charité-Universitätsmedizin Berlin, Berlin, Germany
| | - Enrico Siragusa
- Institute for Medical and Human Genetics, Charité-Universitätsmedizin Berlin, Berlin, Germany.,Berlin Institute for Health, Berlin, Germany.,Max Planck Institute for Molecular Genetics, Berlin, Germany
| | - Tomasz Zemojtel
- Institute for Medical and Human Genetics, Charité-Universitätsmedizin Berlin, Berlin, Germany.,Institute of Bioorganic Chemistry, Polish Academy of Sciences, Poznan, Poland.,Labor Berlin - Charité Vivantes, Humangenetik, Berlin, Germany
| | - Orion J Buske
- Department of Computer Science, University of Toronto, Toronto, Ontario, Canada.,Genetics and Genome Biology, The Hospital for Sick Children, Toronto, Ontario, Canada
| | - Nicole L Washington
- Division of Environmental Genomics and Systems Biology, Lawrence Berkeley National Laboratory, Berkeley, California, USA
| | - William P Bone
- The National Institutes of Health (NIH) Undiagnosed Diseases Program, Common Fund, Office of the Director, NIH, Bethesda, Maryland, USA
| | - Melissa A Haendel
- Department of Medical Informatics and Clinical Epidemiology, Oregon Health &Science University, Portland, Oregon, USA
| | - Peter N Robinson
- Institute for Medical and Human Genetics, Charité-Universitätsmedizin Berlin, Berlin, Germany.,Berlin Brandenburg Center for Regenerative Therapies (BCRT), Charité-Universitätsmedizin Berlin, Berlin, Germany.,Max Planck Institute for Molecular Genetics, Berlin, Germany.,Department of Mathematics and Computer Science, Institute for Bioinformatics, Freie Universität Berlin, Berlin, Germany
| |
Collapse
|
38
|
Mungall CJ, Washington NL, Nguyen-Xuan J, Condit C, Smedley D, Köhler S, Groza T, Shefchek K, Hochheiser H, Robinson PN, Lewis SE, Haendel MA. Use of model organism and disease databases to support matchmaking for human disease gene discovery. Hum Mutat 2015; 36:979-84. [PMID: 26269093 PMCID: PMC5473253 DOI: 10.1002/humu.22857] [Citation(s) in RCA: 34] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/04/2015] [Accepted: 07/22/2015] [Indexed: 11/10/2022]
Abstract
The Matchmaker Exchange application programming interface (API) allows searching a patient's genotypic or phenotypic profiles across clinical sites, for the purposes of cohort discovery and variant disease causal validation. This API can be used not only to search for matching patients, but also to match against public disease and model organism data. This public disease data enable matching known diseases and variant-phenotype associations using phenotype semantic similarity algorithms developed by the Monarch Initiative. The model data can provide additional evidence to aid diagnosis, suggest relevant models for disease mechanism and treatment exploration, and identify collaborators across the translational divide. The Monarch Initiative provides an implementation of this API for searching multiple integrated sources of data that contextualize the knowledge about any given patient or patient family into the greater biomedical knowledge landscape. While this corpus of data can aid diagnosis, it is also the beginning of research to improve understanding of rare human diseases.
Collapse
Affiliation(s)
| | - Nicole L. Washington
- Genomics Division, Lawrence Berkeley National Laboratory, Berkeley, California, USA
| | - Jeremy Nguyen-Xuan
- Genomics Division, Lawrence Berkeley National Laboratory, Berkeley, California, USA
| | - Christopher Condit
- San Diego Supercomputing Center, UC San Diego, La Jolla, California, USA
| | - Damian Smedley
- Wellcome Trust Sanger Institute, Mouse Informatics group, Hinxton, UK
| | - Sebastian Köhler
- Charité - Universitätsmedizin Berlin, Institute for Medical and Human Genetics, Berlin, Germany
| | - Tudor Groza
- Garvan Institute, Kinghorn Centre for Clinical Genomics, Sydney, Australia
| | - Kent Shefchek
- Department of Biomedical Informatics and Clinical Epidemiology, Oregon Health and Science University
| | - Harry Hochheiser
- Department of Biomedical Informatics, University of Pittsburgh, Pittsburgh, Pennsylvania, USA
| | - Peter N. Robinson
- Charité - Universitätsmedizin Berlin, Institute for Medical and Human Genetics, Berlin, Germany
| | - Suzanna E. Lewis
- Genomics Division, Lawrence Berkeley National Laboratory, Berkeley, California, USA
| | - Melissa A. Haendel
- Department of Biomedical Informatics and Clinical Epidemiology, Oregon Health and Science University
| |
Collapse
|
39
|
Abstract
Mouse anatomy ontologies provide standard nomenclature for describing normal and mutant mouse anatomy, and are essential for the description and integration of data directly related to anatomy such as gene expression patterns. Building on our previous work on anatomical ontologies for the embryonic and adult mouse, we have recently developed a new and substantially revised anatomical ontology covering all life stages of the mouse. Anatomical terms are organized in complex hierarchies enabling multiple relationships between terms. Tissue classification as well as partonomic, developmental, and other types of relationships can be represented. Hierarchies for specific developmental stages can also be derived. The ontology forms the core of the eMouse Atlas Project (EMAP) and is used extensively for annotating and integrating gene expression patterns and other data by the Gene Expression Database (GXD), the eMouse Atlas of Gene Expression (EMAGE) and other database resources. Here we illustrate the evolution of the developmental and adult mouse anatomical ontologies toward one combined system. We report on recent ontology enhancements, describe the current status, and discuss future plans for mouse anatomy ontology development and application in integrating data resources.
Collapse
|
40
|
Ruzicka L, Bradford YM, Frazer K, Howe DG, Paddock H, Ramachandran S, Singer A, Toro S, Van Slyke CE, Eagle AE, Fashena D, Kalita P, Knight J, Mani P, Martin R, Moxon SAT, Pich C, Schaper K, Shao X, Westerfield M. ZFIN, The zebrafish model organism database: Updates and new directions. Genesis 2015; 53:498-509. [PMID: 26097180 DOI: 10.1002/dvg.22868] [Citation(s) in RCA: 56] [Impact Index Per Article: 6.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/10/2015] [Revised: 06/16/2015] [Accepted: 06/17/2015] [Indexed: 12/19/2022]
Abstract
The Zebrafish Model Organism Database (ZFIN; http://zfin.org) is the central resource for genetic and genomic data from zebrafish (Danio rerio) research. ZFIN staff curate detailed information about genes, mutants, genotypes, reporter lines, sequences, constructs, antibodies, knockdown reagents, expression patterns, phenotypes, gene product function, and orthology from publications. Researchers can submit mutant, transgenic, expression, and phenotype data directly to ZFIN and use the ZFIN Community Wiki to share antibody and protocol information. Data can be accessed through topic-specific searches, a new site-wide search, and the data-mining resource ZebrafishMine (http://zebrafishmine.org). Data download and web service options are also available. ZFIN collaborates with major bioinformatics organizations to verify and integrate genomic sequence data, provide nomenclature support, establish reciprocal links, and participate in the development of standardized structured vocabularies (ontologies) used for data annotation and searching. ZFIN-curated gene, function, expression, and phenotype data are available for comparative exploration at several multi-species resources. The use of zebrafish as a model for human disease is increasing. ZFIN is supporting this growing area with three major projects: adding easy access to computed orthology data from gene pages, curating details of the gene expression pattern changes in mutant fish, and curating zebrafish models of human diseases.
Collapse
Affiliation(s)
| | | | - Ken Frazer
- ZFIN, 5291 University of Oregon, Eugene, Oregon
| | | | | | | | - Amy Singer
- ZFIN, 5291 University of Oregon, Eugene, Oregon
| | | | | | | | | | | | | | - Prita Mani
- ZFIN, 5291 University of Oregon, Eugene, Oregon
| | - Ryan Martin
- ZFIN, 5291 University of Oregon, Eugene, Oregon
| | | | | | | | - Xiang Shao
- ZFIN, 5291 University of Oregon, Eugene, Oregon
| | | |
Collapse
|
41
|
Abstract
Background Phenotypic data are routinely used to elucidate gene function in organisms amenable to genetic manipulation. However, previous to this work, there was no generalizable system in place for the structured storage and retrieval of phenotypic information for bacteria. Results The Ontology of Microbial Phenotypes (OMP) has been created to standardize the capture of such phenotypic information from microbes. OMP has been built on the foundations of the Basic Formal Ontology and the Phenotype and Trait Ontology. Terms have logical definitions that can facilitate computational searching of phenotypes and their associated genes. OMP can be accessed via a wiki page as well as downloaded from SourceForge. Initial annotations with OMP are being made for Escherichia coli using a wiki-based annotation capture system. New OMP terms are being concurrently developed as annotation proceeds. Conclusions We anticipate that diverse groups studying microbial genetics and associated phenotypes will employ OMP for standardizing microbial phenotype annotation, much as the Gene Ontology has standardized gene product annotation. The resulting OMP resource and associated annotations will facilitate prediction of phenotypes for unknown genes and result in new experimental characterization of phenotypes and functions.
Collapse
|
42
|
Smith CM, Finger JH, Kadin JA, Richardson JE, Ringwald M. The gene expression database for mouse development (GXD): putting developmental expression information at your fingertips. Dev Dyn 2014; 243:1176-86. [PMID: 24958384 DOI: 10.1002/dvdy.24155] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/12/2014] [Revised: 05/16/2014] [Accepted: 06/17/2014] [Indexed: 12/15/2022] Open
Abstract
Because molecular mechanisms of development are extraordinarily complex, the understanding of these processes requires the integration of pertinent research data. Using the Gene Expression Database for Mouse Development (GXD) as an example, we illustrate the progress made toward this goal, and discuss relevant issues that apply to developmental databases and developmental research in general. Since its first release in 1998, GXD has served the scientific community by integrating multiple types of expression data from publications and electronic submissions and by making these data freely and widely available. Focusing on endogenous gene expression in wild-type and mutant mice and covering data from RNA in situ hybridization, in situ reporter (knock-in), immunohistochemistry, reverse transcriptase-polymerase chain reaction, Northern blot, and Western blot experiments, the database has grown tremendously over the years in terms of data content and search utilities. Currently, GXD includes over 1.4 million annotated expression results and over 260,000 images. All these data and images are readily accessible to many types of database searches. Here we describe the data and search tools of GXD; explain how to use the database most effectively; discuss how we acquire, curate, and integrate developmental expression information; and describe how the research community can help in this process.
Collapse
|
43
|
Haendel MA, Balhoff JP, Bastian FB, Blackburn DC, Blake JA, Bradford Y, Comte A, Dahdul WM, Dececchi TA, Druzinsky RE, Hayamizu TF, Ibrahim N, Lewis SE, Mabee PM, Niknejad A, Robinson-Rechavi M, Sereno PC, Mungall CJ. Unification of multi-species vertebrate anatomy ontologies for comparative biology in Uberon. J Biomed Semantics 2014; 5:21. [PMID: 25009735 PMCID: PMC4089931 DOI: 10.1186/2041-1480-5-21] [Citation(s) in RCA: 88] [Impact Index Per Article: 8.8] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/10/2013] [Accepted: 03/25/2014] [Indexed: 12/25/2022] Open
Abstract
BACKGROUND Elucidating disease and developmental dysfunction requires understanding variation in phenotype. Single-species model organism anatomy ontologies (ssAOs) have been established to represent this variation. Multi-species anatomy ontologies (msAOs; vertebrate skeletal, vertebrate homologous, teleost, amphibian AOs) have been developed to represent 'natural' phenotypic variation across species. Our aim has been to integrate ssAOs and msAOs for various purposes, including establishing links between phenotypic variation and candidate genes. RESULTS Previously, msAOs contained a mixture of unique and overlapping content. This hampered integration and coordination due to the need to maintain cross-references or inter-ontology equivalence axioms to the ssAOs, or to perform large-scale obsolescence and modular import. Here we present the unification of anatomy ontologies into Uberon, a single ontology resource that enables interoperability among disparate data and research groups. As a consequence, independent development of TAO, VSAO, AAO, and vHOG has been discontinued. CONCLUSIONS The newly broadened Uberon ontology is a unified cross-taxon resource for metazoans (animals) that has been substantially expanded to include a broad diversity of vertebrate anatomical structures, permitting reasoning across anatomical variation in extinct and extant taxa. Uberon is a core resource that supports single- and cross-species queries for candidate genes using annotations for phenotypes from the systematics, biodiversity, medical, and model organism communities, while also providing entities for logical definitions in the Cell and Gene Ontologies. THE ONTOLOGY RELEASE FILES ASSOCIATED WITH THE ONTOLOGY MERGE DESCRIBED IN THIS MANUSCRIPT ARE AVAILABLE AT: http://purl.obolibrary.org/obo/uberon/releases/2013-02-21/ CURRENT ONTOLOGY RELEASE FILES ARE AVAILABLE ALWAYS AVAILABLE AT: http://purl.obolibrary.org/obo/uberon/releases/
Collapse
Affiliation(s)
- Melissa A Haendel
- Department of Medical Informatics & Epidemiology, Oregon Health & Science University, Portland, OR, USA
| | - James P Balhoff
- Department of Biology, University of North Carolina, Chapel Hill, NC 27599-3280, USA ; National Evolutionary Synthesis Center, Durham, NC, USA
| | - Frederic B Bastian
- Department of Ecology and Evolution, University of Lausanne, Lausanne, Switzerland ; Swiss Institute of Bioinformatics, Lausanne, Switzerland
| | - David C Blackburn
- Department of Vertebrate Zoology and Anthropology, California Academy of Sciences, San Francisco, CA 94118, USA
| | | | - Yvonne Bradford
- The Zebrafish Model Organism Database, University of Oregon, Eugene, OR 97403, USA
| | - Aurelie Comte
- Department of Ecology and Evolution, University of Lausanne, Lausanne, Switzerland ; Swiss Institute of Bioinformatics, Lausanne, Switzerland
| | - Wasila M Dahdul
- National Evolutionary Synthesis Center, Durham, NC, USA ; Department of Biology, University of South Dakota, Vermillion, SD 57069, USA
| | - Thomas A Dececchi
- Department of Biology, University of South Dakota, Vermillion, SD 57069, USA
| | - Robert E Druzinsky
- Department of Oral Biology, University of Illinois-Chicago, Chicago, IL 60612, USA
| | | | - Nizar Ibrahim
- Department of Organismal Biology and Anatomy, University of Chicago, Chicago, IL 60637, USA
| | - Suzanna E Lewis
- Lawrence Berkeley National Laboratory, 1 Cyclotron Rd, Berkeley, CA 94720, USA
| | - Paula M Mabee
- Department of Biology, University of South Dakota, Vermillion, SD 57069, USA
| | - Anne Niknejad
- Department of Ecology and Evolution, University of Lausanne, Lausanne, Switzerland ; Swiss Institute of Bioinformatics, Lausanne, Switzerland
| | - Marc Robinson-Rechavi
- Department of Ecology and Evolution, University of Lausanne, Lausanne, Switzerland ; Swiss Institute of Bioinformatics, Lausanne, Switzerland
| | - Paul C Sereno
- Department of Organismal Biology and Anatomy, University of Chicago, Chicago, IL 60637, USA
| | | |
Collapse
|