1
|
Kaldunski ML, Smith JR, Brodie KC, De Pons JL, Demos WM, Gibson AC, Hayman GT, Lamers L, Laulederkind SJF, Thorat K, Thota J, Tutaj MA, Tutaj M, Vedi M, Wang SJ, Zacher S, Dwinell MR, Kwitek AE. Rare disease research resources at the Rat Genome Database. Genetics 2023; 224:iyad078. [PMID: 37119810 PMCID: PMC10411567 DOI: 10.1093/genetics/iyad078] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/13/2023] [Revised: 04/05/2023] [Accepted: 04/19/2023] [Indexed: 05/01/2023] Open
Abstract
Rare diseases individually affect relatively few people, but as a group they impact considerable numbers of people. The Rat Genome Database (https://rgd.mcw.edu) is a knowledgebase that offers resources for rare disease research. This includes disease definitions, genes, quantitative trail loci (QTLs), genetic variants, annotations to published literature, links to external resources, and more. One important resource is identifying relevant cell lines and rat strains that serve as models for disease research. Diseases, genes, and strains have report pages with consolidated data, and links to analysis tools. Utilizing these globally accessible resources for rare disease research, potentiating discovery of mechanisms and new treatments, can point researchers toward solutions to alleviate the suffering of those afflicted with these diseases.
Collapse
Affiliation(s)
- Mary L Kaldunski
- The Rat Genome Database, Department of Physiology, Medical College of Wisconsin, Milwaukee, WI 53226, USA
| | - Jennifer R Smith
- The Rat Genome Database, Department of Physiology, Medical College of Wisconsin, Milwaukee, WI 53226, USA
| | - Kent C Brodie
- Clinical and Translational Science Institute, Medical College of Wisconsin, Milwaukee, WI 53226, USA
| | - Jeffrey L De Pons
- The Rat Genome Database, Department of Physiology, Medical College of Wisconsin, Milwaukee, WI 53226, USA
| | - Wendy M Demos
- The Rat Genome Database, Department of Physiology, Medical College of Wisconsin, Milwaukee, WI 53226, USA
| | - Adam C Gibson
- The Rat Genome Database, Department of Physiology, Medical College of Wisconsin, Milwaukee, WI 53226, USA
| | - G Thomas Hayman
- The Rat Genome Database, Department of Physiology, Medical College of Wisconsin, Milwaukee, WI 53226, USA
| | - Logan Lamers
- The Rat Genome Database, Department of Physiology, Medical College of Wisconsin, Milwaukee, WI 53226, USA
| | - Stanley J F Laulederkind
- The Rat Genome Database, Department of Physiology, Medical College of Wisconsin, Milwaukee, WI 53226, USA
| | - Ketaki Thorat
- The Rat Genome Database, Department of Physiology, Medical College of Wisconsin, Milwaukee, WI 53226, USA
| | - Jyothi Thota
- The Rat Genome Database, Department of Physiology, Medical College of Wisconsin, Milwaukee, WI 53226, USA
| | - Marek A Tutaj
- The Rat Genome Database, Department of Physiology, Medical College of Wisconsin, Milwaukee, WI 53226, USA
| | - Monika Tutaj
- The Rat Genome Database, Department of Physiology, Medical College of Wisconsin, Milwaukee, WI 53226, USA
| | - Mahima Vedi
- The Rat Genome Database, Department of Physiology, Medical College of Wisconsin, Milwaukee, WI 53226, USA
| | - Shur-Jen Wang
- The Rat Genome Database, Department of Physiology, Medical College of Wisconsin, Milwaukee, WI 53226, USA
| | - Stacy Zacher
- Finance and Administration, Medical College of Wisconsin, Milwaukee, WI 53226, USA
| | - Melinda R Dwinell
- The Rat Genome Database, Department of Physiology, Medical College of Wisconsin, Milwaukee, WI 53226, USA
| | - Anne E Kwitek
- The Rat Genome Database, Department of Physiology, Medical College of Wisconsin, Milwaukee, WI 53226, USA
- Joint Department of Biomedical Engineering, Marquette University & Medical College of Wisconsin, Milwaukee, WI 53226, USA
| |
Collapse
|
2
|
Vedi M, Smith JR, Thomas Hayman G, Tutaj M, Brodie KC, De Pons JL, Demos WM, Gibson AC, Kaldunski ML, Lamers L, Laulederkind SJF, Thota J, Thorat K, Tutaj MA, Wang SJ, Zacher S, Dwinell MR, Kwitek AE. 2022 updates to the Rat Genome Database: a Findable, Accessible, Interoperable, and Reusable (FAIR) resource. Genetics 2023; 224:iyad042. [PMID: 36930729 PMCID: PMC10474928 DOI: 10.1093/genetics/iyad042] [Citation(s) in RCA: 14] [Impact Index Per Article: 14.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/15/2023] [Revised: 03/07/2023] [Accepted: 03/08/2023] [Indexed: 03/19/2023] Open
Abstract
The Rat Genome Database (RGD, https://rgd.mcw.edu) has evolved from simply a resource for rat genetic markers, maps, and genes, by adding multiple genomic data types and extensive disease and phenotype annotations and developing tools to effectively mine, analyze, and visualize the available data, to empower investigators in their hypothesis-driven research. Leveraging its robust and flexible infrastructure, RGD has added data for human and eight other model organisms (mouse, 13-lined ground squirrel, chinchilla, naked mole-rat, dog, pig, African green monkey/vervet, and bonobo) besides rat to enhance its translational aspect. This article presents an overview of the database with the most recent additions to RGD's genome, variant, and quantitative phenotype data. We also briefly introduce Virtual Comparative Map (VCMap), an updated tool that explores synteny between species as an improvement to RGD's suite of tools, followed by a discussion regarding the refinements to the existing PhenoMiner tool that assists researchers in finding and comparing quantitative data across rat strains. Collectively, RGD focuses on providing a continuously improving, consistent, and high-quality data resource for researchers while advancing data reproducibility and fulfilling Findable, Accessible, Interoperable, and Reusable (FAIR) data principles.
Collapse
Affiliation(s)
- Mahima Vedi
- The Rat Genome Database, Department of Physiology, Medical College of Wisconsin, Milwaukee, WI 53226, USA
| | - Jennifer R Smith
- The Rat Genome Database, Department of Physiology, Medical College of Wisconsin, Milwaukee, WI 53226, USA
| | - G Thomas Hayman
- The Rat Genome Database, Department of Physiology, Medical College of Wisconsin, Milwaukee, WI 53226, USA
| | - Monika Tutaj
- The Rat Genome Database, Department of Physiology, Medical College of Wisconsin, Milwaukee, WI 53226, USA
| | - Kent C Brodie
- Clinical and Translational Science Institute, Medical College of Wisconsin, Milwaukee, WI 53226, USA
| | - Jeffrey L De Pons
- The Rat Genome Database, Department of Physiology, Medical College of Wisconsin, Milwaukee, WI 53226, USA
| | - Wendy M Demos
- The Rat Genome Database, Department of Physiology, Medical College of Wisconsin, Milwaukee, WI 53226, USA
| | - Adam C Gibson
- The Rat Genome Database, Department of Physiology, Medical College of Wisconsin, Milwaukee, WI 53226, USA
| | - Mary L Kaldunski
- The Rat Genome Database, Department of Physiology, Medical College of Wisconsin, Milwaukee, WI 53226, USA
| | - Logan Lamers
- The Rat Genome Database, Department of Physiology, Medical College of Wisconsin, Milwaukee, WI 53226, USA
| | - Stanley J F Laulederkind
- The Rat Genome Database, Department of Physiology, Medical College of Wisconsin, Milwaukee, WI 53226, USA
| | - Jyothi Thota
- The Rat Genome Database, Department of Physiology, Medical College of Wisconsin, Milwaukee, WI 53226, USA
| | - Ketaki Thorat
- The Rat Genome Database, Department of Physiology, Medical College of Wisconsin, Milwaukee, WI 53226, USA
| | - Marek A Tutaj
- The Rat Genome Database, Department of Physiology, Medical College of Wisconsin, Milwaukee, WI 53226, USA
| | - Shur-Jen Wang
- The Rat Genome Database, Department of Physiology, Medical College of Wisconsin, Milwaukee, WI 53226, USA
| | - Stacy Zacher
- Finance and Administration, Medical College of Wisconsin, Milwaukee, WI 53226, USA
| | - Melinda R Dwinell
- The Rat Genome Database, Department of Physiology, Medical College of Wisconsin, Milwaukee, WI 53226, USA
| | - Anne E Kwitek
- The Rat Genome Database, Department of Physiology, Medical College of Wisconsin, Milwaukee, WI 53226, USA
- Department of Biomedical Engineering, Medical College of Wisconsin, Milwaukee, WI 53226, USA
| |
Collapse
|
3
|
Smith JR, Hayman GT, Wang SJ, Laulederkind SJF, Hoffman MJ, Kaldunski ML, Tutaj M, Thota J, Nalabolu HS, Ellanki SLR, Tutaj MA, De Pons JL, Kwitek AE, Dwinell MR, Shimoyama ME. The Year of the Rat: The Rat Genome Database at 20: a multi-species knowledgebase and analysis platform. Nucleic Acids Res 2020; 48:D731-D742. [PMID: 31713623 PMCID: PMC7145519 DOI: 10.1093/nar/gkz1041] [Citation(s) in RCA: 59] [Impact Index Per Article: 14.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/15/2019] [Revised: 10/21/2019] [Accepted: 10/24/2019] [Indexed: 12/13/2022] Open
Abstract
Formed in late 1999, the Rat Genome Database (RGD, https://rgd.mcw.edu) will be 20 in 2020, the Year of the Rat. Because the laboratory rat, Rattus norvegicus, has been used as a model for complex human diseases such as cardiovascular disease, diabetes, cancer, neurological disorders and arthritis, among others, for >150 years, RGD has always been disease-focused and committed to providing data and tools for researchers doing comparative genomics and translational studies. At its inception, before the sequencing of the rat genome, RGD started with only a few data types localized on genetic and radiation hybrid (RH) maps and offered only a few tools for querying and consolidating that data. Since that time, RGD has expanded to include a wealth of structured and standardized genetic, genomic, phenotypic, and disease-related data for eight species, and a suite of innovative tools for querying, analyzing and visualizing this data. This article provides an overview of recent substantial additions and improvements to RGD's data and tools that can assist researchers in finding and utilizing the data they need, whether their goal is to develop new precision models of disease or to more fully explore emerging details within a system or across multiple systems.
Collapse
Affiliation(s)
- Jennifer R Smith
- Rat Genome Database, Department of Biomedical Engineering, Medical College of Wisconsin, Milwaukee, WI 53226, USA
- To whom correspondence should be addressed. Tel: +1 414 955 8871; Fax: +1 414 955 6595;
| | - G Thomas Hayman
- Rat Genome Database, Department of Biomedical Engineering, Medical College of Wisconsin, Milwaukee, WI 53226, USA
| | - Shur-Jen Wang
- Rat Genome Database, Department of Biomedical Engineering, Medical College of Wisconsin, Milwaukee, WI 53226, USA
| | - Stanley J F Laulederkind
- Rat Genome Database, Department of Biomedical Engineering, Medical College of Wisconsin, Milwaukee, WI 53226, USA
| | - Matthew J Hoffman
- Rat Genome Database, Department of Biomedical Engineering, Medical College of Wisconsin, Milwaukee, WI 53226, USA
- Genomic Sciences and Precision Medicine Center and Department of Physiology, Medical College of Wisconsin, Milwaukee, WI 53226, USA
| | - Mary L Kaldunski
- Rat Genome Database, Department of Biomedical Engineering, Medical College of Wisconsin, Milwaukee, WI 53226, USA
| | - Monika Tutaj
- Rat Genome Database, Department of Biomedical Engineering, Medical College of Wisconsin, Milwaukee, WI 53226, USA
| | - Jyothi Thota
- Rat Genome Database, Department of Biomedical Engineering, Medical College of Wisconsin, Milwaukee, WI 53226, USA
| | - Harika S Nalabolu
- Rat Genome Database, Department of Biomedical Engineering, Medical College of Wisconsin, Milwaukee, WI 53226, USA
| | - Santoshi L R Ellanki
- Rat Genome Database, Department of Biomedical Engineering, Medical College of Wisconsin, Milwaukee, WI 53226, USA
| | - Marek A Tutaj
- Rat Genome Database, Department of Biomedical Engineering, Medical College of Wisconsin, Milwaukee, WI 53226, USA
| | - Jeffrey L De Pons
- Rat Genome Database, Department of Biomedical Engineering, Medical College of Wisconsin, Milwaukee, WI 53226, USA
| | - Anne E Kwitek
- Genomic Sciences and Precision Medicine Center and Department of Physiology, Medical College of Wisconsin, Milwaukee, WI 53226, USA
| | - Melinda R Dwinell
- Genomic Sciences and Precision Medicine Center and Department of Physiology, Medical College of Wisconsin, Milwaukee, WI 53226, USA
| | - Mary E Shimoyama
- Rat Genome Database, Department of Biomedical Engineering, Medical College of Wisconsin, Milwaukee, WI 53226, USA
| |
Collapse
|
4
|
Abstract
Resources for rat researchers are extensive, including strain repositories and databases all around the world. The Rat Genome Database (RGD) serves as the primary rat data repository, providing both manual and computationally collected data from other databases.
Collapse
|
5
|
Abstract
The laboratory rat, Rattus norvegicus, has been used in biomedical research for more than 150 years, and in many cases remains the model of choice for studies of physiology, behavior, and complex human disease. This book provides detailed information on a number of methodologies that can be used in rat. This chapter gives an introduction to rat as a species and as a biomedical model, providing historical information, a brief introduction to the current state of rat research, and a perspective on the future of rat as a model for human disease.
Collapse
Affiliation(s)
- Jennifer R Smith
- Department of Biomedical Engineering, Rat Genome Database, Medical College of Wisconsin, Milwaukee, WI, USA.
| | - Elizabeth R Bolton
- Department of Biomedical Engineering, Rat Genome Database, Medical College of Wisconsin, Milwaukee, WI, USA
| | - Melinda R Dwinell
- Genomic Sciences and Precision Medicine Center, Medical College of Wisconsin, Milwaukee, WI, USA
- Department of Physiology, Rat Genome Database, Medical College of Wisconsin, Milwaukee, WI, USA
| |
Collapse
|
6
|
Wang SJ, Laulederkind SJF, Zhao Y, Hayman GT, Smith JR, Tutaj M, Thota J, Tutaj MA, Hoffman MJ, Bolton ER, De Pons J, Dwinell MR, Shimoyama M. Integrated curation and data mining for disease and phenotype models at the Rat Genome Database. DATABASE-THE JOURNAL OF BIOLOGICAL DATABASES AND CURATION 2019; 2019:5310053. [PMID: 30753478 PMCID: PMC6369425 DOI: 10.1093/database/baz014] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 10/19/2018] [Accepted: 01/18/2019] [Indexed: 11/20/2022]
Abstract
Rats have been used as research models in biomedical research for over 150 years. These disease models arise from naturally occurring mutations, selective breeding and, more recently, genome manipulation. Through the innovation of genome-editing technologies, genome-modified rats provide precision models of disease by disrupting or complementing targeted genes. To facilitate the use of these data produced from rat disease models, the Rat Genome Database (RGD) organizes rat strains and annotates these strains with disease and qualitative phenotype terms as well as quantitative phenotype measurements. From the curated quantitative data, the expected phenotype profile ranges were established through a meta-analysis pipeline using inbred rat strains in control conditions. The disease and qualitative phenotype annotations are propagated to their associated genes and alleles if applicable. Currently, RGD has curated nearly 1300 rat strains with disease/phenotype annotations and about 11% of them have known allele associations. All of the annotations (disease and phenotype) are integrated and displayed on the strain, gene and allele report pages. Finding disease and phenotype models at RGD can be done by searching for terms in the ontology browser, browsing the disease or phenotype ontology branches or entering keywords in the general search. Use cases are provided to show different targeted searches of rat strains at RGD.
Collapse
Affiliation(s)
- Shur-Jen Wang
- Department of Biomedical Engineering, Marquette University and Medical College of Wisconsin, Milwaukee, WI, USA
| | - Stanley J F Laulederkind
- Department of Biomedical Engineering, Marquette University and Medical College of Wisconsin, Milwaukee, WI, USA
| | - Yiqing Zhao
- Department of Biomedical Engineering, Marquette University and Medical College of Wisconsin, Milwaukee, WI, USA
| | - G Thomas Hayman
- Department of Biomedical Engineering, Marquette University and Medical College of Wisconsin, Milwaukee, WI, USA
| | - Jennifer R Smith
- Department of Biomedical Engineering, Marquette University and Medical College of Wisconsin, Milwaukee, WI, USA
| | - Monika Tutaj
- Department of Biomedical Engineering, Marquette University and Medical College of Wisconsin, Milwaukee, WI, USA
| | - Jyothi Thota
- Department of Biomedical Engineering, Marquette University and Medical College of Wisconsin, Milwaukee, WI, USA
| | - Marek A Tutaj
- Department of Biomedical Engineering, Marquette University and Medical College of Wisconsin, Milwaukee, WI, USA
| | - Matthew J Hoffman
- Department of Physiology, Medical College of Wisconsin, Milwaukee, WI, USA
| | - Elizabeth R Bolton
- Department of Biomedical Engineering, Marquette University and Medical College of Wisconsin, Milwaukee, WI, USA
| | - Jeffrey De Pons
- Department of Biomedical Engineering, Marquette University and Medical College of Wisconsin, Milwaukee, WI, USA
| | - Melinda R Dwinell
- Department of Physiology, Medical College of Wisconsin, Milwaukee, WI, USA
| | - Mary Shimoyama
- Department of Biomedical Engineering, Marquette University and Medical College of Wisconsin, Milwaukee, WI, USA
| |
Collapse
|
7
|
Davis AP, Wiegers TC, Wiegers J, Johnson RJ, Sciaky D, Grondin CJ, Mattingly CJ. Chemical-Induced Phenotypes at CTD Help Inform the Predisease State and Construct Adverse Outcome Pathways. Toxicol Sci 2018; 165:145-156. [PMID: 29846728 PMCID: PMC6111787 DOI: 10.1093/toxsci/kfy131] [Citation(s) in RCA: 30] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/13/2022] Open
Abstract
The Comparative Toxicogenomics Database (CTD; http://ctdbase.org) is a public resource that manually curates the scientific literature to provide content that illuminates the molecular mechanisms by which environmental exposures affect human health. We introduce our new chemical-phenotype module that describes how chemicals can affect molecular, cellular, and physiological phenotypes. At CTD, we operationally distinguish between phenotypes and diseases, wherein a phenotype refers to a nondisease biological event: eg, decreased cell cycle arrest (phenotype) versus liver cancer (disease), increased fat cell proliferation (phenotype) versus morbid obesity (disease), etc. Chemical-phenotype interactions are expressed in a formal structured notation using controlled terms for chemicals, phenotypes, taxon, and anatomical descriptors. Combining this information with CTD's chemical-disease module allows inferences to be made between phenotypes and diseases, yielding potential insight into the predisease state. Integration of all 4 CTD modules furnishes unique opportunities for toxicologists to generate computationally predictive adverse outcome pathways, linking chemical-gene molecular initiating events with phenotypic key events, adverse diseases, and population-level health outcomes. As examples, we present 3 diverse case studies discerning the effect of vehicle emissions on altered leukocyte migration, the role of cadmium in influencing phenotypes preceding Alzheimer disease, and the connection of arsenic-induced glucose metabolic phenotypes with diabetes. To date, CTD contains over 165 000 interactions that connect more than 6400 chemicals to 3900 phenotypes for 760 anatomical terms in 215 species, from over 19 000 scientific articles. To our knowledge, this is the first comprehensive set of manually curated, literature-based, contextualized, chemical-induced, nondisease phenotype data provided to the public.
Collapse
Affiliation(s)
| | | | | | | | | | | | - Carolyn J Mattingly
- Department of Biological Sciences
- Center for Human Health and the Environment, North Carolina State University, Raleigh, North Carolina 27695
| |
Collapse
|
8
|
Shimoyama M, Smith JR, Bryda E, Kuramoto T, Saba L, Dwinell M. Rat Genome and Model Resources. ILAR J 2017; 58:42-58. [PMID: 28838068 PMCID: PMC6057551 DOI: 10.1093/ilar/ilw041] [Citation(s) in RCA: 29] [Impact Index Per Article: 4.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/28/2016] [Indexed: 11/25/2022] Open
Abstract
Rats remain a major model for studying disease mechanisms and discovery, validation, and testing of new compounds to improve human health. The rat’s value continues to grow as indicated by the more than 1.4 million publications (second to human) at PubMed documenting important discoveries using this model. Advanced sequencing technologies, genome modification techniques, and the development of embryonic stem cell protocols ensure the rat remains an important mammalian model for disease studies. The 2004 release of the reference genome has been followed by the production of complete genomes for more than two dozen individual strains utilizing NextGen sequencing technologies; their analyses have identified over 80 million variants. This explosion in genomic data has been accompanied by the ability to selectively edit the rat genome, leading to hundreds of new strains through multiple technologies. A number of resources have been developed to provide investigators with access to precision rat models, comprehensive datasets, and sophisticated software tools necessary for their research. Those profiled here include the Rat Genome Database, PhenoGen, Gene Editing Rat Resource Center, Rat Resource and Research Center, and the National BioResource Project for the Rat in Japan.
Collapse
Affiliation(s)
- Mary Shimoyama
- Department of Biomedical Engineering, Marquette University and the Medical College of Wisconsin, Milwaukee, Wisconsin. Rat Genome Database, Department of Biomedical Engineering at Marquette University and the Medical College of Wisconsin, Milwaukee, Wisconsin. Department of Veterinary Pathobiology, College of Veterinary Medicine, University of Missouri, Columbia, Missouri. Institute of Laboratory Animals, Graduate School of Medicine, Kyoto University, Kyoto, Japan. Department of Pharmaceutical Sciences, Skaggs School of Pharmacy and Pharmaceutical Sciences, University of Colorado Anschutz Medical Campus, Aurora, Colorado. Department of Physiology, Medical College of Wisconsin, Milwaukee, Wisconsin
| | - Jennifer R Smith
- Department of Biomedical Engineering, Marquette University and the Medical College of Wisconsin, Milwaukee, Wisconsin. Rat Genome Database, Department of Biomedical Engineering at Marquette University and the Medical College of Wisconsin, Milwaukee, Wisconsin. Department of Veterinary Pathobiology, College of Veterinary Medicine, University of Missouri, Columbia, Missouri. Institute of Laboratory Animals, Graduate School of Medicine, Kyoto University, Kyoto, Japan. Department of Pharmaceutical Sciences, Skaggs School of Pharmacy and Pharmaceutical Sciences, University of Colorado Anschutz Medical Campus, Aurora, Colorado. Department of Physiology, Medical College of Wisconsin, Milwaukee, Wisconsin
| | - Elizabeth Bryda
- Department of Biomedical Engineering, Marquette University and the Medical College of Wisconsin, Milwaukee, Wisconsin. Rat Genome Database, Department of Biomedical Engineering at Marquette University and the Medical College of Wisconsin, Milwaukee, Wisconsin. Department of Veterinary Pathobiology, College of Veterinary Medicine, University of Missouri, Columbia, Missouri. Institute of Laboratory Animals, Graduate School of Medicine, Kyoto University, Kyoto, Japan. Department of Pharmaceutical Sciences, Skaggs School of Pharmacy and Pharmaceutical Sciences, University of Colorado Anschutz Medical Campus, Aurora, Colorado. Department of Physiology, Medical College of Wisconsin, Milwaukee, Wisconsin
| | - Takashi Kuramoto
- Department of Biomedical Engineering, Marquette University and the Medical College of Wisconsin, Milwaukee, Wisconsin. Rat Genome Database, Department of Biomedical Engineering at Marquette University and the Medical College of Wisconsin, Milwaukee, Wisconsin. Department of Veterinary Pathobiology, College of Veterinary Medicine, University of Missouri, Columbia, Missouri. Institute of Laboratory Animals, Graduate School of Medicine, Kyoto University, Kyoto, Japan. Department of Pharmaceutical Sciences, Skaggs School of Pharmacy and Pharmaceutical Sciences, University of Colorado Anschutz Medical Campus, Aurora, Colorado. Department of Physiology, Medical College of Wisconsin, Milwaukee, Wisconsin
| | - Laura Saba
- Department of Biomedical Engineering, Marquette University and the Medical College of Wisconsin, Milwaukee, Wisconsin. Rat Genome Database, Department of Biomedical Engineering at Marquette University and the Medical College of Wisconsin, Milwaukee, Wisconsin. Department of Veterinary Pathobiology, College of Veterinary Medicine, University of Missouri, Columbia, Missouri. Institute of Laboratory Animals, Graduate School of Medicine, Kyoto University, Kyoto, Japan. Department of Pharmaceutical Sciences, Skaggs School of Pharmacy and Pharmaceutical Sciences, University of Colorado Anschutz Medical Campus, Aurora, Colorado. Department of Physiology, Medical College of Wisconsin, Milwaukee, Wisconsin
| | - Melinda Dwinell
- Department of Biomedical Engineering, Marquette University and the Medical College of Wisconsin, Milwaukee, Wisconsin. Rat Genome Database, Department of Biomedical Engineering at Marquette University and the Medical College of Wisconsin, Milwaukee, Wisconsin. Department of Veterinary Pathobiology, College of Veterinary Medicine, University of Missouri, Columbia, Missouri. Institute of Laboratory Animals, Graduate School of Medicine, Kyoto University, Kyoto, Japan. Department of Pharmaceutical Sciences, Skaggs School of Pharmacy and Pharmaceutical Sciences, University of Colorado Anschutz Medical Campus, Aurora, Colorado. Department of Physiology, Medical College of Wisconsin, Milwaukee, Wisconsin
| |
Collapse
|
9
|
Chen Q, Zobel J, Verspoor K. Duplicates, redundancies and inconsistencies in the primary nucleotide databases: a descriptive study. DATABASE-THE JOURNAL OF BIOLOGICAL DATABASES AND CURATION 2017; 2017:baw163. [PMID: 28077566 PMCID: PMC5225397 DOI: 10.1093/database/baw163] [Citation(s) in RCA: 29] [Impact Index Per Article: 4.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 10/10/2016] [Revised: 11/17/2016] [Accepted: 11/21/2016] [Indexed: 01/22/2023]
Abstract
GenBank, the EMBL European Nucleotide Archive and the DNA DataBank of Japan, known collectively as the International Nucleotide Sequence Database Collaboration or INSDC, are the three most significant nucleotide sequence databases. Their records are derived from laboratory work undertaken by different individuals, by different teams, with a range of technologies and assumptions and over a period of decades. As a consequence, they contain a great many duplicates, redundancies and inconsistencies, but neither the prevalence nor the characteristics of various types of duplicates have been rigorously assessed. Existing duplicate detection methods in bioinformatics only address specific duplicate types, with inconsistent assumptions; and the impact of duplicates in bioinformatics databases has not been carefully assessed, making it difficult to judge the value of such methods. Our goal is to assess the scale, kinds and impact of duplicates in bioinformatics databases, through a retrospective analysis of merged groups in INSDC databases. Our outcomes are threefold: (1) We analyse a benchmark dataset consisting of duplicates manually identified in INSDC—a dataset of 67 888 merged groups with 111 823 duplicate pairs across 21 organisms from INSDC databases – in terms of the prevalence, types and impacts of duplicates. (2) We categorize duplicates at both sequence and annotation level, with supporting quantitative statistics, showing that different organisms have different prevalence of distinct kinds of duplicate. (3) We show that the presence of duplicates has practical impact via a simple case study on duplicates, in terms of GC content and melting temperature. We demonstrate that duplicates not only introduce redundancy, but can lead to inconsistent results for certain tasks. Our findings lead to a better understanding of the problem of duplication in biological databases. Database URL: the merged records are available at https://cloudstor.aarnet.edu.au/plus/index.php/s/Xef2fvsebBEAv9w
Collapse
Affiliation(s)
- Qingyu Chen
- Department of Computing and Information Systems, The University of Melbourne, Parkville, VIC, 3010, Australia
| | - Justin Zobel
- Department of Computing and Information Systems, The University of Melbourne, Parkville, VIC, 3010, Australia
| | - Karin Verspoor
- Department of Computing and Information Systems, The University of Melbourne, Parkville, VIC, 3010, Australia
| |
Collapse
|
10
|
Wang SJ, Laulederkind SJF, Hayman GT, Petri V, Smith JR, Tutaj M, Nigam R, Dwinell MR, Shimoyama M. Comprehensive coverage of cardiovascular disease data in the disease portals at the Rat Genome Database. Physiol Genomics 2016; 48:589-600. [PMID: 27287925 DOI: 10.1152/physiolgenomics.00046.2016] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/15/2016] [Accepted: 06/08/2016] [Indexed: 01/18/2023] Open
Abstract
Cardiovascular diseases are complex diseases caused by a combination of genetic and environmental factors. To facilitate progress in complex disease research, the Rat Genome Database (RGD) provides the community with a disease portal where genome objects and biological data related to cardiovascular diseases are systematically organized. The purpose of this study is to present biocuration at RGD, including disease, genetic, and pathway data. The RGD curation team uses controlled vocabularies/ontologies to organize data curated from the published literature or imported from disease and pathway databases. These organized annotations are associated with genes, strains, and quantitative trait loci (QTLs), thus linking functional annotations to genome objects. Screen shots from the web pages are used to demonstrate the organization of annotations at RGD. The human cardiovascular disease genes identified by annotations were grouped according to data sources and their annotation profiles were compared by in-house tools and other enrichment tools available to the public. The analysis results show that the imported cardiovascular disease genes from ClinVar and OMIM are functionally different from the RGD manually curated genes in terms of pathway and Gene Ontology annotations. The inclusion of disease genes from other databases enriches the collection of disease genes not only in quantity but also in quality.
Collapse
Affiliation(s)
- Shur-Jen Wang
- Department of Surgery, Medical College of Wisconsin, Milwaukee, Wisconsin; and
| | | | - G Thomas Hayman
- Department of Surgery, Medical College of Wisconsin, Milwaukee, Wisconsin; and
| | - Victoria Petri
- Department of Surgery, Medical College of Wisconsin, Milwaukee, Wisconsin; and
| | - Jennifer R Smith
- Department of Surgery, Medical College of Wisconsin, Milwaukee, Wisconsin; and
| | - Marek Tutaj
- Department of Surgery, Medical College of Wisconsin, Milwaukee, Wisconsin; and
| | - Rajni Nigam
- Department of Surgery, Medical College of Wisconsin, Milwaukee, Wisconsin; and
| | - Melinda R Dwinell
- Department of Physiology, Medical College of Wisconsin, Milwaukee, Wisconsin
| | - Mary Shimoyama
- Department of Surgery, Medical College of Wisconsin, Milwaukee, Wisconsin; and
| |
Collapse
|
11
|
Petri V, Hayman GT, Tutaj M, Smith JR, Laulederkind S, Wang SJ, Nigam R, De Pons J, Shimoyama M, Dwinell MR. Disease, Models, Variants and Altered Pathways-Journeying RGD Through the Magnifying Glass. Comput Struct Biotechnol J 2015; 14:35-48. [PMID: 27602200 PMCID: PMC4700298 DOI: 10.1016/j.csbj.2015.11.006] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/06/2015] [Revised: 10/28/2015] [Accepted: 11/20/2015] [Indexed: 12/12/2022] Open
Abstract
Understanding the pathogenesis of disease is instrumental in delineating its progression mechanisms and for envisioning ways to counteract it. In the process, animal models represent invaluable tools for identifying disease-related loci and their genetic components. Amongst them, the laboratory rat is used extensively in the study of many conditions and disorders. The Rat Genome Database (RGD—http://rgd.mcw.edu) has been established to house rat genetic, genomic and phenotypic data. Since its inception, it has continually expanded the depth and breadth of its content. Currently, in addition to rat genes, QTLs and strains, RGD houses mouse and human genes and QTLs and offers pertinent associated data, acquired through manual literature curation and imported via pipelines. A collection of controlled vocabularies and ontologies is employed for the standardized extraction and provision of biological data. The vocabularies/ontologies allow the capture of disease and phenotype associations of rat strains and QTLs, as well as disease and pathway associations of rat, human and mouse genes. A suite of tools enables the retrieval, manipulation, viewing and analysis of data. Genes associated with particular conditions or with altered networks underlying disease pathways can be retrieved. Genetic variants in humans or in sequenced rat strains can be searched and compared. Lists of rat strains and species-specific genes and QTLs can be generated for selected ontology terms and then analyzed, downloaded or sent to other tools. From many entry points, data can be accessed and results retrieved. To illustrate, diabetes is used as a case study to initiate and embark upon an exploratory journey.
Collapse
Affiliation(s)
- Victoria Petri
- Human and Molecular Genetics Center, Medical College of Wisconsin, USA
| | - G Thomas Hayman
- Human and Molecular Genetics Center, Medical College of Wisconsin, USA
| | - Marek Tutaj
- Human and Molecular Genetics Center, Medical College of Wisconsin, USA
| | - Jennifer R Smith
- Human and Molecular Genetics Center, Medical College of Wisconsin, USA
| | - Stan Laulederkind
- Human and Molecular Genetics Center, Medical College of Wisconsin, USA
| | - Shur-Jen Wang
- Human and Molecular Genetics Center, Medical College of Wisconsin, USA
| | - Rajni Nigam
- Human and Molecular Genetics Center, Medical College of Wisconsin, USA
| | - Jeff De Pons
- Human and Molecular Genetics Center, Medical College of Wisconsin, USA
| | - Mary Shimoyama
- Human and Molecular Genetics Center, Medical College of Wisconsin, USA
| | - Melinda R Dwinell
- Human and Molecular Genetics Center, Medical College of Wisconsin, USA
| |
Collapse
|
12
|
Zhang JG, Tan LJ, Xu C, He H, Tian Q, Zhou Y, Qiu C, Chen XD, Deng HW. Integrative Analysis of Transcriptomic and Epigenomic Data to Reveal Regulation Patterns for BMD Variation. PLoS One 2015; 10:e0138524. [PMID: 26390436 PMCID: PMC4577125 DOI: 10.1371/journal.pone.0138524] [Citation(s) in RCA: 18] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/16/2015] [Accepted: 09/01/2015] [Indexed: 01/16/2023] Open
Abstract
Integration of multiple profiling data and construction of functional gene networks may provide additional insights into the molecular mechanisms of complex diseases. Osteoporosis is a worldwide public health problem, but the complex gene-gene interactions, post-transcriptional modifications and regulation of functional networks are still unclear. To gain a comprehensive understanding of osteoporosis etiology, transcriptome gene expression microarray, epigenomic miRNA microarray and methylome sequencing were performed simultaneously in 5 high hip BMD (Bone Mineral Density) subjects and 5 low hip BMD subjects. SPIA (Signaling Pathway Impact Analysis) and PCST (Prize Collecting Steiner Tree) algorithm were used to perform pathway-enrichment analysis and construct the interaction networks. Through integrating the transcriptomic and epigenomic data, firstly we identified 3 genes (FAM50A, ZNF473 and TMEM55B) and one miRNA (hsa-mir-4291) which showed the consistent association evidence from both gene expression and methylation data; secondly in network analysis we identified an interaction network module with 12 genes and 11 miRNAs including AKT1, STAT3, STAT5A, FLT3, hsa-mir-141 and hsa-mir-34a which have been associated with BMD in previous studies. This module revealed the crosstalk among miRNAs, mRNAs and DNA methylation and showed four potential regulatory patterns of gene expression to influence the BMD status. In conclusion, the integration of multiple layers of omics can yield in-depth results than analysis of individual omics data respectively. Integrative analysis from transcriptomics and epigenomic data improves our ability to identify causal genetic factors, and more importantly uncover functional regulation pattern of multi-omics for osteoporosis etiology.
Collapse
Affiliation(s)
- Ji-Gang Zhang
- Center of Genomics and Bioinformatics, Tulane University, New Orleans, Louisiana, 70112, United States of America
- Department of Biostatistics and Bioinformatics, Tulane University, New Orleans, Louisiana, 70112, United States of America
| | - Li-Jun Tan
- Laboratory of Molecular and Statistical Genetics, Hunan Normal University, Changsha, Hunan, 410081, China
- * E-mail: (HWD); (LJT)
| | - Chao Xu
- Center of Genomics and Bioinformatics, Tulane University, New Orleans, Louisiana, 70112, United States of America
- Department of Biostatistics and Bioinformatics, Tulane University, New Orleans, Louisiana, 70112, United States of America
| | - Hao He
- Center of Genomics and Bioinformatics, Tulane University, New Orleans, Louisiana, 70112, United States of America
- Department of Biostatistics and Bioinformatics, Tulane University, New Orleans, Louisiana, 70112, United States of America
| | - Qing Tian
- Center of Genomics and Bioinformatics, Tulane University, New Orleans, Louisiana, 70112, United States of America
- Department of Biostatistics and Bioinformatics, Tulane University, New Orleans, Louisiana, 70112, United States of America
| | - Yu Zhou
- Center of Genomics and Bioinformatics, Tulane University, New Orleans, Louisiana, 70112, United States of America
- Department of Biostatistics and Bioinformatics, Tulane University, New Orleans, Louisiana, 70112, United States of America
| | - Chuan Qiu
- Center of Genomics and Bioinformatics, Tulane University, New Orleans, Louisiana, 70112, United States of America
- Department of Biostatistics and Bioinformatics, Tulane University, New Orleans, Louisiana, 70112, United States of America
| | - Xiang-Ding Chen
- Laboratory of Molecular and Statistical Genetics, Hunan Normal University, Changsha, Hunan, 410081, China
| | - Hong-Wen Deng
- Center of Genomics and Bioinformatics, Tulane University, New Orleans, Louisiana, 70112, United States of America
- Department of Biostatistics and Bioinformatics, Tulane University, New Orleans, Louisiana, 70112, United States of America
- Laboratory of Molecular and Statistical Genetics, Hunan Normal University, Changsha, Hunan, 410081, China
- * E-mail: (HWD); (LJT)
| |
Collapse
|
13
|
Flister MJ, Prokop JW, Lazar J, Shimoyama M, Dwinell M, Geurts A. 2015 Guidelines for Establishing Genetically Modified Rat Models for Cardiovascular Research. J Cardiovasc Transl Res 2015; 8:269-77. [PMID: 25920443 PMCID: PMC4475456 DOI: 10.1007/s12265-015-9626-4] [Citation(s) in RCA: 18] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 02/11/2015] [Accepted: 04/15/2015] [Indexed: 12/24/2022]
Abstract
The rat has long been a key physiological model for cardiovascular research, most of the inbred strains having been previously selected for susceptibility or resistance to various cardiovascular diseases (CVD). These CVD rat models offer a physiologically relevant background on which candidates of human CVD can be tested in a more clinically translatable experimental setting. However, a diverse toolbox for genetically modifying the rat genome to test molecular mechanisms has only recently become available. Here, we provide a high-level description of several strategies for developing genetically modified rat models of CVD.
Collapse
Affiliation(s)
- Michael J Flister
- Human and Molecular Genetics Center, Medical College of Wisconsin, 8701 Watertown Plank Road, Milwaukee, 53226, WI, USA,
| | | | | | | | | | | |
Collapse
|
14
|
Wang SJ, Laulederkind SJF, Hayman GT, Petri V, Liu W, Smith JR, Nigam R, Dwinell MR, Shimoyama M. PhenoMiner: a quantitative phenotype database for the laboratory rat, Rattus norvegicus. Application in hypertension and renal disease. DATABASE-THE JOURNAL OF BIOLOGICAL DATABASES AND CURATION 2015; 2015:bau128. [PMID: 25632109 PMCID: PMC4309021 DOI: 10.1093/database/bau128] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 11/15/2022]
Abstract
Rats have been used extensively as animal models to study physiological and pathological processes involved in human diseases. Numerous rat strains have been selectively bred for certain biological traits related to specific medical interests. Recently, the Rat Genome Database (http://rgd.mcw.edu) has initiated the PhenoMiner project to integrate quantitative phenotype data from the PhysGen Program for Genomic Applications and the National BioResource Project in Japan as well as manual annotations from biomedical literature. PhenoMiner, the search engine for these integrated phenotype data, facilitates mining of data sets across studies by searching the database with a combination of terms from four different ontologies/vocabularies (Rat Strain Ontology, Clinical Measurement Ontology, Measurement Method Ontology and Experimental Condition Ontology). In this study, salt-induced hypertension was used as a model to retrieve blood pressure records of Brown Norway, Fawn-Hooded Hypertensive (FHH) and Dahl salt-sensitive (SS) rat strains. The records from these three strains served as a basis for comparing records from consomic/congenic/mutant offspring derived from them. We examined the cardiovascular and renal phenotypes of consomics derived from FHH and SS, and of SS congenics and mutants. The availability of quantitative records across laboratories in one database, such as these provided by PhenoMiner, can empower researchers to make the best use of publicly available data. Database URL:http://rgd.mcw.edu
Collapse
Affiliation(s)
- Shur-Jen Wang
- Human and Molecular Genetics Center, Medical College of Wisconsin, Department of Physiology, Medical College of Wisconsin and Department of Surgery, Medical College of Wisconsin, 8701 Watertown Plank Road, Milwaukee, WI53226, USA
| | - Stanley J F Laulederkind
- Human and Molecular Genetics Center, Medical College of Wisconsin, Department of Physiology, Medical College of Wisconsin and Department of Surgery, Medical College of Wisconsin, 8701 Watertown Plank Road, Milwaukee, WI53226, USA
| | - G Thomas Hayman
- Human and Molecular Genetics Center, Medical College of Wisconsin, Department of Physiology, Medical College of Wisconsin and Department of Surgery, Medical College of Wisconsin, 8701 Watertown Plank Road, Milwaukee, WI53226, USA
| | - Victoria Petri
- Human and Molecular Genetics Center, Medical College of Wisconsin, Department of Physiology, Medical College of Wisconsin and Department of Surgery, Medical College of Wisconsin, 8701 Watertown Plank Road, Milwaukee, WI53226, USA
| | - Weisong Liu
- Human and Molecular Genetics Center, Medical College of Wisconsin, Department of Physiology, Medical College of Wisconsin and Department of Surgery, Medical College of Wisconsin, 8701 Watertown Plank Road, Milwaukee, WI53226, USA
| | - Jennifer R Smith
- Human and Molecular Genetics Center, Medical College of Wisconsin, Department of Physiology, Medical College of Wisconsin and Department of Surgery, Medical College of Wisconsin, 8701 Watertown Plank Road, Milwaukee, WI53226, USA
| | - Rajni Nigam
- Human and Molecular Genetics Center, Medical College of Wisconsin, Department of Physiology, Medical College of Wisconsin and Department of Surgery, Medical College of Wisconsin, 8701 Watertown Plank Road, Milwaukee, WI53226, USA
| | - Melinda R Dwinell
- Human and Molecular Genetics Center, Medical College of Wisconsin, Department of Physiology, Medical College of Wisconsin and Department of Surgery, Medical College of Wisconsin, 8701 Watertown Plank Road, Milwaukee, WI53226, USA
| | - Mary Shimoyama
- Human and Molecular Genetics Center, Medical College of Wisconsin, Department of Physiology, Medical College of Wisconsin and Department of Surgery, Medical College of Wisconsin, 8701 Watertown Plank Road, Milwaukee, WI53226, USA Human and Molecular Genetics Center, Medical College of Wisconsin, Department of Physiology, Medical College of Wisconsin and Department of Surgery, Medical College of Wisconsin, 8701 Watertown Plank Road, Milwaukee, WI53226, USA
| |
Collapse
|
15
|
Liu W, Laulederkind SJF, Hayman GT, Wang SJ, Nigam R, Smith JR, De Pons J, Dwinell MR, Shimoyama M. OntoMate: a text-mining tool aiding curation at the Rat Genome Database. DATABASE-THE JOURNAL OF BIOLOGICAL DATABASES AND CURATION 2015; 2015:bau129. [PMID: 25619558 PMCID: PMC4305386 DOI: 10.1093/database/bau129] [Citation(s) in RCA: 18] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 01/24/2023]
Abstract
The Rat Genome Database (RGD) is the premier repository of rat genomic, genetic and physiologic data. Converting data from free text in the scientific literature to a structured format is one of the main tasks of all model organism databases. RGD spends considerable effort manually curating gene, Quantitative Trait Locus (QTL) and strain information. The rapidly growing volume of biomedical literature and the active research in the biological natural language processing (bioNLP) community have given RGD the impetus to adopt text-mining tools to improve curation efficiency. Recently, RGD has initiated a project to use OntoMate, an ontology-driven, concept-based literature search engine developed at RGD, as a replacement for the PubMed (http://www.ncbi.nlm.nih.gov/pubmed) search engine in the gene curation workflow. OntoMate tags abstracts with gene names, gene mutations, organism name and most of the 16 ontologies/vocabularies used at RGD. All terms/ entities tagged to an abstract are listed with the abstract in the search results. All listed terms are linked both to data entry boxes and a term browser in the curation tool. OntoMate also provides user-activated filters for species, date and other parameters relevant to the literature search. Using the system for literature search and import has streamlined the process compared to using PubMed. The system was built with a scalable and open architecture, including features specifically designed to accelerate the RGD gene curation process. With the use of bioNLP tools, RGD has added more automation to its curation workflow. Database URL:http://rgd.mcw.edu
Collapse
Affiliation(s)
- Weisong Liu
- Human and Molecular Genetics Center, Medical College of Wisconsin, Department of Quantitative Health Sciences, University of Massachusetts Medical School, Department of Physiology, Medical College of Wisconsin and Department of Surgery, Medical College of Wisconsin, 8701 Watertown Plank Rd, Milwaukee, WI 53226-3548, USA Human and Molecular Genetics Center, Medical College of Wisconsin, Department of Quantitative Health Sciences, University of Massachusetts Medical School, Department of Physiology, Medical College of Wisconsin and Department of Surgery, Medical College of Wisconsin, 8701 Watertown Plank Rd, Milwaukee, WI 53226-3548, USA
| | - Stanley J F Laulederkind
- Human and Molecular Genetics Center, Medical College of Wisconsin, Department of Quantitative Health Sciences, University of Massachusetts Medical School, Department of Physiology, Medical College of Wisconsin and Department of Surgery, Medical College of Wisconsin, 8701 Watertown Plank Rd, Milwaukee, WI 53226-3548, USA
| | - G Thomas Hayman
- Human and Molecular Genetics Center, Medical College of Wisconsin, Department of Quantitative Health Sciences, University of Massachusetts Medical School, Department of Physiology, Medical College of Wisconsin and Department of Surgery, Medical College of Wisconsin, 8701 Watertown Plank Rd, Milwaukee, WI 53226-3548, USA
| | - Shur-Jen Wang
- Human and Molecular Genetics Center, Medical College of Wisconsin, Department of Quantitative Health Sciences, University of Massachusetts Medical School, Department of Physiology, Medical College of Wisconsin and Department of Surgery, Medical College of Wisconsin, 8701 Watertown Plank Rd, Milwaukee, WI 53226-3548, USA
| | - Rajni Nigam
- Human and Molecular Genetics Center, Medical College of Wisconsin, Department of Quantitative Health Sciences, University of Massachusetts Medical School, Department of Physiology, Medical College of Wisconsin and Department of Surgery, Medical College of Wisconsin, 8701 Watertown Plank Rd, Milwaukee, WI 53226-3548, USA
| | - Jennifer R Smith
- Human and Molecular Genetics Center, Medical College of Wisconsin, Department of Quantitative Health Sciences, University of Massachusetts Medical School, Department of Physiology, Medical College of Wisconsin and Department of Surgery, Medical College of Wisconsin, 8701 Watertown Plank Rd, Milwaukee, WI 53226-3548, USA
| | - Jeff De Pons
- Human and Molecular Genetics Center, Medical College of Wisconsin, Department of Quantitative Health Sciences, University of Massachusetts Medical School, Department of Physiology, Medical College of Wisconsin and Department of Surgery, Medical College of Wisconsin, 8701 Watertown Plank Rd, Milwaukee, WI 53226-3548, USA
| | - Melinda R Dwinell
- Human and Molecular Genetics Center, Medical College of Wisconsin, Department of Quantitative Health Sciences, University of Massachusetts Medical School, Department of Physiology, Medical College of Wisconsin and Department of Surgery, Medical College of Wisconsin, 8701 Watertown Plank Rd, Milwaukee, WI 53226-3548, USA Human and Molecular Genetics Center, Medical College of Wisconsin, Department of Quantitative Health Sciences, University of Massachusetts Medical School, Department of Physiology, Medical College of Wisconsin and Department of Surgery, Medical College of Wisconsin, 8701 Watertown Plank Rd, Milwaukee, WI 53226-3548, USA
| | - Mary Shimoyama
- Human and Molecular Genetics Center, Medical College of Wisconsin, Department of Quantitative Health Sciences, University of Massachusetts Medical School, Department of Physiology, Medical College of Wisconsin and Department of Surgery, Medical College of Wisconsin, 8701 Watertown Plank Rd, Milwaukee, WI 53226-3548, USA Human and Molecular Genetics Center, Medical College of Wisconsin, Department of Quantitative Health Sciences, University of Massachusetts Medical School, Department of Physiology, Medical College of Wisconsin and Department of Surgery, Medical College of Wisconsin, 8701 Watertown Plank Rd, Milwaukee, WI 53226-3548, USA
| |
Collapse
|
16
|
Shimoyama M, De Pons J, Hayman GT, Laulederkind SJF, Liu W, Nigam R, Petri V, Smith JR, Tutaj M, Wang SJ, Worthey E, Dwinell M, Jacob H. The Rat Genome Database 2015: genomic, phenotypic and environmental variations and disease. Nucleic Acids Res 2014; 43:D743-50. [PMID: 25355511 PMCID: PMC4383884 DOI: 10.1093/nar/gku1026] [Citation(s) in RCA: 167] [Impact Index Per Article: 16.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/12/2023] Open
Abstract
The Rat Genome Database (RGD, http://rgd.mcw.edu) provides the most comprehensive data repository and informatics platform related to the laboratory rat, one of the most important model organisms for disease studies. RGD maintains and updates datasets for genomic elements such as genes, transcripts and increasingly in recent years, sequence variations, as well as map positions for multiple assemblies and sequence information. Functional annotations for genomic elements are curated from published literature, submitted by researchers and integrated from other public resources. Complementing the genomic data catalogs are those associated with phenotypes and disease, including strains, QTL and experimental phenotype measurements across hundreds of strains. Data are submitted by researchers, acquired through bulk data pipelines or curated from published literature. Innovative software tools provide users with an integrated platform to query, mine, display and analyze valuable genomic and phenomic datasets for discovery and enhancement of their own research. This update highlights recent developments that reflect an increasing focus on: (i) genomic variation, (ii) phenotypes and diseases, (iii) data related to the environment and experimental conditions and (iv) datasets and software tools that allow the user to explore and analyze the interactions among these and their impact on disease.
Collapse
Affiliation(s)
- Mary Shimoyama
- Human and Molecular Genetics Center, Medical College of Wisconsin, Milwaukee, WI 53226, USA Department of Surgery, Medical College of Wisconsin, Milwaukee, WI 53226, USA
| | - Jeff De Pons
- Human and Molecular Genetics Center, Medical College of Wisconsin, Milwaukee, WI 53226, USA
| | - G Thomas Hayman
- Human and Molecular Genetics Center, Medical College of Wisconsin, Milwaukee, WI 53226, USA
| | | | - Weisong Liu
- Human and Molecular Genetics Center, Medical College of Wisconsin, Milwaukee, WI 53226, USA
| | - Rajni Nigam
- Human and Molecular Genetics Center, Medical College of Wisconsin, Milwaukee, WI 53226, USA
| | - Victoria Petri
- Human and Molecular Genetics Center, Medical College of Wisconsin, Milwaukee, WI 53226, USA
| | - Jennifer R Smith
- Human and Molecular Genetics Center, Medical College of Wisconsin, Milwaukee, WI 53226, USA
| | - Marek Tutaj
- Human and Molecular Genetics Center, Medical College of Wisconsin, Milwaukee, WI 53226, USA
| | - Shur-Jen Wang
- Human and Molecular Genetics Center, Medical College of Wisconsin, Milwaukee, WI 53226, USA
| | - Elizabeth Worthey
- Human and Molecular Genetics Center, Medical College of Wisconsin, Milwaukee, WI 53226, USA Department of Pediatrics, Medical College of Wisconsin, Milwaukee, WI 53226, USA
| | - Melinda Dwinell
- Human and Molecular Genetics Center, Medical College of Wisconsin, Milwaukee, WI 53226, USA Department of Physiology, Medical College of Wisconsin, Milwaukee, WI 53226, USA
| | - Howard Jacob
- Human and Molecular Genetics Center, Medical College of Wisconsin, Milwaukee, WI 53226, USA Department of Physiology, Medical College of Wisconsin, Milwaukee, WI 53226, USA
| |
Collapse
|
17
|
Abstract
The use of model organisms as tools for the investigation of human genetic variation has significantly and rapidly advanced our understanding of the aetiologies underlying hereditary traits. However, while equivalences in the DNA sequence of two species may be readily inferred through evolutionary models, the identification of equivalence in the phenotypic consequences resulting from comparable genetic variation is far from straightforward, limiting the value of the modelling paradigm. In this review, we provide an overview of the emerging statistical and computational approaches to objectively identify phenotypic equivalence between human and model organisms with examples from the vertebrate models, mouse and zebrafish. Firstly, we discuss enrichment approaches, which deem the most frequent phenotype among the orthologues of a set of genes associated with a common human phenotype as the orthologous phenotype, or phenolog, in the model species. Secondly, we introduce and discuss computational reasoning approaches to identify phenotypic equivalences made possible through the development of intra- and interspecies ontologies. Finally, we consider the particular challenges involved in modelling neuropsychiatric disorders, which illustrate many of the remaining difficulties in developing comprehensive and unequivocal interspecies phenotype mappings.
Collapse
Affiliation(s)
- Peter N. Robinson
- Institute for Medical Genetics and Human Genetics, Charité-Universitätsmedizin Berlin, Berlin, Germany
- Berlin Brandenburg Center for Regenerative Therapies (BCRT), Charité-Universitätsmedizin Berlin, Berlin, Germany
- Max Planck Institute for Molecular Genetics, Berlin, Germany
- Institute for Bioinformatics, Department of Mathematics and Computer Science, Freie Universität Berlin, Berlin, Germany
- * E-mail: (PNR); (CW)
| | - Caleb Webber
- MRC Functional Genomics Unit, Department of Physiology, Anatomy and Genetics, University of Oxford, Oxford, United Kingdom
- * E-mail: (PNR); (CW)
| |
Collapse
|
18
|
Nigam R, Munzenmaier DH, Worthey EA, Dwinell MR, Shimoyama M, Jacob HJ. Rat Strain Ontology: structured controlled vocabulary designed to facilitate access to strain data at RGD. J Biomed Semantics 2013; 4:36. [PMID: 24267899 PMCID: PMC4177145 DOI: 10.1186/2041-1480-4-36] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/31/2013] [Accepted: 10/02/2013] [Indexed: 11/10/2022] Open
Abstract
Background The Rat Genome Database (RGD) (
http://rgd.mcw.edu/) is the premier site for comprehensive data on the different strains of the laboratory rat (Rattus norvegicus). The strain data are collected from various publications, direct submissions from individual researchers, and rat providers worldwide. Rat strain, substrain designation and nomenclature follow the Guidelines for Nomenclature of Mouse and Rat Strains, instituted by the International Committee on Standardized Genetic Nomenclature for Mice. While symbols and names aid in identifying strains correctly, the flat nature of this information prohibits easy search and retrieval, as well as other data mining functions. In order to improve these functionalities, particularly in ontology-based tools, the Rat Strain Ontology (RS) was developed. Results The Rat Strain Ontology (RS) reflects the breeding history, parental background, and genetic manipulation of rat strains. This controlled vocabulary organizes strains by type: inbred, outbred, chromosome altered, congenic, mutant and so on. In addition, under the chromosome altered category, strains are organized by chromosome, and further by type of manipulations, such as mutant or congenic. This allows users to easily retrieve strains of interest with modifications in specific genomic regions. The ontology was developed using the Open Biological and Biomedical Ontology (OBO) file format, and is organized on the Directed Acyclic Graph (DAG) structure. Rat Strain Ontology IDs are included as part of the strain report (RS: ######). Conclusions As rat researchers are often unaware of the number of substrains or altered strains within a breeding line, this vocabulary now provides an easy way to retrieve all substrains and accompanying information. Its usefulness is particularly evident in tools such as the PhenoMiner at RGD, where users can now easily retrieve phenotype measurement data for related strains, strains with similar backgrounds or those with similar introgressed regions. This controlled vocabulary also allows better retrieval and filtering for QTLs and in genomic tools such as the GViewer. The Rat Strain Ontology has been incorporated into the RGD Ontology Browser (
http://rgd.mcw.edu/rgdweb/ontology/view.html?acc_id=RS:0000457#s) and is available through the National Center for Biomedical Ontology (
http://bioportal.bioontology.org/ontologies/1150) or the RGD ftp site (
ftp://rgd.mcw.edu/pub/ontology/rat_strain/).
Collapse
Affiliation(s)
- Rajni Nigam
- Human and Molecular Genetics Center, Medical College of Wisconsin, 8701 Watertown Plank Rd, Milwaukee 53226-3548, WI, USA.
| | | | | | | | | | | |
Collapse
|
19
|
Smith JR, Park CA, Nigam R, Laulederkind SJF, Hayman GT, Wang SJ, Lowry TF, Petri V, Pons JD, Tutaj M, Liu W, Worthey EA, Shimoyama M, Dwinell MR. The clinical measurement, measurement method and experimental condition ontologies: expansion, improvements and new applications. J Biomed Semantics 2013; 4:26. [PMID: 24103152 PMCID: PMC3882879 DOI: 10.1186/2041-1480-4-26] [Citation(s) in RCA: 27] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/23/2013] [Accepted: 10/01/2013] [Indexed: 11/17/2022] Open
Abstract
BACKGROUND The Clinical Measurement Ontology (CMO), Measurement Method Ontology (MMO), and Experimental Condition Ontology (XCO) were originally developed at the Rat Genome Database (RGD) to standardize quantitative rat phenotype data in order to integrate results from multiple studies into the PhenoMiner database and data mining tool. These ontologies provide the framework for presenting what was measured, how it was measured, and under what conditions it was measured. RESULTS There has been a continuing expansion of subdomains in each ontology with a parallel 2-3 fold increase in the total number of terms, substantially increasing the size and improving the scope of the ontologies. The proportion of terms with textual definitions has increased from ~60% to over 80% with greater synchronization of format and content throughout the three ontologies. Representation of definition source Uniform Resource Identifiers (URI) has been standardized, including the removal of all non-URI characters, and systematic versioning of all ontology files has been implemented. The continued expansion and success of these ontologies has facilitated the integration of more than 60,000 records into the RGD PhenoMiner database. In addition, new applications of these ontologies, such as annotation of Quantitative Trait Loci (QTL), have been added at the sites actively using them, including RGD and the Animal QTL Database. CONCLUSIONS The improvements to these three ontologies have been substantial, and development is ongoing. New terms and expansions to the ontologies continue to be added as a result of active curation efforts at RGD and the Animal QTL database. Use of these vocabularies to standardize data representation for quantitative phenotypes and quantitative trait loci across databases for multiple species has demonstrated their utility for integrating diverse data types from multiple sources. These ontologies are freely available for download and use from the NCBO BioPortal website at http://bioportal.bioontology.org/ontologies/1583 (CMO), http://bioportal.bioontology.org/ontologies/1584 (MMO), and http://bioportal.bioontology.org/ontologies/1585 (XCO), or from the RGD ftp site at ftp://rgd.mcw.edu/pub/ontology/.
Collapse
Affiliation(s)
- Jennifer R Smith
- Human and Molecular Genetics Center, Medical College of Wisconsin, Milwaukee, WI, USA
| | - Carissa A Park
- Department of Animal Science, Iowa State University, Ames, IA, USA
| | - Rajni Nigam
- Human and Molecular Genetics Center, Medical College of Wisconsin, Milwaukee, WI, USA
| | | | - G Thomas Hayman
- Human and Molecular Genetics Center, Medical College of Wisconsin, Milwaukee, WI, USA
| | - Shur-Jen Wang
- Human and Molecular Genetics Center, Medical College of Wisconsin, Milwaukee, WI, USA
| | - Timothy F Lowry
- Human and Molecular Genetics Center, Medical College of Wisconsin, Milwaukee, WI, USA
| | - Victoria Petri
- Human and Molecular Genetics Center, Medical College of Wisconsin, Milwaukee, WI, USA
| | - Jeff De Pons
- Human and Molecular Genetics Center, Medical College of Wisconsin, Milwaukee, WI, USA
| | - Marek Tutaj
- Human and Molecular Genetics Center, Medical College of Wisconsin, Milwaukee, WI, USA
| | - Weisong Liu
- Human and Molecular Genetics Center, Medical College of Wisconsin, Milwaukee, WI, USA
| | - Elizabeth A Worthey
- Human and Molecular Genetics Center, Medical College of Wisconsin, Milwaukee, WI, USA
- Department of Pediatrics, Medical College of Wisconsin, Milwaukee, WI, USA
| | - Mary Shimoyama
- Human and Molecular Genetics Center, Medical College of Wisconsin, Milwaukee, WI, USA
- Department of Surgery, Medical College of Wisconsin, Milwaukee, WI, USA
| | - Melinda R Dwinell
- Human and Molecular Genetics Center, Medical College of Wisconsin, Milwaukee, WI, USA
- Department of Physiology, Medical College of Wisconsin, Milwaukee, WI, USA
| |
Collapse
|
20
|
Vasilevsky NA, Brush MH, Paddock H, Ponting L, Tripathy SJ, Larocca GM, Haendel MA. On the reproducibility of science: unique identification of research resources in the biomedical literature. PeerJ 2013; 1:e148. [PMID: 24032093 PMCID: PMC3771067 DOI: 10.7717/peerj.148] [Citation(s) in RCA: 148] [Impact Index Per Article: 13.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/02/2013] [Accepted: 08/12/2013] [Indexed: 12/24/2022] Open
Abstract
Scientific reproducibility has been at the forefront of many news stories and there exist numerous initiatives to help address this problem. We posit that a contributor is simply a lack of specificity that is required to enable adequate research reproducibility. In particular, the inability to uniquely identify research resources, such as antibodies and model organisms, makes it difficult or impossible to reproduce experiments even where the science is otherwise sound. In order to better understand the magnitude of this problem, we designed an experiment to ascertain the “identifiability” of research resources in the biomedical literature. We evaluated recent journal articles in the fields of Neuroscience, Developmental Biology, Immunology, Cell and Molecular Biology and General Biology, selected randomly based on a diversity of impact factors for the journals, publishers, and experimental method reporting guidelines. We attempted to uniquely identify model organisms (mouse, rat, zebrafish, worm, fly and yeast), antibodies, knockdown reagents (morpholinos or RNAi), constructs, and cell lines. Specific criteria were developed to determine if a resource was uniquely identifiable, and included examining relevant repositories (such as model organism databases, and the Antibody Registry), as well as vendor sites. The results of this experiment show that 54% of resources are not uniquely identifiable in publications, regardless of domain, journal impact factor, or reporting requirements. For example, in many cases the organism strain in which the experiment was performed or antibody that was used could not be identified. Our results show that identifiability is a serious problem for reproducibility. Based on these results, we provide recommendations to authors, reviewers, journal editors, vendors, and publishers. Scientific efficiency and reproducibility depend upon a research-wide improvement of this substantial problem in science today.
Collapse
Affiliation(s)
- Nicole A Vasilevsky
- Ontology Development Group, Library, Oregon Health & Science University , Portland, OR , USA
| | | | | | | | | | | | | |
Collapse
|
21
|
Nigam R, Laulederkind SJF, Hayman GT, Smith JR, Wang SJ, Lowry TF, Petri V, De Pons J, Tutaj M, Liu W, Jayaraman P, Munzenmaier DH, Worthey EA, Dwinell MR, Shimoyama M, Jacob HJ. Rat Genome Database: a unique resource for rat, human, and mouse quantitative trait locus data. Physiol Genomics 2013; 45:809-16. [PMID: 23881287 DOI: 10.1152/physiolgenomics.00065.2013] [Citation(s) in RCA: 22] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022] Open
Abstract
The rat has been widely used as a disease model in a laboratory setting, resulting in an abundance of genetic and phenotype data from a wide variety of studies. These data can be found at the Rat Genome Database (RGD, http://rgd.mcw.edu/), which provides a platform for researchers interested in linking genomic variations to phenotypes. Quantitative trait loci (QTLs) form one of the earliest and core datasets, allowing researchers to identify loci harboring genes associated with disease. These QTLs are not only important for those using the rat to identify genes and regions associated with disease, but also for cross-organism analyses of syntenic regions on the mouse and the human genomes to identify potential regions for study in these organisms. Currently, RGD has data on >1,900 rat QTLs that include details about the methods and animals used to determine the respective QTL along with the genomic positions and markers that define the region. RGD also curates human QTLs (>1,900) and houses>4,000 mouse QTLs (imported from Mouse Genome Informatics). Multiple ontologies are used to standardize traits, phenotypes, diseases, and experimental methods to facilitate queries, analyses, and cross-organism comparisons. QTLs are visualized in tools such as GBrowse and GViewer, with additional tools for analysis of gene sets within QTL regions. The QTL data at RGD provide valuable information for the study of mapped phenotypes and identification of candidate genes for disease associations.
Collapse
Affiliation(s)
- Rajni Nigam
- Human and Molecular Genetics Center, Medical College of Wisconsin, Milwaukee, Wisconsin
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|